
Aye I can make a decision about the encoding. My guess is i could work with UTF-8 encoding for the file at first and see how that works, at first. I know there are a lot of entries that need to go into the gutindex-2009 file as well as the gutindex.all file. But I have a lot of time on my hands. I know the entries typically are divided by month, I can stick with the same format that's been used in prior gutindex-year files. Jared On Fri, Jun 26, 2009 at 9:12 PM, Andrew Sly <sly@victoria.tc.ca> wrote:
One thing you might want to do in that case, is make a clear decsion about the character encoding for that file.
The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out.
--Andrew
On Fri, 26 Jun 2009, Jared Buck wrote:
I'm perfectly willing to help update these files if the person who usually does the gutindex file updating doesn't have time to do it. I get all the latest postings of new books and it would be an easy thing to do to look through the posted-list archives to grab all the listings, and insert them into the gutindex file.
Jared
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
-- Bill Cosby <http://www.brainyquote.com/quotes/authors/b/bill_cosby.html> - "A word to the wise ain't necessary - it's the stupid ones that need the advice."