Aye I can make a decision about the encoding. My guess is i could work with UTF-8 encoding for the file at first and see how that works, at first. I know there are a lot of entries that need to go into the gutindex-2009 file as well as the gutindex.all file. But I have a lot of time on my hands.
I know the entries typically are divided by month, I can stick with the same format that's been used in prior gutindex-year files.
Jared
One thing you might want to do in that case, is make a clear
decsion about the character encoding for that file.
The last time I asked, titles and author names were just being
pasted into the gutindex files, regardless of what the encoding
was, hoping that it would just all work out somehow. The result
is a gutindex file which is mostly Latin-1, but with a few odd
blips once in a while, for instance in Hungarian texts, where
the wrong character can come out.
--Andrew
_______________________________________________
On Fri, 26 Jun 2009, Jared Buck wrote:
> I'm perfectly willing to help update these files if the person who usually
> does the gutindex file updating doesn't have time to do it. I get all the
> latest postings of new books and it would be an easy thing to do to look
> through the posted-list archives to grab all the listings, and insert them
> into the gutindex file.
>
> Jared
>
>
gutvol-d mailing list
gutvol-d@lists.pglaf.org
http://lists.pglaf.org/mailman/listinfo/gutvol-d