Aye I can make a decision about the encoding. My guess is i could work with UTF-8 encoding for the file at first and see how that works, at first.  I know there are a lot of entries that need to go into the gutindex-2009 file as well as the gutindex.all file.  But I have a lot of time on my hands. 

I know the entries typically are divided by month, I can stick with the same format that's been used in prior gutindex-year files. 

Jared

On Fri, Jun 26, 2009 at 9:12 PM, Andrew Sly <sly@victoria.tc.ca> wrote:
One thing you might want to do in that case, is make a clear
decsion about the character encoding for that file.

The last time I asked, titles and author names were just being
pasted into the gutindex files, regardless of what the encoding
was, hoping that it would just all work out somehow. The result
is a gutindex file which is mostly Latin-1, but with a few odd
blips once in a while, for instance in Hungarian texts, where
the wrong character can come out.

--Andrew

On Fri, 26 Jun 2009, Jared Buck wrote:

> I'm perfectly willing to help update these files if the person who usually
> does the gutindex file updating doesn't have time to do it.  I get all the
> latest postings of new books and it would be an easy thing to do to look
> through the posted-list archives to grab all the listings, and insert them
> into the gutindex file.
>
> Jared
>
>
_______________________________________________
gutvol-d mailing list
gutvol-d@lists.pglaf.org
http://lists.pglaf.org/mailman/listinfo/gutvol-d



--

Bill Cosby  - "A word to the wise ain't necessary - it's the stupid ones that need the advice."