Has anyone noticed that the gutindex files for this year haven't been updated since January?

I'm perfectly willing to help update these files if the person who usually does the gutindex file updating doesn't have time to do it. I get all the latest postings of new books and it would be an easy thing to do to look through the posted-list archives to grab all the listings, and insert them into the gutindex file. Jared -- Yogi Berra <http://www.brainyquote.com/quotes/authors/y/yogi_berra.html> - "I never said most of the things I said."

One thing you might want to do in that case, is make a clear decsion about the character encoding for that file. The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out. --Andrew On Fri, 26 Jun 2009, Jared Buck wrote:
I'm perfectly willing to help update these files if the person who usually does the gutindex file updating doesn't have time to do it. I get all the latest postings of new books and it would be an easy thing to do to look through the posted-list archives to grab all the listings, and insert them into the gutindex file.
Jared

Aye I can make a decision about the encoding. My guess is i could work with UTF-8 encoding for the file at first and see how that works, at first. I know there are a lot of entries that need to go into the gutindex-2009 file as well as the gutindex.all file. But I have a lot of time on my hands. I know the entries typically are divided by month, I can stick with the same format that's been used in prior gutindex-year files. Jared On Fri, Jun 26, 2009 at 9:12 PM, Andrew Sly <sly@victoria.tc.ca> wrote:
One thing you might want to do in that case, is make a clear decsion about the character encoding for that file.
The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out.
--Andrew
On Fri, 26 Jun 2009, Jared Buck wrote:
I'm perfectly willing to help update these files if the person who usually does the gutindex file updating doesn't have time to do it. I get all the latest postings of new books and it would be an easy thing to do to look through the posted-list archives to grab all the listings, and insert them into the gutindex file.
Jared
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
-- Bill Cosby <http://www.brainyquote.com/quotes/authors/b/bill_cosby.html> - "A word to the wise ain't necessary - it's the stupid ones that need the advice."

Uhhh !???? Either a file is in an encoding or not. Of course I can read and edit in any encoding I want, but that screws everything. So what is the file suppose to be. So the first step would decide what encoding, then save in this encoding, check to make sure everything is correct and then start updating. regards Keith. Am 27.06.2009 um 06:12 schrieb Andrew Sly:
One thing you might want to do in that case, is make a clear decsion about the character encoding for that file.
The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out.
--Andrew

I decided on using UTF-8 encoding. The text editor I used for the editing test, Notepad++, has an option to convert to UTF-9 encoding. Saving the file then with the encoding seems to preserve the special characters in the file when I view it in Notepad. However, any characters that were entered before but saved in Latin-1 will not show up or will show up as funny characters that don't display correct, so I would need to re-enter said characters and save again in UTF-8 in Notepad++. Will this be acceptable to everyone? Jared On Sat, Jun 27, 2009 at 5:26 AM, Keith J. Schultz <schultzk@uni-trier.de>wrote:
Uhhh !????
Either a file is in an encoding or not. Of course I can read and edit in any encoding I want, but that screws everything. So what is the file suppose to be.
So the first step would decide what encoding, then save in this encoding, check to make sure everything is correct and then start updating.
regards Keith.
Am 27.06.2009 um 06:12 schrieb Andrew Sly:
One thing you might want to do in that case, is make a clear
decsion about the character encoding for that file.
The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out.
--Andrew
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
-- Alfred Hitchcock<http://www.brainyquote.com/quotes/authors/a/alfred_hitchcock.html> - "Television has brought back murder into the home - where it belongs."

Has anyone discussed the maintenance of the gutindex files with Greg Newby? If not, it would be advisable. Al ----- Original Message ----- From: Jared Buck To: Project Gutenberg Volunteer Discussion Sent: Saturday, June 27, 2009 7:28 PM Subject: [gutvol-d] Re: Has anyone noticed that the gutindex files for thisyear haven't been updated since January? I decided on using UTF-8 encoding. The text editor I used for the editing test, Notepad++, has an option to convert to UTF-9 encoding. Saving the file then with the encoding seems to preserve the special characters in the file when I view it in Notepad. However, any characters that were entered before but saved in Latin-1 will not show up or will show up as funny characters that don't display correct, so I would need to re-enter said characters and save again in UTF-8 in Notepad++. Will this be acceptable to everyone? Jared On Sat, Jun 27, 2009 at 5:26 AM, Keith J. Schultz <schultzk@uni-trier.de> wrote: Uhhh !???? Either a file is in an encoding or not. Of course I can read and edit in any encoding I want, but that screws everything. So what is the file suppose to be. So the first step would decide what encoding, then save in this encoding, check to make sure everything is correct and then start updating. regards Keith. Am 27.06.2009 um 06:12 schrieb Andrew Sly: One thing you might want to do in that case, is make a clear decsion about the character encoding for that file. The last time I asked, titles and author names were just being pasted into the gutindex files, regardless of what the encoding was, hoping that it would just all work out somehow. The result is a gutindex file which is mostly Latin-1, but with a few odd blips once in a while, for instance in Hungarian texts, where the wrong character can come out. --Andrew _______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d -- Alfred Hitchcock - "Television has brought back murder into the home - where it belongs." ------------------------------------------------------------------------------ _______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

Here is a little background, for anyone following the conversation, wondering what this is about. Years ago, the gutindex files used to be the _only_ way to locate a text at project gutenberg. If you care to search you can still find old "newbie instructions" written by Michael Hart, describing how to search the gutindex files, find a "base file name", enter command line ftp instructions to retrieve the file, etc. This purpose has been for the most part superceded with the use of online catalogs. (An old one on the promo.net site, and then the current incarnation in use.) However, the gutindex files have been maintained until relativly recently, and could still be useful in some situations. They are basically, plain text files, with each PG text identified on its own line. Over time, more various conventions and additional comments, cross-referencing, and so forth were getting added and making it very bulky. Having a catalog-type record for the definitive information source for each text began to make much more sence. (And thankfully made the distinction between the old "base-file-name" system, and the current numbered system invisible to the average user.) On our current website, Marcello has identified them as "offline catalogs" and they are linked to from: http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs Andrew On Sat, 27 Jun 2009, Al Haines (shaw) wrote:
Has anyone discussed the maintenance of the gutindex files with Greg Newby? If not, it would be advisable.
Al

We certainly wouldn't mind if anyone would like to take them over and keep tham up to date. . . . Michael On Sat, 27 Jun 2009, Andrew Sly wrote:
Here is a little background, for anyone following the conversation, wondering what this is about.
Years ago, the gutindex files used to be the _only_ way to locate a text at project gutenberg. If you care to search you can still find old "newbie instructions" written by Michael Hart, describing how to search the gutindex files, find a "base file name", enter command line ftp instructions to retrieve the file, etc.
This purpose has been for the most part superceded with the use of online catalogs. (An old one on the promo.net site, and then the current incarnation in use.)
However, the gutindex files have been maintained until relativly recently, and could still be useful in some situations.
They are basically, plain text files, with each PG text identified on its own line. Over time, more various conventions and additional comments, cross-referencing, and so forth were getting added and making it very bulky.
Having a catalog-type record for the definitive information source for each text began to make much more sence. (And thankfully made the distinction between the old "base-file-name" system, and the current numbered system invisible to the average user.)
On our current website, Marcello has identified them as "offline catalogs" and they are linked to from: http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
Andrew
On Sat, 27 Jun 2009, Al Haines (shaw) wrote:
Has anyone discussed the maintenance of the gutindex files with Greg Newby? If not, it would be advisable.
Al
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

Which I wouldn't mind doing, being the official keeper of these files :) I have already stated that I find UTF-8 to be the best encoding to use and will save them in that format. I will have to go back and fix the special characters previously inserted via Latin-1 format so they display correctly. Then someone can upload the files (since I don't have FTP upload access to the PG servers) and I can keep them updated once per week like they used to be. Jared On Sat, Jun 27, 2009 at 10:08 PM, Michael S. Hart <hart@pglaf.org> wrote:
We certainly wouldn't mind if anyone would like to take them over and keep tham up to date. . . .
Michael
On Sat, 27 Jun 2009, Andrew Sly wrote:
Here is a little background, for anyone following the conversation, wondering what this is about.
Years ago, the gutindex files used to be the _only_ way to locate a text at project gutenberg. If you care to search you can still find old "newbie instructions" written by Michael Hart, describing how to search the gutindex files, find a "base file name", enter command line ftp instructions to retrieve the file, etc.
This purpose has been for the most part superceded with the use of online catalogs. (An old one on the promo.net site, and then the current incarnation in use.)
However, the gutindex files have been maintained until relativly recently, and could still be useful in some situations.
They are basically, plain text files, with each PG text identified on its own line. Over time, more various conventions and additional comments, cross-referencing, and so forth were getting added and making it very bulky.
Having a catalog-type record for the definitive information source for each text began to make much more sence. (And thankfully made the distinction between the old "base-file-name" system, and the current numbered system invisible to the average user.)
On our current website, Marcello has identified them as "offline catalogs" and they are linked to from: http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
Andrew
On Sat, 27 Jun 2009, Al Haines (shaw) wrote:
Has anyone discussed the maintenance of the gutindex files with Greg
Newby? If not, it would be advisable.
Al
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
-- Yogi Berra <http://www.brainyquote.com/quotes/authors/y/yogi_berra.html> - "I never said most of the things I said."

Hi, Could someone with admin privelges give Jared access to the server, or at least to the directory(ies) to keep the files properly update. regards Keith. Am 28.06.2009 um 07:13 schrieb Jared Buck:
Which I wouldn't mind doing, being the official keeper of these files :) I have already stated that I find UTF-8 to be the best encoding to use and will save them in that format. I will have to go back and fix the special characters previously inserted via Latin-1 format so they display correctly.
Then someone can upload the files (since I don't have FTP upload access to the PG servers) and I can keep them updated once per week like they used to be.
Jared
On Sat, Jun 27, 2009 at 10:08 PM, Michael S. Hart <hart@pglaf.org> wrote:
We certainly wouldn't mind if anyone would like to take them over and keep tham up to date. . . .
Michael
On Sat, 27 Jun 2009, Andrew Sly wrote:
Here is a little background, for anyone following the conversation, wondering what this is about.
Years ago, the gutindex files used to be the _only_ way to locate a text at project gutenberg. If you care to search you can still find old "newbie instructions" written by Michael Hart, describing how to search the gutindex files, find a "base file name", enter command line ftp instructions to retrieve the file, etc.
This purpose has been for the most part superceded with the use of online catalogs. (An old one on the promo.net site, and then the current incarnation in use.)
However, the gutindex files have been maintained until relativly recently, and could still be useful in some situations.
They are basically, plain text files, with each PG text identified on its own line. Over time, more various conventions and additional comments, cross-referencing, and so forth were getting added and making it very bulky.
Having a catalog-type record for the definitive information source for each text began to make much more sence. (And thankfully made the distinction between the old "base-file-name" system, and the current numbered system invisible to the average user.)
On our current website, Marcello has identified them as "offline catalogs" and they are linked to from: http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
Andrew
On Sat, 27 Jun 2009, Al Haines (shaw) wrote:
Has anyone discussed the maintenance of the gutindex files with
Greg Newby? If not, it would be advisable.
Al
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
--
Yogi Berra - "I never said most of the things I said." _______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

On Sat, Jun 27, 2009 at 10:13:41PM -0700, Jared Buck wrote:
Which I wouldn't mind doing, being the official keeper of these files :) I have already stated that I find UTF-8 to be the best encoding to use and will save them in that format. I will have to go back and fix the special characters previously inserted via Latin-1 format so they display correctly.
Then someone can upload the files (since I don't have FTP upload access to the PG servers) and I can keep them updated once per week like they used to be.
Jared
Hi, Jared. As you saw, these files haven't been updated in 2009. Any reasonable format is good for us...it doesn't need to be exactly the same as George used to do them. My main strong desire is to include changes -- we often have catalog changes in the first few days after posting, and we also do periodic updates to older items. These can be tough to automate. Depending on how regularly you want to do this, I could set up a location for you to upload via FTP or whatever. In the meantime, I suggest you could just email me updated files, and I can get them posted. THANKS for taking this on. It is much harder than it seems at first glance. But feel encouraged to try things in different ways...not necessarily backward-compatible. -- Greg
On Sat, Jun 27, 2009 at 10:08 PM, Michael S. Hart <hart@pglaf.org> wrote:
We certainly wouldn't mind if anyone would like to take them over and keep tham up to date. . . .
Michael
On Sat, 27 Jun 2009, Andrew Sly wrote:
Here is a little background, for anyone following the conversation, wondering what this is about.
Years ago, the gutindex files used to be the _only_ way to locate a text at project gutenberg. If you care to search you can still find old "newbie instructions" written by Michael Hart, describing how to search the gutindex files, find a "base file name", enter command line ftp instructions to retrieve the file, etc.
This purpose has been for the most part superceded with the use of online catalogs. (An old one on the promo.net site, and then the current incarnation in use.)
However, the gutindex files have been maintained until relativly recently, and could still be useful in some situations.
They are basically, plain text files, with each PG text identified on its own line. Over time, more various conventions and additional comments, cross-referencing, and so forth were getting added and making it very bulky.
Having a catalog-type record for the definitive information source for each text began to make much more sence. (And thankfully made the distinction between the old "base-file-name" system, and the current numbered system invisible to the average user.)
On our current website, Marcello has identified them as "offline catalogs" and they are linked to from: http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
Andrew
On Sat, 27 Jun 2009, Al Haines (shaw) wrote:
Has anyone discussed the maintenance of the gutindex files with Greg
Newby? If not, it would be advisable.
Al
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
--
Yogi Berra <http://www.brainyquote.com/quotes/authors/y/yogi_berra.html> - "I never said most of the things I said."
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

On 28 Jun 2009, at 06:13, Jared Buck wrote:
Which I wouldn't mind doing, being the official keeper of these files :) I have already stated that I find UTF-8 to be the best encoding to use and will save them in that format. I will have to go back and fix the special characters previously inserted via Latin-1 format so they display correctly.
Then someone can upload the files (since I don't have FTP upload access to the PG servers) and I can keep them updated once per week like they used to be.
Jared
I have just uploaded two files to: http://textual.net/glist.txt http://textual.net/gbiglist.txt The first is an attempt to create a gutindex file for 2009. The second should have all the emails of posted for 2009. Both are in reverse order by reference number - largest first. The first file will need to be edited by reference to the 2nd. There are some gaps - the most recent emails, and a big hole - 27825 to 28013 where I do not have the posted emails. Where is the archive? GUTINDEX.ALL goes up to 27930. I have not dealt with the encoding issue. Philip Baker

On Tue, Jun 30, 2009 at 4:12 PM, Philip Baker <phil@thalasson.com> wrote:
There are some gaps - the most recent emails, and a big hole - 27825 to 28013 where I do not have the posted emails. Where is the archive? GUTINDEX.ALL goes up to 27930.
The archive is at http://lists.pglaf.org/pipermail/posted/ However, there is a gap due to technical problems earlier this year. I have the posted emails from that period and can send them to you, if you want. /me rummages through mail folders. I have the posted emails for #28566 - 19744. Mid-April of this year through early november of '06. Someone else probably has a more complete archive. R C

On 30 Jun 2009, at 22:05, Robert Cicconetti wrote:
On Tue, Jun 30, 2009 at 4:12 PM, Philip Baker <phil@thalasson.com> wrote: There are some gaps - the most recent emails, and a big hole - 27825 to 28013 where I do not have the posted emails. Where is the archive? GUTINDEX.ALL goes up to 27930.
The archive is at http://lists.pglaf.org/pipermail/posted/
However, there is a gap due to technical problems earlier this year.
I have the posted emails from that period and can send them to you, if you want.
/me rummages through mail folders.
Thanks for the offer but don't do anything, at least for the moment. I am going to look at the possibility of using the RDF file for what I want. Philip Baker
participants (8)
-
Al Haines (shaw)
-
Andrew Sly
-
Greg Newby
-
Jared Buck
-
Keith J. Schultz
-
Michael S. Hart
-
Philip Baker
-
Robert Cicconetti