William Waites wrote:
On 10-05-26 16:22, Marcello Perathoner wrote:
catalog.rdf.bz2 gets updated every night.
Yes it does, but the information in there is very different from the individual files.
Not at all. The information is quite the same. (There's also author birth and death dates in the individual files, but otherwise its the same.)
I attach records from the catalogue and from the individual file.
The biggest problem is,
<http://www.gutenberg.org/feeds/catalog.rdf#etext27676> <http://www.gutenberg.org/ebooks/27676>
are completely different. And there's no way (other than hand coding special cases to make it from one to the other). So even if we were to initially populate with catalog.rdf.bz2 we couldn't then go and pull the detailed records.
Why should you do that? The big file contains the same information as the individual ones. -- Marcello Perathoner webmaster@gutenberg.org