
9 Oct
2012
9 Oct
'12
4:50 a.m.
From decomposing the PG catalog, here is what appears to be the structure of the data.
The basic unit, of course, is the etext, which is identified by the ID number. The etext has the following associate fields, provided as shown. 1. Download count - exactly one value. 2. Date created - exactly one value. 3. An indicator if it's an audio file. 4. Zero or more titles. 5. Zero or more "creators" (authors). 6. Zero or more contributors. 7. Zero or more descriptions. 8. One or more languages. 9. Zero or more subjects (about 1/4 of the etexts have no subject). 10. Zero or more "alternatives" - probably alternate titles. 11. Zero or one "friendly title". Plus an indicator for each output formatted file provided.