
I remember, because I use this scheme to name the files that are used in the Encyclopasdia Britannica project on DP. Divided pages have a character appended to maintain order and distinction: 001a, 001b, 002a, 002b, etc. This also gives me the happy distinction of having the only image files that can be submitted more or less directly to PG without renaming the files. And these files are consequently inherently cross-indexed with the texts. Other DP projects, otoh, can at best give PG thousands of stacks of images uncorrelated with the texts, whether they have page numbers or not. Now there's a crowd-source opportunity - renaming the image files. On Mon, Feb 13, 2012 at 5:49 AM, Carlo Traverso <traverso@posso.dm.unipi.it>wrote:
I remember that there was some suggested naming scheme, but I don't remember it since it was complicated. 12973 was a happy exception, since numbering from 001 the titlepage and going on, the pages get the same number as the page number.
I remember that the number in the filenames should match the page number, but there was a series of rules for inserted pages, for frontmatter, for initial pages numbered in roman, and for prefixes to keep them in order.
I have two books that are really weird: one in which the page numbers are out of order in the original, and another in which a signature repeats the numbers of a previous one (after 1...208 we have 197a....208a, then 209...285).
Carlo
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d