I remember, because I use this scheme to name the files that are used in the 
Encyclopasdia Britannica project on DP.

Divided pages have a character appended to maintain order and distinction:
001a, 001b, 002a, 002b, etc.

This also gives me the happy distinction of having the only image files
that can be submitted more or less directly to PG without renaming the files.
And these files are consequently inherently cross-indexed with the texts.

Other DP projects, otoh, can at best give PG thousands of stacks of images 
uncorrelated with the texts, whether they have page numbers or not.

Now there's a crowd-source opportunity - renaming the image files.


On Mon, Feb 13, 2012 at 5:49 AM, Carlo Traverso <traverso@posso.dm.unipi.it> wrote:

I remember that there was some suggested naming scheme, but I don't
remember it since it was complicated. 12973 was a happy exception,
since numbering from 001 the titlepage and going on, the pages get the
same number as the page number.

I remember that the number in the filenames should match the page
number, but there was a series of rules for inserted pages, for
frontmatter, for initial pages numbered in roman, and for prefixes to
keep them in order.

 I have two books that are really weird: one in which the page numbers
are out of order in the original, and another in which a signature
repeats the numbers of a previous one (after 1...208 we have
197a....208a, then 209...285).

Carlo



_______________________________________________
gutvol-d mailing list
gutvol-d@lists.pglaf.org
http://lists.pglaf.org/mailman/listinfo/gutvol-d