
Jon Noring wrote:
It is common in a DP book scan job to scan the pages at one resolution sufficient for text, then return and redo all the illustrations at a higher resolution. (There can be multiple illustrations per page, and an iluustration can be embedded within text.)
So the page scan filename system has to include this possibility.
Did you actually *read* my RFC before commenting on it? I ask, because if you had read it, you would have noticed this section: ------ A filename for a single-page djvu file containing an illustration scanned in a different resolution or color depth MUST follow this pattern: <prefix><page number>-<image position on the page>.djvu The <image position on the page> is "1" for the first image, "2" for the second, etc. --------
0003857-00035-28
Where '0003857' is the decimal identifier for the source book which was scanned -- 7 digits gives us 10,000,000 books (hexadecimal would be slightly more compact but not as human friendly) -- '00035' is the sequential page as appears in the source book (independent of any page numbering scheme which includes unnumbered blank pages), and "28" is the page number (or 'string') the publisher/author actually printed on the page to identify it.
This is more complicated and less robust than my proposal. 1. You don't need the ebook number because the ebook number will be in the filename of the multi-page djvu. 2. You don't want the ebook number because at the time of scanning, the ebook number is unknown. 3. You don't want the sequence number in the filename because it increases the probability that links to the page image break. If you have to insert a page all subsequent files will have to be renamed, and all links to them will break. In my proposal no link will break if you insert or remove pages (except a link to the removed page). 4. Who wants to know about "unnumbered blank pages"? You are not going to cite a blank page, are you? -- Marcello Perathoner webmaster@gutenberg.org