Re: [gutvol-d] RFC: Posting Page Scans in DJVU Format

18 Jul 2005

      Jon Noring wrote:
...
It is common in a DP book scan job to scan the pages at one
resolution sufficient for text, then return and redo all the
illustrations at a higher resolution. (There can be multiple
illustrations per page, and an iluustration can be embedded within
text.)
So the page scan filename system has to include this possibility.
Did you actually *read* my RFC before commenting on it? I ask, because 
if you had read it, you would have noticed this section:

------

A filename for a single-page djvu file containing an illustration 
scanned in a different resolution or color depth MUST follow this pattern:

   <prefix><page number>-<image position on the page>.djvu

The <image position on the page> is "1" for the first image, "2" for the 
second, etc.

--------
...
0003857-00035-28
Where '0003857' is the decimal identifier for the source book which 
was scanned -- 7 digits gives us 10,000,000 books (hexadecimal would 
be slightly more compact but not as human friendly) -- '00035' is the
 sequential page as appears in the source book (independent of any
page numbering scheme which includes unnumbered blank pages), and
"28" is the page number (or 'string') the publisher/author actually
printed on the page to identify it.
This is more complicated and less robust than my proposal.

1. You don't need the ebook number because the ebook number will be in 
the filename of the multi-page djvu.

2. You don't want the ebook number because at the time of scanning, the 
ebook number is unknown.

3. You don't want the sequence number in the filename because it 
increases the probability that links to the page image break. If you 
have to insert a page all subsequent files will have to be renamed, and 
all links to them will break. In my proposal no link will break if you 
insert or remove pages (except a link to the removed page).

4. Who wants to know about "unnumbered blank pages"? You are not going 
to cite a blank page, are you?

-- 
Marcello Perathoner
webmaster@gutenberg.org