
On 7/17/05, Marcello Perathoner <marcello@perathoner.de> wrote:
DP scans already are single-page per file.
But my originals aren't. That's not a big deal, but missplit pages can be a pain. So can pages the OCR badly despeckled.
They just have to implement some small code to rename the files while compressing them to djvu.
It's not a matter of code; it's a matter of human effort for this on every project. Most of my projects can't be described as page number + offset; if nothing else, illustrations aren't numbered in with the rest of the book.
Your assumptions are that only humans need to find the right page in the scans and that this will be a rare task. I don't agree with either of them.
Computers will have a hard time using page numbers considering how inconsistent they are.
I don't want to weep over the lost page numbers as we are weeping today over the lost accents.
We aren't losing anything; we just aren't recording it today.
- user reporting of errata - validation of user reported errata
Flip through the pages.
- scholarly online citation of PG texts
Read the page number off the bottom of the screen or off the HTML file.
Suppose we post a multi-page djvu, where the filenames of the single-page djvu files it contains are not in sync with the real page numbers. Some scholar will then cite a work like this:
See "Foo" by Bar, Page 13 http://www.gutenberg.org/files/12345/12345.djvu#19.djvu
Note: she links to internal file 19.djvu because this file just happens to contain page 13.
And the autoredirect code will break the link. Recently, I linked to http://www.gutenberg.org/files/12345/images/gothic.png in an email message, and it refused to load for some of my recipients. In any case, page 13 is enough to find the section. I don't know that we can guarentee that the page numbers will be stable, given that we may need to rename files to insert a missing page or illustration, and in practice if they insist on using the internal djvu files, they will usually stay the same. At least as likely as any URL you hand around.