
ok then, let's go to work... greg asked for actual examples of how online scans might be used to good effect by project gutenberg. one good set of scans is that created by jon noring, for his "my antonia" demo. those scans are here:
i subjected the scans to o.c.r. using finereader v7. one of the options in finereader is to output a .pdf. i did that, and have uploaded that .pdf for your perusal:
if you look at this .pdf, you will find that finereader does a rather amazing job of retaining the book's formatting. of course, there are scannos in the text, which makes it unusable, but from a _formatting_ standpoint, it is fine. if the .pdf we create at the end of our digitization would look as good as this one finereader makes automatically, just from the scans, we could feel proud of ourselves... that's enough for today... tomorrow we'll go on to look at the o.c.r. output itself... -bowerbird