
jon noring said:
Unfortunately I do not have OCR output
did you do o.c.r. on it? if you can retrieve the output, that would be good. it would allow people to do research on assessing/improving o.c.r. quality, and assist programmers in developing post-o.c.r. text-cleanup programs. (but, from later posts, it looks like you grabbed the text from elsewhere. so what you've done is "blessed" somebody else's work as "trustworthy", presumably after checking it, and maybe correcting it. you could also have done that same thing using project gutenberg's version of the text, since my comparison of the two files shows them to be very similar, so much so that i expect they were indeed based on the same version.)
I'll zip up the 600 dpi 2-color (B&W) scans which have already gone through a clean-up stage (they will be PNG files, and occupy if memory serves me right, about 50 megs of space
those are too big for my purposes, and for me to download. but if i could reimburse you for sending them to me on a cd? or the 120-dpi versions would work just fine for my project, the same ones that are on the website, just zipped together.
Remind me if I don't answer anytime soon.
sure thing.
Thanks. I look forward to it! (Really, I do.)
great. -bowerbird