
as i said, one of the first steps in doing our little mash-up is to sync up the paragraphs. in doing so, i found a half-dozen mistakes that jim had made in the paragraphing... for others who want to verify these errors, i suggest you look directly at jim's #29452:
in addition, i'll give you the u.r.l. to see the actual scans for each page up on my site... *** in 3 cases, jim missed a paragraph break:
Her response, when it came, was cold but http://z-m-l.com/go/wotdjp026.html
She put it as to his caring to know http://z-m-l.com/go/wotdjp263.html
There was a finer http://z-m-l.com/go/wotdjp317.html
*** in another 3 cases, jim incorrectly broke an existing paragraph into two paragraphs.
This was, fortunately for her http://z-m-l.com/go/wotdjp123.html
What queerer consequence http://z-m-l.com/go/wotdjp181.html
It just faintly rankled in her http://z-m-l.com/go/wotdjp201.html
*** 6 paragraphing mistakes is not bad performance. with a book containing some 330 pages, like this, i would say that it's probably about an average job. *** besides, my point is never to say "gotcha! errors!" as super-proofer jose menendez has proven, i make my fair share of book-digitizing errors. so that's not the point. there are several big issues that _are_ the point: 1. comparing digitizations is a great way to pinpoint errors so that they can be corrected. i have made this point in repeated examples. 2. most of the books in the library have errors. even the best ones, which were done recently... if you're convinced there are no errors there, you just don't know how to find them, and i strongly suggest you return to the first point. 3. most of the p.g. e-texts will be used only to proof scan-sets that retain the book's structure, and then the p.g. e-text will simply be discarded, since it doesn't contain that important structure. 4. the p.g. plain-text format has a lot of power and beauty inside it, if it's merely extended a bit, which is precisely what i did when i created z.m.l. -bowerbird