here's your huck finn update for today...

***

i reduced the diffs from ~3,000 to ~2,000.

so now all of the words/letters are "correct".

(the job of standardizing word-forms over
the entire book becomes very problematic
when dialect words get phonetic spelling.)

so now the ~2,000 diffs are on punctuation.

based on my memory of punctuation in
resolving ~1,000 diffs so far, i'd say that
approximately 95% was right in the o.c.r.

if that were to hold up, it'd mean we now
have roughly 100 o.c.r. punctuation errors.

i'm not sure it is worth it to resolve those
~2,000 diffs to fix 100 punctuation errors.

(although, to restate the obvious again,
that would be much more efficient than
checking _all_ the punctuation manually,
which is the workflow used over at d.p.)

so this experiment confirms my earlier
experience that different editions _can_
make comparison less worthwhile to do.
(depends, of course, on how many edits
were done to the text in the new edition.)

it's especially true if you have the option
to use an independent digitization of the
_same_ edition, even if it'd be some work.

as i _do_ have an independent digitization
of the same edition, i will change course
to execute "the final push" for this book.

-bowerbird