
I've looked at this some more and I've come to the conclusion that working at the page-level is sufficient but inefficient, and that working at the book-level is efficient but insufficient. Here's what I base that on. In my latest experiement on a new book, I concatenated all the text files first and did the global corrections. A lot of improvements were made quickly. When I was convinced I had done all I could, I burst that back into individual pages and loaded it into PPE, the page-at-a-time editor which presents the image, and edit window, and an analysis window simultaneously. I found two types of errors that I could not have found at the book level and which would have also likely been missed by the smoothies. The first was where the text of two paragraphs was combined into one, without an intervening blank line. This happened in the middle of two separate pages. Abbyy doesn't preserve the start-of-paragraph indent, so especially if the first paragraph ends its last line near the ight margin, this error is invisible unless you are looking at the image. The other are words incorrectly marked as italic. Typically they are short words ("I" and "we" most commonly), but not always. If you can't see the image, you can't expect to get these right. I had to undo about 8 of these. For the next book, likely I'll do this same procedure: full text and then page at at time in PPE. The first makes it easier; the second makes it right. --Roger