this thread explains how to digitize a book quickly and easily.
search archive.org for "booksculture00mabiuoft" for the book.
***
ok, i ran a test on the end-of-line hyphenates...
> http://zenmarkuplanguage.com/grapes119.py
> http://zenmarkuplanguage.com/grapes119.txt
behold, up popped "first-hand" and "pre-eminently"...
although my dictionary currently accepts these as
fully-joined, non-hyphenated words, _this_ book
treated them both as _hyphenates_, even mid-line,
so i changed them appropriately in my newest text.
that "pre-" made me look at a few other common
compounds, like half-whatever and co-whatever...
that lead me to make a change to "co-ordination".
my dictionary considers it a non-hyphenate, but
this book had a case of "co-ordinating" midline,
so i reinserted the hyphen, just to be consistent...
***
so, our newest version of the text is now here:
> http://zenmarkuplanguage.com/grapes007.txt
***
plus i did final spellcheck with the book's dictionary...
> http://zenmarkuplanguage.com/grapes120.py
> http://zenmarkuplanguage.com/grapes120.txt
we're good to go...
***
now we can get down to an evaluation of our work...
given that we are now done with the text-cleaning,
i ran a check against my existing text of this book,
which has been _thoroughly_ cleaned many times...
and, all in all, the results were very good.
recall the book is over 200k, and 279 pages.
***
16 lines with o.c.r. errors going uncaught,
most of which were involving _punctuation_.
2 were on a name, which we shoulda checked.
error> plete expressions, in that concrete
right> plete expressions. In that concrete
change> =================^=^===============
error> and exhausts while he instructs; the
right> and exhausts while he instructs, the
change> ===============================^====
error> bought Cary's crib, and took it with
right> bought Gary's crib, and took it with
change> =======^============================
error> read my Cary's Plato. It so hap-
right> read my Gary's Plato. It so hap-
change> ========^=======================
error> The latest of them, Count Tolstoi's
right> The latest of them. Count Tolstoi's
change> ==================^================
error> mass the facts about any given period;
right> mass the facts about any given period,
change> =====================================^
error> The reality of this element of per-
right> The reality of this, element of per-
change> ===================^----------------
error> all they saw and knew a part of them-
right> ail they saw and knew a part of them-
change> =^===================================
error> process, and culture and genius stand
right> process, and culture and genius stand,
change> =====================================^
error> there has always been not only a
right> there has always been, not only a
change> =====================^-----------
error> stories and constantly touched upon
right> stones and constantly touched upon
change> ===^-------------------------------
error> exploration, travel, and discovery; he
right> exploration, travel, and discovery; ha
change> =====================================^
error> There are, it is true, a few men and
right> There are. It is true, a few men and
change> =========^=^========================
error> from the days of the earliest Greek
right> from the days of the earliest Greek,
change> ===================================^
error> sion, thought, impulse, which never
right> sion, thought. Impulse, which never
change> =============^=^===================
error> grance, and growth which lie enfolded
right> grance, and growth which he enfolded
change> =========================^^^^^^^^^^^^
***
4 new paragraphs that were missed, which shows
you really can't skip this check like i did...
> This faculty of draining all the
> This contact with the richest per-
> This deepest and most vital of all
> The myth-makers endeavoured to
***
4 "errors" that were present in the p-book...
> XIV. Racial Experience ......... 165
> vironment, and the history of races
> man. This in not true of secondary
> selves throug all fields of thought,
***
...plus i found two errors in the "criterion"...
...which just goes to show perfection is hard...
***
again, all in all, a great performance for
the amount of work we put into cleaning,
a sturdy example of the cost-benefit ratio.
-bowerbird