don said:
>   Al, Roger, Bird, anyone else,
>   For your list of scannos,
>   do you examine every instance
>   of every word on the list?
>   Many of those words
>   would be more frequently correct
>   than a misscan.

i recommend you not even consider testing for a specific scanno
unless you suspect that it will return more hits than false-alarms.

then i would collect data from every test, to _ensure_ that it does,
and stop using any that do not. that's a very high bar to clear, but
if a test returns more false-alarms than hits, it's wasting my time.

it is more important and valuable to build an infrastructure which
acts quickly and efficiently on error-reports from your community
than to waste the time and energy of your digitization volunteers.

sadly, p.g. never gave itself the opportunity to learn that lesson...

-bowerbird