
19 Jul
2005
19 Jul
'05
5:36 p.m.
What I am about to say applies to Abbyy Finereader; I am not sure how well it applies to other OCR engines. When Abbyy receives grayscale data, before recognizing anything, it converts the image to a B/W image by applying a thresholding algorithm. (It chooses its own threshold based on what it thinks will give the best recognition.)
I believe ScanSoft works the same way. This is usually not a problem, but it can be with unevenly lit pages, pages which have faded at different rates, or pages where the printer goofed and didn't make an even impression.