
On 23 February 2010 15:47, Jim Adcock <jimad@msn.com> wrote:
Begs the question why DP doesn't just institute a quality hosted OCR and let people just submit the page images. Ask people to test run a couple pages by the hosted OCR before settling on their digitization settings in order to make sure they know what they are doing.
This was discussed on DP back in 2003. If you have a DP login see here: http://www.pgdp.net/phpBB2/viewtopic.php?t=5840 And the flaw? I quote from from a post about finereader in that thread: ------------------------------ I asked the price of Linux development kit. It is 9000 Euro, plus some more money to get a licence for a fixed number of page/month (500 euro for 25k pages/month) (Tesseract might be the way to go, but there's still the chronic shortage of programmers to implement new DP features.) Malcolm Malcolm