
On 11/11/2011 12:42 AM, Alex Buie wrote:
On Wed, Nov 9, 2011 at 12:28 AM, Lee Passey<lee@novomail.net> wrote:
In the IA output, I'm discovering that that data has been preserved. I think with some effort, it would be possible to use this data to build a web interface substantially identical to the proofreading interface provided by FineReader.
So Alex, all that talk a while back about how I wanted a "leaner, meaner" file? Forget about it. I think I like it just the way it is. I can select out what I need, and it has some potential.
Here's something someone at the archive is working on (after hours, since it's not an official project yet). He'd love to hear your thoughts.
I'm working on a similar project but I've opted for a desktop application. I think that every proofing application should do 99% of the work automatically and pass only the 1% of dubious cases to the human proofer. My app needs lots of processing power to give aids to the proofer and so a desktop app is the more natural choice. Collaborative proofing will be achieved either by passing only actual edits to a central server or by XML replication. I wonder if putting a white-on-transparent OCR overlay on top of the scanned layer lets you find errors just by looking at the black spots. -- Marcello Perathoner webmaster@gutenberg.org