
----- Original Message ----- From: "Steve Thomas" <stephen.thomas@adelaide.edu.au> To: "Project Gutenberg Volunteer Discussion" <gutvol-d@lists.pglaf.org> Sent: Friday, October 22, 2004 5:47 AM Subject: [gutenberg] Re: [gutvol-d] Re: barriers to XML posting
A question (possibly better put over on the DP list):
Is it possible to OCR a scan directly to XML? Or is the output from OCR always going to be text? Abbyy Finereader 7.0 has the capability of saving each page of OCR as Microsoft Word XML format. I have not experimented with it, and am not even knowlegable about XML yet, but if at some point PGDP wanted to use XML as a source format, it could be done, if the project manager has this software to work with. Abbyy 7 can also output its OCR as HTML, Excel spreadsheet, and many other formats.
Ronald Holder PGDP volunteer