
22 Oct
2004
22 Oct
'04
10:47 a.m.
A question (possibly better put over on the DP list): Is it possible to OCR a scan directly to XML? Or is the output from OCR always going to be text? If the first, then we need two processes -- one to deal with new scans (OCR to XML), one to deal with existing plain texts (to convert them to XML). But if the output of OCR is still going to be plain text, then we can use the same process to convert both existing and new books to XML. Steve -- Stephen Thomas, Senior Systems Analyst, Adelaide University Library ADELAIDE UNIVERSITY SA 5005 AUSTRALIA Tel: +61 8 8303 5190 Fax: +61 8 8303 4369 Email: stephen.thomas@adelaide.edu.au URL: http://staff.library.adelaide.edu.au/~sthomas/