11 Nov
2004
11 Nov
'04
3:45 p.m.
----- Original Message ----- From: Jon Noring <jon@noring.name> Date: Thursday, November 11, 2004 10:28 am Subject: Re: [PGCanada] James website and more news
3) Scan these texts, collect the metadata/catalog-info, and place the page scans online. (Optionally, OCR can be done on these scans, and the raw, uncorrected OCR text can be used to enable a "temporary" full-text-search capability of the collection of page scans.)
This last -- use of the raw, uncorrected OCR output -- is what drives projects like canadiana.org, ourroots.ca, newspaperarchive.com, and the cold north wind/paperofrecord.com family of products.