
21 Apr
2010
21 Apr
'10
12:27 a.m.
>I'm currently working through the very first Britannica project ever - The Project Gutenberg Encyclopedia, Volume 1 of 28. It's etext # 200, dated "1995-01-01". It's in sad shape. Text only, many errors apparent to the casual eye. I'd like to reprocess it. I can't tell you how to get the scans but I have tools that will help you recover the original lines breaks and match the PG text against a new OCR, helping identify errors in both the OCR and the existant PG text. Let me know if you find the scans. Yes I have tried this on a couple texts already.