I've previously completed a reasonable number of proof-readings of book content and now I've begun preparing the content of a large annual of weekly magazines - I have already received my copyright clearance.
My annual was published in 1905-6 and is in fair condition, but not suitable for flat-bed scanning; the pages are somewhat yellowed. I'm using a LG LSM-100 Scanner Mouse and I've completed initial scanning of the first issue of this magazine. I'm creating separate PNG files for each complete page and of each distinct illustration; these need further work. Based on my previous proof-reading, I'm working on correcting the initial OCR text in plain text files. I'm using the folder structure as follows...
Annual
pngs (of annual only)
text (of annual only)
Issue 1
pngs (of Issue 1 only)
text (of Issue 1 only)
(etc)
Each weekly magazine comprises fictional and non-fictional material.
Initially, I plan to treat each issue of
about 16 pages as a project and the annual as a separate project. I'm
also thinking that where articles are episodic, I feel it is worth
considering side projects in which the episodes for a given article are
assembled as a unit. The annual has a good table of contents using page
numbers; in the ebook format, the page numbers might usefully be
converted into issue number and article-in-issue number (like 1395.1).
In the first issue, I've seen only one page reference to an issue
pre-dating this annual.
In the Project Gutenberg catalog, I see there are quite a number of
other magazines, so I can use them as a guide though I realize there are experienced final-formatters out there. I would welcome pointers to relevant information to help me proceed and, if possible, contact with someone who has worked on this kind of material.
How do I go about creating the project structure for others to help with proof-reading, etc, in due course?
Where can I find advice on cleaning up illustrations?
Thanks for your patience,
Roger