
On 7/6/06, Andrew Sly <sly@victoria.tc.ca> wrote:
However, getting copyright clearance, and reformatting these is perhaps not as "glamarous" as Distributed Proofreading, so it does not attract as many people. :)
It's not just glamarous, it's hard. You have to go through all the work of finding a specific edition that may not be well-identified in the ebook. You have to dump the text in such a way that doesn't lose all the formatting information, which may range from easy to hard, but will certainly require custom code and massaging. You have to work with a text that is unlikely to be the quality of what DP can produce after five rounds, and could turn out to be pretty bad. And it requires some tedious comparison. I'd actually rather rescan and reprocess and compare after a lot of times rather than try and reformat existing material. If we can't get information for clearing from the source, or at least handle them as a group, they're pretty hard to do.