
does every conversation here _have_ to be tedious? *** ok, let me 'splain you... first, if you're gonna wait for users to report errors, instead of proactively seeking them and fixing them, you might as well _give_ your lunch to someone else and then trot off to your mat so you can take a nap. if the tech world has taught us anything at all lately, it's that you'd better improve your product offerings _yourself_, before any of your competitors do it first. project gutenberg e-texts have one powerful benefit -- they are cleaner than many other texts out there. (but don't look back, because google's coming fast.) the e-texts also have 2 significant related liabilities -- they have no provenance, so the accuracy isn't verifiable. this means a competitor can knock you out quite easily, and even use your own text to do it. they simply take it, hook it up to a scan-set, and do a proof via comparison. voila. they have _cleaner_ text. which _has_ provenance. then they point to your absence of any provenance, and draw attention to your errors -- which they've fixed! -- and boom, they've done a number on your reputation... do that a couple dozen times, with high-profile classics, in a way that goes viral, and the blow would be crushing. crushing. that's if they beat you to the punch. they eat your lunch. there's your top-down seminar... and now for the bottom-up part. if you're going to build a system capable of pulling up an arbitrary page from an arbitrary book in response to an error-report regarding some of the text on that page, then you might as well blossom it into a system that will serve to correct all the text on every page of every book. you dig? -bowerbird