
On 01/24/2012 11:08 PM, Joshua Hutchinson wrote:
So, if someone were to start "refactoring" old PG texts into TEI or RST and working with a WWer to repost them ... is this a workable idea?
More than a technical challenge it would be a political one. I can convert a novel the size of Pride and Prejudice into RST in about an hour. More if there is formatting or images to recover. But I'd prefer to avoid the riot that will ensue if we start to reformat DP texts. We could start redoing the top 100 list excluding everything that is too hard and everything made by DP.
Maybe we start this process on a semi-private mirror of the PG corpus and only when it reaches a critical mass of some sort it gets moved over. But an official notice that this project has some backing is necessary or we'll just keep seeing everything running around in ten different directions and nothing ever getting done.
A semi-official branch would be a good occasion to ditch the old WWer workflow in favor of a source repository (git or mercurial) that holds all the masters. Should we reserve a range of ebook nos. or shadow the existing ones? -- Marcello Perathoner webmaster@gutenberg.org