
On Wed, Feb 03, 2010 at 08:01:40AM +0100, Karl Eichwalder wrote:
Greg Newby <gbnewby@pglaf.org> writes:
On Tue, Feb 02, 2010 at 05:33:01PM -0800, Jim Adcock wrote:
It doesn't. In fact, "extracting" works from DP earlier was a big push I made a couple of years ago. At that time, such two stage (or other great-than-one stage) output was something that didn't fit well with the workflow. Maybe that's something that could be revisited.
I'm all for it. In the DP forum, I proposed this several times.
It's important to not double the effort involved at the final posting phase (whitewashing) through such a two stage process. But there are several good ways of insuring this, which could be incorporated with the process.
Could we give this a try with manually selected books first? How can we make sure that we do not waste the whitewashers' time?
Definitely. On a trial basis, the extra (or different) workload isn't such a big concern...we don't need to streamline while we're trying to experiment.
From the ww'er side, all you really need is a note with the upload that mentions "HTML will be forthcoming later," and then reference the .txt eBook # when the HTML is finally uploaded.
From the DP side, it seems that all this takes is an early extraction of formatted, proofread text, prior to going to HTML.
I'm sure it's somewhat more complicated than that, due to various cascading effects and perhaps some hard-coded policy on workflow, but I hope we all could accommodate some minor upheaval in the interest of exploration. -- Greg