jim said:
> In general, if derived formats including ePUB and MOBI from HTML,
> also HTML from txt, also unwrapping txt from wrapped txt,
> are to work “correctly” then there needs to be *some* degree of
> expectation on the formatting of the incoming texts. Otherwise
> these tasks cannot be successfully automated.
that's true. but i'm not talking just about "derivative formats",
because there's no need to create a "derivative" if you'd rather
just use the .txt file itself to drive the display, a la "eucalyptus".
however, the .txt file does have to be formatted "correctly" if it is
to be _displayed_ correctly. that's what's driving my motivation...
> Going the other way, the automated wrapping of txt is has
> built-in support by most (all?) modern text tools, including
> web browsers, e-book readers, text editors, etc.
stop trying to derail the thread, jim.
there's no way that project gutenberg is going to mount files
that don't have mid-paragraph hard linebreaks... _no_way_...
so that's not what we're talking about here.
and we aren't _going_ to talk about that here,
no matter how many times you try to bring it up.
so stop trying.
what we _are_ talking about now is formatting the .txt files
_correctly_, so that they can be unwrapped automatically...
-bowerbird