Seriously, I rhink you'rre on the right track.

Interesting to note:

Your list has significant commonality with the standard DP stylesheet.

Many of the elements you enumerate are notionally and/or notationally identifed during the DP process, but are overtly discarded on the way from OCR to PG. The conscious intention, I suppose, is to give PG what it looks like, not what it is.

Esstentially what we start with is a single big long string of characters produced by OCR software. Even at that point many of your artifacts are identified or inferrable. Along the way we discard and add artifact knowledge strictly to accomplish narrowly defined local tasks. Finally, we get rid of almost everything except what makes it look nice.

We don't 3ven have explicitly identified chapter titles.

I've often seen "class="poetry" for stuff that had nothing to do with poetry.


On Wed, Jan 25, 2012 at 2:11 PM, Jim Adcock <jimad@msn.com> wrote:
Don> Or the mail server is reflecting back to you the collective resistance
of the recipients to overzealous pedantry.

Seriously: if anyone wanted to improve any of the current situation, how
would anyone do so?



_______________________________________________
gutvol-d mailing list
gutvol-d@lists.pglaf.org
http://lists.pglaf.org/mailman/listinfo/gutvol-d