
Hi James, The question is what is on a page. What DP has done is add information which is not actually of the page of the book itself. Much of what they mark up is inferred from information not on the page. As an example I give you a chapter title. Many chapter headers do not say they are chapter titles. A human recognizes the chapter title/beginning by how it is represented on the page. This knowledge is in the human, not on the page. If you ever had the pleasure to read an original latin text, you would understand the importance of the knowledge a human requires in order to identify such structures as a sentence. the roman had no problem with this all so simply task. Yet, it takes years of experience of the students and experts of latin to master the written roman language. If you are a linguist and would love to discuss your opinion on if language constitutes meaning! To come back to formatting. I can live with fact that you and others consider our approach as a dumbing downing. But, for processing physical books through OCR to creating an actual ebook you do not need any semantic mark up at all from a computational view. You would be surprised what you can with language and a computer. It would make a linguistic proper toes curl. But the result is something he can not imagine to be possible or even figure out. regards Keith. Am 06.02.2012 um 23:31 schrieb James Adcock:
Don>Except that's DPs current markup. That's specifically what we need to remedy, including introducing syntactic markup.
I certainly don’t understand this statement. The variety of things one can find in the original book which are differentiated continue to need to be differentiated. Otherwise you are dumbing down the formatting, and throwing away information which was in the original book. You might want to rename the DP tags, but not sure how that is supposed to make a contribution?
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d