
----- Original Message ----- From: Scott Lawton <scott_bulkmail@productarchitect.com>
My feedback on PGTEI is too long for email, so I posted it here: http://Classicosm.com/xml/feedbackonpgtei.html
Feedback welcome!!!
quote is used in an example but apparently isn't part of TEI Lite (it's not in link_outAppendix A). What's the story? It is part of the full TEI spec. Thanks for pointing it out. I meant to have it in my test.xml, but I forgot. The test.xml should have <quote rend="display"> for blockquotes (and will on the next update.) TEI-Lite is the starting point, but we will probably pull in other stuff from the full spec where we need it. ** q: in cases where the quotation marks don't balance, it may be difficult to automatically convert quotation marks to the appropriate q.../q form, and time consuming to manually proof. Accordingly, I suggest this step be left as optional. I actually agree here. I prefer using " instead of <q>. Can any of the experts explain why this is a "bad idea"? ** pgHeader looks like it's contains information that should be described in teiHeader (though I'm new to TEI so may be wrong). alice.tei and lmiss.tei both contain pgHeader; the generated PGTEI does not. Assuming I understand this part right ... The teiHeader contains all the information. pgHeader is the call out to the part that takes the info in teiHeader and formats into a standard display header when you convert to HTML or TEXT. Marcello is probably the guy to explain it more fully. ** Having separate index tags for TOC, PDF and PDB strikes me as unnecessary and prone to error. Shouldn't the TOC one suffice for all? In fact, the tag itself seems redundant. Shouldn't the head itself suffice? (If TEI requires it, that's another example of where I think TEI is too complex.) Well, the reason they are separate is for the occasion where you have a header, but you don't want that header to appear in the Tabel of Contents. HTML requires an anchor and <h1> markup both ... this is the TEI equivalent. As for the multiple index entries, I wondered about the need myself, but I haven't gotten around to asking Marcello about it (or digging through documentation to try to understand the need). ** alice.tei: reg="Carroll, Lewis" should use the complete "authority" form, which I believe is "Carroll, Lewis, 1832-1898". Note that unlike the PG website, there are no parens around the dates. Here's an illustration of paren usage: "Baum, L. Frank (Lyman Frank), 1856-1919". I'm hoping consistency in format will be achieved when we have 1) some examples in place and 2) a web form for generating the, admittedly confusing, teiHeader section. ** There appear to be two validation errors, e.g. in the link_outPGTEI documentation: Error (7/117): <SPAN> must not contain block level elements like <H1>. Error (379/1): The start tag for </P> can't be found. Marcello knows about these and they will be fixed. ** In the documentation, why is "Versprich mir, Heinrich" repeated in the output, the second time in white? This one confused me for a minute, too... Then I realized, it is the only way a HTML browser will be able to space over the right amount. In effect, Marcello is trying to make the text invisible. There may be a better way to hide the spacing text, but I haven't given it much thought yet. It works now, if not in an "elegant solution" manner. ** The lack of space between paragraphs goes against Web conventions. (It's fine as an option but a poor choice for the default.) Agreed. I promise it will be changed. ** Thanks again for your analysis! Josh