
Continuing my evalaution of Marcello's PGTEI setup on the gutenberg website (http://www.gutenberg.org/tei/)... I used the same Declaration of Independence file I used last week to comment on the XML markup itself. This time I'm converted that XML file to HTML and TEXT using the online services section. Below are the bulleted items that *I* believe need some improvement. If anyone wants to duplicate my conversions, see my post from last week that contained the XML I used (or send me a quick e-mail and I'll forward the file on). Josh *** HTML conversion items: 1 - First thing that jumps out is the need for bigger left and right margins. This is a simple CSS change. Currently, DP has *mostly* standardized on 10% margins on the left and right. This gives some nice white space for easier reading and gives room for things like original source page numbers and sidenotes to be put in the margin area. 2 - If the author field is left blank, the conversion shouldn't put a "by" out there all by itself. Both the HTML and the TEXT version have this dangling word. 3 - The publication and edition date are both being printed, but it isn't clear which is which. Maybe put "Original publication date:" label before the date itself? 4 - Since the title, author, etc. is already list in the first few lines, the second listing below the gutenberg disclaimer line is redundant. Also, in that same spot, the language code is printed, which is nice, but I would suggest changing the format slightly. Namely, put the language code in brackets after the written out language. i.e.: English-United States [en-us] For most of us normal humans, the language codes are not intuitive. 5 - In the CONTENTS section, if there are no footnotes/endnotes, don't list a NOTES section. 6 - Use standard HTML paragraph spacing. Right now, the CSS specifies no blank line between paragraphs and an indent to the beginning of each paragraph. While this matches the original paper source, for me at least, it is jarring to read on a computer screen. This type of formatting would make perfect sense in the PDF conversion, since that one is geared for printing on paper. 7 - Need a horizontal rule (75% width seems right to me) between the CONTENTS section and the first section of the text. Right now, they run together. 8 - Need horizontal rule between major divisions of the text. Currently, the large type header gives a visual indication, but I don't believe it is enough. 9 - No need for the extra horizontal rule to mark off the FOOTNOTES section if there is no footnote section in that text. Currently, this situation makes for two horizontal rules in a row in a text with no footnotes. *** TEXT conversion items: 1 - It lists "The Project Gutenberg EBook of" twice. 2 - Has a dangling "by" line even when no author is specified. 3 - Same redundant title/author info as in the HTML conversion. 4 - Notes section appears whether there are any footnotes or not.