
don kretz <dakretz@gmail.com> writes:
And though each project's final phase involves a great deal of manual work resulting in a polished text that is the basis for both the versions released to PG, it's interesting to note that there is no facility provided for actually preserving this foundation text version. Oversimplifying a bit, the text version removes a bunch of information, and the html version adds a bunch of stuff. Arguably the most valuable text (for content and metadata) is, at best, on someone's PC somewhere.
Probably, but not necessarily. I produced an XML master file (TEI) where e. g., I kept corrections as follows (German example, Handbuch der deutschen Kunstdenkmäler, Bd.1, Mitteldeutschland, 1914 by Georg Dehio, http://www.gutenberg.org/etext/19460): Die <corr sic="Mormarstatuen">Marmorstatuen</corr> und Vasen des Gartens (einst 150) in Rom und Venedig bestellt, Or dubious words or phrases with simple sic! statements: <sic>wagerechte</sic> I even kept page-breaks inbetween words and references to the original scans (Jon Noring once recommended something along these lines for every hyphenated word!): Besonders <reg orig='merk-|würdig'>merkwürdig</reg>'> <pb n='105' id='i114.png'/> <!-- [P: 105] --> ein bogenförmiger rom. <hi rend="g">Altaraufsatz</hi> aus Stuck, ====================================================================== You wont notice all this in the offered TXT and HTML, but if wanted you can customize the XSL style-sheet and produce special purpose output files in HTML or PDF. My output files are by no means perfect, but good enough for demonstration. -- Karl Eichwalder