
Joshua wrote:
Lee Passey wrote:
It appears that the file is latin-1 encoded, despite the fact that the DTD claims that it is utf-8 encoded. This caused Firefox some grief as it tried to utf-8-decode some latin-1 accented vowels.
Ok, I tried to see what grief you are talking about ... all the accented vowels I looked at are appearing correctly. Which ones are you having trouble with? (This is looking at the XML directly in Firefox)
I thought everything in Latin-1 encoding would be the same under a UTF-8 encoding, but evidentally I'm mistaken there (which wouldn't be surprising, my encoding set knowledge is often shaky at best).
Hmmm, I notice in the PG-TEI documentation (version 0.3 at URL: http://www.gutenberg.org/tei/marcello/0.3/doc/20000-h/20000-h.html#toc_12 ) that the "template" has the following DOCTYPE: <?xml version="1.0" encoding="iso-8859-1" ?> Why isn't it <?xml version="1.0" encoding="utf-8" ?> ? Is this the issue of what Lee observed, or is this a different issue? Jon