
Andrew Sly wrote:
If you want some history, basically you can blame microsoft. They developed their own character sets for use with Windows, which were _close_ to already-established standards, but not quite identical.
No, you cannot blame Microsoft. This is one of the few cases were they did right: They registered their character sets with IANA, and this makes them as standard as any other character set, ISO or UNICODE or whatever. The blame lies with the whitewasher who mislabeled the file as ISO-8859-1 when it really is WINDOWS-1252. Whatever. I fixed this by overriding the PG header in the database. Somebody should check all books by http://www.ebooksgratuits.com or all books with RTF files and see if they are correctly labelled. -- Marcello Perathoner webmaster@gutenberg.org