
Phil Hitchcock wrote:
I am currently preparing e-text versions of "W H Sleeman's, Rambles and Recollections of an Indian Official". The book describes life and customs in India in the 1830's.
Many of the place names, personal names, and various other words have a dash - placed over an a, e, i, or u, to indicate a long vowel.
The 'dash' over the vowel is known as a macron. A lower-case a with macron can be coded in Unicode with U+0101. As you say, you could code this in HTML as ā -- or you could specify UTF-8 encoding for your document and just paste the character in. (E.g. in Windows, go to "Start/Programs/Accessories/System tools and choose Character Map, and copy-paste the appropriate characters from there.) If you use UTF8 encoding, you can create your document as text, html, RTF, PDF, ... whatever, and you'll have the correct character in all cases. Note that the macron is not the same as a circumflex, so if you're aiming for accurate repro of the original edition, you'll want to use the macron. -- Stephen Thomas, Senior Systems Analyst, Adelaide University Library ADELAIDE UNIVERSITY SA 5005 AUSTRALIA Tel: +61 8 8303 5190 Fax: +61 8 8303 4369 Email: stephen.thomas@adelaide.edu.au URL: http://staff.library.adelaide.edu.au/~sthomas/