
On Tue, Jul 26, 2005 at 11:13:27PM -0700, Andrew Sly wrote:
Any PG volunteers here with experience representing characters in old Greek using unicode? Or perhaps someone at DP?
I worked on a book at PGDP-EU a few weeks ago with a lot of such things. Title Histoire des Grecs (Tome 1 sur 3) Author Victor Duruy http://dp.rastko.net/tools/proofers/proof.php?project=projectID41e6e8e1a3a9c&proofstate=avail_2 I dit not want to learn how to "type" Greek in a standard way so I developed my own coding scheme in ASCII/latin1 and a Perl script to make the transformation then. Example: I typed "A" for "CAPITAL ALPHA" which got transformed into the right Unicode character. Most of the time the OCR got it right (when the letters looked like latin letters). I tagged Greek between {{...}} for my script to recognize where it started and ended. When there were "accents" (tones) I used a LaTeX notation (à la \'A) or latin1 characters if easier to read (Á). I checked everything after transformation to HTML in a web browser.
I have a book I'm preparing which has about 12 places where a couple Greek words are quoted, and a few letters I'm uncertain of.
For this little I probably can (try to) do it for you. I guess you know your document will have to be coded in some Unicode format in the end (for example: utf-8). Caveat: I never studied ancient Greek, I just "recognize" shapes. Greek friends of mine can (try to) validate the final product if you like. I guess we can continue this conversation in private mail, unless other people on the list are interested.