
Hi Robert, As far as a markup is concerned I would suggest using TeX or XeTeX. For one you can encode all the information we want as you want. Such as \entry, \pronunciation, \meaning, \synonym, etc, you name it. Then either write comands for formating or a TeX script to produce the desired output, or use any other language to process the data. Another way to go is use XML to encode the data and take it from there. Eitherway you have full control of the input data and output. regards Keith Am 12.02.2010 um 20:45 schrieb Robert Cicconetti:
On Fri, Feb 12, 2010 at 2:22 PM, David Starner <prosfilaes@gmail.com> wrote:
Dictionary. We've had scans of the OED for years; no one has been willing to attack it. We can probably come up with a dozen usable
Not exactly true. I have a clearance on it, and have a fascicle prepped and at DP. The holdup is that I have yet to come up with a good markup for proofing that can be machine transformed into various dictionary formats. Straight TEI is too big, and likely to lead to inconsistencies. I refuse to start something this big without a decent plan for the final output.
Granted, once started, it will probably take decades to work through DP...