
I wanted to do this for Dutch in the pre-1947 orthography, but it is a tremendous lot of work, so have not yet completed this work. I've collected about 100 megabytes of text in this orthography, and made an initial word list out of it, discarding anything that appears less than five times. Then have to match this to a modern word list, to fill in the gaps, then have to go through the entire list again to add all regular (grammatical) variants of each word, and filter out unwanted words, such as common misspellings and scarce words matching with common scannos or typos. I think I would like to set up a distributed word-list creator website to distribute this work. From a complete word-list to a spelling checker is easier, especially with open-office. After that, I have to do the exercise again for pre-1865 Dutch orthography. Jeroen. Darren Burnhill wrote:
Hi,
Has anyone ever developed dictionaries to suit the checking of texts from different ages?