
james said:
Let's go with these as is for now
sure, no problem. so, now you have to decide how to proceed. i see two routes. 1a. fix the names, using the "scrabble" script, and then do a global change to fix the incorrect ones, or 1b. fix the names using some other methodology, and then do the global change to fix the bad ones, or 2. go immediately to the following step, which is the page-by-page edit, where each instance of a bad name will be flagged individually and corrected individually... i've been thinking in terms of route 1, mostly route 1a -- as my focus is a methodology at the book-level -- and there are many reasons to think that _is_ the best. after all, why not do a global change? on the face of it, it would seem to be much more efficient, without doubt. but as i've looked at this, i'm not so sure of it any more. either way, you'll see each name at least once, to verify it. but once you've done that for a name which is _correct_, in the page-by-page process, it will be in the dictionary, so that'll be the end of it -- it will not get flagged again. so you end up dealing with a correct name only the once. an incorrect name is also fixed the first time you see it. and thus, if it only occurred _once_, which is typical, it'll take the same amount of time, whichever route you use. so the only words which will end up taking extra time will be (a) incorrect ones (b) which occur multiple times. correct words and singleton errors take the same time, no matter which route you go -- you look at 'em once... perhaps i need to see how many names occur only once. i'll go write that routine right after i send this. *** the other thing that i noticed is that many of the names occur on those family-tree pages, and thus if you check those pages, and all the names occurring on those pages, it might go faster than checking each name individually. considering that you did most of those pages manually, i can even mark your existing diacritics on those pages as "correct", which will give you a fairly big head start... that's another routine i'll go and write right after this... *** finally... it also might be the case that the whole "scrabble" step takes you extra time all by itself, as a separate process. in which case you'd want to opt for the page-by-page... _or_ it could be the case that it _saves_ you time, since you have an attentional focus on names with diacritics. i have found that attentional focus is a big time-saver. so... it could go either way... i haven't done the research to know which holds true... basically, it boils down to how you want to do it. :+) what do you think? -bowerbird