
just as one example, there's this:
{ [Illustration: } { The } { Bakery } { and } { Shops } { of } { the } { Russians--Later } { the | property. }
that's a line of text from one of the two comparison files, which got chopped into its individual words. i could only imagine the mess if a whole paragraph or section was added in the second comparison file.
Absolutely. This is what I have been telling you. If you make a tool that works good for small changes, it doesn't work good for big changes. You said you wanted a tool for small changes, I provided you with the output format that you wanted for small changes, and having provided you with that, now you say you want an output format for big changes. I can try to guess what format you want, and you will always come back and complain. OR you could take my source code, which I posted, and you could modify it to any output format you want. Since I didn't want to deal with this kind of mess when using the output with a regex editor, I generated an output format that doesn't contain this kind of mess. But when I published that format you didn't like that format either. Again, post YOUR source code, and working code examples, and then let's talk about it. You will find that you your tool whatever it is you eventually post someday will also have to deal with these problems, because they are fundamental to the INPUT domain -- how two different files match or do not match and why and what you want to do about it.