
I actually agree with the complaints if you don't mind my input (no flaming please). I try to do some sort of correction for my ebook reader, but its very primitive (and breakable) if the first alphabetic character in the new line is uppercase, keep the line, otherwise join them. First i tried if the last character of the previous line before a alphanumeric is a punctuation, keep the line, otherwise join it, but hey, more false positives. The one i uses at least corrects normal errors (Noun names non-withstanding) while keeping things like Chapter headings mostly intact (except lowercase off course). They can't be both applied i think. If some has a better algorithm, please share hey? This is one of the reasons i prefer html formats. A space is a space is not dozens of spaces and \n is nothing at all and <p> is king.