
Hi David, Am 25.09.2012 um 00:37 schrieb David Starner <prosfilaes@gmail.com>:
On Mon, Sep 24, 2012 at 8:02 AM, Mark Swofford <mark@romanization.com> wrote:
Yes. Seriously.
In the vast majority of PG works, probably between 98 and 100 percent of quotation marks and apostrophes could be reliably corrected without the need for human eyeballing.
The problem is, if they can be automatically corrected, then they aren't carrying an information load. It's exactly those case where we have nested quotes and "'an't 'ee, ya' 'ee?" that they carry information load, and that's exactly where they'll get it wrong, and that's exactly where readers will slam into a brick wall and have to stop and wonder whether the curved quotes or their reading is wrong. Oh, fiddle sticks. Got something harder to solve and get done properly. Naturally, reg-ex is the wrong way to go! Anyone, who has just started parsing can handle this easily.
regards Keith