
Hi There, Am 15.09.2009 um 11:20 schrieb Marcello Perathoner:
Keith J. Schultz wrote:
We need a a format that is not based on an existing format, ...
Why not? Very simply. Basically, most formats have a particular output in mind! Furthermore they are far too complex. The idea is to markup the book text in a way that we can extract its structure and features. Then depending on the the output format is created.
... but we want a representation that contains as much information as possible. It should only take about a month to create such a format.
ROTFL
I said to create such a format. I did not say create the tools for creating output formats. Which is the actual crux if you have been trying to follow this thread. Also, you need tools for getting the scan into this format from scans which should be done mostly by a computer inorder to save time. regards Keith.