
"Metadata" is a loaded word in this context. In particular, DP has it's own legacy set of definitions. It's worth taking the time to explain what you think the metadata is in pretty complete and precise terms, lest you be misunderstood and differing assumptions be made particularly about the difficulty of collecting it. On Wed, Feb 8, 2012 at 1:58 PM, Keith J. Schultz <schultzk@uni-trier.de>wrote:
Hi Lee,
Well, I would go further and say that the meta data should be part of the master format.
Something I should put into ANA.
Thanx
regards Keith.
Am 08.02.2012 um 21:11 schrieb Lee Passey:
On Wed, February 8, 2012 1:03 pm, Jimmy O'Regan wrote:
That's not so much a problem in converting HTML to EPUB -- most EPUB files are just HTML in a zip file, with some metadata -- the problem is inferring this metadata from HTML. Inferring semantic information of any kind from presentation-level details is, at best, unreliable.
One of the first things that Ms. Lofstrom suggested in response to Mr. Hutchinson's original proposal was updates to the metadata associated with a text. I would think that any automated generation process should be extracting the metadata associated with a text - and if the metadata is incorrect in the resulting file then obviously we need to improve the master metadata.
And I think this raises an interesting issue: not only do we need a master document format, we also need a master /metadata/ format.
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d