William Waites wrote:
* Different subject URI <-- very important, could kludge with owl:sameAs but shouldn't have to
The DCMI changed their recommendations. The RDF files follow what recommendations where current at the time I wrote the scripts. The syntax may be different but the semantic is the same.
* Different layout for dc:subject (uses a rdf:Bag in one, a simple bunch of bnodes in the other)
A Bag *is* just a bunch of nodes. The syntax may be different but the semantic is the same.
* Creator/Contributor/Publisher has a URI in the individual files but a text string in the catalog.rdf.gz. Using a URI is the right way to do it.
Thats the only difference. Actually using an URL is quite the wrong way. I did that only to make it possible for somebody to create an exact replica of our dataset (ie. containing the exact same set of (wrong?) assumptions we made.) The semantic of the string literal is: the author of this book is spelled 'John Doe'. The semantic of the URL is: the author of this book is spelled 'John Doe' *and* the authors of two books are the same person if they share the same url. Now the second statement is a very bold statement, especially if you don't find any LoC record for the book you are cataloguing or the LoC doesn't know either. (This happens quite often.)
* Links to downloadable resources are absent in the catalog
Look further down. -- Marcello Perathoner webmaster@gutenberg.org