[gutvol-p] Re: Gutenberg Catalogue RDF

Marcello Perathoner marcello at perathoner.de
Wed May 26 13:36:07 PDT 2010


William Waites wrote:

>  * Different subject URI <-- very important, could kludge with
> owl:sameAs but
>     shouldn't have to

The DCMI changed their recommendations. The RDF files follow what 
recommendations where current at the time I wrote the scripts.

The syntax may be different but the semantic is the same.

>  * Different layout for dc:subject (uses a rdf:Bag in one, a simple bunch of
>     bnodes in the other)

A Bag *is* just a bunch of nodes.

The syntax may be different but the semantic is the same.

>  * Creator/Contributor/Publisher has a URI in the individual files but a
> text string
>     in the catalog.rdf.gz. Using a URI is the right way to do it.

Thats the only difference.

Actually using an URL is quite the wrong way. I did that only to make it 
possible for somebody to create an exact replica of our dataset (ie. 
containing the exact same set of (wrong?) assumptions we made.)

The semantic of the string literal is: the author of this book is 
spelled 'John Doe'.

The semantic of the URL is: the author of this book is spelled 'John 
Doe' *and* the authors of two books are the same person if they share 
the same url.

Now the second statement is a very bold statement, especially if you 
don't find any LoC record for the book you are cataloguing or the LoC 
doesn't know either. (This happens quite often.)


>  * Links to downloadable resources are absent in the catalog

Look further down.



-- 
Marcello Perathoner
webmaster at gutenberg.org



More information about the gutvol-p mailing list