[gutvol-p] catalog.rdf invalid xml

Marcello Perathoner webmaster at gutenberg.org
Wed Mar 23 14:34:09 PST 2005


Conrad Parker wrote:

> Unfortunately it seems the catalog.rdf file is missing some lines, and
> as a result cannot be parsed by strict parsers such as those in libxml2
> (which is very widely used by many platforms).

I just parsed it successfully using perl 5.8.0 and libxml 2.5.10.


> After some brief googling I came across Grahame Bowland's site, which
> includes a simple unix shell script which he developed recently:
> 
> http://angrygoats.net/svn/gutenberg/fix-catalog.sh
> 
> This inserts the missing entities into the DOCTYPE declaration at the
> top of catalog.rdf. Of course it would be better if these entities could
> be included in the original catalog.rdf published by Project Gutenberg :)

We do not use HTML entities in the database any more, so the generated 
RDF/XML and RSS should not contain any.



-- 
Marcello Perathoner
webmaster at gutenberg.org




More information about the gutvol-p mailing list