
On Thu, February 9, 2012 1:42 pm, don kretz wrote:
Is it possible to imagine poaching off the metadata other people have?
For one example, could TIA be mined to acquire a list of books whose images are harvestable? And could some of them be determined clearable to build a list of projects that could be begun with lower-than-usual overhead?
For WEM metadata, Open Library is probably the best open-source repository. Interestingly, Open Library contains some references to Project Gutenberg works, but only those which have been independently posted to IA. The OL people have consistently expressed complete indifference to the PG corpus. There is perhaps an opportunity here to find a way to do an automated upload of references to the PG corpus to OL, and at the same time recording the OLID for each work back into the PG metadata.