
The gallica pdf's are very low resoloution mostly. Where there are diagrams they hardly come out at all, especially mathematical ones with small lettters on them. I t may he helpful to have a copy of the book nearby. OCR'ing pdf's is not for the faint hearted, as they are ot designed for this purpose. However they are good for layout of the original publications and for copyright use as the date of publication is usually given. Also shows the title page often omitted from other pdf files. I believe some gallica are available in text format if you push the "text" button. nwolcott2@post.harvard.edu ----- Original Message ----- From: "Karl Eichwalder" <ke@gnu.franken.de> To: <gutvol-d@lists.pglaf.org> Sent: Wednesday, May 31, 2006 4:03 PM Subject: Re: !@!Re: [gutvol-d] Kevin Kelly in NYT on future of digitallibraries Michael Hart <hart@pglaf.org> writes:
No one seems to thinks Gallica is really an eBook collection, raw scans seems to be most of what is available, and even those are a set of low-res versions that is not really suitable for OCRing.
OCRing is important, but OCR without the scans nearby is often not enough. I think gallica is one of the best e-book collections. Their PDF are very useful (you can download complete books as PDFs pretty easily and they are readable)! This way I can access the Bulletin Monumental.
I must admit that I am relying on my friends here, as my Francias is not really good enough to know if I didn't miss something that would have provided better results on their site.
Sure, you must know the way to create and download PDFs: www.gallica.fr -> Recherche -> "Mots du titre" - enter the title, for example "Bulletin Monumental" In the "Résultat de la recherche: click on "Bulletin Monumental" Select the volume, you are interested in, for example "1861 (Sér. 2)" Now "Télécharger" and "ok" if you are interested in the complete book Then wait, PDF preparation takes time. Click Vous pouvez le télécharger "en cliquant ici." or use the supplied FTP address. I hope this helps. -- http://www.gnu.franken.de/ke/ | ,__o | _-\_<, | (*)/'(*) Key fingerprint = F138 B28F B7ED E0AC 1AB4 AA7F C90A 35C3 E9D0 5D1C _______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d