
On Wed, 31 May 2006, Karl Eichwalder wrote:
Michael Hart <hart@pglaf.org> writes:
No one seems to thinks Gallica is really an eBook collection, raw scans seems to be most of what is available, and even those are a set of low-res versions that is not really suitable for OCRing.
OCRing is important, but OCR without the scans nearby is often not enough. I think gallica is one of the best e-book collections. Their PDF are very useful (you can download complete books as PDFs pretty easily and they are readable)! This way I can access the Bulletin Monumental.
I must admit that I am relying on my friends here, as my Francias is not really good enough to know if I didn't miss something that would have provided better results on their site.
Sure, you must know the way to create and download PDFs:
Each .pdf file seemed to just hold a .gif file. . .or is there something else going on there that was missed?
www.gallica.fr -> Recherche -> "Mots du titre" - enter the title, for example "Bulletin Monumental" In the "Résultat de la recherche: click on "Bulletin Monumental" Select the volume, you are interested in, for example "1861 (Sér. 2)" Now "Télécharger" and "ok" if you are interested in the complete book
Then wait, PDF preparation takes time. Click Vous pouvez le télécharger "en cliquant ici." or use the supplied FTP address.
And this is supposed to prepare the book as a single .pdf file? Searchable? Thanks!!! Give the world eBooks in 2006!!! Michael S. Hart Founder Project Gutenberg Blog at http://hart.pglaf.org