Re: [gutvol-d] book of james -- 001

carlo said:
You seem to be unaware that you can get the text for a single page from the Internet Archive djvu files, through the djvutxt command and the -page option.
holy crap... i did not know that... i figured that there _had_to_ be a way to do it, since the .djvu knows the text on a page, but i didn't know how. how/where exactly does one issue this command? -bowerbird p.s. james, if you could still zip up your files, that would let me splice in your corrections...

Even simpler, you can do http://www.archive.org/stream/aliceinwonderlan00carriala#page/23 Instructions here: http://openlibrary.org/dev/docs/bookurls On Tue, Dec 20, 2011 at 4:24 PM, <Bowerbird@aol.com> wrote:
carlo said:
You seem to be unaware that you can get the text for a single page from the Internet Archive djvu files, through the djvutxt command and the -page option.
holy crap... i did not know that... i figured that there _had_to_ be a way to do it, since the .djvu knows the text on a page, but i didn't know how.
how/where exactly does one issue this command?
-bowerbird
p.s. james, if you could still zip up your files, that would let me splice in your corrections...
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

On 12/20/2011 8:05 PM, don kretz wrote:
Instructions here:
Wow! It's like they think a book is a sequence of page images. Which I guess is OK if you just want to be an archive of page images. But there's so much value they could add!

If you read the whole page it discusses pulling djvu text further down. On Tue, Dec 20, 2011 at 6:15 PM, Da Badger <badger@nyc.rr.com> wrote:
On 12/20/2011 8:05 PM, don kretz wrote:
Instructions here:
http://openlibrary.org/dev/docs/bookurls
Wow! It's like they think a book is a sequence of page images. Which I guess is OK if you just want to be an archive of page images. But there's so much value they could add!
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d

"Bowerbird" == Bowerbird <Bowerbird@aol.com> writes:
Bowerbird> carlo said: >> You seem to be unaware that you can get the text for a single >> page from the Internet Archive djvu files, through the djvutxt >> command and the -page option. Bowerbird> holy crap... i did not know that... i figured that Bowerbird> there _had_to_ be a way to do it, since the .djvu knows Bowerbird> the text on a page, but i didn't know how. Bowerbird> how/where exactly does one issue this command? djvutxt --page=123 djvufile.djvu 123.txt does the job for the 123th page. Just read the manual, e.g. google djvutxt (first hit, http://djvu.sourceforge.net/doc/man/djvutxt.html) for more details. Search the DP forum for djvutxt for a few simple shell scripts using djvutxt. Carlo
participants (4)
-
Bowerbird@aol.com
-
Da Badger
-
don kretz
-
traverso@posso.dm.unipi.it