happy birthday project gutenberg!

michael said:
Considering the 40th Anniversary of eBooks
happy birthday, project gutenberg!
40 years ago...one title available at Project Gutenberg.
perhaps even more amazing: after its first 20 years, project gutenberg only had 100 e-texts in its library -- still more "proof of concept" than anything else... but it took only 12 more years to expand to 10,000, and gain the heft to merit the appellation "library", with this growth fueled by _individual_digitizers_... all this got the attention necessary for larger efforts, such as internet archive and google, and thus we saw the possibility for electronic-books to finally emerge, after a decades-long period of quite painful gestation. and thus, in 2011, we have the "coming of age" time for e-books, when amazon reports that it sells more e-books than p-books, while self-published authors -- always insulted previously with the "vanity" slur -- discovered (quite by accident) that a direct connection to their audience enabled them to make _big_ money, hundreds of thousands of dollars, and even millions... just as shocking, a wide range of e-book hardware is available in all the digital-gadget big-box stores, and even being touted in slick commercials on television. my word, even barnes&noble is in the e-reader game! so there is definitely cause for celebration! :+) at the same time, however, the news is not all good... and the ugly part of the equation can be summed up in one word: greed. unsurprisingly, leading this side are the corporations, this time in the form of the boys at the publishing houses, who are extremely reluctant to give up their gravy train, so are draggin' their feet... you know all the tactics... copyright legislation, d.r.m., an "open standard" called ".epub" that is a piece of shit, and every fear/uncertainty/doubt issue they can muster. but, hey, we _expected_ those selfish kids to be greedy; after all, it's the object of the capitalist game they play. what really hurt is that they were joined by a group of _authors_, who couldn't see the vast potential in being free of their corporate masters, and thus joined them in suing the google scanning project... and then what _really_ brought the house crumbling down was when google decided to turn evil, and buy out of the lawsuit by throwing some chump change to their "opposition" from extraction via onerous extortion of you and me... google deduced it could get the public and libraries to pay for access to google e-books... so essentially the "global library" we envisioned has been morphed into a "global bookstore", a monopoly with too-high prices. and just today, google announced "iriver story", some e-book tablet "integrated" with its e-book platform... "through which you can buy and read google e-books over wi-fi". thus the bait-and-switch is fait accompli. will future generations even know what a "library" is? or will they have access only to the books they pay for? what about the rest of the world, much of which cannot begin to afford the ridiculous rates google will charge? just how hard will the greedy capitalists fight this fight? i don't know the answers to these questions, but i can say that, at my age, i'm less optimistic than i once was. it is entirely possible that we won the battle for e-books, but lost the war for a global cyberlibrary accessible to all. so again, happy birthday, project gutenberg! but now starts the hard part... -bowerbird

I should add that Barnes and Noble also reports more eBooks sold than paper books. . .but I THINK that is just from online sales, not sure, but soundls like it doesn't include over the counter. mh

I don't know if the google library it charges access to is usable by screen readers or not, but the two ebooks I downloaded from them, both of which were free, were simply scanned images of the paper book nicely bundled into a (I'm sure) pretty looking pdf file. Unfortunately, that kind of thing is completely useless to visually impaired folks who depend on screen readers for their text, since there's no text on the page, simply an image of text. Thus, Google's claim that they're creating a library for "all" is erroneous. I doubt this sort of thing will change, but it sure would be nice if they'd actually include a readable version of the book, since obviously they have it in a real text form, or else their search engine couldn't index the silly thing. Stupid if you ask me, but since nobody did. ...

I don't know if the google library it charges access to is usable by screen readers or not, but the two ebooks I downloaded from them, both of which were free, were simply scanned images of the paper book nicely bundled into a (I'm sure) pretty looking pdf file.
Not to defend that which Google is doing or not doing, but to point out a "correction" -- not all the books in google library are simply scanned images bundled into a "pretty looking pdf file." On the left side of the google books page that displays a book you have clicked through on because you think you are interested in that book is a tab choice labeled "Read on Your Device." About half the books I see there have the option "eReaders and other devices" which will have the choice you described earlier namely "Download PDF" which indeed is simply a bundle of pretty pictures -- bitmap representations of "Xerox Copies" of each page in the book. But the other option is sometimes "Download EPUB" -- which still has the problem that introductory pages, table of contents continue to only be represented as bitmap "pretty pictures", but the body text has been automagically Optical Character Recognized and converted to electronic text form, which screen readers can hopefully handle -- especially since one can get ePub plug-ins to allow one to read ePub format directly in several common web browsers -- and modern screen readers had better be able to handle modern web browsers. Google has put a lot of work into their in-house OCR engine the last couple years, and while the results are not even up to that of the worse PG books, I find them surprisingly useful for a raw OCR effort. Still, truth be told, I find it easier to read the "pretty picture" bitmap scanned "Xerox Copies" of google books than their somewhat mangled "OCR EPUB" versions. In either case one has to be somewhat motivated to actually read the book in question -- which presumably is something old and obscure -- otherwise one can generally find a "clean copy" of things that are old and *not* obscure already in existence in the PG library. Below find a snippet example of the performance of the google OCR efforts which one can compare to PG books human efforts (compare to William Winter "Shakespeare's England" PG# 35105) -- not all google OCR efforts are this successful, especially when they try to OCR an even older text: ===== PEEFACE. Beautiful and storied scenes which have soothed and elevated the mind, naturally inspire a feeling of gratitude. Prompted by this feeling, the present author has written this record of his rambles in England. It was his wish, in dwelling thus upon the rural loveliness and the literary and historical associations of that delightful realm, to afford sympathetic guidance and useful suggestion to other American travellers who, like himself, might be attracted to roam among the shrines of the mother land. There is no pursuit more fascinating, or in a high intellectual sense more remunerative; since it serves to define and regulate the stores of knowledge which have been acquired by reading, to correct misapprehensions of fact, * to broaden the mental vision, to ripen and refine the judgment and the taste, and to fill the memory with ennobling recollections. These papers, accordingly, since they aim to encourage this pursuit, are at least creditable in design, however defective they may be in execution. They were addressed more particularly to American than to European readers. They commemorate two separate visits to England, the first made in 1877, tJte second in 1882; they occasionally touch upon the same place or scene as observed at different times; and especially they describe two distinct journeys, separated by an interval of five years, through the region associated with the great name of Shakespeare. Repetitions of the same reference, which now and then occur, were found unavoidable by the writer, but it is hoped that they will not be found tedious by the reader. Those who walk twice in the same pathways should be pleased, and not pained, to find the same wild-flowers growing beside them. The American edition of this work is comprised i> two volumes, published at Boston, called "The Trip to England" and "English Rambles." ....

On Aug 13, 2011, at 12:13 PM, Jim Adcock wrote:
Not to defend that which Google is doing or not doing, but to point out a "correction" -- not all the books in google library are simply scanned images bundled into a "pretty looking pdf file." On the left side of the google books page that displays a book you have clicked through on because you think you are interested in that book is a tab choice labeled "Read on Your Device." About half the books I see there have the option "eReaders and other devices" which will have the choice you described earlier namely "Download PDF" which indeed is simply a bundle of pretty pictures -- bitmap representations of "Xerox Copies" of each page in the book. But the other option is sometimes "Download EPUB" -- which still has the problem that introductory pages, table of contents continue to only be represented as bitmap "pretty pictures", but the body text has been automagically Optical Character Recognized and converted to electronic text form, which screen readers can hopefully handle -- especially since one can get ePub plug-ins to allow one to read ePub format directly in several common web browsers -- and modern screen readers had better be able to handle modern web browsers.
Heh. Yeah, they sure can. (well, most of them) However, I generally find it easier (until recently, I didn't have a device that could handle epub books) simply to extract the contents of the epub book, and read the html thereby revealed in my regular web browser. It works a treat, and doesn't require me to do anything to make it accessible. :) I wasn't aware google books were available in epub formats. I'll take another look then, and see if there's epub versions next time I perform a search and they turn up. Thanks for that info.
participants (4)
-
Bowerbird@aol.com
-
Jim Adcock
-
Michael S. Hart
-
Travis Siegel