New subject: Kindle DX, etc.

22 Jun 2009

      jim said:
...
the Google Books approach of simply digitizing the page images of a 
book 
   is a perhaps reasonable approach where the readership for that book is
...
small compared to the effort required to take the Gutenberg approach 
of 
   producing a clean digital text.  If one is going to simply digitize 
the 
   page images of a book then one needs a reader with a screen big enough
...
to display that page image legibly -- unless one imagines a technology
...
to slice and re-dice the page image in an intelligent manner so that 
   snippets of the page image fits readably onto an iPhone.
...
http://z-m-l.com/go/myant/myantp123.png
ok, where do i begin here?

just pick a spot and jump in, i guess...

i believe the google book "settlement" should be rejected.
it gives far too much to google -- a virtual monopoly --
and would thus shut down experimentation prematurely...

but what you have said here, jim, doesn't do google justice.

you seem to think that google is just scanning the books,
and displaying those scans to people.   that's not the case.
google is doing o.c.r., and is using the results of that o.c.r.

further, they show indications they'll be cleaning that o.c.r.

contrary to what you have implied here, it does _not_ take
a great deal of time or effort to create "a clean digital text".

i have shown here -- repeatedly -- that the vast majority of
o.c.r. errors can be corrected with little or no human effort,
using rigorously-applied computerized correction routines.

and google knows how to create, test, and refine those routines.

i have also shown here -- again, repeatedly -- that one attains
high accuracy by comparing different digitizations to each other,
and minimizes the human attention required in the process...

google can use this, as it is scanning lots of books multiple times.

so why does distributed proofreaders work so hard to do its books?
because it's stupid, with its head in the sand, ignores what i prove,
and doesn't mind wasting the energy and time of its volunteers...

next, one doesn't have to "imagine" a technology that will slice
and re-dice a page-image to fit it onto a certain display-size...
bill janssen over at parc demonstrated such a system long ago.
and if you look, google is currently using its own variant of that.

finally, there are a lot of page-images that fit nicely on an iphone
even without being sliced and diced.   for instance, navigate here:
this page-image displays quite nicely on an iphone, thank you.

-bowerbird

**************
Make your summer sizzle with fast and easy recipes for the 
grill. (http://food.aol.com/grilling?ncid=emlcntusfood00000004)

Re: Kindle DX, etc.

Bowerbird＠aol.com

Jim Adcock

tags

participants (2)