project gutenberg was hit in the middle of the road,
bloody and bruised, and with a ton of broken bones.

so what do you guys do?

you make a circle-jerk, trying to create a dialog on
how you should draw up plans to build a hospital...

it's all very amusing.

except, you know, for the guy in the middle of the road.

***

marc d'hooghe said:
>   All those who are comma fucking and
>   mosquito sifting right now - what does
>   all that mean to the general public?

fantastic question, marc.  thanks for asking it.

i answered you, and turned it into an e-book...

you can access that book online, here:
>   http://z-m-l.com/go/ansmd/ansmdp001.html

i'll also append the text of the book here, but since
my approach was an examination of _screen-shots_
from a couple p.g. kindle files, it'll probably be better
to view the thing online, so you see the screen-shots.

if you want to review the screen-shots separately,
they all can be found in that very same directory:

>   http://z-m-l.com/go/ansmd/ansmdp001.png
>   http://z-m-l.com/go/ansmd/ansmdp002.png
...
>   http://z-m-l.com/go/ansmd/ansmdp025.png

once you actually _look_ at these derivative formats,
like i do here, you realize that there are a few flaws
in the converter that account for a large number of
the goofy stuff that most users find so objectionable,
meaning that it wouldn't be hard to fix the problems,
and thereby increase end-user-satisfaction immensely.

which is the right way to _start_ treating this patient
who is bloody and bruised in the middle of the road.

-bowerbird

{{ansmdp001.png}}







watching alice


by bowerbird



[[1]]
{{ansmdp002.png}}







table of contents


watching alice ......... 1
table of contents ......... 2
the object of this drill ......... 3
so what do we see right away? ......... 4
the title-page screen ......... 5
divorce is not good for the children ......... 6
and now the corpse... i mean the body ......... 7
a tail of woe ......... 8
paragraphs set adrift in the sea ......... 9
shall we dance? ......... 10
maybe we need an update of the input ......... 11
did you ever do collage in school? ......... 12
i majored in collage at college ......... 13
blue plate special coming right up ......... 14
more paragraphs set adrift at sea ......... 15
this screen hit an iceberg and is now sinking ......... 16
use extreme caution due to wide load ahead ......... 17
another tail of woe ......... 18
getting a head of your self ......... 21
a much nicer dance ......... 22
followed by vomit on the dance-floor ......... 23
left to die all alone by yourself ......... 24
so that does it for the kindle versions ......... 25



[[2]]
{{ansmdp003.png}}







the object of this drill


marc d'hooghe said:
>   All those who are comma fucking
>   and mosquito sifting right now -
>   what does all that mean to the
>   general public? please enlighten?

excellent questions, marc!

i'm not too sure i know what the
other people here have in mind...

but my intentions with these issues
are _to_root_out_the_huge_failure._

the .epubs and .mobis generated
by project gutenberg are _flawed._

they often do not work right, and
even when they "do", they're ugly.

so i'll point out some of the flaws,
so you see what i'm talking about.

i'm gonna work with a classic text:
alice's adventures in wonderland...

many e-book aficionados consider
"alice" to be the flagship e-book...

michael hart, for instance, loved it.

so let's take a look at this trophy...

as befitting the legendary status of
this book, its p.g. e-text # is 11...

>   http://gutenberg.org/ebooks/11

so let's take a look at the .mobi and
the .epub which p.g. auto-generates,
shall we?

today we'll do the .mobi version...

this is not an _exhaustive_ review
-- i won't point to all the flaws --
but rather just an illustrative one.

so let's get right to it, shall we?



[[3]]
{{ansmdp004.png}}







so what do we see right away?


this is what we see right away...

legalese is ugly.  make it go away.



[[4]]
{{ansmdp005.png}}







the title-page screen


here's the "title-page" screen.

it has the title, but that's all.
there's room for more stuff
-- like the author's name --
but nothing more is here...



[[5]]
{{ansmdp006.png}}







divorce is not good for the children


we have to wait for the second screen
to learn the author.  and then, again,
that's all we get.  a big screen, with
nothing but the author's name on it
and a line informing us of the edition.



[[6]]
{{ansmdp007.png}}







and now the corpse...  i mean the body


now we get to the actual text.  good.

except that's not a very nice header.
the word "chapter" is all-uppercase,
which gives it unnecessary gravitas,
compared to the title of the chapter,
which is entirely more informative...

and the way the chapter-number and
the chapter-title run together is ugly.

the original p-book had the number
on one line, and the title beneath it;
that's a much better way to display it.

also, we see that this book still has
the all-uppercase version of _italics,_
so we sigh at the antiquated practice.



[[7]]
{{ansmdp008.png}}







a tail of woe


one of the famous things in "alice" is
the formatting of "the mouse's tale".

it plays off the "tail/tale" homonym,
as the type was set in the shape of
the curled tail of a mouse, complete
with the nice touch that the type got
_smaller_ toward the end of the tail.

e-book viewer-programs can't manage
small type as nicely as the printed page,
but we should at least get the _shape_
of the tail right.  but alas, the formatting
here simply pulls all the lines to the left.

that has to be regarded as a _total_fail._

you cannot mangle something so iconic...

(and if you _are_ gonna mangle it, then
at least show the people a _picture_ of
what you _should_ have done for 'em.)



[[8]]
{{ansmdp009.png}}







paragraphs set adrift in the sea


another problem with this file is that
the paragraphs have been set adrift...
the spacing between paragraphs is
too large.  it's as if there are _two_
blank lines between each paragraph.

one would be sufficient.

indeed, since these paragraphs are
also _indented,_ some people would
"inform" you that there should be _no_
blank space between the paragraphs.

they'll tell you that you have to use
_either_ block-style or indent-style.

i don't know about that.  i kinda like
the hybrid style, especially with only
half a blank line between paragraphs.

but two blank lines between paragraphs?
that's clearly too much, and it's a big flaw
on pages like this, with many paragraphs.
(pages with a lot of dialog are the worst.)

also note that the last few paragraphs
on this page are not tagged correctly...
they're in a different font, monospaced,
and with the formatting being improper.



[[9]]
{{ansmdp010.png}}







shall we dance?


this page contains "the lobster dance",
yet another famous piece in the book.

as you see, it's not formatted very nicely.

the monospaced font is particularly ugly...
but even in a regular font, it wouldn't work.
the long-line/short-line pacing is faulty...



[[10]]
{{ansmdp011.png}}







maybe we need an update of the input


ok, so we see that venerable #11 is _not_
looking all that good when p.g. updates it
with generated formats for the modern age.

but maybe a more-recent rendering will?

especially one which started life with .html,
rather than as a .txt file like good old #11?

so let's look at #28885, dated may of 2009,
done by -- among others -- jana, who just
posted here recently, so might provide input.

the first thing to notice, again, is the legalese
-- an awful way to present a book to a person.
like a good dish, the presentation _matters..._

and yet another thing to notice here is that
the table of contents -- in the left panel --
is defective.  none of the chapters are listed.
oops.  and ok, yes, if you click on "contents",
you are taken to the table of contents, where
the individual chapters _are_ listed and linked.

the purpose of the contents in the left-panel,
however, is precisely so you see the contents
_without_ having to jump to the contents page,
so you don't have to leave the page you are on.

so right off the bat, we have a major flaw here,
with the fact of that missing table-of-contents.



[[11]]
{{ansmdp012.png}}







did you ever do collage in school?


this next screenshot is the "title-page" screen.

it's "interesting", that's for sure, kind of a collage,
with the title-type rendered at the screen-bottom.

i'm not sure it's too impressive as a "title-page",
but i give it credit for being artsy and creative...



[[12]]
{{ansmdp013.png}}







i majored in collage at college


more creativity via collage on the next screen...



[[13]]
{{ansmdp014.png}}







blue plate special coming right up


next up, we have a "list of the plates", hotlinked.

this is really nice.  people like to look at pictures,
and this gives them a fairly easy way to do that...

so bonus points here for convenience for the user.

unfortunately, before i can even add in the bonus,
i will have to subtract points for some major flaws.

first up, we have the chapter starting mid-screen.
this is very bad, and serves as a frightening omen.

moreover, the chapter-title isn't made prominent.
it's not in bigger type.  it's not bold.  not centered.
none of the trappings that would mark it as a title
have been included here.  again, this is a big flaw.

meanwhile, as before, "chapter" is all-caps.  ugly.

but then, these problems are compounded further.

we see the postprocessor tried to "get fancy" here,
and installed a drop-cap.  yes, it's very "decorative",
but .mobi doesn't handle images very well right now,
so the drop-cap -- the letter "a" -- goes on one line,
and the rest of the word/sentence continues _below._

so instead of starting with "alice was beginning to",
the line now says "lice was beginning to", and i don't
think _lice_ is something the reader should conjure
at this point in time, right at the start of this story...

alice has quite beautiful hair, and we don't want to
have to wonder if she's been infested with head-lice.
that's not what the "wonder" in wonderland is about.



[[14]]
{{ansmdp015.png}}







more paragraphs set adrift at sea


this screen reveals this version, too, has the glitch of
too much space between paragraphs.  plus, in addition,
there is a _ton_ of space on top of "down, down, down".

i don't know what the problem is, but this is a problem.



[[15]]
{{ansmdp016.png}}







this screen hit an iceberg and is now sinking


on the next screenshot, we see we have some troubles
with orphans.  many reader-programs have this flaw,
and there isn't a whole lot we can do to _avoid_ it, but
what we see here is that bad omen i mentioned earlier.

chapter 2 starts right beneath it, and thus _exacerbates_
the orphan.  a new chapter should start on a new screen,
to wash away any memory of an orphan right before it.

and, of course, we have the chapter-title which doesn't
_look_ like it's a title, the overbearing all-caps "chapter",
and the embarrassing drop-cap too.  not to mention the
spacey paragraphs.  in addition, we have an _invitation_
which is not formatted nicely, as it was in the p-book...

so all in all, this screen is pretty much a disaster.



[[16]]
{{ansmdp017.png}}







use extreme caution due to wide load ahead


the next screenshot shows us a page containing a picture.

the postprocessor floated the picture to the right and put
the caption to the left of it.  which works very nicely here,
because i have a large monitor.  but it should be obvious
that such a design won't work as well on a smaller screen.
there, it works better to put a caption _below_ its image...

also, given that the picture takes up so much of the screen,
the spacey paragraphs below seem even more isolated and
the screen ends up having far too much white-space on it...



[[17]]
{{ansmdp018.png}}







another tail of woe


here we have the mouse-tail.  once again, a total failure...


[[18]]
{{ansmdp019.png}}


...complete utter failure...


[[19]]
{{ansmdp020.png}}


...all the way to the end...

also, here's yet another page with lots of dialog,
which causes our spacey paragraphing to grow
discomforting to what is an unbearable extreme.



[[20]]
{{ansmdp021.png}}







getting a head of your self


and now that bad omen morphs into a bad dream,
what with a chapter-head separated from its body.

when you start each chapter on a new screen, you
avoid the possibility of decapitations like this one.



[[21]]
{{ansmdp022.png}}







a much nicer dance


the lobster dance got formatted better in this version
than it was formatted in the .mobi for the earlier one.

seems a bit weird that the last line of each verse is
separated, but maybe that's how the p-book had it.
i don't remember it that way, but, uh... alzheimer's.

i'd also wanna check indentation against the p-book,
but that's also an issue of postprocessor prerogative,
and i don't have any objections to how it looks now.



[[22]]
{{ansmdp023.png}}







followed by vomit on the dance-floor


the verse on this page, however, is done very badly...



[[23]]
{{ansmdp024.png}}







left to die all alone by yourself


it's really embarrassing when your whole book ends
on an orphan.  embarrassing for you, and the company
who programmed the app that did such a stupid thing.
(i'm looking at you, amazon.  looking squarely at you.)

and yeah, the transcriber's note, which instructs you to
hover the mouse over a correction (with its dotted line)
in order to get the original text, will be quite amusing to
people reading on their iphones and tablets and all the
other form-factors which don't happen to have a mouse.
it just goes to show the mindset of the post-processor...

they think they're making e-books for the ages, and they
seem to have no idea how limited their vision actually is...
if there's anything we should have learned by now, it's that
whatever our idea of the future, the future will be not-that.

oh yeah, and now we have the legalese at the book's end...
it assaulted us on the way in, and now again on the way out.
please just make it go away.  put it in an "about" box that
end-users can totally and completely and utterly _ignore._
i'll accept one line -- a link to the legalese on the p.g. site.
anything more than that is _too_much._  make it go away...



[[24]]
{{ansmdp025.png}}







ok, so that does it for the kindle versions


so, we've looked at two versions of "alice in wonderland",
and found that the kindle edition which is generated is,
to put it mildly, flawed.  many of the problems _could_ be
solved by simple improvements to the converter routine,
having the appeal of fixing other auto-generated versions.

some of the other problems might require some easy edits
to the input-files, so that they will be converted correctly...

but there appears to be no reason that this book cannot
display nicely on the kindle, befitting its status as legend.

sometime later this week, we'll take a look at the .epubs
which are generated by the project gutenberg converter.

-bowerbird



[[25]]