
Everyone, I just posted a TeleRead blog article, which in turn links to the blog article posted by Catherine Hodge at DPP Store, about the World eBook Fair (WeBF). My blog article: http://www.teleread.org/blog/?p=5230 Catherine's blog article: http://dppebookstore.blogspot.com/2006/07/world-ebook-fair-12-million-downlo... Both Catherine and I are perplexed by the lack of public discussion about the WeBF on the various ebook-related forums such as this one. What are your thoughts? Jon Noring

On Wed, 26 Jul 2006, Jon Noring wrote:
Both Catherine and I are perplexed by the lack of public discussion about the WeBF on the various ebook-related forums such as this one. What are your thoughts?
Ok, since you ask, I'll share my viewpoint. I think that most PG volunteers are aware that PG texts are widely reused, reformatted, and re-presented, on many websites (and sometimes in print, as well). The WEBF can be seen as just one more of these instances. And yes, I know that it also contains many texts from other collections. For years there has been thousands of other texts online that cannot be found in PG. Again, the WEBF is just one more of these instances. What I do give it credit for is good marketing. It's like putting up a big sign saying: "Free for a limited time only!" and giving away material which you can find freely any time you want. However, for some people, that might be the best way to get their interest. It certainly does fit into Michael Hart's vision of giving away as many eBooks as possible to as many people as possible in as many ways as possible. Andrew

For my part, I volunteered to help Greg with readingroo.ms administration and then promptly forgot that you can't all see the graphs I can about the amount of throughput that server generated. :)

On Thu, Jul 27, 2006 at 12:00:11AM -0600, joey wrote:
For my part, I volunteered to help Greg with readingroo.ms administration and then promptly forgot that you can't all see the graphs I can about the amount of throughput that server generated. :)
They're here (though maybe someday they should be password-protected): http://ibis.riseup.net/munin/ms/readingroo.ms.html We maxed at 100Mbps last week, and have been pushing 40-60Mbps daily, with peaks during the daytime in Europe/Asia. -- Greg

According to worldebookfair.com they serve 1 million ebooks / day. gutenberg.org serves 60.000 ebooks / day. According to alexa¹ worldebookfair.com gets less traffic than gutenberg.org and still they manage to serve 16 times as many ebooks. I wonder how they do that? ¹) http://www.gutenberg.org/internal/stats/alexa user: internal pass: books On the plus side gutenberg.org gets some traffic from worldebookfair.com. This is where people came from in July: Listing the top 30 referring sites by the number of requests, sorted by the number of requests. reqs %reqs site 236070 19.04% http://www.google.com/ 125094 10.09% http://en.wikipedia.org/ 107354 8.66% http://worldebookfair.com/ 57974 4.68% http://search.yahoo.com/ 31210 2.52% http://www.google.co.uk/ 25132 2.03% http://www.promo.net/ 18850 1.52% http://www.google.co.in/ 17347 1.40% http://www.google.ca/ 16011 1.29% http://www.google.de/ 15762 1.27% http://www.stumbleupon.com/ 13664 1.10% http://profile.myspace.com/ 13238 1.07% http://www.google.com.au/ 12807 1.03% http://my.yahoo.com/ 12650 1.02% http://www.google.fr/ 12649 1.02% http://64.233.179.104/ 11854 0.96% http://www.digg.com/ 11694 0.94% http://digg.com/ 9228 0.74% http://www.google.com.ph/ 8621 0.70% http://search.msn.com/ 7801 0.63% http://www.ovelho.com/ 7671 0.62% http://www.worldebookfair.com/ 6568 0.53% http://66.249.93.104/ 6487 0.52% http://oldfashionededucation.com/ 6475 0.52% http://www.google.es/ 6023 0.49% http://www.google.it/ 5894 0.48% http://www.google.pl/ 5854 0.47% http://librivox.org/ 5824 0.47% http://www.google.com.br/ 5751 0.46% http://www.google.nl/ 5106 0.41% http://luminis1.wright.edu/ 413228 33.33% [not listed: 20,347 sites] -- Marcello Perathoner webmaster@gutenberg.org

Marcello Perathoner wrote on 27/07/2006, 6:15 AM:
According to worldebookfair.com they serve 1 million ebooks / day.
gutenberg.org serves 60.000 ebooks / day.
According to alexa¹ worldebookfair.com gets less traffic than gutenberg.org and still they manage to serve 16 times as many ebooks. I wonder how they do that?
¹) http://www.gutenberg.org/internal/stats/alexa user: internal pass: books
On the plus side gutenberg.org gets some traffic from worldebookfair.com. This is where people came from in July:
Listing the top 30 referring sites by the number of requests, sorted by the number of requests.
reqs %reqs site 236070 19.04% http://www.google.com/ 125094 10.09% http://en.wikipedia.org/ 107354 8.66% http://worldebookfair.com/ 57974 4.68% http://search.yahoo.com/ 31210 2.52% http://www.google.co.uk/ 25132 2.03% http://www.promo.net/ 18850 1.52% http://www.google.co.in/ 17347 1.40% http://www.google.ca/ 16011 1.29% http://www.google.de/ 15762 1.27% http://www.stumbleupon.com/ 13664 1.10% http://profile.myspace.com/ 13238 1.07% http://www.google.com.au/ 12807 1.03% http://my.yahoo.com/ 12650 1.02% http://www.google.fr/ 12649 1.02% http://64.233.179.104/ 11854 0.96% http://www.digg.com/ 11694 0.94% http://digg.com/ 9228 0.74% http://www.google.com.ph/ 8621 0.70% http://search.msn.com/ 7801 0.63% http://www.ovelho.com/ 7671 0.62% http://www.worldebookfair.com/ 6568 0.53% http://66.249.93.104/ 6487 0.52% http://oldfashionededucation.com/ 6475 0.52% http://www.google.es/ 6023 0.49% http://www.google.it/ 5894 0.48% http://www.google.pl/ 5854 0.47% http://librivox.org/ 5824 0.47% http://www.google.com.br/ 5751 0.46% http://www.google.nl/ 5106 0.41% http://luminis1.wright.edu/ 413228 33.33% [not listed: 20,347 sites]
-- Marcello Perathoner webmaster@gutenberg.org
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d
I do say, that's good word of mouth to get over 100,000 requests from the book fair site. Of course indeed, a lot of our traffic comes from wikipedia (which has a nice article on PG) and from people searching for books or for PG itself. Any word of mouth is great, no matter where it comes from :) Jared -- .

Has it been about the same (gutenberg downloads) while said fair has been on? Quoting Marcello Perathoner <marcello@perathoner.de>:
According to worldebookfair.com they serve 1 million ebooks / day.
gutenberg.org serves 60.000 ebooks / day.
According to alexa¹ worldebookfair.com gets less traffic than gutenberg.org and still they manage to serve 16 times as many ebooks. I wonder how they do that?
------------------------------------------------------------ This email was sent from Netspace Webmail: http://www.netspace.net.au

rnmscott@netspace.net.au wrote:
Has it been about the same (gutenberg downloads) while said fair has been on?
I'd say we got about 20% more book downloads in the first two weeks ... its back to normal now. http://www.gutenberg.org/browse/scores/books-downloaded.png -- Marcello Perathoner webmaster@gutenberg.org

On Thu, Jul 27, 2006 at 03:15:49PM +0200, Marcello Perathoner wrote:
According to worldebookfair.com they serve 1 million ebooks / day.
gutenberg.org serves 60.000 ebooks / day.
According to alexa¹ worldebookfair.com gets less traffic than gutenberg.org and still they manage to serve 16 times as many ebooks. I wonder how they do that?
My first guess is that since Alexa is based on sampling, their estimate is incorrect. I've watched traffic from wef since it started, and we've been pushing anywhere from 20Mbps to as high as 100Mbps (with typical daily peaks of 40-60Mbps). That's a lot of data. The last time I heard, UNC (where iBiblio is based) has 600Mbps total capacity, and about 1/3 of that (200Mbps) is allocated to iBiblio, where gutenberg.org lives. Those numbers might have increased in the last few years, however. On the other hand, maybe I'm counting wrong. I'll be looking at the 7GB access_log (currently) in detail once the WEF is over, and maybe Marcello can help so we can compare apples to apples. I have tried to only include successful/completed downloads, and also to only include eBooks (not stuff like front page images and the catalog page), but the count is based on a simple "grep" so could be off. One other factoid: We are using iptables to limit the number of simultaneous connections from a single IP address. (This might make for some unhappy proxy users, unfortunately.) The download total as of right now is just over 19 million. -- Greg
¹) http://www.gutenberg.org/internal/stats/alexa user: internal pass: books
On the plus side gutenberg.org gets some traffic from worldebookfair.com. This is where people came from in July:
Listing the top 30 referring sites by the number of requests, sorted by the number of requests.
reqs %reqs site 236070 19.04% http://www.google.com/ 125094 10.09% http://en.wikipedia.org/ 107354 8.66% http://worldebookfair.com/ 57974 4.68% http://search.yahoo.com/ 31210 2.52% http://www.google.co.uk/ 25132 2.03% http://www.promo.net/ 18850 1.52% http://www.google.co.in/ 17347 1.40% http://www.google.ca/ 16011 1.29% http://www.google.de/ 15762 1.27% http://www.stumbleupon.com/ 13664 1.10% http://profile.myspace.com/ 13238 1.07% http://www.google.com.au/ 12807 1.03% http://my.yahoo.com/ 12650 1.02% http://www.google.fr/ 12649 1.02% http://64.233.179.104/ 11854 0.96% http://www.digg.com/ 11694 0.94% http://digg.com/ 9228 0.74% http://www.google.com.ph/ 8621 0.70% http://search.msn.com/ 7801 0.63% http://www.ovelho.com/ 7671 0.62% http://www.worldebookfair.com/ 6568 0.53% http://66.249.93.104/ 6487 0.52% http://oldfashionededucation.com/ 6475 0.52% http://www.google.es/ 6023 0.49% http://www.google.it/ 5894 0.48% http://www.google.pl/ 5854 0.47% http://librivox.org/ 5824 0.47% http://www.google.com.br/ 5751 0.46% http://www.google.nl/ 5106 0.41% http://luminis1.wright.edu/ 413228 33.33% [not listed: 20,347 sites]
-- Marcello Perathoner webmaster@gutenberg.org
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d

Greg wrote:
On the other hand, maybe I'm counting wrong. I'll be looking at the 7GB access_log (currently) in detail once the WEF is over, and maybe Marcello can help so we can compare apples to apples. I have tried to only include successful/completed downloads, and also to only include eBooks (not stuff like front page images and the catalog page), but the count is based on a simple "grep" so could be off.
One other factoid: We are using iptables to limit the number of simultaneous connections from a single IP address. (This might make for some unhappy proxy users, unfortunately.)
The download total as of right now is just over 19 million.
A more telling statistic would be the number of unique downloaders rather than books downloaded. I hypothesize that a sizable chunk of the downloads for the WeBF are being done by a relatively small number of people who are massively downloading the collection, especially the non-PG stuff. Jon Noring
participants (7)
-
Andrew Sly
-
Greg Newby
-
Jared Buck
-
joey
-
Jon Noring
-
Marcello Perathoner
-
rnmscott@netspace.net.au