
On Thu, Jul 27, 2006 at 03:15:49PM +0200, Marcello Perathoner wrote:
According to worldebookfair.com they serve 1 million ebooks / day.
gutenberg.org serves 60.000 ebooks / day.
According to alexa¹ worldebookfair.com gets less traffic than gutenberg.org and still they manage to serve 16 times as many ebooks. I wonder how they do that?
My first guess is that since Alexa is based on sampling, their estimate is incorrect. I've watched traffic from wef since it started, and we've been pushing anywhere from 20Mbps to as high as 100Mbps (with typical daily peaks of 40-60Mbps). That's a lot of data. The last time I heard, UNC (where iBiblio is based) has 600Mbps total capacity, and about 1/3 of that (200Mbps) is allocated to iBiblio, where gutenberg.org lives. Those numbers might have increased in the last few years, however. On the other hand, maybe I'm counting wrong. I'll be looking at the 7GB access_log (currently) in detail once the WEF is over, and maybe Marcello can help so we can compare apples to apples. I have tried to only include successful/completed downloads, and also to only include eBooks (not stuff like front page images and the catalog page), but the count is based on a simple "grep" so could be off. One other factoid: We are using iptables to limit the number of simultaneous connections from a single IP address. (This might make for some unhappy proxy users, unfortunately.) The download total as of right now is just over 19 million. -- Greg
¹) http://www.gutenberg.org/internal/stats/alexa user: internal pass: books
On the plus side gutenberg.org gets some traffic from worldebookfair.com. This is where people came from in July:
Listing the top 30 referring sites by the number of requests, sorted by the number of requests.
reqs %reqs site 236070 19.04% http://www.google.com/ 125094 10.09% http://en.wikipedia.org/ 107354 8.66% http://worldebookfair.com/ 57974 4.68% http://search.yahoo.com/ 31210 2.52% http://www.google.co.uk/ 25132 2.03% http://www.promo.net/ 18850 1.52% http://www.google.co.in/ 17347 1.40% http://www.google.ca/ 16011 1.29% http://www.google.de/ 15762 1.27% http://www.stumbleupon.com/ 13664 1.10% http://profile.myspace.com/ 13238 1.07% http://www.google.com.au/ 12807 1.03% http://my.yahoo.com/ 12650 1.02% http://www.google.fr/ 12649 1.02% http://64.233.179.104/ 11854 0.96% http://www.digg.com/ 11694 0.94% http://digg.com/ 9228 0.74% http://www.google.com.ph/ 8621 0.70% http://search.msn.com/ 7801 0.63% http://www.ovelho.com/ 7671 0.62% http://www.worldebookfair.com/ 6568 0.53% http://66.249.93.104/ 6487 0.52% http://oldfashionededucation.com/ 6475 0.52% http://www.google.es/ 6023 0.49% http://www.google.it/ 5894 0.48% http://www.google.pl/ 5854 0.47% http://librivox.org/ 5824 0.47% http://www.google.com.br/ 5751 0.46% http://www.google.nl/ 5106 0.41% http://luminis1.wright.edu/ 413228 33.33% [not listed: 20,347 sites]
-- Marcello Perathoner webmaster@gutenberg.org
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d