re: [gutvol-d] entire Gutenberg ebook collection on a flash drive

aaron said:
Not really. A lot more could be on there if we had added ZIP files instead of uncompressed text, but that wasn't really the point.
and the point -- as it was -- was quite phenomenal. although the website says differently, if i remember, i believe i counted some 3,000 e-texts on that c.d. now, since plain-text compresses greatly when zipped, i would love to see a c.d. full of zipped text-files as well. i'd especially love it if they were all in z.m.l. format... ;+)
You're probably right and I imagine that that will be the case eventually.
cool! -bowerbird

On Tue, Sep 13, 2005 at 01:20:38AM -0400, Bowerbird@aol.com wrote:
aaron said:
Not really. A lot more could be on there if we had added ZIP files instead of uncompressed text, but that wasn't really the point.
and the point -- as it was -- was quite phenomenal. although the website says differently, if i remember, i believe i counted some 3,000 e-texts on that c.d.
#10802 is a DVD, all zipped, with about 9400 eBooks. From December 2004. #11802 is a CD, not zipped, with "best of" selections (lot of HTML, too). about 960 eBooks from August 2003 There's a new DVD image, but it needs some tweaking still: 5100 eBooks, unzipped, incorporating the Best Of (with updates) plus most of the rest of our HTML, plus lots more favorites. This is not ready for widespread distribution (it mistakenly has some copyrighted stuff on it), but you can grab it here: ftp://snowy.arsc.alaska.edu/pub/gbn/pgjun05.iso One goal of the new ISO builder software is for people to save and "publish" their favorite collections. Another is to easily facilitate "updates" to a particular collection. BTW, the tool is telling me that all zip/txt files add up to 116% of a DVD (about 5.24GB). That will fit on a dual layer DVD, but won't quite fit on a single-layer. I'll be working with Craig to speed things up during the selection process (there are some MySQL optimizations he didn't use *). Ultimately, to build the ISO, *all* of the files need to be read from disk, then written to the new ISO file. So, for a large ISO file, this can take while (i.e., over 20 minutes for a regular DVD image). We'll be adding some common sense instructions (such as, don't just add every eBook -- you're better off simply using rsync to download what you need), tutorials, examples, etc. before this goes production. I'm planning to demo it as part of the StorCloud competition at the SuperComputing conference (Seattle November 12-18). -- Greg * Everyone who wants to criticize use of MySQL, bite your tongue, get the source from Craig, and do your own implementation. You can even set it up on snowy!

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 At 12:47 AM 9/13/2005, you wrote:
#10802 is a DVD, all zipped, with about 9400 eBooks. From December 2004. #11802 is a CD, not zipped, with "best of" selections (lot of HTML, too). about 960 eBooks from August 2003
Actually, the total title count on that one is just above 600, but close enough.
BTW, the tool is telling me that all zip/txt files add up to 116% of a DVD (about 5.24GB). That will fit on a dual layer DVD, but won't quite fit on a single-layer.
If you remove the human genome files, it will. I did it yesterday, but ran in to difficulties on my end before I was able to save it; Not the systems fault.
I'll be working with Craig to speed things up during the selection process (there are some MySQL optimizations he didn't use *). Ultimately, to build the ISO, *all* of the files need to be read from disk, then written to the new ISO file. So, for a large ISO file, this can take while (i.e., over 20 minutes for a regular DVD image).
Not to mention if a lot of people are using it at the same time. :) Anyway, I know its a lot faster than it was. Craig has done an awesome job!
We'll be adding some common sense instructions (such as, don't just add every eBook -- you're better off simply using rsync to download what you need), tutorials, examples, etc. before this goes production. I'm planning to demo it as part of the StorCloud competition at the SuperComputing conference (Seattle November 12-18). -- Greg
I'll hurry up on the documentation then. We're just making some minor edits. You can use what you like and throw out the rest. It will be in an FAQ type format. Sincerely Aaron Cannon - -- E-mail: cannona@fireantproductions.com Skype: cannona MSN Messenger: cannona@hotmail.com (Do not send E-mail to the hotmail address.) -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.2 (MingW32) - GPGrelay v0.959 Comment: Key available from all major key servers. iD8DBQFDJtGcI7J99hVZuJcRAiccAKCwH4Cm5SLIzE8iBZcYxN9pAuyPSgCeNWV0 cya7AizktT2sOjxdjfvcyGU= =anda -----END PGP SIGNATURE-----
participants (3)
-
Aaron Cannon
-
Bowerbird@aol.com
-
Greg Newby