
Hi list, I know that often ftp.archive.org is down for a few days at a time, but it has been down now for almost all of June. Is this permanent? Is ftp access via ftp.archive.org ended? I prefer it over ftp.ibiblio.org for PG files because it is significantly faster. If ftp access is no longer available, can anyone recommend a fast mirror that is kept frequently up to date? I tried snowy.arsc.alaska.edu but it wasn't as current as I would like. I'm planning to download several thousand zip files so a fast mirror is appreciated. I'm sure http is faster but I would prefer ftp if possible. Besides http://www.gutenberg.org/dirs/ isn't really much faster than metalab.unc.edu, AKA ftp.ibiblio.org. Is there a chance that ftp.archive.org has moved to a different host or ip address? I'm running ncftp for Windows so I don't think it's a caching or dns problem. I think I tried under Linux as well with similar results. It tries for about a minute and times out. I tried with and without passive mode but it doesn't matter since I can't connect. I am in California, US. -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.4/375 - Release Date: 6/25/06

Hi Tony, I am in California too - southern california to be exact. I don't know why it's not working for you because it works fine for me. Maybe your FTP program is not connecting correctly. Me, i use wget (avilable for windows as well as a default on Linux) for my Gutenberg downloading needs. I plan to get an external hard drive (preferably an Iomega drive) later, probably for my birthday next Thursday, which i can then use to store the Gutenberg etexts and save me some disk space on my current drive. I would be using rsync to do that (check the Mirroring FAQ on PG if you don't know what that is), apparently it's much faster than wget or even FTP because it doesn't check every single file for hours to find updates, it keeps a list of all files and only downloads the ones that specifically need updating, saves you a couple of hours of time. Or at least that's what Aaron (Cannon) told me. Jared Tony Baechler wrote on 26/06/2006, 11:47 AM:
Hi list,
I know that often ftp.archive.org is down for a few days at a time, but it has been down now for almost all of June. Is this permanent? Is ftp access via ftp.archive.org ended? I prefer it over ftp.ibiblio.org for PG files because it is significantly faster. If ftp access is no longer available, can anyone recommend a fast mirror that is kept frequently up to date? I tried snowy.arsc.alaska.edu but it wasn't as current as I would like. I'm planning to download several thousand zip files so a fast mirror is appreciated. I'm sure http is faster but I would prefer ftp if possible. Besides http://www.gutenberg.org/dirs/ isn't really much faster than metalab.unc.edu, AKA ftp.ibiblio.org. Is there a chance that ftp.archive.org has moved to a different host or ip address?
I'm running ncftp for Windows so I don't think it's a caching or dns problem. I think I tried under Linux as well with similar results. It tries for about a minute and times out. I tried with and without passive mode but it doesn't matter since I can't connect. I am in California, US.
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.4/375 - Release Date: 6/25/06
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d
-- . .:. .:::. .:::::. ***.:::::::.*** *******.:::::::::.******* Dmitri Yalovsky ********.:::::::::::.******** ********.:::::::::::::.******** USS Authority *******.::::::'***`::::.******* ******.::::'*********`::.****** Asst. Chief of Engineering ****.:::'*************`:.**** *.::'*****************`.* .:' *************** . .

Hi. Yes, I'm vaguely familiar with rsync. The problem is that I don't want each and every file posted. I don't download html and 8-bit files for example. I only download the zipped plain text files. Also I don't want some religious works. Therefore rsync won't help me. As far as the external drive, that's not a bad idea but I think I prefer DVD instead. Finally, http://www.archive.org/ is fine, just ftp doesn't work. I tried on two different computers so I don't think it's my settings. I also have wget but prefer ncftp as it's a dedicated ftp client. I am near San Diego, CA. At 02:12 PM 6/26/06 -0700, you wrote:
Hi Tony, I am in California too - southern california to be exact. I don't know why it's not working for you because it works fine for me. Maybe your FTP program is not connecting correctly. Me, i use wget (avilable for windows as well as a default on Linux) for my Gutenberg downloading needs.
I plan to get an external hard drive (preferably an Iomega drive) later, probably for my birthday next Thursday, which i can then use to store the Gutenberg etexts and save me some disk space on my current drive. I would be using rsync to do that (check the Mirroring FAQ on PG if you don't know what that is), apparently it's much faster than wget or even FTP because it doesn't check every single file for hours to find updates, it keeps a list of all files and only downloads the ones that specifically need updating, saves you a couple of hours of time. Or at least that's what Aaron (Cannon) told me.
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.5/376 - Release Date: 6/26/06

Hi. Yes, I'm vaguely familiar with rsync. The problem is that I don't want each and every file posted. I don't download html and 8-bit files for example. I only download the zipped plain text files. Also I don't want some religious works. Therefore rsync won't help me.
I'm sorry... what? You can rsync exactly what files you wish, recursively or not, pick and choose, with rsync... using the right options. I mirror Gutenberg here with rsync, skipping the DVD files, .mp3 files, .rar files and a few others, getting only the useful copies of books. What part of rsync's usage is confusing you? David A. Desrosiers desrod@gnu-designs.com http://gnu-designs.com

On Wed, Jun 28, 2006 at 07:10:43AM -0400, David A. Desrosiers wrote:
Hi. Yes, I'm vaguely familiar with rsync. The problem is that I don't want each and every file posted. I don't download html and 8-bit files for example. I only download the zipped plain text files. Also I don't want some religious works. Therefore rsync won't help me.
I have to echo what David said. Rather than chaining yourself to FTP, you should look more deeply at what rsync is capable of. If you need, I could probably help you define an rsync line that gets what you want and ONLY what you want (I myself already have one that pulls ONLY the zip files).

I have to echo what David said. Rather than chaining yourself to FTP, you should look more deeply at what rsync is capable of. If you need, I could probably help you define an rsync line that gets what you want and ONLY what you want (I myself already have one that pulls ONLY the zip files).
Here's mine... rsync -avzprlHtPS --delete --exclude=[0-9]*.txt \ --exclude=*.iso --exclude=*.rar --exclude=*.ISO \ --exclude=*.mp3 --exclude=pgdvd* \ ftp@ftp.ibiblio.org::gutenberg Gutenberg This gives me ~34GiB of data... enough for me to use as a viable mirror. David A. Desrosiers desrod@gnu-designs.com http://gnu-designs.com

On Mon, Jun 26, 2006 at 11:47:17AM -0700, Tony Baechler wrote:
Hi list,
I know that often ftp.archive.org is down for a few days at a time, but it has been down now for almost all of June. Is this permanent? Is ftp access via ftp.archive.org ended? I prefer it over ftp.ibiblio.org for PG files because it is significantly faster. If ftp access is no longer available, can anyone recommend a fast mirror that is kept frequently up to date? I tried snowy.arsc.alaska.edu but it wasn't as current as I would like. I'm planning to download several thousand zip files so a fast mirror is appreciated. I'm sure http is faster but I would prefer ftp if possible. Besides http://www.gutenberg.org/dirs/ isn't really much faster than metalab.unc.edu, AKA ftp.ibiblio.org. Is there a chance that ftp.archive.org has moved to a different host or ip address?
I'm surprised you can connect to ftp.archive.org. I can't. We stopped pushing the collection to them several weeks ago. They had a hardware failure, and were unresponsive. Today, there are three master collections where new eBooks are pushed: http://www.gutenberg.org on iBiblio....see this for direct access to the raw files: ftp://ftp.ibiblio.org/pub/docs/books/gutenberg http://gutenberg.readingroo.ms same as ftp://readingroo.ms/gutenberg http://snowy.arsc.alaska.edu/gutenberg same as ftp://snowy.arsc.alaska.edu/mirrors/gutenberg They all get new files immediately. The catalog at gutenberg.org is only updated daily, and of course mirrors have their own schedule. You can check "gutenberg.dcs" in the top-level mirror directory to see if they have updated in the past week (we update gutenberg.dcs Sunday mornings EST). I hope this helps. My guess is the readingroo.ms server will give you the best throughput (though it will have some brief downtime, then possibly be heavily loaded during the world ebook fair, http://www.worldebookfair.com). Are there any Debian whizzes on this list who might want to help look after the readingroo.ms server with me? -- Greg
I'm running ncftp for Windows so I don't think it's a caching or dns problem. I think I tried under Linux as well with similar results. It tries for about a minute and times out. I tried with and without passive mode but it doesn't matter since I can't connect. I am in California, US.
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.4/375 - Release Date: 6/25/06
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d

I just wanted to add my personal thanks! Thanks!!! Give the world eBooks in 2006!!! Michael S. Hart Founder Project Gutenberg Blog at http://hart.pglaf.org

Hi. Thanks very much, the readingroo.ms server seems much faster. When I checked last, snowy.arsc.alaska.edu seemed to be a few hours behind the other master sites. I am no longer able to connect to ftp.archive.org, it just times out. I am not a Debian expert but I do run a Debian server and know a reasonable amount about it. What needs doing? I am not really a programmer but I know how to install packages and set up things for the most part. If there is something that needs to be done, let me know and I'll see. At 03:35 PM 6/26/06 -0700, you wrote:
I hope this helps. My guess is the readingroo.ms server will give you the best throughput (though it will have some brief downtime, then possibly be heavily loaded during the world ebook fair, http://www.worldebookfair.com).
Are there any Debian whizzes on this list who might want to help look after the readingroo.ms server with me?
-- Greg
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.5/376 - Release Date: 6/26/06

On Wed, Jun 28, 2006 at 12:52:27AM -0700, Tony Baechler wrote:
Hi. Thanks very much, the readingroo.ms server seems much faster. When I checked last, snowy.arsc.alaska.edu seemed to be a few hours behind the other master sites. I am no longer able to
This was a little mysterious... turns out I have two independent copies on snowy.arsc.alaska.edu. The one at ftp://snowy.arsc.alaska.edu/mirrors/gutenberg is not actually a mirror. It receives a live copy of files as they are posted to readingoo.ms and our main server at ibiblio.org (which runs gutenberg.org's server). The one at http://snowy.arsc.alaska.edu/gutenberg is just a regular mirror that I pull back from ibiblio, daily. That explains why it's not quite current. I'm in the middle of setting up some additional mirrors, so this will probably continue to change a bit. -- Greg
connect to ftp.archive.org, it just times out. I am not a Debian expert but I do run a Debian server and know a reasonable amount about it. What needs doing? I am not really a programmer but I know how to install packages and set up things for the most part. If there is something that needs to be done, let me know and I'll see.
At 03:35 PM 6/26/06 -0700, you wrote:
I hope this helps. My guess is the readingroo.ms server will give you the best throughput (though it will have some brief downtime, then possibly be heavily loaded during the world ebook fair, http://www.worldebookfair.com).
Are there any Debian whizzes on this list who might want to help look after the readingroo.ms server with me?
-- Greg
-- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.1.394 / Virus Database: 268.9.5/376 - Release Date: 6/26/06
participants (6)
-
David A. Desrosiers
-
Greg Newby
-
Jared Buck
-
joey
-
Michael Hart
-
Tony Baechler