New canonical URL for etext directories

The new canonical url: http://www.gutenberg.org/files/12345 will redirect to: http://www.gutenberg.org/dirs/1/2/3/4/12345/ Of course, this works for all ebooks that are stored in the new filesystem. Old ebooks that have not yet been moved to the new filesystem will NOT work. Note: the redirect takes one extra round trip to the server, so the second url is faster. If response time is important, use the second url. -- Marcello Perathoner webmaster@gutenberg.org

Marcello, I see that you are also redirecting old-syle files, e.g. http://www.gutenberg.org/etext90/getty11h.htm is redirected to http://www.gutenberg.org/dirs/etext90/getty11h.htm Would it not be better to redirect these to the catalog, e.g. http://www.gutenberg.org/etext90/getty* could redirect to http://www.gutenberg.org/etext/4 That would ensure that anyone using old-style links would get directed to the latest version(s) of a work. I'm guessing that you could program the redirect from the old GUTINDEX file. Steve Marcello Perathoner wrote:
The new canonical url:
http://www.gutenberg.org/files/12345
will redirect to:
http://www.gutenberg.org/dirs/1/2/3/4/12345/
Of course, this works for all ebooks that are stored in the new filesystem. Old ebooks that have not yet been moved to the new filesystem will NOT work.
Note: the redirect takes one extra round trip to the server, so the second url is faster. If response time is important, use the second url.
-- Stephen Thomas, Senior Systems Analyst, Adelaide University Library ADELAIDE UNIVERSITY SA 5005 AUSTRALIA Tel: +61 8 8303 5190 Fax: +61 8 8303 4369 Email: stephen.thomas@adelaide.edu.au URL: http://staff.library.adelaide.edu.au/~sthomas/

Steve Thomas wrote:
I see that you are also redirecting old-syle files, e.g.
http://www.gutenberg.org/etext90/getty11h.htm
is redirected to
http://www.gutenberg.org/dirs/etext90/getty11h.htm
Would it not be better to redirect these to the catalog, e.g.
http://www.gutenberg.org/etext90/getty*
could redirect to
http://www.gutenberg.org/etext/4
That would ensure that anyone using old-style links would get directed to the latest version(s) of a work.
Something like that is already in place experimentally: If a deep link to a file is posted on a web page, I redirect to the bibrec page instead. Some people think this is a bad idea, though. -- Marcello Perathoner webmaster@gutenberg.org

It seems to me that we are redirecting too much: we are even redirecting gutenberg.org! I have tried ebook n. 14837, to avoid being redirected I typed http://www.gutenberg.org/dirs/1/4/8/3/14837/ and it is OK; I click on 14837-h and it is still OK; I click on 14837-h.htm and instead of getting http://www.gutenberg.org/dirs/1/4/8/3/14837/14837-h.htm that is undoubtedly there, I am redirected to http://www.gutenberg.org/catalog/world/file?file=1/4/8/3/14837/14837-h/14837... that gives I see no such file here! (1/4/8/3/14837/14837-h/14837-h.htm) Typing http://www.gutenberg.org/dirs/1/4/8/3/14837/14837-h.htm sometimes I am redirected as above, sometimes I get:
Page Not Found
Sorry, but the page you tried to access can no longer be found under that url.
In November 2003, Project Gutenberg's Web pages moved from promo.net to our new host ibiblio.org. Not all of the content from promo.net was moved to ibiblio.org, and some of the content was reorganized.
Also, we are gradually updating all eBooks older than #10.000, and in the process, moving them to a new filing system.
Please use the site map to find what you are looking for. We apologize for the inconvenience. Thanks for visiting Project Gutenberg, and happy reading!
that does not make sense at all, and once I have been able to get The Real Thing, showing that it exists! Carlo Traverso

Carlo Traverso wrote:
It seems to me that we are redirecting too much: we are even redirecting gutenberg.org! I have tried ebook n. 14837, to avoid being redirected I typed
http://www.gutenberg.org/dirs/1/4/8/3/14837/
and it is OK; I click on 14837-h and it is still OK; I click on 14837-h.htm and instead of getting http://www.gutenberg.org/dirs/1/4/8/3/14837/14837-h.htm that is undoubtedly there, I am redirected to
http://www.gutenberg.org/catalog/world/file?file=1/4/8/3/14837/14837-h/14837...
that gives
I see no such file here! (1/4/8/3/14837/14837-h/14837-h.htm)
That is the source of the problem! ibiblio is experiencing serious file server overload to the point of complete failure. If the web server cannot get an answer from the file server in a reasonable amount of time it calls the error page (even if the file _is_ on the file server.) Tomorrow ibiblio will move the ftp directories to a new file server. That should fix the problems, at least for some time. -- Marcello Perathoner webmaster@gutenberg.org

Carlo Traverso wrote:
and it is OK; I click on 14837-h and it is still OK; I click on 14837-h.htm and instead of getting http://www.gutenberg.org/dirs/1/4/8/3/14837/14837-h.htm that is undoubtedly there, I am redirected to
That URL is wrong, try this one: http://www.gutenberg.org/dirs/1/4/8/3/14837/14837-h/14837-h.htm Works for me. -- Marcello Perathoner webmaster@gutenberg.org
participants (3)
-
Carlo Traverso
-
Marcello Perathoner
-
Steve Thomas