
Thanks, Greg (and, by extension, Jim). So, to clarify: One of the ways is to upload the page images with the new e-book. I'm assuming I just include another folder, next to the images folder and the text and HTML files I submit, called “page-images”, including all the page scans. I can obviously not call it “FILENUMBER-page-images”, since at that point I don't know the e-book number yet. Is that correct? I'll try with my next submission. What if I haven't submitted the page images with a book and want to go about adding them later? How do I send them to the WWers? Do I upload them somewhere? If so, where? Shall I use the ordinary e-book submission form again and just upload the page images without the book, with a note which e-book number they are for? That seems rather inconvenient. (Not inconvenient for _me_, but for the _WWers_. _I_ wouldn't mind.) Or is it still done by FTP? If so, how do I (or anyone else) get an account? And exact instructions on where to put the images and how to notify the WWers? What do I do if there are lots of pages and/or lots of illustrations and the file gets a little larger? The e-book submission web form has an upload limit. I don't know how large it is, but I know some of us have run up against it even for ordinary illustrated e-book submissions without page images. Is there an alternative way in that case? Jana, really happy that she'll hopefully get to store her images somewhere other than on her own machine soon! (I have some that are _not_ in the Internet Archive or Google Books, or at least not as complete sets, so at least those would be valuable to have) On Feb 13, 2012, at 11:04, Greg Newby wrote:
On Mon, Feb 13, 2012 at 09:43:36AM +0100, Jana Srna wrote:
So ? can I get a reply on this from someone? Greg? One of the WWers?
Greg says he's ?disappointed? we aren't providing page scans for each title. He says it's ?part of the established WWer workflow? to post them, either as part of a new book, or separately.
Can someone please explain that ?established workflow?? How exactly does one, as the contributor of a book, either new or already posted, go about submitting page scans for it?
Sorry for not sending this earlier, but I checked with the WWers first, and then neglected to follow up on -d. Details below.
I don't know whether these guidelines are available elsewhere, but I did confirm with the WWers they are able & ready to accept page images. One volunteer was very active at getting these from DP, but that ended a few years ago.
Note that we do NOT need page images only as a zip. They are OK unzipped. (The PIZ extensions are because our old mailing list manager refused messages with the string 'zip')
From: "Jim Tinsley" <jtinsley@pobox.com> To: "Posted Etexts for Project Gutenberg" <posted@listserv.unc.edu> Subject: [posted] Posted (#12973, Butler) ! Date: Tue, 20 Jul 2004 20:24:32 -0700 (PDT)
Personal Recollections of Pardee Butler, by Pardee Butler 12973 [Editor: Mrs. Rosetta B. Hastings] [Contributor: Mrs. Rosetta B. Hastings] [Contributor: Elder John Boggs] [Contributor: Elder J. B. McCleery] [Link: http://www.gutenberg.net/1/2/9/7/12973 ] [Files: 12973.txt; 12973-h.htm; 12973-page-images]
Thanks to Roger for finding and scanning this book.
This is the first PG book to be posted with page images. We are now beginning to accept page images along with the regular postings. Of course, DP has always preserved its page images, and those will eventually be uploaded in a big batch, or series of batches, but non-DP contributions may now begin adding page images.
For now, we're setting the following guidelines for page image postings:
1. PG is now accepting page images of books posted. Page images will be posted _only_ as an addition to an etext posted in the normal way -- we will not post page images without plain text.
2. Page images are an option; they are not and will not be required for the posting of a text.
3. All page images should be good enough to work reasonably well with OCR packages, up to 600 dpi, and should be stored as black-and-white TIFFs with CCITT-4 (aka ITU-G4 or Fax Group 4) compression. This is important, so that we keep the overall file size down to a sustainable level. With this compression, a typical 600dpi page can be stored for about 40KB. Our ability to post these images depends on the file sizes staying fairly reasonable. Pages such as color pictures or greyscale photos that cannot reasonably be stored as black-and-white only should be stored as TIFF or JPEG with the best compression you can get for that image.
(Note: Irfanview for Windows does this nicely individually or in batch. ImageMagick v 6.x: convert myimage.png -compress group4 myimage.tif )
4. Each page image should be a separate file and named with the page number within the set; e.g. 001.tif, 002.tif, etc. Separate, non-page images, such as covers or color images scanned separately from the pages, should have suitable names, such as "cover.jpg" or "072-image.tif" All page images for the book will be zipped into one file, to be called FILENUMBER-page-images, e.g. 12345-page-images.piz (reverse the extension) for etext #12345, and stored in the main directory for that etext. It will unzip to a subdirectory ./page-images, but we will not post separate page images in that directory, since that would double the space used, and we believe that people who want to consult the images will probably want them all. So, for now at least, if you want the images, you download the PIZ (backwards again) file.
jim