
Funny, I would have thought Lee already knew the answers to these questions... Be that as it may, see below... Al
-----Original Message----- From: gutvol-d-bounces@lists.pglaf.org [mailto:gutvol-d-bounces@lists.pglaf.org] On Behalf Of Lee Passey Sent: Monday, January 23, 2012 1:49 PM To: 'Project Gutenberg Volunteer Discussion' Subject: Re: [gutvol-d] Producing epub ready HTML
On Mon, January 23, 2012 12:43 pm, Al Haines wrote:
I've seen Marcello's response to this, and to put it into plain English, a submission can be one, and only one, of the following:
See, I never would have gotten /this/ out of Mr. Perathoner statement.
- one (and only one) TEI file, plus illustrations. No other files are allowed.
This seems bizarre to me. What is the rationale for not allowing, let alone requiring, an impoverished text file when the other submission is TEI?
Because the TEI process creates text (UTF8, Latin1, ASCII) and HTML files from the TEI file. Ditto with an RST submission. Isn't that what a master file is supposed to do? If a supposed master file also requires an accompanying text file, what's the point of the master file? More below...
[snip]
- one (X)HTML file, plus illustrations, plus from one to three text files (UTF8, Latin1, ASCII), depending on the submission's requirements.
The language in the FAQ that one may submit a HTML version without a plain ASCII version (#H.3._Can_I_submit_a_HTML_version_without_a_plain_ASCII_versi on.3F) needs to be updated with this new requirement.
The requirement for a text file isn't new--it's been around at least since I started producing etexts, eight years ago. Much of PG's assorted FAQs might be candidates for update, but the bulk of them have stood the test of time. The questions are, who's going to update them, how much argument from the cheap seats is going to ensue, and since two of the three main complainers in this forum don't submit anything anyway, why should any updater put up with those complaints? More below...
(This kind of HTML+text submission *may* also include such binary formats as doc, rtf, and pdf, but since they have to be handled manually, and PDF files are difficult, if not impossible, to correct, they're not encouraged.)
How in the world can these kinds of documents be included in HTML? Or do you mean you can package them up in a zip file along side the HTML?
They aren't "included in the HTML", whatever that means. A submitted project is usually a zip file containing the submitter-prepared files (text, HTML, images), and the WWers process those into a zip file containing an etext-numbered set of files, e.g. 12345.txt (ASCII), 12345-8.txt (Latin1), 12345-0.txt (UTF8), 12345-h.htm (HTML), plus images. Not all the text files are produced for a given project, and if a submission doesn't include an HTML file, there won't be one for that project, until Marcello's software auto-generates one. If you follow the "Other Files..." link on an ebook's download page, you'll see the contents of the zip file uploaded by the WWers. (Note that files still in the old etextnn folders won't have this link.)
_______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/mailman/listinfo/gutvol-d