>PG has no control over the format submitted.

Nonsense.  I have tried submitting “other things” and have been told repeatedly that the “minimum requirements in practice of PG” is that a TXT and an HTML file be submitted, and that these two files pass through a large number of fitness tests required by PG, which in practice includes restrictions on the choice of char sets used in the internal rep of the TXT and of the HTML file.  So, in fact PG DOES have control over the format submitted, and the way PG asserts that control is by refusing to accept submission of formats and details of those formats that they choose not to support.

As a simple counter-example of the above claim “PG has no control over the format submitted” note that personally I would much rather be submitting TXT files which do not correspond to the PG requirements of including a gratuitous line wrap every 72 chars.  Or if I am required to submit TXT files with line wraps I would much prefer to retain the line wraps of the original text, because it is a royal pain for some future volunteer to have to “fix” the position of line wraps back to the original text in order to do additional processing of the text file in the future, for example because they want to find and include additional semantic information that can be found in the original page scans, but not in the TXT.  And in practice it is impossible to do this visual analysis unless one matches line breaks to the original page scans – as DP well knows.

 

Another example from a couple years ago is I asked PG how I could submit MOBI formatted texts of books they already had in other formats.  I was told that I was not allowed to do so.  So I set up an independent website to distribute PG books in MOBI format to my friends in the MOBI community -- retaining the PG licenses and legalese conditions.  Now, as hoped for, some years later PG has decided to support MOBI after all – at least to some extent. But: what a pain!

 

Why is this important to me?  Well, I happen to like classes of reader machines that the internal mechanizers of PG do not like.  PG likes big teletype like display machines, capable of displaying more than 72 chars per line.  [Your standard PC or Mac still remains fundamentally a teletype emulator] And PG likes tiny machines with extremely limited displays, also known as cell phones.  I personally do not like either of those classes of machines, but rather machines that are middle sized – small enough that I can pick them up and easily read them while lying in bed late at night for example.  But large enough that I can understand in context the ebb-and-flow of what the author wrote in some surrounding context. With these middle-sized machines issues of text reflow become a central issue in the pleasure (or lack thereof) of being able to use the machine.  And yes there are quite a number of tools one can use to help “fix” at least partially “broken” texts re these machines, including Calibre and say Mobipocket Creator.  But I’d rather not have to “fix” a text each time before I can read it.  And I’d rather it not be  “broken” in the first place.

 

So, in summary, as a “volunteer” am I free to do what I want?  Yes, certainly – but not if I want any of my efforts to ever show up on any PG website! As Bowerbird is only too happy to point out: “Please feel free to go somewhere else!”