
I agree as well that the original digitalization should be as lossless as possible. The various outputs of this original can then be as lossy as anyone cares to make them. If the original is lossy, then the information is lost 'forever.' Lossy originals reduce the choices of readers and forces a convention upon them of what is and is not important. The readers of the books should decide what they want/need. Carel -------- Original Message -------- Subject: [gutvol-d] Re: !@!!@!!@!Re: Re: so what is so important about pagination? From: "Keith J. Schultz" <schultzk@uni-trier.de> Date: Wed, February 24, 2010 3:23 am To: Project Gutenberg Volunteer Discussion <gutvol-d@lists.pglaf.org> Hi All, Been following this thread loosely. But the fact remains that the problem is the encoding of the digital text!!! NOT, the output. I have said it time and time again, the the encoding has to encode the information of the original scan or p-text. Thats is a rendering of such an encoding will produce a fairly close copy of the original. (I use rendering here as a way of producing the text; either in text or display) For the output you simply use those parts of the encoding for your output that you need. The page number problem can be handled because out of convention they are placed in the header or footer of a page. So if you do not want the original pagenumbers then just skip them. If you want the original page breaks use them and output it where you want it. I agree with BB, but we have different ways about doing it.