
On Sun, 10 Oct 2004, Karl Eichwalder wrote:
Tony Baechler <tb@baechler.net> writes:
However, it does not create clean ASCII files. Every page I convert has tww blank spaces at the beginning of every line and it inserts junk to mark links and image placeholders.
I appreciate your feedback very much! I guess with a little bit post-processing we can improve the output. Or we should use 'w3m' for creating txt files.
One thing I really like about the current PG model is that I can quickl go to the ftp site, grab a file, unzip it and have readable plain text.
Yes, I don't want you to produce txt files on your own. We should change the way how we create txt files. Doing txt files by hand is too slow. Often it is necessary to improve a text (typos, missing part, random garbage); if you have to apply the same correction to various files manually you must spend more time than necessary and such a procedure is error prone by itself.
When I was faced with these problems, I just wrote macros for my word processor to take out leading and trailing spaces. If there were sections of poetry or songs that looked better indented, then I just changed the spaces in those to @'s and then did a global search and replace [after first searching for @'s already there]. These steps all combined take less time than I spent writing this. Michael Hart