re: [gutvol-d] the end of the line

jon said:
Well, since I sort of initiated this sub-thread
um, you mean since you _hijacked_ the thread... just had to talk about your shiny markup, didn't you? what a debilitating distraction... the need to retain line-breaks has nothing to do with markup. (and your example, which shows the absurd lengths to which a markup mentality will drive a person, was very illuminating, as is all the technoid jargon-jabbering in this "sub-thread".) p.g. introduces its own linebreaks into its plain-ascii e-texts, all without ever entering the markup arena. and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key. i'll bring this thread back to relevance starting tomorrow... -bowerbird

Bowerbird wrote:
p.g. introduces its own linebreaks into its plain-ascii e-texts, all without ever entering the markup arena.
and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key.
i'll bring this thread back to relevance starting tomorrow...
Yes, if one doesn't care about internal word breaks in the original, then the markup approach is *equivalent* to the plain text break approach. Using your example above, let's suppose we want to preserve internal word breaks, then we might have (ignore starting spaces, simply used to shift the left margin inward so it's easier to see in this message): and i put in my own line-breaks, right here in these posts to this list- serve, again without using any markup at all, just the return key. So the question is, is the word which is broken "listserve" or is it "list-serve"? Does it matter? Yes, for word searching purposes, and a few other purposes. I surmise you don't think that preserving the actual internal word break in the original is important, just shift the break to the nearest intra-word break. Well, fine, but some people might want to have the information preserved. The markup approach I presented gives the optional capability to mark this up. (I'm evaluating Marcello's feedback, so there are different markup approaches that may be taken.) Btw, if we don't care about internal word breaks, and place the break at the nearest intra-word break, your original example in markup becomes (using the or: namespace): <p>and i put in my<or:lb/> own line-breaks,<or:lb/> right here in these<or:lb/> posts to this listserve,<or:lb/> again without using<or:lb/> any markup at all,<or:lb/> just the return key.</p> If the above is rendered in a web browser, and the end-user does not care about where the line breaks occur and takes no action, the web browser ignores the <or:lb/> tags, and the text is displayed nicely to fit the browser window parameters. But an ebook reading system, as well as simple CSS, can be used to "activate" the <or:lb/> at user demand. Or if there's a conversion script of the markup to plain text, such as to ZML, then we know where the breaks are. (One advantage to using <or:lb/> rather than <br/> is that browsers will ignore the <or:lb/> tag by default -- it would take CSS or a special user agent to activate them on demand.) Another advantage with using <or:lb/> is that the markup document is not restricted to exact plain text formatting. This allows a lot of latitude for document authors to do what they want in their text editor editing the XML document. For example, the above markup could be expressed in the document as: <p>and i put in my<or:lb/> own line-breaks,<or:lb/> right here in these<or:lb/> posts to this listserve,<or:lb/> again without using<or:lb/> any markup at all,<or:lb/> just the return key.</p> Or as: <p>and i put in my<or:lb/> own line-breaks,<or:lb/> right here in these<or:lb/> posts to this listserve,<or:lb/> again without using<or:lb/> any markup at all,<or:lb/> just the return key.</p> Same thing... XML parsing user agents normalize all three to the same thing. But in plain text, if someone happens to edit your text, such as to and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key. The line breaks are changed and the original line breaks lost forever. What if someone takes a PG text formatted in ZML, and didn't understand it was ZML (see note below), did some line length reformatting, and then redistributed that -- especially if it's Bowerbird poetry? Jon Noring (Note: How would the user know the plain text they are working with is ZML? And how would they know in a particular instance that text line breaks *are* important? Is there going to be machine-readable metadata to say that the document is ZML or that text breaks are important? I recommended that a plain text document which conforms to ZML should have some message or processing-instruction-like thing at the beginning saying it is ZML, and which version, and possibly that line breaks are important in this particular document and why. That's the purpose for <?xml version="1.0"?> at the beginning of an XML document. It identifies it and even assists with determination of the text encoding. Will ZML require UTF-8 or UTF-16? Or will it stick to ASCII? Or will it allow ISO 8859-1? Or will it allow all of them? Will it allow any text encoding? How would a user agent know the text encoding of the ZML document, especially without having to process the whole thing?)

Bowerbird@aol.com wrote:
p.g. introduces its own linebreaks into its plain-ascii e-texts, all without ever entering the markup arena.
and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key.
Line-breaks are mark-up. They don't add anything whatsoever to the text itself and are completely arbitrarily decided, usually based on the technology that is used to display the actual content. You can deny the difference between structure, content and presentation all you want, but it is perfectly possible to reformat a book using columns instead of lines without changing the actual content. And where will your precious line-breaks go in that case? Greetings, Walter

Hi All, Am 26.06.2006 um 11:12 schrieb Walter van Holst:
Bowerbird@aol.com wrote:
p.g. introduces its own linebreaks into its plain-ascii e-texts, all without ever entering the markup arena.
and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key.
Line-breaks are mark-up. They don't add anything whatsoever to the text itself and are completely arbitrarily decided, usually based on the technology that is used to display the actual content. You can deny the difference between structure, content and presentation all you want, but it is perfectly possible to reformat a book using columns instead of lines without changing the actual content. And where will your precious line-breaks go in that case?
In normal prose line breaks generally do not effect the actual content, but in poetry in may be very meaningful. Especially in works where the form of the text is important !! But do not take my word for it. Keith.

Schultz Keith J. wrote:
In normal prose line breaks generally do not effect the actual content, but in poetry in may be very meaningful. Especially in works where the form of the text is important !!
But do not take my word for it.
I will take your word for it. In some poetry even the typeface is part of the poem. Think about Paul van Ostaijen's Boem Paukenslag! http://users.pandora.be/gaston.d.haese/paukenslag.html Nonetheless, I wouldn't dare to call poetry each and every e-mail I wrote. Regards, Walter

Walter van Holst wrote:
Bowerbird@aol.com wrote:
p.g. introduces its own linebreaks into its plain-ascii e-texts, all without ever entering the markup arena.
and i put in my own line-breaks, right here in these posts to this listserve, again without using any markup at all, just the return key.
Line-breaks are mark-up. They don't add anything whatsoever to the text itself and are completely arbitrarily decided, usually based on the technology that is used to display the actual content. You can deny the difference between structure, content and presentation all you want, but it is perfectly possible to reformat a book using columns instead of lines without changing the actual content. And where will your precious line-breaks go in that case?
Yes, line-breaks (CR/LF, etc.) are markup. They are text characters used to communicate something besides the content. Paper books don't need to include these characters (they'd be invisible anyway), thus they are characters not part of the content, i.e. markup. Also, using * and _ for highlighting purposes is also markup. Of course, what Bowerbird means by markup is formalized and comprehensive text markup systems such as TeX, SGML/XML, troff, etc., but then his ZML system is another markup system that has kept markup characters to a minimum. This brings up an interesting observation in that using line-breaks in plain text has variable importance, from mildly important (arbitrarily used in paragraphs simply to trim line lengths to something manageable), to quite important (preserving poetry lines, and as Bowerbird would attest, everything he writes even if when prose-like in meaning.) The problem is knowing the relative importance of line-breaks in a plain text document, especially if one does not understand the language to ascertain context. ZML tries to tackle this issue, and I think somewhat succeeds, albeit at a loss of richness like the Model T Ford and black paint. And as noted previously, *how* does one know a particular plain text is ZML and thus falls under strict and unambiguous plain text formatting rules? Jon Noring
participants (4)
-
Bowerbird@aol.com
-
Jon Noring
-
Schultz Keith J.
-
Walter van Holst