Re: [gutvol-d] Annotations for students

17 Aug 2005

      Bowerbird wrote:
...
jon said:
...
...
 How is that? Annotations can be linked to the text using the markup
as "hooks" (e.g., using XPointer.) The more markup there is, the
more hooks to latch onto.
...
please show me -- and the original poster -- an implementation that
actually works, now.
Simply add an 'id' to any XHTML tag, and you can link to it using any
web application. Fragment identifiers are used in many millions of web
pages, if not billions.

The first step to an "annotation application" (which you are driving
at) is to have the underlying standards (the "hooks") worked out so as
to easily allow linking by the annotation application. This is already
worked out in the XML world (e.g., XPointer.) That's what I meant. In
LibraryCity we are now working on such annotation and related social
networking applications (e.g., blogs, wikis) to digital texts using
XPointer (which includes simple 'id' links.) Since the W3C standards
are open and universal, others can likewise build their own
application -- no need to invent a new hooking mechanism.
...
...
 Pointing to 'id' ("fragment identifiers") is the most robust and
can survive various types of document edits. In plain text systems,
where annotations have to hook to the content itself (rather than
markup which is separate from the content), it is more difficult to
prevent link breakage.
...
you are trying to make us believe that the text changes and the
markup doesn't.
Markup can change, but in the case of using 'id', the document
maintainers will be careful to keep 'id's undisturbed as much as
possible during document edits. This actually allows *major* changes
to documents and yet keep existing links unbroken. Can this be done
with plain text? Not as easily (it's not impossible, but requires some
sort of mapping system, or a knowledge of all the known
externally-generated links into the original text document.)

For example, an author may issue some work, and later revise it by
rewording paragraphs, adding new paragraphs, new chapters, etc. If
they are careful, they can assure integrity of existing links into the
updated Work by assuring the 'id's are properly preserved and placed
where they should be.

Here's an example:

======================================================================
[First Edition of a work]

   <p id="1234">First paragraph.</p>
   <p id="1235">Second paragraph.</p>

[Second Edition]

   <p id="1234">First paragraph with some minor edits.</p>
   <p id="4567">Inserted whole new paragraph.</p>
   <p id="1235">Second paragraph with some minor edits.</p>

======================================================================

If I have an external annotation which points to the content of
the second paragraph in the First Edition: id="1235", then in the
Second Edition, the link will remain unbroken even if a new
paragraph was inserted before it *and* the content in that paragraph
was revised but not enough to be topically different.

PSWG discussed the issues of interpublication linking for over a month
for the next generation Open eBook Publication Structure, where we
wanted to enable robust interpublication linking, annotation, etc.,
into OEBPS Publications. There's a *lot* of subtle and not-so-subtle
issues involved, some of which I've outlined in prior messages to
gutvol-d and TeBC a while back.

The original proposal to allow external annotation of digital texts
(like PG texts) may seem like a new idea to many PGers here, but it's
been something several of us have considered for quite a while (I was
thinking of it back in 2000 for Yomu.) It's not new to me. I've even
mentioned it here a few times, but not so explicitly (because we had
not yet publicly announced LibraryCity.)
...
what you've done, though, is merely specified that there is markup
which _cannot_ change (the "fragment identifiers"), so as to assure
link-permanence. if i were to specify content that can not change, i
can guarantee link-permanence as well.
Of course, one can come up with a scheme to link into plain text
documents by character counting, paragraph counting, or a number of
other methods (including Ted Nelson's Project Xanadu approach.) It is
entirely possible someone has even come up with an IETF/RFC or
something else covering this. Have you researched to see what others
have already proposed for such a standard?

(For Ted Nelson's Xanadu, refer to http://www.xanadu.com/ )
...
and in almost all cases, we're more likely to have text-invariance
than to have markup-invariance. (but this is beside the point,
since it's easy enough to specify invariance of text and markup.
it is also very easy to show link breakage in cases of variance.)
Ah, but with XHTML 'id' (and now XML xmlid), it is possible to do
significant text amendment and preserve existing links based on 'id'.
Of course, as noted above link preservation can be achieved for plain
text emendments, but it appears to be much messier, especially at the
authoring level. But then someone smart and motivated (like Bowerbird)
may come up with a clever way to make this work for plain text
documents.

Jon

Re: [gutvol-d] Annotations for students

Jon Noring