<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Metadata Based Annotation Infrastructure offers Flexibility and Extensibility for Collaborative Applications and Beyond</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marja-Riitta Koivunen</string-name>
          <email>marja@w3.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ralph Swick</string-name>
          <email>swick@w3.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>World Wide Web Consortium, MIT Laboratory for Computer Science</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this position paper, we describe three user scenarios that benefit from metadata based annotation infrastructure. We explain how a basic annotation schema can be extended to support new scenarios. We also describe and evaluate some other features and modifications that are useful when implementing these scenarios. The most laborious part in the scenarios is the design and implementation of new user interfaces; the metadata infrastructure itself easily supports the needs of the different applications and new schemas. 1 http://www.ietf.org/rfc/rfc2812.txt 2 http://www.microsoft.com/windows/netmeeting/ of possibilities that extend beyond basic annotation capabilities are opened. This paper describes a simple collaborative annotation scenario and then broadens the scope of the annotations in a couple of additional scenarios. We briefly explain the basic metadata infrastructure for annotations that is provided by our system, known as Annotea [1], and the features that are needed to support the additional scenarios.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Annotation infrastructure</kwd>
        <kwd>metadata</kwd>
        <kwd>collaboration scenarios</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The group gathers lists of references on a shared Web page.
The lists include an estimation of the papers' relevance and
a preliminary categorization of the reference. As the
students read each paper they mark the paper as interesting
or uninteresting and refine the categorization. They use
annotations to mark or question unclear text, point out
interesting perspectives, add keywords and share other
general comments with each other.</p>
      <p>Later they dedicate one person to write more detailed
replies to selected research questions pointed out in the
annotations and write a short summary. This starts fruitful
discussions in the context of the reference document and the
new summaries. By using annotations to conduct their
commentary on their reading, the group avoids contention
for write access to a single shared document and potential
loss of data from conflicting updates.
2.2 Scenario: Using Annotations for Shared
Bookmarking
In the first stage of gathering references for their report on
whale communication, the group uses traditional Web
search tools to locate references on the Web. They create
'bookmark' annotations in their dedicated seminar
annotation server to those references that appear relevant.
When they create these bookmarks they also select a
category from a list of categories defined by a shared
ontology or, if no existing category is a good match, they
define new categories, adding each such category to a
special seminar ontology that is stored in their shared Web
space. The classification category is more metadata about
the bookmark annotation, one of a variety of such
extensions that the group can store with their metadata.
When a user goes to a bookmarked page she sees the
existing bookmarks as annotations. The user can also ask
for a list of bookmarks, in which case, a page is
dynamically created showing bookmarks under different
categories. The user may query all the bookmark
annotations on the annotation servers or filter the list to
show only certain bookmarks. The user may also ask for
just the bookmarks that belong to the concepts in a given
ontology.
2.3 Scenario: Using Annotations to Present Evaluation
Results
Kim is a teaching assistant in a collaborative seminar. He
wants to make sure that the students remember that the
readers of their documents may have different physical or
cognitive abilities in receiving and interacting with the
information. Kim uses the Web Accessibility Initiative3
guidelines and some automatic tools for assessing the
markup used within Web pages. These accessibility
assessment tools rely on EARL, a metadata language
expressing what is or may be wrong in a page, citing by
URI the specific guideline that describes the accessibility
issue.</p>
      <p>Kim stores the EARL analysis of each document in the
same annotation server that holds the seminar's other
annotations. Kim also adds to the server some inferencing
rules that represent a transformation from the EARL</p>
    </sec>
    <sec id="sec-2">
      <title>3 http://www.w3.org/WAI/</title>
      <p>vocabulary to the annotation vocabulary. The EARL
vocabulary is a superset of the annotation vocabulary, so
Kim includes some style rules that instruct presentation
clients in the rendering of the extra properties of the EARL
metadata.</p>
      <p>When students view their pages they see the EARL report
items as annotations on the pages as a result of processing
the inferencing rules. Now they can address the
accessibility issues in the pages and add additional metadata
to the annotations to note them as fixed or to request help
from Kim. When Kim helps the group, he sends a mail to
the mailing list explaining the problem and adds a link to
the EARL annotation so that others in the group can benefit
from the example.</p>
      <p>When the work is done the group can run the accessibility
evaluation tools again. The document author can choose to
delete the earlier report annotations at this time or she may
just mark them as obsolete. The group may also freeze a
copy of the evaluated page with the original annotations.
3 ANNOTEA METADATA INFRASTRUCTURE
The metadata infrastructure of the Annotea project makes it
easy to support the annotation scenarios presented above.
The Annotea infrastructure provides flexibility and an easy
framework to extend the annotation capabilities to other
applications. The basic infrastructure and the extensions
needed for the previous scenarios are discussed in the
following sections.
3.1 Basic Annotea Annotations
In the first scenario, the students annotate Web pages and
use reply threads as supported by the Annotea
infrastructure.</p>
      <p>
        Annotea sees annotations as metadata about a whole
document or a part of a document. This metadata is written
in RDF/XML [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and can be stored in annotation servers
using the HTTP protocol. An annotation client queries
annotations related to a document from one or several
annotation servers and presents them in document context.
The Annotea annotation model uses multiple RDF schemas
e.g. Dublin Core4 (dc:) with the Annotation schema to
define the basic annotation properties (see Figure 1). The
annotates property refers to the annotated document, the
context property refers to the actual place of the annotation
within the document, the body property contains the content
of the annotation, the dc:title property is a descriptive
annotation title. The other properties further describe the
annotation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>4 http://dublincore.org/</title>
      <p>“This is
great”</p>
      <sec id="sec-3-1">
        <title>Ralph</title>
        <p>rdf:type
dc:title
dc:creator</p>
        <p>3ACF6D754
created
dc:date
annotates
context
body
2000-01-10T17:20Z</p>
        <p>
          2000-01-10T17:20Z
With RDF it is also easy to add new properties to the
annotations. The DAML+OIL5 ontology construction
vocabulary [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] provides a framework for describing new
properties with precise semantics and placing those
semantics in the Web.
5 http://www.daml.org/2001/03/daml+oil-index
3.2 Extending the Annotation Schema for Reply
Threads
Annotea has a concept of a reply that relates to an
annotation or another reply. Replies can form discussion
threads that start from an annotation.
        </p>
        <p>The reply schema looks similar to an annotation schema. It
has two new properties, the reply-to property, which defines
which annotation or reply was the previous one in the
thread, and the root-of-thread, which is the first annotation
in the thread. The generic metadata-based design of our
annotation server made it easy to incorporate these
additional properties.
“I totally
agree”</p>
      </sec>
      <sec id="sec-3-2">
        <title>Jose</title>
        <p>Reply</p>
        <p>3ACF6D754
rdf:type root-of-thread
reply-to
dc:title
dc:creator</p>
        <p>2BCA7D661
annotation:
created
annotation:
context
annotation:
body
2000-01-10T17:20Z
3.3 Using Annotea for Shared Bookmark Annotations
Shared bookmarks can be easily seen as annotations of type
bookmark. In addition, they need a category property.
Again, no changes are needed to our annotation server. The
bookmark annotations can be presented as annotations on
the pages with a special icon to visually differentiate them.
For that an icon property can be added to the metadata.
Addition of new properties for annotation schemas
necessitates a user interface change so that the client can
present them. The presentation style for a property can be
described in the same metadata framework as properties of
properties. We expect to work on a schema for describing
presentation characteristics as part of future development.
Existing ontology construction applications provide user
interfaces for ontology definitions and these are well suited
to the definition of categories for classifying bookmarks.
The generic metadata approach to describing bookmarks
naturally lends itself to supporting a variety of views on the
bookmark database. User-customizable queries can select
bookmarks by any criteria desired.
3.4 Accessibility Evaluation Report Items as Annotea
Annotations
Annotations can also be used to present automatically
generated report items, such as accessibility evaluation
items or markup validation items. If the report items are
described in the metadata format it is straight-forward to
map them to an annotation schema. For instance, the EARL
report item reporting an accessibility problem has semantics
that map easily into an annotation of a part or the whole of
the evaluated Web page. This mapping can be expressed as
a collection of inference rules over the properties produced
by the EARL tools.</p>
        <p>The generic metadata framework provides the necessary
flexibility to decide on a case by case basis whether to
archive, delete, or revise annotations when a document is
reprocessed through the evaluation tool. The tool can
maintain state information for successive runs in the same
metadata store.
4 CONCLUSIONS
A metadata based annotation infrastructure such as Annotea
can easily support a broad range of different annotation
needs. The generic property mechanism of RDF allows us
to construct ontology-neutral data stores. Applications can
use several ontologies simultaneously to describe different
aspects of their annotations.</p>
        <p>Most work in the scenarios is needed in the customization
of the user interfaces for the different annotation
applications. More research is needed to ease the
presentation of the metadata, especially new properties
from ontologies the application (or user) may not have
previously seen.</p>
        <p>The RDF model provides a convenient mechanism on
which to layer client-side or server-side inferencing for
mapping between ontologies. Further work to build
effective end-user tools to take advantage of this capability
is in progress.</p>
        <p>ACKNOWLEDGMENTS
We thank Jose Kahan, Eric Prud’hommeaux, Art Barstow,
Eric Miller and other W3C staff for their many ideas that
have contributed to this paper. We also thank Charles
McCathieNevile for his help when we were experimenting
with EARL scenarios.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>José</given-names>
            <surname>Kahan</surname>
          </string-name>
          ,
          <string-name>
            <surname>Marja-Riitta</surname>
            <given-names>Koivunen</given-names>
          </string-name>
          , Eric Prud'Hommeaux,
          <string-name>
            <surname>Ralph R. Swick</surname>
          </string-name>
          ,
          <article-title>Annotea: An Open RDF Infrastructure for Shared Web Annotations</article-title>
          ,
          <source>in Proc. of the WWW10 International Conference</source>
          , Hong Kong, May
          <year>2001</year>
          (http://www10.org/cdrom/papers/488/index.html).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Ora</given-names>
            <surname>Lassila</surname>
          </string-name>
          and Ralph R. Swick, eds.,
          <string-name>
            <surname>Resource Description</surname>
          </string-name>
          <article-title>Framework (RDF) Model and Syntax Specification, W3C Recommendation</article-title>
          ,
          <year>February 1999</year>
          (http://www.w3.org/TR/1999/REC-rdf-syntax19990222).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Frank van Harmelen,
          <string-name>
            <surname>Peter F. Patel-Schneider</surname>
          </string-name>
          and Ian Horrocks, eds,
          <source>Reference description of the DAML+OIL (March</source>
          <year>2001</year>
          )
          <article-title>ontology markup language, Joint United States / European Union ad hoc Agent Markup Language Committee (http://www</article-title>
          .daml.org/
          <year>2001</year>
          /03/reference.html).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>