<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards the definition of guidelines for RDF and Topic Maps interoperability</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Valentina Presutti</string-name>
          <email>presutti@cs.unibo.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lars Marius Garshol</string-name>
          <email>larsga@ontopia.net</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabio Vitali Steve Pepper</string-name>
          <email>pepper@ontopia.net</email>
          <email>vitali@cs.unibo.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Gessa</string-name>
          <email>gessa@cs.unibo.it</email>
        </contrib>
      </contrib-group>
      <fpage>83</fpage>
      <lpage>88</lpage>
      <abstract>
        <p>The Semantic Web relies on the presence of semantic annotations which describe information in a machine readable form. There are two standard formalisms which are suitable for this aim: RDF and Topic Maps. This paper presents an analysis of the issues that need to be addressed in order to define a set of rules for performing automated and consistent translations between RDF and Topic Maps, i.e. semantic mapping issues. The analysis is based on existing approaches to the RDF and Topic Maps translation in the light of the new official formal models released for both RDF and Topic Maps.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Topic Maps in a nutshell We assume the reader is already familiar with RDF,
but probably many readers are not familiar with Topic Maps, and so we will
provide a brief introduction. Topic Maps consist of topics, each of which represent
some thing of interest, known as subjects Associations repesent relationships
between two or more topics, have a type (which is a topic) and consist of a set of
(role type, role player) pairs (where both elements are topics). Occurrences can
be either simple property-value assignments for topics, or references to resources
which contain information pertinent to a topic. Occurrences also have a type
(which is a topic). In addition, topics may have one or more names (which also
may be typed). Further, any construct can be reified (associations, occurrences,
roles, and names), and any construct (except roles) may have a scope, which
is a set of topic representing the context in which the construct is valid. The
technology stack consists of a data model [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], an XML interchange syntax [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], a
constraint language [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] , and a query language [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Analysis of existing approaches</title>
      <p>
        We decided to focus on five existing proposals. Each translation proposal has
been evaluated against the general criteria of: Completeness, and
Naturalness. All the existing approaches fall into two distinct categories that in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] are
called ”object mappings” and ”semantic mappings”. The two approaches can be
summed up as follows: (i) object mappings use the low-level building blocks
of one language to describe the object model of the other; (ii) semantic
mappings start from higher level concepts that carry the semantics of each model
and attempt to find equivalences between them. The analysis of the options
and solutions provided in literature clearly shows the advantages of semantic
mapping.
      </p>
      <p>
        For the sake of brevity a description of the existing approaches is summarized
in a table shown in Table 1. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] describes two proposals which differ in the
approach taken, hence it is evaluated twice.
      </p>
      <p>
        Proposal Approach Basis Direction Complete? Natural?
Moore [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] semantic mapping informal both no yes
Moore [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] object mapping informal both almost no
Lacher and Decker [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] object mapping [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] TM2RDF almost no
Ogievetsky [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] object mapping [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] both almost no
Garshol [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] semantic mapping [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] both almost almost
UniBo [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] hybrid [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] both almost almost
      </p>
      <p>Table 1. Evaluation of existing proposals</p>
      <p>
        It should be noted that [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] are both based on PMTM4[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], an early data
model for Topic Maps that has since been abandoned. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] predates any published
data model for Topic Maps, and so is based on an informal understanding of
Topic Maps. The following section describes the result of this analysis.
2.1
      </p>
      <p>
        Semantic mapping issues
This section gives an informal description of the mapping behaviour we currently
think is correct, with notes about currently open issues.
– Things and proxies: there is a fundamental equivalence between subjects
and resources. This equivalence may be refined as follows: topics are to
subjects as RDF nodes (excluding literals) are to resources. Subjects and
resources are the things (entities, concepts, documents, whatever) about which
assertions are made. Topics and RDF nodes (excluding literals) are the
corresponding proxies that represent subjects and resources within the Topic
Maps and RDF models respectively.
– Identity: both topics and resources may use URI references (or URIrefs) as
identifiers. However, in Topic Maps there are two ways in which a URIref can
be used to identify a subject: (i) directly, as the actual address (or locator) of
the subject, in which case it is called a subject locator ; (ii) indirectly, as the
locator of an information resource that provides some human-interpretable
indication of the subject, in which case it is called a subject identifier. There
are problems with all of the analyzed proposals. Equating URIs in RDF with
subject locators is problematic in several ways. Firstly it leads to incorrect
semantics (as the description of the Unibo proposal shows). Secondly, the
result is less natural. Finally, the identifiers of occurrence types and
association types (which are typically subject identifiers) could not be used as
the URIs of RDF properties. Equating URIs with subject identifiers also
yields unnatural results, since the identifier of an addressable subject (i.e.,
an information resource) will not become the URI of the corresponding
resource, as would be most natural in RDF. However, this alternative does
not exhibit the other problems that result from favouring subject locators.
The ideal solution would be to allow either subject identifiers or subject
locators to be regarded as URIs (and vice versa), but at the same time to
retain sufficient information when going from Topic Maps to RDF to be
able to perform round-tripping. The recognition in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] tag of the distinction
between resources in general and information resources, and the insights in
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], may provide the foundation for such a solution. The issue of
multiple identifiers is treated explicitly by [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] only. His proposal to use
equivalence properties defined in OWL (i.e., owl:sameAs, owl:equivalentClass, and
owl:equivalentProperty) should clearly be investigated in more detail since
such an approach is likely to lead to increased interoperability between RDF
and Topic Maps.
– Names: in RDF the name of a resource is usually represented by a single
statement. RDF Schema defines a property for this purpose (rdfs:label) but
many vocabularies define their own properties (e.g., dc:title, foaf:name,
etc.). An accurate semantic mapping from Topic Maps can be achieved by
translating base names to such properties. Both [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] take this
approach, differing only in that [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] always maps a base name to rdfs:label (and
vice versa), while [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] allows base names (including scoped base names) to
be mapped to other properties. There are two possible situations going from
Topic Maps to RDF: typed names and untyped names. For typed names the
solution is to map name types to properties and properties to name types.
Instead, there are two basic, alternative approaches to untyped names: (i)
always translate untyped base names to the same property, e.g., rdfs:label,
rdftm:name, tm:basename. This approach leads on one hand to simpler
mapping, and on the other hand to less natural result; (ii) translate untyped base
names to different properties depending on the type of the topic/resource.
With this approach we can obtain more natural result, but the mapping is
much more complex.
– Binary relationships: representations of binary relationships have
somewhat different topographies in RDF and Topic Maps. RDF uses a single
statement , in which the subject and object represent the two resources that
participate in the relationship. In Topic Maps there is no concept of subject
and object in a binary association because the association has no direction.
The nature of the two participating topics’ involvement in the relationship
is stated explicitly through their role types. The challenge is complimentary
to this when going from Topic Maps to RDF. Both [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] solve this by
allowing additional information to be provided that allows the RDF subject
and object to be connected with their respective role types.
– Non-binary relationships: one major difference between the models of
RDF and Topic Maps is that the latter permits non-binary relationships
to be expressed directly: an association may have one, two, or more role
players. In RDF on the other hand the base model permits only binary
relationships. Most of the existing proposals for translating associations with
more than two role-players are unsatisfactory, since they result in a large
number of RDF statements. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] proposes patterns for representing n-ary
relations in RDF in which the relation is ”re-represented” as a class rather
than a property. Each such pattern requires n statements in order to express
the relationship. If such patterns are adopted in the RDF community it
would seem to be advisable, in the interest of compatibility, to follow them
as closely as possible when translating n-ary associations from Topic Maps
to RDF.
– Occurrences: both [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] recognize that occurrences are most
naturally represented as single RDF statements where the property corresponds
to the occurrence type. Internal and external occurrences correspond to
statements whose objects are literals and resources respectively. Going from
Topic Maps to RDF presents no problems at all; going the other way seems to
require additional information in order to distinguish an internal occurrence
from a name, and an external occurrence from an association or identifier.
– Types and subtypes: [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] agree on the fundamental semantic
equivalence between the concept of type-instance in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and rdf:type, on
the one hand; and between supertype-subtype and rdfs:subClassOf on the
other. In addition, association types and occurrence types are regarded as
equivalent to RDF properties. Role types present particular problems, as
discussed above, and name types, as already noted, did not exist at the time
the proposals were written.
– Reification: only [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] mention reification and neither proposal
regards it as being problematic. In actual fact, [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] only talks explicitly about
the reification of associations, while [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] mentions reified names, occurrences,
and associations. Neither proposal covers the reification of topic maps and
association roles.
– Scope: the concept of scope is peculiar to Topic Maps and has been regarded
as one of the major stumbling blocks for RDF/Topic Maps interoperability.
All the existing proposals discuss the issue in one form or another but only
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] do so in terms of its semantics, i.e., as a way to express the
contextual validity of an assertion. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] makes the point that scope is most
properly regarded as a special kind of assertion made about another
assertion. Since assertions about assertions are handled through reification in
both paradigms, and reification translates rather easily, Garshol proposes to
translate scope using reification together with a property that captures the
semantics of contextual validity. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] treats scoped base names as a special
case, however, and allows a base name in a particular scope to be translated
to a specific property. Given that the forthcoming revised Topic Maps
standard will permit typed names, it would be possible to avoid treating scoped
names as a special case and still obtain natural results.
– Other issues: none of the existing proposals discuss how to represent RDF
containers and collections, language tags, XML literals or typed literals in
Topic Maps. Of these issues, the latter two are addressed by recent
datatyping extensions to the Topic Maps model. Language tagging can be seen
as a kind of contextual information akin to scope and treated accordingly.
Containers and collections may or may not require special treatment. Since
they are expressed using the fundamental building blocks of RDF (nodes
and arcs), they may be represented using associations in Topic Maps. The
semantics would not be lost and could be recovered when round-tripping.
However, they would not be ”visible” in terms of some equivalent Topic
Maps construct.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion and future work</title>
      <p>The main result of this work has been to identify a number of outstanding issues
that need to be considered when providing an approach for the interoperability
between RDF and Topic Maps. The immediate next step is that of providing for
each issue described in section 2.1 a set of mapping rules that allow to perform
automated translation between RDF and Topic Maps, to translate data from
one form to the other without unacceptable loss of information or corruption
of the semantics. Furthermore, it should also be possible to query the results
of a translation in terms of the target model and it should be possible to share
vocabularies across the two paradigms. The overall approach we are following is
based on the definition of a specific vocabulary for the mapping. The guidelines
will provide rules for performing both guided and unguided translations.</p>
      <p>(RDF)</p>
      <p>Family
of</p>
      <p>Specifications.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Resource Description Framework http://www.w3.org/RDF,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Garshol</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Moore</surname>
          </string-name>
          . ISO/IEC 13250-2:
          <string-name>
            <given-names>Topic</given-names>
            <surname>Maps - Data Model</surname>
          </string-name>
          ,
          <year>2005</year>
          . http://www.isotopicmaps.org/sam/sam-model/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <article-title>Members of the Topic Maps</article-title>
          .Org Authoring Group.
          <source>Xml topic maps (xtm 1.0)</source>
          .
          <source>Technical report, ISO</source>
          ,
          <year>2001</year>
          . http://www.topicmaps.
          <source>org/xtm/1</source>
          .0/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Moore</surname>
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bogachev</surname>
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nishikawa</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Topic maps constraint language</article-title>
          .
          <source>Technical report</source>
          , TopicMaps.org,
          <year>2005</year>
          . http://www.isotopicmaps.org/tmcl/tmcl-2005
          <source>- 02-12</source>
          .html.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Garshol</surname>
            <given-names>L. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barta</surname>
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nishikawa</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Topic maps query language</article-title>
          .
          <source>Technical report, ISO</source>
          ,
          <year>2005</year>
          . http://www.isotopicmaps.org/tmql/spec.html.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lacher</surname>
            <given-names>M</given-names>
          </string-name>
          and
          <string-name>
            <surname>Decker S</surname>
          </string-name>
          .
          <article-title>On the integration of Topic Map data and RDF data</article-title>
          .
          <source>In Proccedings of the Extreme Markup Languages 2001 Conference</source>
          , Montreal, Canada,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Moore</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <article-title>RDF and Topic Maps: An Exercise of Convergence</article-title>
          .
          <source>In Proceedings of the XML Europe 2001 Conference</source>
          , Berlin, Germany,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Biezunski</surname>
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Newcomb</surname>
            <given-names>S. R.</given-names>
          </string-name>
          <string-name>
            <surname>Topicmaps</surname>
          </string-name>
          .
          <article-title>net's Processing Model for XTM</article-title>
          . http://topicmaps.net/pmtm4.htm,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Ogievetsky</surname>
            <given-names>N. XML</given-names>
          </string-name>
          <string-name>
            <surname>Topic</surname>
          </string-name>
          <article-title>Maps through RDF glasses</article-title>
          .
          <source>In Proceedings of the Extreme Markup Languages 2001 Conference</source>
          , Montreal, Canada,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Garshol L.M.</surname>
          </string-name>
          <article-title>Living with RDF and Topic Maps</article-title>
          . http://www.ontopia.net/topicmaps/materials/tmrdf.html,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ciancarini</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gentilucci</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pirruccio</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presutti</surname>
            <given-names>V.</given-names>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <surname>VitaliF.</surname>
          </string-name>
          <article-title>Metadata on the web: On the integration of RDF and Topic Maps</article-title>
          .
          <source>In Proc. Conf. on Extreme Markup Languages</source>
          , page (to appear), Montreal, Canada,
          <year>August 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Jacobs</surname>
            <given-names>I.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Walsh</surname>
            <given-names>N.</given-names>
          </string-name>
          <article-title>Architecture of the World Wide Web</article-title>
          .
          <source>Technical report, W3C</source>
          ,
          <year>2004</year>
          . http://www.w3.org/TR/webarch.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Pepper</surname>
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schwab</surname>
            <given-names>S. Curing</given-names>
          </string-name>
          <article-title>the Web's Identity Crisis: Subject Indicators for RDF</article-title>
          .
          <source>Technical report, Ontopia</source>
          ,
          <year>2003</year>
          . http://www.ontopia.net/topicmaps/materials/identitycrisis.html (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Garshol L.
          <article-title>Topic maps, RDF, DAML, OIL: A Comparison</article-title>
          . http://www.ontopia.net/topicmaps/materials/tmrdfoildaml.html,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Noy</surname>
            <given-names>N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rector</surname>
            <given-names>A. Defining</given-names>
          </string-name>
          <article-title>N-ary Relations on the Semantic Web: Use With Individuals</article-title>
          .
          <source>Technical report, W3C</source>
          ,
          <year>2005</year>
          . http://www.w3.org/TR/swbp-naryRelations/ (
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>