<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Harnessing the power of folksonomies for formal ontology matching on-the-y</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Theodosia Togia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fiona McNeill</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alan Bundy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Informatics, University of Edinburgh</institution>
          ,
          <addr-line>EH8 9LE, Scotland</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper is a short introduction to our work on building and using folksonomies to facilitate communication between Semantic Web agents with disparate ontological representations. We briey present the Semantic Matcher, a system that measures the semantic proximity between terms in interacting agents' ontologies at run-time, fully automatically and minimally: that is, only for semantic mismatches that impede communication. The system is designed to allow agents to "understand" the meanings of terms to be matched by comparing their folksonomy-based "mental representations".</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>Using Folksonomies for Ontology Matching</title>
      <p>
        The architecture of our matcher is inspired by that of search engines. Broad
folksonomies (comparable to "virtual documents" [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] or bags of words) are built
for every candidate term (i.e. name of a relation, class or individual) in the
requester’s ontology with our sense creation algorithm (which extracts information
from databases such as WordNet and SUMO and manipulates it with techniques
such as stemming and stopping 1). During agent interaction, when ORS diagnoses
a semantic mismatch, a sense must be created for the unknown term, which will
act as a query to the search engine. This step must be performed on-the-y
without interrupting normal interaction more than necessary. Our search engine then
takes the sense representing the unknown term and a list of senses representing
the requester’s candidate terms as input and returns as output a ranking of the
candidate terms.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Implementation and Evaluation</title>
      <p>The system briey discussed here has been fully implemented. Evaluation was
performed using dierent versions of the SUMO ontology and its sub-ontologies
from the Sigmakee repository 2. When terms are changed between SUMO
versions (e.g. "Corn" becomes "Maize"), we have an objective way of measuring
the performance of the matcher because we can safely regard terms and their
renamings as synonyms and compare these pairings with our system’s prediction.
Initial results are encouraging, with 57% of correct matches chosen as the best
by the system, 19% as the second-best.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Further Work and Conclusions</title>
      <p>
        This paper briey introduced our work on integrating folksonomies with formal
ontologies to perform matching on-the-y, whenever the need becomes apparent.
We believe these ideas could be a major step forward in the problem of ontology
matching in an agent communication environment, and in providing symbol
grounding for ontology terms. Furthermore, they can provide a framework for
the design of matchers which exploit the vast amount of tag data available on
the web. Full details of the theory on which this work is based, together with
full descriptions of the implementation and evaluation, can be found in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. We
are currently extending this work and evaluating it more fully.
1 Supplementing this data with folksonomies discovered on the web (e.g. tags that
often co-occur with the tag "cat") is also possible though not currently implemented.
2 http://sigmakee.cvs.sourceforge.net/viewvc/sigmakee/KBs/
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>G.</given-names>
            <surname>Frege</surname>
          </string-name>
          (
          <year>1892</year>
          ).
          <article-title>On sense and reference</article-title>
          . In P. Ludlow, editor,
          <source>Readings in the Philosophy of Language</source>
          . MIT,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>F.</given-names>
            <surname>McNeill</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alan</given-names>
            <surname>Bundy</surname>
          </string-name>
          .
          <article-title>Dynamic, automatic, rst-order ontology repair by diagnosis of failed plan execution</article-title>
          .
          <source>IJSWIS (International Journal on Semantic Web and Information Systems) special issue on Ontology Matching</source>
          ,
          <volume>3</volume>
          :
          <fpage>135</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>T.</given-names>
            <surname>Togia</surname>
          </string-name>
          .
          <source>Automated ontology evolution: Semantic</source>
          matching.
          <source>Master's thesis</source>
          , University of Edinburgh,
          <year>2010</year>
          . Available online at http://dream.inf.ed.ac.uk/projects/dor/ (unpublished).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>T. Vander</given-names>
            <surname>Wal</surname>
          </string-name>
          .
          <source>Explaining and showing broad and norrow folksonomies</source>
          ,
          <year>2005</year>
          . available online at http://www.vanderwal.net/random/entrysel.php?blog=
          <fpage>1635</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. W. Hu Y. Qu and G. Cheng.
          <article-title>Constructing virtual documents for ontology matching</article-title>
          .
          <source>In Proceedings of the 15th International WWW Conference</source>
          , pages
          <fpage>2331</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>