<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Matching of Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christoph Quix</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marko Pascan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pratanu Roy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Kensche</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Informatik 5 (Information Systems), RWTH Aachen University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Introduction The discovery of semantic relationships such as subsumption and
disjointness is still a challenge in ontology matching [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Existing methods use logical
reasoning over computed equivalence relationships or machine learning based on
lexical and structural features of ontology elements [
        <xref ref-type="bibr" rid="ref3 ref7">7, 3</xref>
        ]. While these methods deliver
good results for some cases, they are limited to the information contained in the input
ontologies to be matched. Therefore, background knowledge in form of an additional
ontology may be useful to detect semantic relationships. In existing approaches, the
identification of an appropriate ontology as background knowledge is often a task left
for the user. We present two enhanced approaches for identifying semantic
relationships. The first one is based on background knowledge; in contrast to other approaches,
it is able to identify a background ontology automatically. The second approach builds
on existing machine learning methods for identifying semantic relationships. First
evaluation results for these methods and combined approaches show that the integration of
these methods is reasonable as more semantic relationships are identified.
Semantic Matching using Background Knowledge It has been shown in previous
works that using an ontology as background knowledge can improve the match result
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The selection of the background ontology is obviously an important step in such an
approach. While earlier works either relied on the user to provide such an ontology [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
or used very general upper ontologies (e.g. SUMO-OWL, [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]), our approach is able to
select the background ontology automatically. The idea is illustrated in fig. 1 and 2. For
the input ontologies S and T, we generate keyword queries for a web search engine (e.g.,
Google or Swoogle), and for our local ontology repository. The external search engine
is only used if the local repository does not contain an appropriate ontology. When a
background ontology O is found it can be used for matching. In addition to the direct
alignment Adir, two alignments AO;S and AO;T , between the input ontologies and
the background ontology, are computed. Then, for each pair of correspondences from
AO;S and AO;T , existence of a relationship (i.e., equivalence, subsumption) between
the model elements from O is determined. If that is the case, a new correspondence
between the concepts from S and T can be inferred. All the correspondences found in
this way are called semantic matches (Asem). Eventually, the final result is created by
building the union of Adir and Asem.
      </p>
      <p>
        Using Machine Learning for Semantic Matching Our second, complementary method
uses machine learning to identify semantic relationships. We implemented an approach
similar to the method presented in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The computation of subsumption relationships
is considered as a binary classification task, i.e., a concept pair is classified into two
possible classes: subsumption and not-subsumption relationship.
      </p>
      <p>Because ontologies are usually hierarchical structures, the subsumption
relationships found in input ontologies can be used as training examples, making the process
of classifier training independent of alignment computation. Each training example is a</p>
    </sec>
    <sec id="sec-2">
      <title>Ontologies</title>
      <p>(from the web)
O
O
O
O</p>
    </sec>
    <sec id="sec-3">
      <title>Keyword Queries</title>
      <p>Repository</p>
      <p>QT</p>
      <p>Asem
Fig. 2. Matching using a background ontology
1,0 Initial
0,8 ML
0,6 BGK
000,,,240 CCoommbb..12
cmt‐confOf cmt‐ekaw cmt‐iasted cmt‐sigkdd confOf‐ekaw
Fig. 3. Recall for OAEI conference track
concept pair (ci; cj ), where ci v cj and both concepts belong to a single ontology, i.e.,
the source or target ontology of a matching task. We use distinct words, extracted from
both source and target ontologies, as features for the machine learning method. In order
to represent the concept pairs in the feature space, each concept ci of a concept pair is
described by a set of feature space words that can be found in its neighborhood,
constituting the concept’s context document Dci . The notion of the concept’s neighborhood
can be defined in various ways. In our implementation Dci is created from words found
in: name, label, comment, instances, data and object properties of ci, direct sub or super
concepts of ci, concepts in union or intersection definitions of ci and equivalent
concepts of ci. The context documents of concepts are translated to feature vectors which
are used as input for the machine learning method (currently, C4.5 decision tree). We
use an optimization to avoid the classification of all concept pairs.</p>
      <p>
        Conclusion The presented approaches have been integrated into our generic matching
system GeRoMeSuite [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. To evaluate our approach, we used semantic precision and
recall as measures [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and data sets from the oriented track of OAEI 2009. We created
also two combined match configurations using both approaches. The results in fig. 3
show very good results for the combined approaches.
      </p>
      <p>Acknowledgments: This work is supported by the Umbrella Cooperation Programme
(http://www.umbrella-coop.org/).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Z.</given-names>
            <surname>Aleksovski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Klein</surname>
          </string-name>
          , W. ten
          <string-name>
            <surname>Kate</surname>
            ,
            <given-names>F. van Harmelen. Matching</given-names>
          </string-name>
          <string-name>
            <surname>Unstructured</surname>
          </string-name>
          <article-title>Vocabularies Using a Background Ontology</article-title>
          .
          <source>Proc. EKAW</source>
          .
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Euzenat</surname>
          </string-name>
          .
          <article-title>Semantic precision and recall for ontology alignment evaluation</article-title>
          .
          <source>Proc. IJCAI</source>
          , pp.
          <fpage>348</fpage>
          -
          <lpage>353</lpage>
          .
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Y. R.</given-names>
            <surname>Jean-Mary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Shironoshita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Kabuka</surname>
          </string-name>
          .
          <article-title>Ontology matching with semantic verification</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>7</volume>
          (
          <issue>3</issue>
          ):
          <fpage>235</fpage>
          -
          <lpage>251</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>D.</given-names>
            <surname>Kensche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <article-title>GeRoMeSuite: A System for Holistic Generic Model Management</article-title>
          .
          <source>Proc. VLDB</source>
          , pp.
          <fpage>1322</fpage>
          -
          <lpage>1325</lpage>
          .
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>V.</given-names>
            <surname>Mascardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Locoro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          .
          <article-title>Automatic Ontology Matching via Upper Ontologies: A Systematic Evaluation</article-title>
          .
          <source>IEEE Trans. on Knowl. &amp; Data Eng.</source>
          ,
          <volume>22</volume>
          :
          <fpage>609</fpage>
          -
          <lpage>623</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>P.</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Euzenat</surname>
          </string-name>
          .
          <article-title>Ten Challenges for Ontology Matching</article-title>
          .
          <source>Proc. ODBASE</source>
          , pp.
          <fpage>1164</fpage>
          -
          <lpage>1182</lpage>
          .
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>V.</given-names>
            <surname>Spiliopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Vouros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Karkaletsis</surname>
          </string-name>
          .
          <article-title>On the discovery of subsumption relations for the alignment of ontologies</article-title>
          .
          <source>Web Semantics</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>69</fpage>
          -
          <lpage>88</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>