<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Comparison of Complex Correspondence Detection Techniques</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Brian Walshe</string-name>
          <email>walshebr@scss.tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rob Brennan</string-name>
          <email>rob.brennan@scss.tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Declan O'Sullivan</string-name>
          <email>declan.osullivan@scss.tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FAME &amp; Knowledge and Data Engineering Group, School of computer Science and Statistics, Trintiy College Dublin</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>One to one correspondences between entities are not always sufficient to describe the true relationship between related entities in diverse ontologies, and complex correspondences are needed instead. We demonstrate the types of complex correspondence occurring between two LOD sources and compare techniques for discovering these complex correspondences.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Most alignment research focuses on one-to-one correspondences between named
ontology elements [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], but these are not always sufficient for performing many
integration tasks [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Data values, for example, may need some form of translation, or
some form of condition may be required to scope a broader concept to correspond
with a narrower one. These correspondences, which contain conditions or
transformations, are known as complex correspondences.
      </p>
      <p>
        There are many known patterns of complex correspondence [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Conditional
correspondences – where instances of a concept in one ontology are related to a
corresponding concept in the other ontology only if they have a particular value for a given
attribute – include Class by Attribute Type (CAT), Class by Attribute Value (CAV),
and Class by Attribute Existence (CAE). Similarly, Class by Attribute Path
Correspondences (PATH) occur when some path of attributes must be followed before the
scope of the more general concept can be narrowed. Correspondences where the value
of an attribute must be altered in some way are called Attribute Transformation
Correspondences (ATC).
      </p>
      <p>
        In a sample of 50 concepts from YAGO2 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], six of these concepts corresponded to
equivalent concepts in the DBpedia [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] ontology, and 14 concepts required a Class by
Attribute Value correspondence. Twenty-one concepts from YAGO2 corresponded
with DBpedia concepts with broader scope which could not be narrowed with a
correspondence pattern. Six YAGO2 concepts were aligned with DBpedia instances. We
found no cases of CAT or PATH correspondences.
Pattern Fitting
      </p>
      <p>MRDM
Model Fitting</p>
      <p>Boolean values</p>
      <p>No
Numerical</p>
      <p>
        Detecting Complex Correspondences
Approaches to detecting complex correspondences include a pattern based approach
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], multi relational data mining (MRDM) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and our model based approach [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Each approach differs in the particular types of correspondence it can detect, and
these differences are outlined in table 1. The pattern based approach is the least
flexible. For attribute value based patterns it is only capable of detecting cases where
attributes have Boolean values. Each of the complex correspondences we found
between DBpedia and YAGO2 use non-Boolean attributes, and so it could not detect
these. The MRDM approach is more flexible, and is theoretically capable of finding
most correspondence patterns listed in section 1, except value transformation patterns.
Only the model fitting approach is capable of detecting value transformation
correspondences. The current implementation can detect numerical transformations, but the
approach could be extended to also detect transformations such as string splitting.
      </p>
      <p>Acknowledgement: This research is supported by the Science Foundation Ireland
(Grant 08/SRC/I1403) as part of the FAME Strategic Research Cluster.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Ontology matching: state of the art and future challenges</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          . preprint, (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Scharffe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fensel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Correspondence patterns for ontology alignment</article-title>
          .
          <source>Knowledge Engineering: Practice and Patterns</source>
          .
          <volume>83</volume>
          -
          <fpage>92</fpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kasneci</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Yago: A large ontology from wikipedia and wordnet</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web. 6</source>
          ,
          <fpage>203</fpage>
          -
          <lpage>217</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobilarov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Becker</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hellmann</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>DBpedia - A crystallization point for the Web of Data</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          .
          <fpage>154</fpage>
          -
          <lpage>165</lpage>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ritze</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meilicke</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sváb-Zamazal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
          </string-name>
          , H.:
          <article-title>A pattern-based ontology matching approach for detecting complex correspondences</article-title>
          .
          <source>Proc. of Int. Workshop on Ontology Matching (OM)</source>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dou</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lependu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Discovering Executable Semantic Mappings Between Ontologies. 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA</article-title>
          , ODBASE, GADA, and IS. pp.
          <fpage>832</fpage>
          -
          <lpage>849</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Walshe</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <source>Identifying Complex Semantic Matches. 9th Extended Semantic Web Conference</source>
          . pp.
          <fpage>849</fpage>
          -
          <lpage>853</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>