<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards a complex alignment evaluation dataset</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Élodie Thiéblin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ollivier Haemmerlé</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nathalie Hernandez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cassia Trojahn</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IRIT &amp; Université de Toulouse 2 Jean Jaurès</institution>
          ,
          <addr-line>Toulouse</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>- simple correspondence when both eo1 and eo2 are single entities: e.g. 8x; o1:Person(x) o2:Human(x) is a simple correspondence. - complex correspondence when at least one of eo1 or eo2 is a construction of entities, i.e. involving at least a constructor or a transformation function. For example, 8x,y, o1:priceInDollars(x,y) 9y1, o2:priceInEuro(x,conversion(y)) is a complex correspondence with a transformation function (conversion that states that y1 = changeRate y). 8x, o1:AcceptedPaper(x) 9y, o2:Paper(x) ^ o2:acceptedBy(x,y) is a complex correspondence with constructors.</p>
      </abstract>
      <kwd-group>
        <kwd>complex alignments</kwd>
        <kwd>evaluation dataset</kwd>
        <kwd>complex dataset</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Motivation and background</title>
      <p>
        Simple ontology alignments, largely studied, link one entity from a source
ontology to one entity of a target ontology. One of the limitations of these alignments
is, however, their lack of expressiveness which can be overcome by complex
alignments. Different approaches for generating complex alignments have emerged in
the literature [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4,5,6</xref>
        ]. However, there is a lack of datasets on which they can be
evaluated.
      </p>
      <p>
        Ontology matching is the process of generating an alignment. An alignment
A between a source o1 and a target o2 ontologies is a set of correspondences [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Each correspondence is a triple heo1; eo2; ri. eo1 and eo2 are the members of the
correspondence: they can be single ontology entities or constructions of these
entities using constructors or transformation functions. r is a relation (e.g., ,
, ) between eo1 and eo2. We consider two types of correspondences:
      </p>
      <p>A complex alignment contains at least one complex correspondence.</p>
    </sec>
    <sec id="sec-2">
      <title>The evaluation dataset</title>
      <p>
        The proposed dataset is based on the OntoFarm dataset [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] composed of 16
ontologies on the conference organisation domain and simple reference alignments
between 7 of these ontologies. This dataset has been widely used in the ontology
alignment evaluation domain [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The dataset proposed here is a first version of
an extension of the OntoFarm dataset including complex correspondences. 3 out
of the 7 ontologies of the reference alignments have been manually aligned (cmt,
conference and edas), resulting in 3 alignments: cmt-conference, cmt-edas and
conference-edas. The methodology applied to create the complex dataset consists
in manually finding an equivalent construction of target entities for each source
entity. All correspondences have a single entity member and an other member
that is either a single entity (simple correspondence) or a construction (complex
correspondence). The correspondences are diverse for they can be classified with
8 different correspondence patterns or compositions of them [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In the 3
alignments, the dataset contains 51 complex correspondences. The alignments are
expressed in First Order Logic and in EDOAL1. The resulting alignments were
translated into OWL axioms as an ontology merging process. The HermiT
reasoner [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] was used to check the consistency of the merged ontology. The dataset is
available online at http://doi.org/10.6084/m9.figshare.4986368.v4 under
a CC-BY License.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion and future work</title>
      <p>
        We have proposed a complex coherent dataset with complex correspondences
between 3 ontologies of the OntoFarm dataset. As perspectives, the dataset will
be extended with other ontologies of this dataset. The confidence of a
correspondence (a value associated with a correspondence to express its confidence degree)
could be added to the dataset. This could express, as in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the consensus level
of experts on each correspondence. Finally, we aim at using this dataset for the
purpose of evaluating complex matchers.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cheatham</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Conference v2.
          <article-title>0: An uncertain version of the OAEI Conference benchmark</article-title>
          .
          <source>In: ISWC</source>
          . pp.
          <fpage>33</fpage>
          -
          <lpage>48</lpage>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Ontology Matching. Springer Berlin Heidelberg (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoilos</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>HermiT: An OWL 2 reasoner</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          <volume>53</volume>
          (
          <issue>3</issue>
          ),
          <fpage>245</fpage>
          -
          <lpage>269</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lowd</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kafle</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dou</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Ontology matching with knowledge rules. In: Transactions on Large-Scale Data-</article-title>
          and
          <string-name>
            <surname>Knowledge-Centered Systems</surname>
            <given-names>XXVIII</given-names>
          </string-name>
          , pp.
          <fpage>75</fpage>
          -
          <lpage>95</lpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Parundekar</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knoblock</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ambite</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          :
          <article-title>Linking and building ontologies of linked data</article-title>
          .
          <source>In: ISWC</source>
          . pp.
          <fpage>598</fpage>
          -
          <lpage>614</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ritze</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meilicke</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Šváb Zamazal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
          </string-name>
          , H.:
          <article-title>A pattern-based ontology matching approach for detecting complex correspondences</article-title>
          .
          <source>In: 4th ISWC workshop on ontology matching</source>
          . pp.
          <fpage>25</fpage>
          -
          <lpage>36</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Scharffe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Correspondence Patterns Representation</article-title>
          .
          <source>Ph.D. thesis</source>
          , Faculty of Mathematics, Computer Science and University of Innsbruck (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Zamazal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svátek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>The Ten-Year OntoFarm and its Fertilization within the Onto-Sphere</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>43</volume>
          ,
          <fpage>46</fpage>
          -
          <lpage>53</lpage>
          (
          <year>Mar 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Šváb</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svátek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berka</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rak</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tomášek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Ontofarm:
          <article-title>Towards an experimental collection of parallel ontologies</article-title>
          .
          <source>Poster Track of ISWC</source>
          <year>2005</year>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>