<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>XMap : Results for OAEI 2017</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Warith Eddine DJEDDI</string-name>
          <email>djeddi@labged.net</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Tarek KHADIR</string-name>
          <email>khadir@labged.net</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sadok BEN YAHIA</string-name>
          <email>sadok.benyahia@fst.rnu.tn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Faculty of Sciences of Tunis, University of Tunis El-Manar</institution>
          ,
          <addr-line>LIPAH-LR 11ES14, 2092</addr-line>
          ,
          <country country="TN">Tunisia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LabGED, Computer Science Department, University Badji Mokhtar</institution>
          ,
          <addr-line>Annaba</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We describe in this paper the XMap system and the results achieved during the 2017 edition of the Ontology Alignment Evaluation Initiative. XMap aims to tackle the issue of matching large scale ontologies by involving particular parallel matching on multiple cores or machines. Presentation of the system XMap, as for eXtended Mapping, is one of the leading ontology matching systems for large-scale ontology matching relying on the notion of context in order to deal with lexical ambiguity as well as a divide-and-conquer approach to tackle the issue of matching large ontologies. In XMap, the measurement of lexical similarity in ontology matching is performed using a synset, defined in WordNet [1] and UMLS [2]. In our approach, the similarity between two entities of different ontologies is evaluated not only by investigating the semantics of the entities names, but also taking into account the context, through which the effective meaning is described. The translation into many languages is based on the Microsoft ⃝RTranslator. Our system stores locally all translation results from Microsoft ⃝R Translator in dictionary files. The translator will also be queried only when no stored translation are found in order to gain time and avoid overloading the server.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        XMap using an oracle by modifying the validation process of the candidate mappings
according to the quality of the interactive matching in terms of F-measure and number
of required interactions. This process is performed after each round of candidate
retrieving. Our approach is based on semantic techniques and on a parallel execution strategy
adapted from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], to address the challenge of scalability and efficiency of matching
techniques. One of the main trusts of the introduced approach is the increasing scalability
and speed of ontology alignment by matching linguistic and structural features.
      </p>
      <p>
        At a glance, the mapping process of XMap is depicted in Figure 1. XMap uses
various similarity measures of different categories such as string, linguistic, and structural
based similarity measures, each contributing to some extent to the alignment results.
Afterwards, the alignments from all matchers can be aggregated to obtain a final alignment
through the use of sequential composition [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Finally, a fast repair method is applied
so as to detect and remove the inconsistent classes by ”‘Applying Logical Constraints
on Matching Ontologies”’ (ALCOMO) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The main goal is to try to remove less
unsatisfiable classes (discovering disjointness relationships) without having an impact on
the F-measure score.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Results</title>
      <p>In this section, we present the evaluation results obtained by running XMap under the
SEALS client with Anatomy, Conference, Multifarm, Interactive matching evaluation,
Large Biomedical Ontologies and Disease and Phenotype tracks.</p>
      <p>Anatomy The Anatomy track consists of finding an alignment between the Adult
Mouse Anatomy (2744 classes) and a part of the NCI Thesaurus (3304 classes)
describing the human anatomy. XMap achieves a good F-Measure value of 89% in
a reasonable amount of time (37 sec.) (see Table 1). In terms of F-Measure/runtime,
XMap is ranked 2nd among the tools participated in this track.
Conference The Conference track uses a collection of 16 ontologies from the domain
of academic conferences. Most ontologies were equipped with OWL-DL axioms of
various types; this opens a useful way to test our semantic matchers. For each reference
alignment, three evaluation modalities are applied : a) crisp reference alignments, b) the
uncertain version of the reference alignment, c) logical reasoning.</p>
      <p>As depicted in Table 2 and 3, XMap produces fairly consistent alignments when
matching the conference ontologies. Finally, XMap generated only one incoherent
alignment for the evaluation based on logical reasoning.</p>
      <p>Multifarm This track is based on the translation of the OntoFarm collection of
ontologies into 9 different languages. XMap have low performance due to many internal
exceptions. The results are showed in Table 4.</p>
      <p>
        Interactive matching evaluation The goal of this evaluation is to imitate interactive
alignment [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ], where a oracle user is involved to validate the correspondences found
by the alignment approach by checking the reference alignment, and changing error
values in order to assess their influence on the performance of alignment systems. For the
2017 edition, participating systems are evaluated on the Conference, Anatomy, Large
biomedical and Phenotype datasets using an oracle based on the reference alignment.
      </p>
      <p>XMap uses various similarity measures to generate candidate mappings. It applies
two thresholds to filter the candidate mappings: one for the mappings that are directly
added to the final alignment and another for those that are presented to the user for
validation. The latter threshold is selected to be high in order to minimize the number of
requests and the rejected candidate mappings from the oracle; the requests are mainly
about incorrect mappings. The mappings accepted by the user are moved to the final
alignment. For the two years 2016 and 2017, XMap preserved roughly the same
FMeasure value, and it benefits the least from the interaction with the oracle. All XMap’s
measures differ with less than 0:2% from the non-interactive runs, and performance
does not change at all with the increasing error rates.</p>
      <p>Large biomedical ontologies This track consists of finding alignments between the
Foundational Model of Anatomy (FMA), SNOMED CT, and the National Cancer
Institute Thesaurus (NCI). The results obtained by XMap are depicted by Table 5.</p>
      <p>In general, we can conclude that XMap achieved a good precision/recall values. The
high recall value can be explained by the fact that UMLS thesaurus contains definitions
of highly technical medical terms.</p>
      <p>Disease and Phenotype This track based on a real use case where it is required to find
alignments between disease and phenotype ontologies. Specifically, the selected
ontologies are the Human Phenotype Ontology (HPO), the Mammalian Phenotype Ontology
(MP), the Human Disease Ontology (DOID), and the Orphanet and Rare Diseases
Ontology (ORDO).</p>
      <p>XMap achieved fair results according to the three evaluation (Silver standard,
Manually generated mappings and Manual assessment of unique mappings).
4
4.1</p>
    </sec>
    <sec id="sec-3">
      <title>General comments</title>
      <sec id="sec-3-1">
        <title>Comments on the results</title>
        <p>This is the 5th time that we participate in the OAEI campaign. The official results of
OAEI 2017 show that XMap is competitive with other well-known ontology matching
systems in all OAEI tracks.
4.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Comments on the OAEI 2017 procedure</title>
        <p>As a fifth participation, we found the OAEI procedure very convenient and the
organizers very supportive. The OAEI test cases are various, and this leads to a comparison on
different levels of difficulty, which is very interesting. We found that SEALS platform
is a precious tool to compare the performance of our system with the others.
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>In this paper, we presented the results achieved during the 2017 edition of the OAEI
campaign. The used benchmark helped greatly identify the power and weaknesses of
the algorithm. In addition, XMap showed the feasibility of our approach especially on
large-scale biomedical ontologies which was a thriving challenge in ontology matching
domain.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Christiane</surname>
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fellbaum. WordNet - An Electronic</surname>
          </string-name>
          Lexical Database. MIT Press,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Olivier</given-names>
            <surname>Bodenreider</surname>
          </string-name>
          .
          <article-title>The unified medical language system (UMLS): integrating biomedical terminology</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>32</volume>
          (
          <string-name>
            <surname>Database-Issue</surname>
          </string-name>
          ):
          <fpage>267</fpage>
          -
          <lpage>270</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Anika</given-names>
            <surname>Gross</surname>
          </string-name>
          , Michael Hartung, Toralf Kirsten, and
          <string-name>
            <given-names>Erhard</given-names>
            <surname>Rahm</surname>
          </string-name>
          .
          <article-title>On matching large life science ontologies in parallel</article-title>
          .
          <source>In Data Integration in the Life Sciences</source>
          , pages
          <fpage>35</fpage>
          -
          <lpage>49</lpage>
          . Springer,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Je´roˆme Euzenat and
          <string-name>
            <given-names>Pavel</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          .
          <source>Ontology Matching</source>
          . Springer-Verlag New York, Inc., Berlin, Heidelberg,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Meilicke</surname>
          </string-name>
          .
          <article-title>Alignment incoherence in ontology matching</article-title>
          .
          <source>PhD thesis</source>
          , University of Mannheim,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          , Sven Hertling, and
          <string-name>
            <given-names>Dominique</given-names>
            <surname>Ritzei</surname>
          </string-name>
          .
          <article-title>Towards evaluating interactive ontology matching tools</article-title>
          .
          <source>In The Semantic Web: Semantics and Big Data</source>
          , 10th International Conference, ESWC 2013, Montpellier, France, May
          <volume>26</volume>
          -30,
          <year>2013</year>
          . Proceedings, pages
          <fpage>31</fpage>
          -
          <lpage>45</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Zlatan</given-names>
            <surname>Dragisic</surname>
          </string-name>
          , Valentina Ivanova, Patrick Lambrix, Daniel Faria, Ernesto Jimenez-Ruiz,
          <article-title>and Catia Pesquita. User validation in ontology alignment</article-title>
          .
          <source>In Proceedings of the International Semantic Web Conference</source>
          , volume
          <volume>9981</volume>
          of LNCS,
          <year>October 2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>