<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>⋆ Is my ontology matching system similar to yours?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ernesto Jime´nez-Ruiz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bernardo Cuenca Grau</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ian Horrocks</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Oxford</institution>
          ,
          <addr-line>Oxford</addr-line>
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we extend the evaluation of the OAEI 2012 Large BioMed track, which involves the matching of the semantically rich ontologies FMA, NCI and SNOMED CT. Concretely, we report about the differences and similarities among the mappings computed by the participant ontology matching systems. The quality of the mappings computed by an ontology matching system in the Ontology Alignment Evaluation Initiative (OAEI) [2, 1] is typically measured in terms of precision and recall with respect to a reference set of mappings. Additionally, the OAEI also evaluates the coherence of the computed mappings [1]. However, the differences and similarities among the mappings computed by different systems have often been neglected in the OAEI.1 In this paper we provide a more fine-grained comparison among the matching systems participating in the OAEI 2012 Large BioMed track;2 concretely (i) we have harmonised (i.e. voted) the computed mapping sets, and (ii) we provide a graphical representation of the similarity of these sets. We have considered the mappings voted (i.e. included in the output) by at least one ontology matching system. Figure 1 shows the harmonization (i.e. voting) results for the FMA-NCI and FMA-SNOMED matching problems. Mappings have received at most 11 and 8 votes (i.e. number of participating systems3), respectively. For example, in the FMA-NCI matching problem, 3,719 mappings have been voted by at least 2 systems. Figure 1 also shows the evolution of F-score, Precision and Recall for the different harmonized mapping sets. As expected the maximum recall (respectively precision) is reached with the minimum (respectively maximum) number of votes. For example, the maximum recall in the FMA-SNOMED problem is 0.81, which shows the difficulty of identifying correct mappings in this matching problem. The harmonized mapping sets with the best trade-off between precision and recall have been selected as the representative mapping sets of the participating ontology matching systems. For the FMA-NCI matching problem we have selected the mappings sets with (at least) 3, 4 and 5 votes, while in the FMA-SNOMED matching problem we have selected the sets with (at least) 2 and 3 votes (see dark-grey bars in Figure 1). ⋆ This research was financed by the Optique project with the grant agreement FP7-318338. 1 As far as we know, only in the 2007 Anatomy track some effort was done in this line: http: //oaei.ontologymatching.org/2007/results/anatomy/ 2 Results available at: http://www.cs.ox.ac.uk/isg/projects/SEALS/oaei/ 3 Systems with several variants have only been considered once in the voting process.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Introduction</p>
      <p>Mapping harmonization</p>
    </sec>
    <sec id="sec-2">
      <title>LLooggMMaapp-noe</title>
      <sec id="sec-2-1">
        <title>ServOMapL YAM++</title>
      </sec>
      <sec id="sec-2-2">
        <title>ServOMap</title>
      </sec>
      <sec id="sec-2-3">
        <title>Vote5</title>
      </sec>
      <sec id="sec-2-4">
        <title>Vote4</title>
      </sec>
      <sec id="sec-2-5">
        <title>Vote3</title>
      </sec>
      <sec id="sec-2-6">
        <title>GOMMA</title>
      </sec>
      <sec id="sec-2-7">
        <title>GOMMA-Bk</title>
      </sec>
      <sec id="sec-2-8">
        <title>UMLSL</title>
      </sec>
      <sec id="sec-2-9">
        <title>UMLSA</title>
        <p>UMLS
spapng
i
m
fr
boeu
m
N
11,160
17,020
3,719 3,074 2,862 2,739 2,563 2,420 2,292 2,077 1,630
812
1 2 3 4 5Number5of votes7 8 9 10 11
7,997
6,592
4,218</p>
        <p>UUMMLSLUSLMALS</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>YSAeMrv+O+MapL</title>
    </sec>
    <sec id="sec-4">
      <title>SVeortveO3Map</title>
      <p>LLooggMMaapp-noe
0 0LogMapLt
0.10
0.20
Jaccard distance
0.30
0.40
0 0 LogMapLt
0.10
0.20
0.30 Jaccard distance 0.50
0.40</p>
      <p>
        Mapping similarity among systems
We have compared the similarity among (i) the representative mapping sets from the
harmonisation (see Section 2), (ii) the UMLS-based reference mappings of the track,
and (iii) the mapping sets computed by the top-8 ontology matching systems in the
FMA-NCI and FMA-SNOMED matching problems [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. To this end we have calculated
the jaccard distance (|MA ∪ MB | − |MA ∩ MB |)/|MA ∪ MB |, which ranges from 0
(the same) to 1 (different), between each pair (MA and MB ) of the mapping sets from
(i)-(iii), and represented such distances in a two-dimensional scatterplot (see Figure 2).
System names which are distant to each other indicate that their computed mappings
differ to a large degree. For example, in Figure 2 (right), the mappings computed by
LogMapLt and GOMMA are very different with respect to the mappings computed by
other systems, as well as with respect to the harmonized and reference mapping sets.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aguirre</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>Results of the Ontology Alignment Evaluation Initiative 2012</article-title>
          . In: Ontology Matching Workshop. Vol-
          <volume>946</volume>
          <source>of CEUR Workshop Proceedings</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meilicke</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trojahn</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Ontology alignment evaluation initiative: Six years of experience</article-title>
          .
          <source>J. Data Sem</source>
          .
          <volume>15</volume>
          ,
          <fpage>158</fpage>
          -
          <lpage>192</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>