<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OMReasoner: Combination of Multi-matchers for Ontology Matching: results for OAEI 2014</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Guohua Shen</string-name>
          <email>ghshen@nuaa.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yinling Liu</string-name>
          <email>ylliu@nuaa.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fei Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jia Si</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zi Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zhiqiu Huang</string-name>
          <email>zqhuang@nuaa.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dazhou Kang</string-name>
          <email>dzkang@nuaa.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>College of Computer Sci. &amp;Tech., Nanjing Univ. of Aeronautics and Astronautics</institution>
          ,
          <addr-line>Nanjing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontology matching produces correspondences between entities of two ontologies. The OMReasoner is unique in that it creates an extensible framework for combination of multiple individual matchers, and reasons about ontology matching by using external dictionary WordNet and description logic reasoner. It handles ontology matching in both literal and semantic level, and it makes use of the semantic part of OWL-DL as well as structure. This paper describes the result of OMReasoner in the OAEI 2014 competition in three tracks: benchmark, conference, and MultiFarm.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1.1</p>
      <p>State, purpose, general statement
The matching process can be defined as a function f.</p>
      <p>A’=f(O1, O2, A, p, r)</p>
      <p>Where O1 and O2 are a pair of ontologies as input to match, A is the input
alignment between these ontologies and A’ is new alignment returned, p is a set of
parameters (e.g., weight w and threshold τ) and r is a set of oracles and resources.</p>
      <p>Alignments express correspondences between two entities. A correspondence must
express two corresponding entities and the relation that is supposed to hold between
them. Given two ontologies, a correspondence is a 5-tuple:&lt;id,e1,e2,R,n&gt;, where
. id is a unique identifier of the given correspondence;
. e1 and e2 are the entities of the first and the second ontology respectively;
. R is a relation (e.g., equivalence(=), more general(&gt;), less general(&lt;), disjointness
 )) holding between e1 and e2. In OAEI campaign, equivalence is mainly considered;
. n is a confidence measure (typically in the [0 1] range) for the correspondence
between e1 and e2.</p>
      <p>OMReasoner</p>
      <p>O1</p>
      <p>O2
C1’,C2’,
R1’,R2’</p>
      <p>C1,C2,
R1,R2…
p ( w,τ)</p>
      <p>r dictionary
2 multi-matchers
matcher1
.
.
.
matchern</p>
    </sec>
    <sec id="sec-2">
      <title>A literal corresp.</title>
      <p>C1≡C1’
R1⊑R1’
Combination
3 reasoning</p>
    </sec>
    <sec id="sec-3">
      <title>A’ inference corresp.</title>
      <p>C2≡C2’
C2⊒C3’
R2⊓R2’
reference corresp.</p>
      <p>C2≡C2’
C2≡C3’
R2≡R2’
evaluation
results
C1,C2,
R1,R2
…
C1’,C2’,
R1’,R2’
…
1. Parsing: we can achieve the classes and properties of ontologies by using
ontology API: Jena.
2. Combination of multiple individual matchers: the literal correspondences (e.g.
equivalence) can be produced by using multiple match algorithms or matchers,
for example, string similarity measure (prefix, suffix, edit distance) by
stringbased, constrained-based techniques. Meanwhile, some semantic
correspondences can be achieved by using an external dictionary: WordNet.
Then the multiple match results can be combined by using specific strategy.
The framework of multi-matchers combination is supported, which facilitates
inclusion of new individual matchers.
3. Reasoning: the further semantic correspondences can be deduced by using DL
reasoner, which uses literal correspondences produced in step 2 as input.</p>
      <p>Finally, we evaluate the results against the reference alignments, and compute two
measures: precision and recall.</p>
      <p>In OMReasoner, the framework for multi-matchers is flexible, and any new
individual matcher can be included. Now, the instances of multi-matchers include
EditDistance and WordNet (see Fig.2).
1.2</p>
      <p>Specific techniques used</p>
    </sec>
    <sec id="sec-4">
      <title>1. Threshold</title>
      <p>Threshold is necessary for many matchers (especially syntactic ones) to determine
whether the similarity is regarded as equivalence. For example, the edit distance of
“book” and “booklet” is 3/7 (i.e., the similarity confidence measure is 1-3/7=0.57). If
the threshold is 0.55, then these two entities are equivalent (with confidence measure
0.57); else if threshold is 0.6, they are not. So, we have to tune our tool via threshold.
2. Combination of confidence measure</p>
      <p>Each individual matcher can produce correspondences with confidence measures.
All these confidence measures will be normalized before combination. OMReasoner
includes following flexible strategies to combine the multiple match results:
(a) weighted summarizing algorithm (WeightSum)
The confidence can be summarized by weighted similarity algorithm (see formula
1), where wk is the weight for a specific matcher k, and simk(e1,e2) is the
confidence measure of similarity (mainly equivalence) by this method.</p>
      <p>n
sim(e1,e2)  k 1 wk  simk(e1,e2),
n
where k1 wk  1.0
(b) maximum method (Max)
The maximum confidence measure is chosen among n matchers (see formula 2) .
sim(e1,e 2)  max(sim1(e1,e 2),...,simn(e1,e 2))</p>
    </sec>
    <sec id="sec-5">
      <title>3. semantic matching</title>
      <p>OMReasoner uses semantic matching methods like WordNet matcher and
description logic (DL) reasoning.</p>
      <p>WordNet1 is an electronic lexical database for English, where various senses
(possible meanings of a word or expression) of words are put together into sets of
synonyms. Relations between ontology entities can be computed in terms of bindings
between WordNet senses. This individual matcher uses an external dictionary:
WordNet to achieve semantic correspondences.</p>
    </sec>
    <sec id="sec-6">
      <title>1 http://wordnet.princeton.edu/</title>
      <p>
        (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
      </p>
      <p>OMReasoner employs DL reasoner provided by Jena. OMReasoner includes
external rules to reason about the ontology matching. However, reasoning is time
consuming and only contributes a little to results. In this version, reasoning is skipped.
2</p>
      <p>Results：a comment for each dataset performed
In this section, we present the results obtained by OMReasoner in the OAEI 2014
competition. It participated in three tracks: benchmark, conference, and MultiFarm.
Tests were carried out on a PC running Windows Server 2008 R2 Standard with Intel
Core i5 processor running at 2.8 Ghz and 16 GB RAM.
2.1</p>
      <p>Benchmark
In this track, the ontologies can be divided into 3 categories(see Table 1) . In group 1,
the lexical information have been altered to change their labels or identifiers. This
alteration includes replacing the labels or identifiers with other names that follow a
particular naming convention, a random name, a misspelled name or a foreign word.
In group 2 have ontologies that have flattened hierarchies, expanded hierarchies or no
hierarchies at all. In group 3 the ontologies are the most challenging ones to align.
This is because labels have been scrambled such that they comprise a permutation of
letters of a particular length. We tune our tool by using threshold T and combination
strategy S, then get the better results (τwd=0.95, τed=0.9; S=Max). The results
obtained by OMReasoner in the benchmarks track are summarized in Table 2.</p>
      <p>Table 1. The categories of the Benchmark 2014
category
tests cases
concept
101-104
201-210
systematic
221-247
248-266
real ontology
301-304
The confidence data set consists of numerous real-world ontologies describing the domain
of organizing scientific conferences. We use Combination strategy to run our system
tool in Conference track. The results obtained by OMReasoner in the
benchmarks track are summarized in Table 3 (τwd=0.9, τed=0.8; S=Max).
MultiFarm track is composed of a subset of the Conference dataset, translated in eight
different languages. In this track, the ontologies can be divided into 2 categories. In
group 1 the alignments ontologies are the same. In group 2 the alignments ontologies
are different.</p>
      <p>Firstly, we take use of dictionary to translate different languages into English. Then,
the translated English is imported in multi-matchers by using Max strategy. Finally
we get the results. We tune our tool by using threshold, and the results can be
seen in Table 4(τwd=0.8, τed=0.6; S=Max), which show that the F-Measures of the
ontologies alignments in group 2 are obviously worse than those in group 1. We think
the reasons are that OMReasoner is not well designed to match different ontologies
which are written in completely different languages yet.</p>
      <p>To choose better threshold, we compare the results (see Table 5) across several
thresholds in Conference track. Still we use Max method to run our tool. From the
results, we find that when thresholdτwd=0.9,τed=0.8, our tool performs best. So
that we take use of thresholdτwd=0.9,τed=0.8 in Conference track. Using the same
method, we get the better thresholds for Benchmark and MultiFarm track.
Threshold</p>
      <sec id="sec-6-1">
        <title>General comments</title>
        <p>Discussions on the way to improve the proposed system
The performance of inference relies on the literal correspondences heavily, so more
accurate results which are exported from multi-matchers will greatly enhance the
results of our tool. Some approaches to improving our tool are listed as follows:
1. Adopt more flexible strategies in multi-matchers combination instead of just
weighed sum.
2. Add some preprocessing (see Fig.2), such as eliminating specific character
(e.g., ‘-’, ‘_’) or separating compound words, before words are imported into
matchers.
3. Take comments and label information of ontology into account, especially
when the name of concept is meaningless.
4. Reexamine the use of an appropriate threshold value to optimize accuracy.</p>
        <p>Another problem in our tool is that we ignore structure information among
ontology at the present stage. And we will improve it in the future.
3.2</p>
        <p>Comments on the OAEI procedure
OAEI procedure arranged everything in good order, furthermore SEALS platform
provides a uniform and convenient way to standardize and evaluate our tool.
3.3</p>
        <p>Comments on the OAEI test cases
The OAEI test cases involve all kinds of fields which include conference, anatomy,
language, etc. The variety of tracks and the improvements introduced along the years
makes the campaign very useful to test the performance of ontology aligners and
analyses their strengths and weaknesses. Nevertheless, we miss blind tests cases in
more tracks, which will allow a fair comparison between systems.
3.4</p>
        <p>Proposed new measures
After serious discussion, we believe that OMReasoner can improve a lot. Some new
ways proposed as follows:
1. Enrich the semantic dictionaries because WordNet is not a professional
dictionary, which cannot obtain more comprehensive semantic concepts.
2. Take into account hierarchical ones instead of only all concepts and properties.
3. Find NCI thesaurus for anatomy track.
4. Find different languages dictionaries for MultiFarm.
5. Improve the algorithm of some matchers.</p>
        <p>6. Include more different matchers.</p>
      </sec>
      <sec id="sec-6-2">
        <title>Conclusions</title>
        <p>In this paper, we presented the results of the OMReasoner system for aligning
ontologies in the OAEI 2014 competition in three tracks: benchmark, conference, and
MultiFarm. The combination strategy of multiple individual matchers and DL
reasoner are included in our approach. This is the third time we participate the OAEI,
the results are still not satisfying and we will improve it in the future.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Rahm</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Bernstein</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A survey of approaches to automatic schema matching</article-title>
          .
          <source>The VLDB Journal,</source>
          ,
          <volume>10</volume>
          (
          <issue>4</issue>
          ):
          <fpage>334</fpage>
          --
          <lpage>350</lpage>
          (
          <year>2001</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A survey of schema-based matching approaches</article-title>
          .
          <source>Journal on Data Semantics (JoDS) IV</source>
          ,
          <fpage>146</fpage>
          --
          <lpage>171</lpage>
          (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Kalfoglou</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schorlemmer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Ontology mapping: the state of the art</article-title>
          .
          <source>The Knowledge Engineering Review Journal</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          --
          <lpage>31</lpage>
          , (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Iterative Schema-based Semantic Matching</article-title>
          .
          <source>PhD</source>
          , University of Trento, (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Jian</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
          </string-name>
          , W., Cheng, G. et al:
          <article-title>Falcon-AO: Aligning Ontologies with Falcon</article-title>
          .
          <source>In: Proceedings of the K-CAP Workshop on Integrating Ontologies</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Do</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Rahm</surname>
          </string-name>
          , E.:
          <article-title>COMA- a system for flexible combination of schema matching approaches</article-title>
          .
          <source>In: Proceedings of the International Conference on Very Large Databases</source>
          ,
          <fpage>610</fpage>
          --
          <lpage>621</lpage>
          .(
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Giunchiglia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Yatskevich</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>S-Match: an algorithm and an implementation of semantic matching</article-title>
          .
          <source>In: Proceedings of the European Semantic Web Symposium</source>
          ,
          <fpage>61</fpage>
          --
          <lpage>75</lpage>
          .(
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kalfoglou</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schorlemmert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>If-map: an ontology mapping method based on information flow theory</article-title>
          .
          <source>In: Proceedings of ISWC'03</source>
          , Workshop on Semantic Integration, (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Bouquet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Serafini</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Zanobini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Semantic coordination: A new approach and an application</article-title>
          .
          <source>In: Proceedings of the International Semantic Web Conference</source>
          ,
          <fpage>130</fpage>
          --
          <lpage>145</lpage>
          .(
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.:
          <article-title>The description logic handbook: theory, implementations and applications</article-title>
          . Cambridge University Press, (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ehrig</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sure</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Ontology mapping - an integrated approach</article-title>
          .
          <source>In Proceedings of the European Semantic Web Symposium (ESWS)</source>
          ,
          <fpage>76</fpage>
          --
          <lpage>91</lpage>
          , (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <article-title>RacerPro User Guide</article-title>
          . http://www.racer
          <article-title>-systems</article-title>
          . com,
          <year>2005</year>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Do</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melnik</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rahm</surname>
          </string-name>
          , E.:
          <article-title>Comparison of Schema Matching Evaluations</article-title>
          .
          <source>In: Proceedings of the 2nd Intl. Workshop on Web Databases</source>
          , Erfurt, Germany:,
          <fpage>221</fpage>
          --
          <lpage>237</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>OMReasoner: using reasoner for ontology matching: results for OAEI 2011</article-title>
          .
          <source>In Proceedings of the 6th International Workshop on Ontology Matching.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , and Kang D.:
          <article-title>OMReasoner: using multi-matchers and reasoner for ontology matching: results for OAEI 2012</article-title>
          .
          <source>In Proceedings of the 7th International Workshop on Ontology Matching.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>