<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Alignment Results of SOBOM for OAEI 2010</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peigang Xu</string-name>
          <email>peigang.xu@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yadong Wang</string-name>
          <email>ydwang@hit.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liang Cheng</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tianyi Zang</string-name>
          <email>tianyi.zang@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computer Science and Technology Harbin Institute of Technology</institution>
          ,
          <addr-line>Harbin</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we give a brief explanation of how Sub-Ontology based Ontology Matching (SOBOM) method gets the alignment results at OAEI2010. SOBOM deal with an ontology from two different views: an ontology with is-a hierarchical structure O' and an ontology with other relationships O''. Firstly, from the O' view, SOBOM starts with a set of anchors provided by a linguistic matcher. And then it extracts sub-ontologies based on the anchors and ranks these sub-ontologies according to their depths. Secondly, SOBOM utilizes Semantic Inductive Similarity Flooding algorithm to compute the similarity of concepts between different sub-ontologies derived from the two ontologies according the depth of sub-ontologies to get concept alignments. Finally, from the O'' view, SOBOM gets relationship alignments by using the concept alignment results in O''. The experiment results show SOBOM can find more alignment results than other compared relevant methods.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>System presentation</title>
      <p>
        Currently more and more ontologies are distributedly built and used by different
organizations. And these ontologies are usually light-weighted [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] containing lots of
concepts especially in biomedicine, such as anatomy taxonomy NCI Thesaurus. The
Sub-ontology based Ontology Matching (SOBOM) is designed for matching
lightweight ontologies that has is-a hierarchy as their backbones. It matches an ontology
from two views: O’ and O’’ that are depicted in Fig. 1. The unique feature of our
method is combining sub-ontology extraction with ontology matching.
1.1
      </p>
      <sec id="sec-1-1">
        <title>State, purpose, general statement</title>
        <p>
          SOBOM is developed to match ontology automatically for general purpose. Based on
two different views, we design three elementary matchers in current version. The first
one is a anchor generator which is used to find anchors; the second one is a structure
matcher SISF (Semantic Inductive Similarity Flooding) which is inspired by
AnchorPrompt [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and SF [4] algorithms and is exploited to flood similarity among concepts.
The last one is a relationship matcher which utilizes the results of SISF to get
relationship alignments. In addition, a Sub-ontology Extractor (SoE) is integrated into
SOBOM to extract sub-ontologies according to the anchors got by linguistic matcher
and rank them by their depths descendingly. Overall SOBOM is a sequential method,
so it does not care how to combine the results of different matchers. The overview of
the method is illustrated in Fig. 2.
        </p>
        <p>O '</p>
        <p>O ''
O
O'</p>
        <p>Sub_O2 Sub_O1</p>
        <p>Sub_O2Sub_O1
that the local context of an entity can express the meaning of it. Consequently,
we get three similarity matrixes respectively, and we choose the maximal of
them as the final results.
z SISF uses the RDF statement to represent the ontology and utilizes the
anchors to inducting the construction of similarity propagation graph for the
sub-ontologies. SISF handles the ontology from the view O’ and only
generate concept-concept alignment.
z R-matcher is a relationship matcher base on the definition of the ontology. It
combines the linguistic and semantic information of a relation. From the O’’
view, it utilizes the is-a hierarchy to extend the domain and range of a
relationship and uses the result of SISF to generate the alignment between
relationships.</p>
        <p>More importantly, SoE is integrated into SOBOM and extracts sub-ontologies
according to the anchors [5,6]. SoE ranks extracted sub-ontologies according to their
depths. As we extract sub-ontologies for ontology matching, the rules of extracting
sub-ontology in SoE are as following: only sub-concepts of anchor are included in the
sub-ontology. In other words, a sub-ontology is a taxonomy which has anchor as root.</p>
        <p>If one of the two concepts in an anchor is a leaf node in the original ontology, we
do not use SISF to deal with it actually. Because this phenomenon just represents a
one-to-many mapping. After extracting sub-ontologies, SOBOM will match these
sub-ontologies according to their depth in original ontology. We first match the
subontologies with larger depth value. By using SoE, SOBOM can reduce the scale of
ontology and make it easy to operate sub-ontologies in SISF.
1.3</p>
      </sec>
      <sec id="sec-1-2">
        <title>Adaptations made for the evaluation</title>
        <p>We don’t make any specific adaptation for the tests in the OAEI 2010 campaign. All
the alignments outputted by SOBOM are based on the same set of parameters.
1.4</p>
      </sec>
      <sec id="sec-1-3">
        <title>Link to the system and parameters file</title>
        <p>The current version of SOBOM is available at:
http://mlg.hit.edu.cn:8080/Ontology/Download.jsp, and the parameters setting is
illustrated in the reading me file.
1.5</p>
      </sec>
      <sec id="sec-1-4">
        <title>Link to the set of provided alignments (in align format)</title>
        <p>We deploy our matcher as a web service, our web service name is:
eu.sealsproject.omt.ws.matcher.AlignmentWSImpl. The endpoint of our web service
can be found at: http://mlg.hit.edu.cn:8080/SOBOMService/SOBOMMatcher?wsdl.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Results</title>
      <p>In this section, we describe the results of SOBOM algorithm against the benchmark,
directory and anatomy ontologies provided by the OAEI 2010 campaign. We use
Jena-API to parse the RDF and OWL files. The experiments were carried out on a PC
running Windows vista ultimate with Core 2 Duo processors and 4-gigabyte memory.
2.1</p>
      <sec id="sec-2-1">
        <title>Benchmark</title>
        <p>On the basis of the nature, we can divide the benchmark dataset into five groups:
#101-104, #201-210, #221-247, #248-266 and #301-304. SOBOM is a sequential
matcher. If the linguistic matcher gets no results, SOBOM will produce no result. We
described the performance of our SOBOM algorithm over each group and overall
performance on the benchmark test set in Table 1.</p>
        <p>#101-104 SOBOM plays well for these test cases.</p>
        <p>#201-210 In this group, some linguistic features of candidate ontologies are
discarded or modified, their structures are quite similar. SOBOM is a sequential
matcher, our anchor generator matches concepts based on their local context not only</p>
        <p>Compared to our previous results (OAEI2009), the recall of every group is highly
improved. This is enhanced by our redesigned anchor generator.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Anatomy</title>
      </sec>
      <sec id="sec-2-3">
        <title>2.3 Conference</title>
        <p>There are 120 pairs of ontologies in this track. Most of them are blind tests (i.e.
there no reference alignment available). The whole results are available at:
http://seals.inrialpes.fr/platform/;jsessionid=FD1E3A5CE8DA43C1D52DB21079EA
ECF3?wicket:bookmarkablePage=:eu.sealsproject.omt.ui.Results&amp;endpoint=http://21
9.217.238.162:8080/SOBOMService/SOBOMMatcher?wsdl&amp;evaluationID=http://21
9.217.238.162:8080/SOBOMService/SOBOMMatcher?wsdl2010/10/03+02:09:03&amp;tr
ack=Conference+Testsuite.
3</p>
      </sec>
      <sec id="sec-2-4">
        <title>General comments</title>
        <p>In this section, we want to introduce comments on the results of SOBOM algorithm
and the way to improve it.
3.1</p>
      </sec>
      <sec id="sec-2-5">
        <title>Comments on the results</title>
        <p>Strengths SOBOM deals with ontology from two different views and combines
results of every step in a sequential way. If the ontologies have regular literals and
hierarchical structures, SOBOM can achieve satisfactory alignments. And it can avoid
missing alignment in many partitioning matching methods as illustrated in [7].
Weaknesses SOBOM needs anchors to extract sub-ontologies. So it depends on the
precision of anchors. In current version, we use a linguistic matcher to get anchor
concept, if the literals of concept missed, SOBOM will get bad results.
3.2</p>
      </sec>
      <sec id="sec-2-6">
        <title>Discussions on the way to improve the proposed system</title>
        <p>SOBOM can be viewed as a frame of ontology matching. So many independent
matchers can be integrated into it. Now, we have enhanced the anchor generator by
not considering the textual information but also the structure information. Our next
plan is to integrate a more powerful matcher to produce anchor concepts or develop a
new method to get anchor concepts. Meanwhile, we plan to develop a mapping
debugging method to refine the results of SOBOM.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>Ontology matching is very important part of establishing interoperability among
semantic applications. This paper reports our participation in OAEI2010 campaign.
We present the alignment process of SOBOM and describe the specific techniques for
ontology matching. We also show the performance in different alignment tasks. The
strengths and the weaknesses of our proposed approach are summarized and the
possible improvement will be made for the system in the future. We propose a brand
new algorithm to match ontologies. The unique feature of our method is combining
sub-ontology extraction with ontology matching based on two different views of an
ontology.
4. S. Melnik, H.G. Molina and E. Rahm: Similarity Flooding: A Versatile Graph Matching</p>
      <p>Algorithm, In Proc 18th Int’l Conf. Data Eng. (ECDE’02) (2002) 117-128.
5. Julian Seidenberg and Alan Rector: Web Ontology Segmentation: Analysis, Classification
and Use, WWW2006, (2006).
6. H. Stuckenschmidt and M. Klein: Structure-Based Partitioning of Large Class Hierarchies. In</p>
      <p>Proc of the 3rd International Semantic Web Conference (2004).
7. W. Hu, Y. Qu: Block matching for ontologies, In Proc of the 5th International Semantic Web</p>
      <p>Conference, LNCS, vol. 4273, Springer (2006) 300-313.
8. P.G. Xu, H.J. Tao: SOBOM: Sub-ontology based Ontology Matching. To be appear.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Fausto</given-names>
            <surname>Giunchiglia</surname>
          </string-name>
          and Ilya Aihrayeu :
          <article-title>Lightweight Ontologies</article-title>
          .
          <source>Technical Report</source>
          . (
          <year>2007</year>
          ) DIT-07-071.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>G.</given-names>
            <surname>Stoilos</surname>
          </string-name>
          , G. Stamou,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <article-title>Kollias: A string metric for ontology alignment</article-title>
          ,
          <source>In Proc. Of the 4th International Semantic Web Conference(ISWC'05)</source>
          . (
          <year>2005</year>
          )
          <fpage>623</fpage>
          -
          <lpage>637</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>N.F.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.A.</given-names>
            <surname>Musen</surname>
          </string-name>
          <article-title>: Anchor-PROMPT: using non-local context for semantic matching</article-title>
          ,
          <source>In Proc. Of IJCAI2001 Workshop on Ontology and Information Sharing</source>
          , (
          <year>2001</year>
          )
          <fpage>63</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>