<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Anchor-Flood: Results for OAEI 2009</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Md. Hanif Seddiqui</string-name>
          <email>hanif@kde.ics.tut.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Masaki Aono</string-name>
          <email>aono@ics.tut.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Toyohashi University of Technology</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Our ontology schema matching algorithm takes the essence of the locality of reference by considering the neighboring concepts and relations to align the entities of ontologies. It starts off a seed point called an anchor (a pair of “look-alike” concepts across ontologies) and collects two blocks of neighboring concepts across ontologies. The concepts of the pair of blocks are aligned and the process is repeated for newly found aligned pairs. This year, we use a semantically reformed dynamic block of concepts starting from an anchor-concept and produce two blocks from one anchor to get alignment. We improve our memory management. The experimental results show its effectiveness against the benchmark, anatomy track and other datasets. We also extend our algorithm to match instances of IIMB benchmarks and we obtained effective results.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Presentation of the system</title>
      <p>During OAEI-2008, our ontology alignment system used the locality of reference for
collecting neighboring concepts with strong semantic arbitrary depth for aligning
concepts across pair of ontologies. This year, we incorporate a process of collecting
concepts with strong intrinsic semantic similarity within ontology elements
considering intrinsic Information Content (IC) [6] to form a dynamic block. Hence
our system forms a pair of dynamic blocks staring off an anchor across ontologies.
We improve our memory management to cope large scale ontology alignment
effectively. Our algorithm has shorter run time than that of the previous year. It takes
less memory and even less time as well to align large ontologies. We participate in the
benchmark datasets, all four tasks of anatomy track, conference and directory as well.
We also take limited participation in the instance matching track. We participate only
in the IIMB benchmark track of instance matching track.
1.1</p>
      <sec id="sec-1-1">
        <title>State, purpose, general statement</title>
        <p>The purpose of our Anchor-Flood algorithm [8] is basically ontology matching.
However, we use our algorithm in patent mining system to classify a research abstract
in terms of International Patent Classification (IPC). Containing mostly general
terminologies in an abstract leads classification to a formidable task. Automatic
extracted taxonomy of related terms available in an abstract is aligned with the
taxonomy of IPC ontology with our algorithm successfully.</p>
        <p>Furthermore, we use our algorithm to integrate the multimedia resources represented
by MPEG-7 [5] ontologies [11]. We have achieved good performance with effective
results in the field of multimedia resource integration [7].</p>
        <p>To be specific, we describe our Anchor-Flood algorithm, instance matching algorithm
and their results against OAEI 2009 datasets here.
1.2</p>
      </sec>
      <sec id="sec-1-2">
        <title>Specific techniques used</title>
        <p>We have two parts of our system. One is the ontology schema matching
AnchorFlood algorithm to align concepts and properties of a pair of ontologies. Another is
the instance matching approach which essentially uses our Anchor-Flood algorithm.
We implement our system in Java. We create our own memory model of ontology by
the ARP triple parser of Jena module.</p>
      </sec>
      <sec id="sec-1-3">
        <title>1.2.1 Ontology Schema Matching</title>
        <p>As a part of preprocessing, our system parses ontologies into our own developed
memory model by using ARP triple parser of Jena. We also normalize the lexical
description of ontology entities.
Our schema matching algorithm starts off an anchor. It has a complex process of
collecting small blocks of concepts and related properties dynamically by considering
super-concept, sub-concept, siblings and few other neighbors from the anchor point.
The size of blocks affect the running time adversely. Therefore, we incorporate
semantic similarity considering intrinsic Information Content (IC) for building blocks
of neighboring concepts from anchor-concepts.</p>
        <p>Local alignment process aligns concepts and their related properties based on lexical
information [2, 10, and 12], and structural relations [1, 3, 4]. Retrieved aligned pairs
are considered as anchors for further processing. The process is repeated until there is
no more aligned pair to be processed. Hence, it burst out with a pair of aligned
fragment of the ontologies, giving the taste of segmentation [9]. Multiple anchors
from different part of ontologies confirm a fair collection of aligned pairs as a whole.</p>
      </sec>
      <sec id="sec-1-4">
        <title>1.2.2 Ontology Instance Matching</title>
        <p>In an ontology, neither a concept nor an instance comprises its full specification in its
name or URI alone. Therefore we consider the semantically linked information that
includes linked concepts, properties and their values and other instances as well. They
all together make an information cloud to specify the meaning of that particular
instance. We refer this collective information of association as Semantic Link cloud.
The degree of certainty is proportional to the number of semantic link associated to a
particular instance by means of property values and other instances. First, pair of
TBox is aligned with our Anchor-Flood algorithm. Then, we check the alignment of
the type of an instance to any concept of the neighbors of the type of another instance
across ABox. We measure the structural similarity among the elements available in a
pair of clouds to produce instance alignment. The instance matching algorithm is
depicted in Fig. 2 and in Fig. 3.
The Anchor-Flood algorithm needs an anchor to start off. Therefore, we use a tiny
program module for extracting probable aligned pairs as anchors. It uses lexical
information and some statistical information to extract a small number of aligned
pairs from different part of ontologies. The program is essentially smaller, simpler
and faster. We also removed the subsumption module of our algorithm to keep it
faster.
1.4</p>
      </sec>
      <sec id="sec-1-5">
        <title>Link to the system and parameters file</title>
        <p>The version of Anchor-Flood for OAEI-2009 can be downloaded from our website:
http://www.kde.ics.tut.ac.jp/~hanif/res/2009/anchor_flood.zip. The parameter file is
also included in the anchor_flood.zip file. I recommend readers to read the readme.txt
file first. The file includes the necessary description and parameters as well in brief.
1.5</p>
      </sec>
      <sec id="sec-1-6">
        <title>Link to the set of provided alignments (in align format)</title>
        <p>The results for OAEI-2008 are available at our website:
http://www.kde.ics.tut.ac.jp/~hanif/res/2009/aflood.zip.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Results</title>
      <p>2.1</p>
      <p>benchmark
In this section, we describe the results of Anchor-Flood algorithm against the
benchmark, anatomy, directory and conferences ontologies and the IIMB instance
matching benchmark provided by the OAEI 2009 campaign.
We also participate directory and conference track this year for the first time.
2.4</p>
      <sec id="sec-2-1">
        <title>Instance Matching: IIMB Benchmarks</title>
        <p>On the basis of transformation, the benchmark dataset is divided into four groups:
001-010, 011-019, 020-029 and 030-037. Table 3 shows the precision and recall for
each of the groups. However, the detailed results are displayed in Annex section of
this paper.</p>
        <p>Table 3. Instance matching results against IIMB benchmarks</p>
      </sec>
      <sec id="sec-2-2">
        <title>Datasets</title>
      </sec>
      <sec id="sec-2-3">
        <title>Trnasformation</title>
        <p>001-010
011-019
020-029
030-037</p>
        <p>Value transformations
Structural transformations
Logical transformations
Several combinations of the
previous transformations</p>
      </sec>
      <sec id="sec-2-4">
        <title>Prec.</title>
        <p>0.99
0.72
1.00
0.75
Rec.
0.99
0.79
0.96
0.82</p>
      </sec>
      <sec id="sec-2-5">
        <title>F-Measure</title>
        <p>0.991
0.751
0.981
0.786
3</p>
      </sec>
      <sec id="sec-2-6">
        <title>General comments</title>
        <p>In this section, we want to comment on the results of our system and the way to
improve it.
3.1</p>
      </sec>
      <sec id="sec-2-7">
        <title>Comments on the results</title>
        <p>The main strength of our schema matching system is the way of minimizing the
comparisons between entities, which leads enhancement in running time. In instance
matching, our system shows its strength over value and logical transformations.
The weak points are: our system ignores some distantly placed aligned pairs in
ontology alignment system. In instance matching, we have still rooms to work in
structural transformation.
3.2</p>
      </sec>
      <sec id="sec-2-8">
        <title>Discussions on the way to improve the proposed system</title>
        <p>It has still rooms of improving alignments strengthening the semantic and structural
analysis and adding background knowledge. We also want to incorporate complex
alignment like subsumption and 1:n alignments. In instance matching, we want to
improve our system against structural transformation.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>Ontology matching is very important for attaining interoperability as the core of every
semantic application is ontology. We implemented faster algorithm to align specific
interrelated parts across ontologies, which gives the flavor of segmentation. The
anatomical ontology matching shows the effectiveness of our Anchor-Flood
algorithm. Our instance matching algorithm also shows its strength in value and
logical transformations. In structural transformation our algorithm is also effective in
spite of challenging transformation. We improved our previous Anchor-Flood
algorithm in several perceptions to retrieve ontology alignment. Furthermore, we
improve the versatility of using it in different applications including instance
matching, patent classification and multimedia resource integration.
6.
7.
8.
9.
10.
11.
12.
018
019
020
021
022
023
024
025
026
027
028
029
030
031
032
033
034
035
036
037
51
26
93
93
93
93
93
93
93
93
93
93
65
26
99
95
76
36
95
30</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Bouquet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Serafini</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Zanobini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Semantic Coordination: A New Approach and an Application</article-title>
          .
          <source>Proceedings of the 2nd International Semantic Web Conference (ISWC2003)</source>
          , Sanibel Island, Florida, USA (
          <year>2003</year>
          ) pp.
          <fpage>130</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>Proceedings of the 16th European Conference on Artificial Intelligence (ECAI2004)</source>
          , Valencia, Spain (
          <year>2004</year>
          ) pp.
          <fpage>333</fpage>
          -
          <lpage>337</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Giunchiglia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Shaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Semantic Matching,
          <source>The Knowledge Engineering Review</source>
          , Cambridge Univ Press, Vol.
          <volume>18</volume>
          (
          <issue>3</issue>
          ),
          <year>2004</year>
          , pp.
          <fpage>265</fpage>
          -
          <lpage>280</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Giunchiglia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Yatskevich</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>S-Match: an Algorithm and an Implementation of Semantic Matching</article-title>
          .
          <source>Proceedings of the 1st European Semantic Web Symposium (ESWS2004)</source>
          , Heraklion, Greece, (
          <year>2004</year>
          ) pp.
          <fpage>61</fpage>
          -
          <lpage>75</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Nack</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lindsay</surname>
          </string-name>
          , A.T.:
          <article-title>Everything you wanted to know about MPEG-7 (</article-title>
          <string-name>
            <surname>Part</surname>
            <given-names>I).</given-names>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>IEEE Multimedia</source>
          , Vol.
          <volume>6</volume>
          (
          <issue>3</issue>
          ),
          <year>1999</year>
          , pp.
          <fpage>65</fpage>
          --
          <lpage>77</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>Proceedings of the 14th International Joint Conference on Artificial Intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Montreal</surname>
          </string-name>
          , Canada (
          <year>1995</year>
          ) pp.
          <fpage>448</fpage>
          -
          <lpage>453</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Seddiqui</surname>
            ,
            <given-names>M.H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Aono</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>MPEG-7 based Multimedia Information Integration through Instance Matching</article-title>
          . Berkeley, IEEE International Conference on Semantic Computing, CA, USA (
          <year>2009</year>
          ) pp.
          <fpage>618</fpage>
          -
          <lpage>623</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Seddiqui</surname>
            ,
            <given-names>M.H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Aono</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>An Efficient and Scalable Algorithm for Segmented Alignment of Ontologies of Arbitrary Size</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          (
          <year>2008</year>
          ), doi:10.1016/j.websem.
          <year>2009</year>
          .
          <volume>09</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Seidenberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rector</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Web Ontology Segmentation: Analysis, Classification and Use</article-title>
          .
          <source>Proceedings of the 15th International Conference on World Wide Web (WWW2006)</source>
          , Edinburgh, Scotland (
          <year>2006</year>
          ) pp.
          <fpage>13</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>Proceedings of the 4th International Semantic Web Conference (ISWC2005)</source>
          , Galway, Ireland (
          <year>2005</year>
          ) pp.
          <fpage>623</fpage>
          -
          <lpage>637</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Troncy</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , et al.:
          <article-title>Mpeg-7 based Multimedia Ontologies: Interoperability Support or Interoperability Issue</article-title>
          .
          <source>Proceedings of the 1st International Workshop on Multimedia Annotation</source>
          and
          <article-title>Retrieval enabled by Shared Ontologies (MAReSO), Genova</article-title>
          , Italy (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Technical</given-names>
            <surname>Report</surname>
          </string-name>
          , Statistical Research Division, U.S. Census Bureau, Washington, USA (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>