<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Variations on Aligning Linked Open Data Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Valerie Cross</string-name>
          <email>crossv@muohio.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chen Gu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xi Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Weiguo Xia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Simon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Science and Software Engineering Department Miami University</institution>
          ,
          <addr-line>Oxford, OH 45056</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Traditional OA systems are not as suitable for aligning LOD ontology schemas; for example, equivalence relations are limited among LOD concepts so that OA systems for LOD ontology alignment also find subclass and superclass relations. Four recent approaches for LOD ontology alignment are BLOOMS (BL) [1] and BLOOMS+ [2], AgreementMaker (AM) [3], WikiMatch (WM) [4], and Holistic Concept Mapping (HCM) [5]. Table 1 briefly compares these systems for aligning LOD ontologies.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>BL/BL+
AM
WM
HCM</p>
    </sec>
    <sec id="sec-2">
      <title>Mapping type</title>
    </sec>
    <sec id="sec-3">
      <title>Equivalence, subclass</title>
    </sec>
    <sec id="sec-4">
      <title>Equivalence, subclass, superclass</title>
    </sec>
    <sec id="sec-5">
      <title>Equivalence</title>
    </sec>
    <sec id="sec-6">
      <title>Equivalence, similar to, disjoint</title>
    </sec>
    <sec id="sec-7">
      <title>Knowledge</title>
    </sec>
    <sec id="sec-8">
      <title>Source</title>
    </sec>
    <sec id="sec-9">
      <title>Wikipedia category hierarchy</title>
    </sec>
    <sec id="sec-10">
      <title>WordNet,</title>
      <p>other LOD
ontologies,
i.e.,DBpedia
or FOAF</p>
    </sec>
    <sec id="sec-11">
      <title>Wikipedia articles</title>
    </sec>
    <sec id="sec-12">
      <title>Wikipedia category hierarchy</title>
    </sec>
    <sec id="sec-13">
      <title>Data</title>
    </sec>
    <sec id="sec-14">
      <title>Structure</title>
    </sec>
    <sec id="sec-15">
      <title>Concept category trees in a forest</title>
    </sec>
    <sec id="sec-16">
      <title>Lexicon</title>
      <p>for a
concept
sets of
articles</p>
    </sec>
    <sec id="sec-17">
      <title>Concept category trees in a forest.</title>
    </sec>
    <sec id="sec-18">
      <title>Algorithms</title>
    </sec>
    <sec id="sec-19">
      <title>Tree overlap</title>
      <p>using node
depth,
contextual
similarity on
superconcepts
Advanced
Similarity
Matcher,
inferencing on
import concept</p>
    </sec>
    <sec id="sec-20">
      <title>Jaccard index</title>
      <p>on article sets
IR tf-idf on
comment, label
keyword, topic
sets ppjoin
with Jaccard</p>
    </sec>
    <sec id="sec-21">
      <title>Experiment</title>
      <p>description</p>
    </sec>
    <sec id="sec-22">
      <title>LOD reference</title>
      <p>alignments,</p>
    </sec>
    <sec id="sec-23">
      <title>Proton mappings</title>
      <p>to DBpedia,</p>
    </sec>
    <sec id="sec-24">
      <title>Geonames,</title>
    </sec>
    <sec id="sec-25">
      <title>Freebase.</title>
      <p>
        LOD reference
alignments
OAEI 2011.5
conference track,
multifarm dataset
Concepts from
triples of Billion
Triple Challenge
dataset, expert
evaluation
Unlike the Ontology Alignment Evaluation Initiative (OAEI), no standard reference
alignment exists for LOD ontologies. Researchers [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] had experts develop a
benchmark, the LOD reference alignments between ontology schema pairs taken from
eight LOD ontologies: AKT Reference (A), BBC program(B), DBpedia (D),
FOAF(F), Geonames(G), Music(M), SIOC (S), and the Semantic Web Conference
(W) because of their substantial LOD coverage domain diversity, and publicly
available schemas. Experts produced both subclass and equivalence mappings
between the pairs listed in Table 2. BLOOMS and AM are compared in the last two
columns [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] since WM or HCM produce only equivalence mappings and few of these
% inferable
# mappings
exist in the LOD reference alignments. Both BLOOMS and AM use inferencing to
produce some subclass mappings, BLOOMS using post-processing with the Jena
reasoner and AM using its own inferencing techniques. To understand this influence,
we performed an analysis on the LOD reference alignment for each pair to see the
percentage of its mappings inferrable from its equivalence mappings, given in column
1. The * for M, B indicates an analysis was not possible since many BBC concepts
could not be found directly in its file or even when opening the file using Protégé.
BLOOMS has better recall except for F,D and G,D. F,D has 87% inferable mappings
from its three equivalence relations. AM’s use of other LOD ontologies and WordNet
contributes to finding more correct mappings. For the G,D pair BLOOMS does not
find the only equivalence relation SpatialThing = Place so that Jena cannot produce
any of the inferable mappings. AM finds this mapping, likely from the comment field
for SpatialThing including the word ‘places.’ AM finds 68% (recall) of the reference
alignment mappings, very close to the 71% inferable mappings. Of the five remaining
pairs, AM has better precision for M,D with the smallest percentage of inferable
mappings. BLOOMS with Wikipedia finds more correct mappings since very few are
from inferable relations. AM’s lower recall corresponds with fewer inferable
relations, but those that it does find are more likely correct with its 0.62 precision
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verma</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Yeh</surname>
            ,
            <given-names>P. Z.</given-names>
          </string-name>
          :
          <article-title>Ontology Alignment for Linked Open Data</article-title>
          .
          <source>In: Proceedings of the International Semantic Web Conference (ISWC)</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yeh</surname>
          </string-name>
          , P. Z,
          <string-name>
            <surname>Verma</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vasquez</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Damova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          :
          <article-title>Contextual Ontology Alignment of LOD with an Upper Ontology:A Case Study with Proton</article-title>
          .
          <source>In Proceedings of the Extended Semantic Web Conference (ESWC)</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cruz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmonari</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caima</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Stroe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Towards “On the Go” Matching of Linked Open Data Ontologies</article-title>
          .
          <source>In: Workshop on Discovering Meaning On the Go in Large Heterogeneous Data (LHD)</source>
          ,
          <source>The 22nd International Joint Conference on Artificial Intelligence (IJCAI-11)</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Hertling</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>WikiMatch - Using Wikipedia for Ontology Matching</article-title>
          , in: Seventh International Workshop on Ontology Matching (OM
          <year>2012</year>
          )
          <article-title>(</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Grutze</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bohm</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          .
          <string-name>
            <surname>Naumann</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Holistic and Scalable Ontology Alignment for Linked Open Data</article-title>
          .
          <source>In: Workshop on Linked Data on the Web (LDOW) at WWW</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>