<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GMRs: Reconciliation of Generic Merge Requirements in Ontology Integration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Logic Properties</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Heinz-Nixdorf Chair for Distributed Information Systems Institute for Computer Science, Friedrich Schiller University Jena</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>With growing popularity of ontologies as semantics-aware integration solutions, various ontology merging methods have been proposed. Each of them uses their own set of criteria to determine a desirable merge result. In this work, we rst categorize these criteria into generic merge requirements (GMRs). Not all of these requirements can be met simultaneously. We argue that users should be able to select those requirements that are most important to them, but that system support is required to determine a compatible set of requirements based on user inputs. We propose a graph-theory based approach to determining such sets.</p>
      </abstract>
      <kwd-group>
        <kwd>Ontology merging</kwd>
        <kwd>Merge requirements</kwd>
        <kwd>Graph theory</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>An ontology is a formal explicit description of a domain. It contains a set
of entities including classes, properties, and instances. Given a set of input
ontologies and a set of correspondences between them, an ontology merging
process creates a new merged ontology. One aspect that various approaches to
ontology merging di er in is the set of criteria they aim to ful ll, i.e., what are
the requirements they expect the merged ontology to meet. We have conducted
an analysis of the literature and determined which criteria, here called generic
merge requirements (GMR), are used by di erent approaches. Our ultimate aim
is to provide a exible merging approach, where users can actively choose which
requirements are important to them, instead of allowing only a very indirect
choice by picking the right merging method, something that is not transparent
to the user. Unfortunately, not all GMRs are compatible with each other. For
instance, one may want to preserve all classes contained in the original ontology
in the merged ontology. On the other hand, one could wish to achieve class
acyclicity. Likely, these goals con ict. Thus, once a user has chosen which GMRs
they consider important, a system is needed that is able to check whether these
are compatible and which other requirements can be met simultaneously.</p>
      <p>In this paper, we take the rst step towards this goal: We analyse the
literature to compile a list of GMRs, we provide rst insights into their
compatibility and we describe a graph-theory based method for determining
maximal sets of compatible GMRs.</p>
      <p>Completeness
 . Class preservation
 . Property preservation
 . Instance preservation
 4. Correspondence preservation
 5. Correspondences’ Property</p>
      <p>preservation
 6. Property’s value preservation
 7. Structure preservation</p>
      <p>Integrity</p>
      <p>Minimality
 8. Class redundancy prohibition
 9. Property redundancy</p>
      <p>prohibition
 . Instance redundancy</p>
      <p>prohibition
 . Extraneous entity
prohibition</p>
      <p>Deduction
 12. Entailment deduction
satisfaction</p>
      <p>Constraint
  . One type restriction
 14. Property value’s constraint
 . Property’s domain and</p>
      <p>range oneness
Model Properties</p>
      <p>Acyclicity
 . Acyclicity in the class hierarchy
 . Acyclicity in the property</p>
      <p>hierarchy
 18. Prohibition of properties
being inverses of themselves</p>
      <p>Connectivity
 19. Unconnected class prohibition
  . Unconnected property</p>
      <p>prohibition</p>
      <p>Legend Dimension Aspect GMR</p>
    </sec>
    <sec id="sec-2">
      <title>GMRs Overview and Classi cation</title>
      <p>
        To discover the most commonly used GMRs, we have studied and analyzed three
di erent research areas including (i) ontology merging methods [
        <xref ref-type="bibr" rid="ref1 ref10 ref2 ref3 ref6 ref7">1,2,3,6,7,10</xref>
        ], (ii)
ontology merging benchmarks [
        <xref ref-type="bibr" rid="ref4 ref9">4,9</xref>
        ], and (iii) ontology engineering [
        <xref ref-type="bibr" rid="ref5 ref8">5,8</xref>
        ]. Overall,
we classify the GMRs according to three dimensions subdivided into di erent
aspects (see Fig. 1). The three dimensions we identi ed are:
{ Integrity: This dimension refers to the degree of knowledge coverage in
the merge process via (i) the completeness aspect, and to the amount of
knowledge redundancy by (ii) the minimality aspect.
{ Model Properties: Within this dimension, the principles of creating a new
ontology model are investigated. In this dimension, (i) acyclicity, and (ii)
connectivity satisfaction aspects are taken into account.
{ Logic Properties: The inference of the expected knowledge with involved
constraints is analyzed in this dimension. This includes (i) deduction, and
(ii) constraint satisfaction aspects.
      </p>
      <p>
        We determined six aspects that GMRs can be grouped into:
Completeness refers to knowledge preservation and coverage:
R1. Class preservation: Each class in (all/target) 1 input ontologies should have
a mapped class in the merged ontology [
        <xref ref-type="bibr" rid="ref1 ref10 ref3 ref6 ref7">1,3,6,7,10</xref>
        ].
      </p>
      <p>
        R2. Property preservation: Each property from the (all/target) input ontologies
is explicitly in or implied by the merged ontology [
        <xref ref-type="bibr" rid="ref10 ref7">7,10</xref>
        ].
      </p>
      <p>
        R3. Instance preservation: All instances of (all/target) input ontologies should
be preserved in the merged ontology [
        <xref ref-type="bibr" rid="ref10 ref7">7,10</xref>
        ].
      </p>
      <p>
        R4. Correspondence preservation: If two entities of the input ontologies are
corresponding 2, both should map to the same merged entity in the merged
ontology [
        <xref ref-type="bibr" rid="ref10 ref6 ref7">6,7,10</xref>
        ].
1 Preserving all entities from all input ontologies or a preferred one.
2 This can be equality, similarity or is-a correspondences. In each case, the same type
of corresponding should be preserved.
      </p>
      <p>
        R5. Correspondences' property preservation : If any of the corresponding entities
from the input ontologies has a certain property, the merged entity should
also have this property [
        <xref ref-type="bibr" rid="ref6 ref7">6,7</xref>
        ].
      </p>
      <p>
        R6. Property's value preservation : Properties' values from the (all/target) input
ontologies should be preserved in the merged ontology [
        <xref ref-type="bibr" rid="ref6 ref7">6,7</xref>
        ]. In case of
con icts a resolution strategy is required.
      </p>
      <p>
        R7. Structure preservation : If two entities are connected via a certain property
in an input ontology, their mapped entities in the merged ontology should be
connected via the respective mapped property [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], thus preserving the input
ontologies' structures in the merged ontology.
      </p>
      <p>
        Minimality refers to knowledge redundancy and controlling of semantic overlap:
R8. Class redundancy prohibition : A class from the (all/target) input ontologies
should have at most one mapping in the merged ontology [
        <xref ref-type="bibr" rid="ref1 ref10 ref3 ref4 ref7">1,3,4,7,10</xref>
        ].
R9. Property redundancy prohibition : A property from the (all/target) input
ontologies should have at most one mapping in the merged ontology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
R10. Instance redundancy prohibition : An instance from the (all/target) input
ontologies should have at most one mapping in the merged ontology.
R11. Extraneous entity prohibition : No additional entities other than input
ontologies' entities should be added in the merged result [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Deduction refers to the deduction satisfaction with R12 :
R12. Entailment deduction satisfaction : The merged ontology is desirable to be
able to entail all entailments of the (all/target) input ontologies [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. As the
semantic consequences of the integration, it can include more entailments
but it should at least not miss knowledge from the input ontologies.
      </p>
      <p>
        Constraint re ects the satisfaction of the ontology constraints:
R13. One type restriction: Two corresponding entities should follow the same data
type [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]; e.g., if the range of author Id in one of the input ontology is
String and in the other one is Integer, then the range of the merged entity
author Id in the merged ontology cannot have both types.
      </p>
      <p>
        R14. Property value's constraint : If the (all/target) input ontologies place some
restriction on a property's values (e.g., in terms of cardinality or by
enumerating possible values) this should be preserved without con ict in
the merged ontology [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        R15. Property's domain and range oneness : The merge process should not result
in multiple domains or ranges de ned for a single property. This rule is recast
from the ontology modelling issues in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        Acyclicity refers to controlling the chain problem in the merged ontology:
R16. Acyclicity in the class hierarchy : A cycle of is-a relationships implies equality
of all of the classes in the cycle, since is-a is transitive. Therefore, the merge
process should not produce a cycle in the class hierarchy [
        <xref ref-type="bibr" rid="ref1 ref10 ref3 ref5 ref7 ref8">1,3,5,7,8,10</xref>
        ].
R17. Acyclicity in the property hierarchy : The merge process should not
produce a cycle between properties with respect to the is-subproperty-of
relationship [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        R18. Prohibition of properties being inverses of themselves : The merged process
should not cause an inverse recursive de nition on the properties [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        Connectivity refers to the hierarchy connectivity satisfaction:
R19. Unconnected class prohibition : The merge process should not make the
classes unconnected [
        <xref ref-type="bibr" rid="ref3 ref7">3,7</xref>
        ]. Every class that had some connections in the input
ontologies before the merge process, should not be unconnected after merge
process in the merged ontology.
      </p>
      <p>
        R20. Unconnected property prohibition : The merge process should not make the
properties unconnected [
        <xref ref-type="bibr" rid="ref6 ref8">6,8</xref>
        ]. Every property that had some connections in
the input ontologies before the merge process, should not be unconnected
after merge process in the merged ontology.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>GMR Compatibility</title>
      <p>Intuitively, not all GMRs are compatible, e.g., completeness and minimality are
di cult to achieve simultaneously. In our ongoing work, we are determining the
precise relationships. Typically, users will, however, not be interested in all of
them anyhow. Depending on the application they want to create an ontology for,
some GMRs might be important to them while others are only nice to have. We
therefore propose an approach that takes as input a set of user-selected GMRs,
namely U and an undirected graph G re ecting the relationship among GMRs.
G = (V; E ), where V is the set of vertices representing the GMRs, and E is the
set of edges. Two GMRs are connected via an edge i they are compatible.</p>
      <p>Given the set U containing the GMRs the user is interested in, we nd the
maximum subset of V containing all vertices out of U and no incompatible nodes.
This may not always be possible, since the user might have chosen incompatible
GMRs already. In this case, we search for a maximum subset of V that preserves
as many nodes out of U as possible and contains compatible nodes only.</p>
      <p>
        Precisely, we recast the problem at hand as maximum clique extraction on G.
A clique is a set of fully connected vertices. We thus extract a compatible clique
KC -Clique, where K indicates the number of vertices in the clique, and C denotes
that the clique is compatible. KC -Clique includes compatible GMRs from U and
some additional compatible GMRs related to U 's elements. KC -max-Clique, is a
clique containing at least K vertices that is not a subset of any other cliques. To
compute the KC -max-Clique, we use the CLIQUES algorithm in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>To illustrate our approach with an example, Fig. 2 left shows the graph G,
for six GMRs where U = fR2; R8; R11g. Based on this graph, three di erent
cliques have been extracted, in which the user-selected compatible GMRs, the
incompatible U 's elements, and the additional compatible GMRs related to U
are indicated with green, red, and yellow circles, respectively. Two possible
represented 3C -Clique are compatible cliques however they are not a maximal
clique. The 4C -max-Clique is the best match to the U and indicates the maximal
compatible subset on the sketched G.
Fig. 2: From left to right: a graph G for six GMRs; two di erent compatible
3C -Clique; a compatible 4C -max-Clique.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>There is a variety of generic merge requirements (GMR) in the ontology
engineering domain. We gathered the most common GMRs 3, and classi ed
them into three dimensions. Since not all GMRs can be ful lled at the same
time, we propose an approach that allows users to specify the most important
GMRs for their speci c task, then determines the maximal compatible subset
of GMRs. This result can then be used to select a proper merge method or to
parameterize a generic merge method. We are currently working on two aspects
of this problem: First, a systematic determination of GMR compatibility and
second the development of a generic, parameterizable merge method.</p>
      <p>Acknowledgments. S. Babalou is supported by a scholarship from German
Academic Exchange Service (DAAD).
3 see also: http://comerger.uni-jena.de/requirement.jsp</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>F.</given-names>
            <surname>Duchateau</surname>
          </string-name>
          and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Bellahsene</surname>
          </string-name>
          .
          <article-title>Measuring the quality of an integrated schema</article-title>
          .
          <source>In ER</source>
          , pages
          <volume>261</volume>
          {
          <fpage>273</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>E.</given-names>
            <surname>Jimenez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. C.</given-names>
            <surname>Grau</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Horrocks</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Berlanga</surname>
          </string-name>
          .
          <article-title>Ontology integration using mappings: Towards getting the right logical consequences</article-title>
          .
          <source>In ESWC</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Ju</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. E.</given-names>
            <surname>Esquivel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Rebollar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Su</surname>
          </string-name>
          .
          <article-title>Creado{a methodology to create domain ontologies using parameter-based ontology merging techniques</article-title>
          .
          <source>In MICAI</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>M.</given-names>
            <surname>Mahfoudh</surname>
          </string-name>
          , G. Forestier, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Hassenforder</surname>
          </string-name>
          .
          <article-title>A benchmark for ontologies merging assessment</article-title>
          .
          <source>In KSEM</source>
          , pages
          <volume>555</volume>
          {
          <fpage>566</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          , et al.
          <article-title>Ontology development 101: A guide to creating your rst ontology</article-title>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Noy</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          .
          <article-title>The prompt suite: interactive tools for ontology merging and mapping</article-title>
          .
          <source>IJHCS</source>
          ,
          <volume>59</volume>
          (
          <issue>6</issue>
          ):
          <volume>983</volume>
          {
          <fpage>1024</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Pottinger</surname>
          </string-name>
          and
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Bernstein</surname>
          </string-name>
          .
          <article-title>Merging models based on given correspondences</article-title>
          .
          <source>In VLDB Endowment</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>M.</given-names>
            <surname>Poveda-Villalon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Suarez-Figueroa</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Gomez-Perez</surname>
          </string-name>
          .
          <article-title>Validating ontologies with oops</article-title>
          ! pages
          <fpage>267</fpage>
          {
          <fpage>281</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>S.</given-names>
            <surname>Raunich</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Rahm</surname>
          </string-name>
          .
          <article-title>Towards a benchmark for ontology merging</article-title>
          .
          <source>In OTM</source>
          , volume
          <volume>7567</volume>
          , pages
          <fpage>124</fpage>
          {
          <fpage>133</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>S.</given-names>
            <surname>Raunich</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Rahm</surname>
          </string-name>
          .
          <article-title>Target-driven merging of taxonomies with atom</article-title>
          .
          <source>Inf. Syst.</source>
          ,
          <volume>42</volume>
          :1{
          <fpage>14</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. E.
          <string-name>
            <surname>Tomita</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Tanaka</surname>
            , and
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Takahashi</surname>
          </string-name>
          .
          <article-title>The worst-case time complexity for generating all maximal cliques and computational experiments</article-title>
          .
          <source>TCS</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>