<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A typology of ontology-based semantic measures</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Emmanuel Blanchard</string-name>
          <email>emmanuel.blanchard@univ-nantes.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mounira Harzallah</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Henri Briand</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pascale Kuntz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Laboratoire d'Informatique de Nantes Atlantique Site École polytechnique de l'université de Nantes</institution>
          <addr-line>rue Christian Pauc BP 50609 - 44306 Nantes Cedex 3 -</addr-line>
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontologies are in the heart of the knowledge management process. Different semantic measures have been proposed in the literature to evaluate the strength of the semantic link between two concepts or two groups of concepts from either two different ontologies (ontology alignment) or the same ontology. This article presents an off-context study of eight semantic measures based on an ontology restricted to subsomption links. We first present some common principles, and then propose a comparative study based on a set of semantic and theoretical criteria.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic measure classification</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        A consensus is now established about the definition and the role of an ontology in
knowledge engineering: "an ontology is a formal, explicit, specification of a shared
conceptualization" [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], it constitutes knowledge repository that supports
information exchanges between computer systems. Several application fields search to
exploit their wealth: semantic web, system interoperability, competence
management, e-learning, natural language processing, etc. Numerous semantic measures
on ontology have been proposed in literature to evaluate the strength of the
semantic link between two concepts or two groups of concepts from two different
ontologies (ontology alignment) or inside an ontology. The majority of the
semantic measures defined on a unique ontology is developed and validated in a
specific context. This article presents an off-context study of eight semantic
measures whose definitions take only into account subsomption links. The synthesis
of the parameters which appear in at least one of these measures has provided
us a basis of comparison to develop a semantic measure typology.
      </p>
      <p>
        First, we identify ontology-based semantic measure characteristics and the
parameters that influence these measures. Then, we present a comparative study
of eight measures structured according to the previously defined criterion.
We consider here the basic primitives of an ontology which are concepts and
relations. Among the set of possible relations, some of them are not used
systematically. For instance, the taxonomic relations (hyperonymy/hyponymy) which
correspond to the subsomption link (is-a) are the commonly used relations.
Additional relations may also appear e.g. partonomic relations (meronymy/holonymy)
which correspond to the composition link (part-of), lexical relations (synonymy,
antonymy, etc.) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ][
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. We have here decided to focus ourselves on taxonomic
relations before we generalize this study to other relation types. This choice is
guided by most of previous works which are limited to one or some relation types
but which always consider taxonomic relations.
      </p>
      <p>
        Measure validation is evoked according to three ways [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]: mathematical
analysis, comparison with human judgment and application specific evaluation. In this
paper we favour the mathematical analysis, and we introduce the different
characteristics of semantic measures, and the different parameters which influence
them. When defining a measure, three characteristics are generally specified:
Information sources. Each considered measure is based on a given ontology
(most often WordNet). Some definitions require a corpus of texts to add
information such as the distribution of concept frequencies.
      </p>
      <p>Principles. Most of measures are based on axiomatic principles e.g. they make
functions of the information content or the shortest path length.
Semantic class. Different classes have been introduced in the literature:
semantic distance, semantic similarity and semantic relatedness between two
concepts in the same ontology.</p>
      <p>The semantic similarity evaluates the resemblance between two concepts
from a subset of significant semantic links (e.g is-a and part-of). The semantic
relatedness evaluates the closeness between two concepts from the whole set of
their semantic links. All pairs of concepts with a high semantic similarity value
have a high semantic relatedness value whereas the inverse is not necessarily
true. The semantic distance evaluates the disaffection between two concepts; it
is an inverse notion to the semantic relatedness.</p>
      <p>We have identified four parameters associated with the ontology taxonomic
hierarchy which influence at least one of the measures:
p1 the length of the shortest path: the length of the shortest path between two
concepts ci and cj ;
p2 the depth of the most specific common subsumer : the length of the shortest
path between the root and the most specific common subsumer of ci and cj ;
p3 the density of the concepts of the shortest path: the number of sons of each
concept which belongs to the shortest path between two concepts ci and cj ;
p4 the density of the concepts from the root to the most specific common
subsumer : the number of sons of the concepts which belong to the shortest path
from the root to the most specific common subsumer of two concepts ci and
cj .</p>
      <p>Since our study is restricted to the taxonomic hierarchy, p1 is equal to the sum
of the two shortest path from the two concepts to their most specific common
subsumer. The parameter p2 is the difference between the length of the shortest
path between the root and one of the two concepts and the length of the shortest
path between this concept and the most specific common subsumer.</p>
      <p>The measures which also use a corpus consider the information content of
some concepts. Let us consider a concept c. The information content is defined
by CI(c) = − log(P (c)) where P (c) corresponds to the occurrence probability,
in a consequent corpus of texts, of c or one of its directly or indirectly subsumed
concepts. Let us notice that this definition contains the information on the
shortest path from the root to the concept c (depth of c). As P (c) is an exponential
decreasing function of the depth of c, CI(c) is proportional to this latter. In
addition, the information extracted from a corpus by this approach contains also
the information of the density of the concepts on the same path. Indeed, let us
consider the set S of the concepts which have the same father (direct subsumer)
as c. When the cardinality of S increases, the average occurrence probability of
each element of S decreases. Consequently, CI(c) increases.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Semantic measure presentation</title>
      <p>
        In the following, we present eight measures using information sources and
principles defined in the previous section. Then, we analyse the theoretic definition
of each measure. Some of the following ontology or corpus characteristics are
considered in the definitions:
rt: ontology root
pths(x, y): set of paths between the concepts x and y
lene(x): length in number of edges of the path x
lenn(x): length in number of nodes of the path x
P (x): occurrence probability of a concept x in a corpus
mscs(x, y): the most specific common subsumer of x and y
trn(x): number of direction changes of the path x
minr: minimum weight assigned to the relation r
maxr: maximum weight assigned to the relation r
nr(x): number of relations of type of r which leave x
Rada et al.’s distance[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. It is based on the shortest path between two
concepts ci and cj (p1) in an ontology restricted to taxonomic links:
distrmbb(ci, cj) =
      </p>
      <p>When considering the shortest path, all the taxonomic links between two
adjacent concepts are supposed to have a same value.</p>
      <p>
        Resnik’s similarity[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. It is established on the following hypothesis: the
more the information two concepts share in common, the more similar they are.
Like the previous measure (1), this one only considers taxonomic links. On the
basis of the information theory, Resnik proposes to add the information content.
The information shared by two concepts is indicated by the information content
of their most specific common subsumer:
simr(ci, cj) = − log P (mscs(ci, cj))
(1)
(2)
      </p>
      <p>The use of the information content of the most specific common subsumer
implies that this measure depends on two parameters: the length of the shortest
path from the root to the most specific common subsumer of ci and cj (p2) and
the density of concepts on this path (p4).</p>
      <p>
        Leacock and Chodorow’s similarity[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. It corresponds to a
transformation of the Rada distance into a similarity. The shortest path between two
concepts of the ontology restricted to taxonomic links is normalized by introducing
a division by the double of the maximum hierarchy depth:
simlc(ci, cj ) = − log
      </p>
      <p>Like the Rada’s measure, only the shortest path length (p1) influences this
measure.</p>
      <p>
        Wu and Palmer’s similarity[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. It is a measure between concepts in an
ontology restricted to taxonomic links. The two parameters which are the length
of the two paths from ci to mscs(ci, cj ) and from cj to mscs(ci, cj ) in the Wu and
Palmer’s definition have been added. Their addition corresponds to the shortest
path between ci and cj in the formula below:
simwp(ci, cj ) =
2 ∗
      </p>
      <p>min
p∈pths(mscs(ci,cj),rt)
lene(p) + 2 ∗</p>
      <p>lene(p)</p>
      <p>Here, the depth of the most specific common subsumer (p2) has a non linear
influence. We can observe this evolution if we set the shortest path length to a
constant k : (inf luence(x) = x/(x + k)). Furthermore, this measure is sensitive
to the shortest path (p1).</p>
      <p>
        Jiang and Conrath’s distance[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The authors have used, like Resnik,
a corpus in addition to the ontology restricted to taxonomic links. Jiang and
Conrath formulate the distance between two concepts as the difference between
the sum of the information content of the two concepts and the information
content of their most specific common subsumer:
distjc(ci, cj ) = 2 ∗ log P
mscs(ci, cj ) −
      </p>
      <p>This definition is composed of two interesting components which are the
information content of the two concepts and the information content of their most
specific common subsumer. We can suppose that it varies according to all the
proposed parameters. But the combination revokes the effect of two parameters.
Finally, this measure is sensitive to the shortest path length between ci and cj
(p1) and the density of concepts along this same path (p3).</p>
      <p>
        Lin’s similarity[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Lin deduces from an axiomatic approach a measure
based on an ontology restricted to taxonomic links and a corpus. This similarity
takes into account the information shared by two concepts like Resnik, but also
(3)
(4)
the difference between them. The definition contains the same components as in
the previous measure but the combination is not a difference but a ratio:
siml(ci, cj ) =
2 ∗ log P mscs(ci, cj )
      </p>
      <p>In this case, the combination allows this measure to be sensitive to the whole
parameter set (p1, p2, p3, p4). Lin notices that the Wu and Palmer measure is a
particular case of his measure. Indeed if, for c0 the father of c, we consider that
P (c|c0) is constant, then we obtain the Wu and Palmer’s measure.</p>
      <p>
        Sussna’s distance[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. It is based on all the possible links. For each relation
r, we define a weight w(ci →r cj ) from a given interval [minr; maxr]. This weight
is calculated with the local density which corresponds to the number of relations
of the type r which go from ci:
w(ci →r cj ) = maxr −
maxr − minr
nr(ci)
      </p>
      <p>Sussna defines the distance between two adjacent concepts. This link
corresponds to the relations r and its inverse r0.</p>
      <p>dists(ci, cj ) =</p>
      <p>h
2 ∗ max
w(ci →r cj ) + w(cj →r0 ci)</p>
      <p>min
p∈pths(ci,rt)
lene(p);</p>
      <p>min
p∈pths(cj,rt)
lene(p)
i</p>
      <p>This formula is defined for adjacent nodes only. To calculate the distance
between two concepts, we have to sum the distances of all the links which compose
the shortest path between these two concepts. The distance obtained is sensitive
to three parameters: the shortest path length between ci and cj (p1), the density
of the concepts along this same path (p2) and the shortest path length from the
root to the most specific common subsumer of ci and cj (p3).</p>
      <p>
        Hirst and St Onge’s relatedness[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. It is based on an ontology. Hirst
and St Onge distinguish four relation types between two concepts qualified of
extra-strong, strong, medium and week. Hirst and St Onge propose a different
way to calculate the relatedness functions of the relation type. In the following
formula, C and K are two constants:
relhs(ci, cj ) =


      </p>
      <p>C − minp∈pths(ci,cj) lene(p)</p>
      <p>−K ∗ minp∈pths(ci,cj) trn(p)
 3 ∗ C(extra-strong); 2 ∗ C(strong); 0(week);


(medium)</p>
      <p>Synthesis and conclusion. The table 1 summarizes the studied
characteristics and the influential parameters of the measures. The four parameters are
independent and issued from the ontology. However, no proposed measure takes
into account the whole parameter set without the use of a corpus. Concerning
the introduction of a corpus in addition to an ontology when building a
measure, we believe that this latter brings few information in comparison with its
algorithmic complexity.
(6)
(7)
(8)
(9)</p>
      <p>characteristics Parameters
sources semantics p1 p2 p3 p4
ontology distance √
ontology+corpus similarity √ √</p>
      <p>ontology similarity √
ontology+corpus distance √ √</p>
      <p>ontology similarity √ √
ontology+corpus similarity √ √ √ √
ontology distance √ √ √
ontology relatedness √</p>
      <p>In the next future, we plan to define a semantic similarity measure based
on an ontology which will be sensitive to all parameters: the length and density
of concepts of the shortest path between two concepts and of the shortest path
from the root to their most specific common subsumer.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Gruber</surname>
            ,
            <given-names>T.R.:</given-names>
          </string-name>
          <article-title>A translation approach to portable ontology specifications</article-title>
          .
          <source>Knowledge Acquisition</source>
          <volume>5</volume>
          (
          <year>1993</year>
          )
          <fpage>199</fpage>
          -
          <lpage>220</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Sussna</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Word sense disambiguation for free-text indexing using a massive semantic network</article-title>
          .
          <source>In: Proceedings of the Second International Conference on Information and Knowledge Management</source>
          . (
          <year>1993</year>
          )
          <fpage>67</fpage>
          -
          <lpage>74</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Conrath</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          :
          <article-title>Semantic similarity based on corpus statistics and lexical taxonomy</article-title>
          .
          <source>In: Proceedings of International Conference on Research in Computational Linguistics</source>
          . (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Budanitsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Lexical semantic relatedness and its application in natural language processing</article-title>
          .
          <source>Technical report</source>
          , Computer Systems Research Group - University of Toronto (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Rada</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mili</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bicknell</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blettner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Development and application of a metric on semantic nets</article-title>
          .
          <source>IEEE Transactions on Systems, Man, and Cybernetics</source>
          <volume>19</volume>
          (
          <year>1989</year>
          )
          <fpage>17</fpage>
          -
          <lpage>30</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Resnik</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Using information content to evaluate semantic similarity in a taxonomy</article-title>
          .
          <source>In: Proceedings of the 14th International Joint Conference on Artificial Intelligence</source>
          . Volume
          <volume>1</volume>
          . (
          <year>1995</year>
          )
          <fpage>448</fpage>
          -
          <lpage>453</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Leacock</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chodorow</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Combining local context and wordnet similarity for word sense identification</article-title>
          . In Fellbaum, C., ed.:
          <article-title>WordNet: An electronic lexical database</article-title>
          . MIT Press (
          <year>1998</year>
          )
          <fpage>265</fpage>
          -
          <lpage>283</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Verb semantics and lexical selection</article-title>
          .
          <source>In: Proceedings of the 32nd Annual Meeting of the Associations for Computational Linguistics</source>
          . (
          <year>1994</year>
          )
          <fpage>133</fpage>
          -
          <lpage>138</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>An information-theoretic definition of similarity</article-title>
          .
          <source>In: Proceedings of the 15th International Conference on Machine Learning</source>
          , Morgan Kaufmann (
          <year>1998</year>
          )
          <fpage>296</fpage>
          -
          <lpage>304</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hirst</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>StOnge</surname>
          </string-name>
          , D.:
          <article-title>Lexical chains as representation of context for the detection and correction of malaproprisms</article-title>
          . In Fellbaum, C., ed.:
          <article-title>WordNet: An electronic lexical database</article-title>
          . MIT Press (
          <year>1998</year>
          )
          <fpage>305</fpage>
          -
          <lpage>332</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>