<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LooPings: a Look at Semantic Similarities</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adama Sow</string-name>
          <email>adama.sow.4@ulaval.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean-Re´mi Bourguet</string-name>
          <email>jrbourguet@inf.ufes.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>De ́partement Ge ́nie Informatique et Te ́le ́communications Ecole Polytechnique de Thie`s (EPT) -</institution>
          <country country="SN">Senegal</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dipartimento di Scienze Politiche ed Ingegneria dell'Informazione Universita` degli Studi di Sassari (UNISS) -</institution>
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Nu ́ cleo de Estudos em Modelagem Conceitual e Ontologias Federal University of Esprito Santo (UFES) -</institution>
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <fpage>23</fpage>
      <lpage>32</lpage>
      <abstract>
        <p>Semantic similarities is a cross-field research in Natural Language Processing and Ontologies with some possible fallouts in Artificial Intelligence. Formerly, similarities were computed following a syntactical treatment to support case-based reasoning. Textual similarities are now guided by semantic machineries, offering various ways to compute relatedness measures. In this paper, we present both a logical and a visual framework aiming to reason with them. For that reason, we introduced F LH , a fragment of description logic underpinning the well-known lexical database Wordnet. We illustrated this framework with the path length relatedness, one of the historical similarity measures occurring in a taxonomy. The core of our framework orchestrates the computation of similarity scores supported by REVERB, STANFORD CORENLP and WORDNET:SIMILARITY APIs and interfaces global similarities in graphical way by positioning them on segments. We also depicted some experimental results to confront our computational framework with some empirical data.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic Similarity</kwd>
        <kwd>Ontology</kwd>
        <kwd>Description logic</kwd>
        <kwd>Natural language processing</kwd>
        <kwd>Interface</kwd>
        <kwd>Empirical data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Reasoning with similarities is seen as one of the crucial steps in Artificial
Intelligence. Turing, in his paper Computing Machinery and Intelligence [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], suggested that
a machine having passed the so-called test, should appear as a human 70% of the time
after five minutes of conversation. From Joseph Weizenbaum, and his seminal proposal
Eliza [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] in 1966, until the last generation of programs Jabberwacky and Cleverbot
developed by Rollo Carpenter [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] during the last decade, the treatment of similarities
attracted a lot of interest to tackle this issue. Formerly, one classical way was to deal
with Natural Language Processing (NLP), focusing on syntactic similarities through a
case-based reasoning machinery.
      </p>
      <p>
        These last years, the Semantic Web [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] has given complementary resources in terms
of knowledge bases, offering new standards to deal with lexical databases. The semantic
dimension of similarities is now guided by well-established and moderated repositories
of knowledge including ontological layers (see [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]), but few attempts was performed to
combine the classical NLP-based technologies together with semantic web
infrastructures in order to analyze the limits of such interactions (see for example [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]). Moreover,
if some methods were also proposed to compute global scores of similarities between
two phrases based on local semantic similarities of their components (see [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ]), we
remark that the usual way to output a set of similarities between a phrase and a set
of phrases is an ordered list of items (e.g. the popular search engines), and very few
approaches were proposed in the literature in order to screen the similarity scores in
different ways (see [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ]).
      </p>
      <p>
        In this paper, we present a logical and visual framework to represent and reason with
textual relatedness measures guided by ontologies. In order to bridge the gap between
natural language and ontologies, we introduced a fragment of Description Logic (DL),
underpinning the well-known lexical database WORDNET [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. With this fragment, we
can properly redefine some relatedness measures historically used in taxonomies. To
gain place, we only introduced here the path length relatedness measure denoted plr.
Nevertheless, other historical similarity measures considering the maximum depth in
taxonomy (see lch in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]), the depth of the least common subsumer (see wup in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ])
or the supported information content (see jcn in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and lin in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]) have also been
integrated in this framework.
      </p>
      <p>After some preliminaries to introduce our logical framework in section 2, we
describe in section 3 both our computation and an interface to screen the similarities
between a phrase and a set of phrases. Finally, in section 4 we present an experimental
validation to confront the theoretical similarities with some experimental ones.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>
        DL is a well-known family of formal knowledge representation models. Semantic
languages used on the web to share knowledge (e.g. RDFS [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], OWL [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and OWL2 [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ])
have all some direct underpinning logics that are fragments of DL. The core
interpretation of a DL in first order logic was given by Baader and Nutt in [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] as follows:
      </p>
      <sec id="sec-2-1">
        <title>Definition 1. Let C the set all the atomic concepts and R the set all the atomic roles,</title>
        <p>an interpretation I is an ordered pair ( I ; I ) such that:
- I is the domain, i.e. a non-empty set of individuals,
- I is the interpretation function which maps:
- each atomic concept A 2 C to a set AI I ,
- each atomic role r 2 R to a binary relation rI I I .</p>
        <p>
          In this paper, we chose to introduce F LH defined in Definition 2 as an extension
of AL [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] using transitivity and hierarchy for roles but removing negation,
intersection, limited existential and value restrictions.
        </p>
        <sec id="sec-2-1-1">
          <title>Definition 2. Let an interpretation I, fA; Bg C and fr; sg</title>
          <p>R
(dom(r)
(ran(r)
&gt;I = I
?I = ;
(A v B)I = AI BI
(r v s)I = rI sI
(r+)I = (a; b) 2 rI ^ (b; c) 2 rI ! (a; c) 2 rI</p>
          <p>A)I = 8a; b 2 I :a 2 AI [ (a; b) 2= rI
B)I = 8a; b 2 I :b 2 BI [ (a; b) 2= rI
top concept
bottom concept
inclusion axiom
role hierarchy
transitivity
domain
range</p>
          <p>
            As redefined in [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ], a Terminology Box or a TBox in DL is a finite set of axioms,
with no symbolic name (equality whose left-hand side is an atomic concept), which is
defined more than once. Princeton’s WORDNET is a lexical database for the English
language [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ]. A decade after the creation of this database, van Assem [
            <xref ref-type="bibr" rid="ref22">22</xref>
            ] proposed a
conversion of WORDNET in OWL. Example 1 presents a sample of TBox underpinning
WORDNET in OWL introducing particularly the concept SYNSET4 and the role h5.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Example 1 (Sample of WORDNET TBox).</title>
        <p>SYNSET v &gt;, dom(h) SYNSET, ran(h)</p>
        <p>SYNSET, h+.</p>
        <p>
          In the terms of Baader and Nuts [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], an Assertional Box or an ABox is a specific
state of affairs of an application domain in terms of concepts and roles. Example 2
depicts a graph representing a sample of ABox underpinning WORDNET in OWL where
&gt;n represents the individual root of all the individual nouns present in WORDNET,
the edge represents synsets (e.g. SYNSET(Astract)) and the directed arcs represent the
hyponym relation assertions between two individuals (e.g. h(Attribute; Abstract)).
Note that the WORDNET Abox is partitioned in different set, dealing with different
parts of speech (nouns, verbs, adjectives and adverbs).
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Example 2 (Sample of WORDNET ABox).</title>
        <p>Relation</p>
        <p>Astract</p>
        <p>P hysical
&gt;n</p>
        <p>Attribute</p>
        <p>
          The idea of computing relatedness measures between concepts occurring in a
taxonomy is an old issue (see [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]). Few approaches (see [
          <xref ref-type="bibr" rid="ref24 ref25">24,25</xref>
          ]) have described the common
relatedness measures through a DL formalism. According to Rada [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], a path length
is founded on a node-counting scheme concerning the smallest specified role counting
between two individuals. Given an interpretation and two individuals, we redefined the
shortest path length function denoted L as follows:
4 Short for “Synonym set” to call a non-empty set of synonyms.
5 Short for “hyponym” to call the well-known “hyponymy” relation.
        </p>
        <sec id="sec-2-3-1">
          <title>Definition 3. Let an interpretation I, fuk; vlg 2 I and h 2 R</title>
          <p>L(uk; vl) = min(2n-(k+l)) s.t. 8(i; j) 2 [[k; n] [[l; n]: h(ui; ui+1)^h(vj ; vj+1)^un=vn
Note that if a practitioner deals with the transitive closure of the WORDNET ABox,
a min max operator may be used due to the transitivity of the relation h. Example 3 lists
the shortest path length between individual nouns introduced in Example 2. We denote
&gt;v the individual root of all the verbs. The ABox being partitioned, note that no verb
is involved in a hyponym relation with a noun (and vice-versa).</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Example 3 (Shortest Path Length).</title>
        <p>L(Relation; Abstract) = 1
L(P hysical; P hysical) = 0</p>
        <p>L(Relation; Attribute) = 2
L(P hysical; &gt;v) 7! 1</p>
        <p>
          As reintroduced by Pedersen [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], the path length based relatedness score is equal
to the inverse of the shortest path length between two concepts. Naturally, it is inversely
proportional to the number of nodes along the shortest path between the synsets. The
path length based relatedness denoted plr is defined as follows:
        </p>
        <sec id="sec-2-4-1">
          <title>Definition 4. Let an interpretation I, fu; vg 2</title>
          <p>I
plr(u; v) =</p>
          <p>1
1+L(u;v)</p>
          <p>If no path exists (e.g. between a verb and a noun) we consider that L tends to 1, and
then the path length based relatedness score approaches 0. The highest possible score
occurs when the two synsets are the same, in which case the score is 1.</p>
        </sec>
      </sec>
      <sec id="sec-2-5">
        <title>Example 4 (Path Length based Relatedness).</title>
        <p>plr(Relation; Abstract) = 1/2
plr(P hysical; P hysical) = 1
plr(Relation; Attribute) = 1/3
plr(P hysical; &gt;v) 7! 0</p>
        <p>In the next section, we present our method to compute similarity scores between
two phrases based on the local semantic similarities of their components (e.g. plr),
moreover we depict a way to screen them through a graphical user interface.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Computational Framework</title>
      <p>
        The finality of our framework is to screen similarity scores between one phrase and a
set of phrases. We chose to investigate the way transforming a phrase in a set of triples
before performing the computation of similarities. We selected the API REVERB, an
information extractor for massive corpora (working without pre-specified vocabulary).
In [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], REVERB was judged with a extraction precision of 0,8 or higher in at least 30%
of the time, substantially outperforming others extractors like TEXTRUNNER (see [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ])
and WOE (see [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]). Formally and for the following, we can see a phrase p as a set
of triples ft1; : : : ; tng where tl = htl1; tl2; tl3i with tlk 2 I . Thereafter, we define
the global score between two phrases as the maximum of the averages of similarities
between the components from the triples (average of similarities between subjects,
objects, and complements). Moreover, we arbitrary avoid to take into account two kinds of
scores: scores of 0, it is the case when at least one entry is not present in the lexical base
of WORDNET (see section 2), and the score of 1 (similar strings) only if the individuals
in comparison are present in a stop words set denoted .
      </p>
      <sec id="sec-3-1">
        <title>Definition 5. Let and two phrases p, q, the Score of similarity is defined as follows:</title>
        <p>S(p; q)=max(avg(fs(tik; tjk)j s(tik; tjk) 6= 0 ^ (s(tik; tjk) 6= 1 _ tik; tjk 2= )g))
ti;tj
with ti 2 p, tj 2 q, k 2 [[1; 3]], s 2 fplr, lch, wup, jcn, ling and S the upper case of s.</p>
        <p>
          To support this computation, we used the STANFORD CORENLP API [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ] to
perform the lemmatization of the components from the triples. The similarities calculus of
the lemme were performed by WORDNET:SIMILARITY (developed by Pedersen et al.
in [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] and redesigned in Java by Shima [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]).
        </p>
        <p>All this treatment is included in the framework LooPings6 (Lexical and ontological
observations to Plot ingathering similarities). Looking at the system description of
Fig. 1, LooPings is decomposed in several modules:
6 The framework LooPings is available at http://nemo.inf.ufes.br/?p=1185.
INPUT takes a phrase and some parameters (e.g. local measure, stop words list, etc.).
LIBRARIES comprise REVERB, STANFORD CORENLP and WS4J API (bundled
with WORDNET).</p>
        <p>CORE manages the computation.</p>
        <p>OUTPUT is devoted to screen and to plot the similarity scores.</p>
        <p>One of the most challenging steps is the connection between the NLP APIs
REVERB and WS4J. The main aspect that made this compatibility a little immature is the
fact that REVERB can output expressions (several raw words) while WS4J accepts only
a single element as input (a lemme or an expression of lemmes linked themselves in one
string by underscores). Then, we had to perform a treatment from our raw phrases,
following some basic heuristics, as described through the example below:
. What can we learn from the neural networks of C.elegans to understand human brains?
The outputted triple is hwe,learn from,the neural networks of C.elegansi.
1. Lemmatization:</p>
        <p>hwe,learn from,the neural network of C.elegansi
2. Chunking:</p>
        <p>hwe,learn from,the neural network C.elegansi
3. Stop words removing:</p>
        <p>hwe, learn, neural network C.elegansi
4. Cutting:
hwe, learn, neural networki;hwe, learn, C.elegansi</p>
        <p>
          Note that during the step of the removal, stop words were never removed from the
triples when it involved an empty set for one component. Fig. 2 presents the interface of
LooPings, where practitioners can visualize and confront the semantic similarities.
Note that this framework is oriented to integrate queries and SPARQL [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ]
interpretations allowing a possible integration of other semantic web technologies. The similarity
scores are represented on a segment [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ].
        </p>
        <p>The last section is dedicated to the confrontation between the theoretical scores with
some experimental ones.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Validation</title>
      <p>In order to deal with real queries, we used STACK EXCHANGE API allowing us to
extract from a website some structured data (in JSON format) about different
questionand-answer themes. The total number of available queries for the Cognitive Science
section of STACK EXCHANGE website was 1245, we decided to extract the 1200 first
queries (w.r.t. the extraction order), from each of them we stored ids (from 1 to 4697)
and queries. Formally, a query referenced by an id is denoted qid, moreover we call
series Sj a set of queries: Sj fqidj1 id 4697g. A tricky aspect was the fact that
REVERB outputted nothing in 70.75 percent of queries. So, we decided to divide, the
1200 queries in 12 series of 100 queries, finally giving a median series of 28 queries.
Due to space limitation, we will depict in this article only two series (A and B).</p>
      <p>Once the extraction was performed, we created an on-line semantic recall based
experience. For each series, we designed a witness query. We attached a questionnaire to
each series and sent them by e-mail to mailing lists of PhD students. The requirement for
participating was to be proficient in English. Questionnaires were designed as follows:
1. Instruction to read carefully a witness query.
2. Instruction to read the series of queries and to check 1, 2 or 3 queries among them,
(at least 1, at most 3), the most similar, for the user, to the witness queries.
3. Instruction to partially preorder the selected query(ies).</p>
      <p>We succeeded in obtaining 50 volunteers. We gave accumulated marks for queries
in each series w.r.t the preorders given by the volunteers. For example if a volunteer
selected only one query qi in the series, the accumulated mark for qi was increased by
6, the remaining possible situations are listed below:
qi
qi
qi
qj
qj
qj
qk
qk
qi(+4); qj (+2)
qi(+3); qj (+2); qk(+1)
qi(+3); qj ; qk(+1:5)
qi
qi
qi
qj
qj
qj
qk
qk
qi; qj (+3)
qi; qj ; qk(+2)
qi; qj (+2:5); qk(+1)</p>
      <p>The maximum mark is translated to an experimental score of 1, after what all the
other marks have been transposed in experimental scores by cross-multiplications. As
depicted in Fig. 3, we took the PLR score as a reference in order to observe how the
other scores behaved following it. Thereafter, we describe what is globally remarkable
in the behavior of the theoretical similarities.
– The higher the PLR is, the lower is the difference between scores founded on path
lengths (LCH, WUP) and scores founded on information contents (JCN, LIN).
– WUP and LIN react in the same way in the case of a sudden increase or drop.
– LIN and JCN scores have a behavior of exponential shape in all the series.</p>
      <p>Nevertheless, there are some limitations for this framework. We relate for example
the case of a saturation. Series B is remarkable by the fact that 5 queries are outputted
with the maximal score.
. Where could I find psychological experiments tools?
The outputted triples were hI,find,psychologicali; hI,find,experimenti; hI,find,tooli</p>
      <p>Here, the systematic presence of two stop words “I” and “find” make that all the
similarity scores are 1 iff one of the words “psychological”, “experiment” or “tool” is
present in one of the extracted triples.</p>
      <p>This experience showed some natural limits concerning our approach. The main
issue seems to be the automatic semantic annotation step (REVERB) and the matching
step with WORDNET individuals (through WORDNET:SIMILARITY).
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper, we presented both a computational and a visual framework to support
semantic similarities guided by ontologies. The first contribution was to redefine some
taxonomy-based relatedness measures (e.g. path length relatedness) in a DL fragment
(F LH ) we introduced.</p>
      <p>
        The core of our framework orchestrates the computation of similarity scores supported
by REVERB, STANFORD CORENLP and WORDNET:SIMILARITY APIs and interfaces
results in graphical way through segments. Moreover, we plotted the behavior of our
computations by designing a semantic recall-based experience to confront empirical
similarities with the theoretical ones. Some research perspectives for this work would
be to extend LooPings in two ways. The first concerns the core of our framework
by interpreting scores founded on other relations like for instance the mereology in
WORDNET. The second is to integrate other repositories to support the similarities
or other frameworks to compute similarities directly on semantic web languages (e.g.
using the technology of QAKiS [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]).
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Alan</given-names>
            <surname>Mathison Turing</surname>
          </string-name>
          .
          <article-title>Computing machinery and intelligence</article-title>
          .
          <source>Mind</source>
          ,
          <volume>59</volume>
          (
          <issue>236</issue>
          ):
          <fpage>433</fpage>
          -
          <lpage>460</lpage>
          ,
          <year>1950</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Joseph</given-names>
            <surname>Weizenbaum</surname>
          </string-name>
          .
          <article-title>Eliza;a computer program for the study of natural language communication between man and machine</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>9</volume>
          (
          <issue>1</issue>
          ):
          <fpage>36</fpage>
          -
          <lpage>45</lpage>
          ,
          <year>1966</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Rollo</given-names>
            <surname>Carpenter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Jonathan</given-names>
            <surname>Freeman</surname>
          </string-name>
          .
          <article-title>Computing machinery and the individual: The Personal Turing Test</article-title>
          .
          <source>Technical report, Jabberwacky</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Tim</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>James</given-names>
            <surname>Hendler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Ora</given-names>
            <surname>Lassila</surname>
          </string-name>
          , et al.
          <article-title>The semantic web</article-title>
          .
          <source>Scientific american</source>
          ,
          <volume>284</volume>
          (
          <issue>5</issue>
          ):
          <fpage>28</fpage>
          -
          <lpage>37</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Andreas</surname>
            <given-names>Hotho</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steffen Staab</surname>
            , and
            <given-names>Gerd</given-names>
          </string-name>
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Ontologies improve text document clustering</article-title>
          .
          <source>In Proceedings of the 2003 IEEE International Conference on Data Mining</source>
          , pages
          <fpage>541</fpage>
          -
          <lpage>544</lpage>
          . IEEE Computer Society, November 19-
          <issue>22</issue>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Viviana</given-names>
            <surname>Mascardi</surname>
          </string-name>
          , Angela Locoro, and
          <string-name>
            <given-names>Fabrizio</given-names>
            <surname>Larosa</surname>
          </string-name>
          .
          <article-title>Exploiting prolog and nlp techniques for matching ontologies and for repairing correspondences</article-title>
          .
          <source>In Proceedings of the 24th Convegno Italiano di Logica Computazionale</source>
          , pages
          <fpage>10</fpage>
          -
          <lpage>24</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Yuhua</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>David McLean</surname>
            ,
            <given-names>Zuhair</given-names>
          </string-name>
          <string-name>
            <surname>Bandar</surname>
          </string-name>
          ,
          <string-name>
            <surname>James O'Shea</surname>
          </string-name>
          , and
          <string-name>
            <surname>Keeley</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Crockett</surname>
          </string-name>
          .
          <article-title>Sentence similarity based on semantic nets and corpus statistics</article-title>
          .
          <source>IEEE Trans. Knowl</source>
          . Data Eng.,
          <volume>18</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1138</fpage>
          -
          <lpage>1150</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Palakorn</given-names>
            <surname>Achananuparp</surname>
          </string-name>
          , Xiaohua Hu, and
          <string-name>
            <given-names>Xiajiong</given-names>
            <surname>Shen</surname>
          </string-name>
          .
          <article-title>The evaluation of sentence similarity measures</article-title>
          .
          <source>In Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery</source>
          ,
          <source>DaWaK '08</source>
          , pages
          <fpage>305</fpage>
          -
          <lpage>316</lpage>
          . Springer-Verlag,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Oren</given-names>
            <surname>Zamir</surname>
          </string-name>
          and
          <string-name>
            <given-names>Oren</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Grouper: A dynamic clustering interface to web search results</article-title>
          . Comput. Netw.,
          <volume>31</volume>
          (
          <fpage>11</fpage>
          -16):
          <fpage>1361</fpage>
          -
          <lpage>1374</lpage>
          , May
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Tomas</surname>
            <given-names>Mikolov</given-names>
          </string-name>
          , Ilya Sutskever, Kai Chen, Gregory S. Corrado, and
          <string-name>
            <given-names>Jeffrey</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8</source>
          ,
          <year>2013</year>
          ,
          <string-name>
            <given-names>Lake</given-names>
            <surname>Tahoe</surname>
          </string-name>
          , Nevada, United States., pages
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>George</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Miller</surname>
          </string-name>
          .
          <article-title>Wordnet: A lexical database for english</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>38</volume>
          :
          <fpage>39</fpage>
          -
          <lpage>41</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Claudia</given-names>
            <surname>Leacock</surname>
          </string-name>
          and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Chodorow</surname>
          </string-name>
          .
          <article-title>Combining local context and wordnet similarity for word sense identification</article-title>
          .
          <source>WordNet: An Electronic Lexical Database</source>
          , pages
          <fpage>265</fpage>
          -
          <lpage>283</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <article-title>Zhibiao Wu and Martha Stone Palmer</article-title>
          .
          <article-title>Verb semantics and lexical selection</article-title>
          .
          <source>In James Pustejovsky, editor, 32nd Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <fpage>27</fpage>
          -
          <lpage>30</lpage>
          June 1994, New Mexico State University, Las Cruces, New Mexico, USA, Proceedings, pages
          <fpage>133</fpage>
          -
          <lpage>138</lpage>
          . Morgan Kaufmann Publishers / ACL,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Jay</given-names>
            <surname>Jiang</surname>
          </string-name>
          and
          <string-name>
            <given-names>David</given-names>
            <surname>Conrath</surname>
          </string-name>
          .
          <article-title>Semantic similarity based on corpus statistics and lexical taxonomy</article-title>
          .
          <source>In Proc. of the Int'l. Conf. on Research in Computational Linguistics</source>
          , pages
          <fpage>19</fpage>
          -
          <lpage>33</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>Dekang</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <article-title>An information-theoretic definition of similarity</article-title>
          .
          <source>In Proceedings of the Fifteenth International Conference on Machine Learning, ICML '98</source>
          , pages
          <fpage>296</fpage>
          -
          <lpage>304</lpage>
          , San Francisco, CA, USA,
          <year>1998</year>
          . Morgan Kaufmann Publishers Inc.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. RDFS Working Group.
          <article-title>Resource description framework schema (rdfs). www</article-title>
          .w3.org/ TR/rdf-schema/.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. OWL Working Group.
          <article-title>Web ontology language (owl). www</article-title>
          .w3.org/TR/ owl-features/.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18. OWL2 Working Group.
          <article-title>Web ontology language 2(owl2). www</article-title>
          .w3.org/TR/ owl2-overview/.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>Franz</given-names>
            <surname>Baader</surname>
          </string-name>
          and
          <string-name>
            <given-names>Werner</given-names>
            <surname>Nutt</surname>
          </string-name>
          .
          <article-title>Basic description logics</article-title>
          .
          <source>In Description Logic Handbook</source>
          , pages
          <fpage>43</fpage>
          -
          <lpage>95</lpage>
          . Cambridge University Press,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>Klaus</given-names>
            <surname>Schild</surname>
          </string-name>
          .
          <article-title>A correspondence theory for terminological logics: Preliminary report</article-title>
          .
          <source>In In Proc. of IJCAI-91</source>
          , pages
          <fpage>466</fpage>
          -
          <lpage>471</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Christiane Fellbaum.
          <article-title>WordNet: an electronic lexical database</article-title>
          . MIT Press,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22. Mark van Assem,
          <string-name>
            <surname>Aldo Gangemi</surname>
            , and
            <given-names>Guus</given-names>
          </string-name>
          <string-name>
            <surname>Schreiber</surname>
          </string-name>
          .
          <article-title>Rdf/owl representation of wordnet</article-title>
          .
          <source>Technical report, W3C</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>Amos</given-names>
            <surname>Tversky</surname>
          </string-name>
          .
          <article-title>Features of similarity</article-title>
          .
          <source>Psychological Review</source>
          ,
          <volume>84</volume>
          (
          <issue>4</issue>
          ):
          <fpage>327</fpage>
          -
          <lpage>352</lpage>
          ,
          <year>1977</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Peter C. Weinstein</surname>
            and
            <given-names>William P.</given-names>
          </string-name>
          <string-name>
            <surname>Birmingham</surname>
          </string-name>
          .
          <article-title>Comparing concepts in differentiated ontologies</article-title>
          .
          <source>In Proceedings of the Twelfth Workshop on Knowledge Acquisition, Modeling and Management (KAW'99)</source>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Alexander</surname>
            <given-names>Borgida</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Thomas J.</given-names>
            <surname>Walsh</surname>
          </string-name>
          , and Haym Hirsh.
          <article-title>Towards measuring similarity in description logics</article-title>
          .
          <source>In Proceedings of the 2005 International Workshop on Description Logics (DL2005)</source>
          , Edinburgh, Scotland,
          <string-name>
            <surname>UK</surname>
          </string-name>
          ,
          <source>July 26-28</source>
          ,
          <year>2005</year>
          , volume
          <volume>147</volume>
          .
          <article-title>CEUR-WS</article-title>
          .org,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Roy</surname>
            <given-names>Rada</given-names>
          </string-name>
          , Hafedh Mili, Ellen Bicknell, and
          <string-name>
            <given-names>Maria</given-names>
            <surname>Blettner</surname>
          </string-name>
          .
          <article-title>Development and application of a metric on semantic nets</article-title>
          .
          <source>IEEE Transactions on Systems, Man, and Cybernetics</source>
          ,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <fpage>17</fpage>
          -
          <lpage>30</lpage>
          ,
          <year>1989</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Ted</surname>
            <given-names>Pedersen</given-names>
          </string-name>
          , Siddharth Patwardhan, and
          <string-name>
            <given-names>Jason</given-names>
            <surname>Michelizzi</surname>
          </string-name>
          . Wordnet:
          <article-title>:similarity: Measuring the relatedness of concepts</article-title>
          .
          <source>In Demonstration Papers at HLT-NAACL</source>
          <year>2004</year>
          , HLT-NAACLDemonstrations '
          <volume>04</volume>
          , pages
          <fpage>38</fpage>
          -
          <lpage>41</lpage>
          , Stroudsburg, PA, USA,
          <year>2004</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Anthony</surname>
            <given-names>Fader</given-names>
          </string-name>
          , Stephen Soderland, and
          <string-name>
            <given-names>Oren</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Identifying relations for open information extraction</article-title>
          .
          <source>In Proceedings of the Conference of Empirical Methods in Natural Language Processing (EMNLP '11)</source>
          , Edinburgh, Scotland,
          <string-name>
            <surname>UK</surname>
          </string-name>
          , July
          <volume>27</volume>
          -31
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Michele</surname>
            <given-names>Banko</given-names>
          </string-name>
          , Michael J Cafarella, Stephen Soderland, Matt Broadhead, and
          <string-name>
            <given-names>Oren</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Open information extraction from the web</article-title>
          .
          <source>In IN IJCAI</source>
          , pages
          <fpage>2670</fpage>
          -
          <lpage>2676</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Fei</surname>
          </string-name>
          Wu and Daniel S. Weld.
          <article-title>Open information extraction using wikipedia</article-title>
          .
          <source>In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10</source>
          , pages
          <fpage>118</fpage>
          -
          <lpage>127</lpage>
          . Association for Computational Linguistics,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Christopher D. Manning</surname>
            , Mihai Surdeanu, John Bauer, Jenny Finkel,
            <given-names>Steven J.</given-names>
          </string-name>
          <string-name>
            <surname>Bethard</surname>
          </string-name>
          , and
          <string-name>
            <surname>David McClosky</surname>
          </string-name>
          .
          <article-title>The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations</article-title>
          , pages
          <fpage>55</fpage>
          -
          <lpage>60</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Ted</surname>
            <given-names>Pedersen</given-names>
          </string-name>
          , Siddharth Patwardhan, and
          <string-name>
            <given-names>Jason</given-names>
            <surname>Michelizzi</surname>
          </string-name>
          . Wordnet:
          <article-title>Similarity - measuring the relatedness of concepts</article-title>
          .
          <source>In Proceedings of the 19th National Conference on Artifical Intelligence</source>
          ,
          <source>AAAI'04</source>
          , pages
          <fpage>1024</fpage>
          -
          <lpage>1025</lpage>
          . AAAI Press,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <given-names>Hideki</given-names>
            <surname>Shima</surname>
          </string-name>
          .
          <article-title>Wordnet similarity for java (ws4j)</article-title>
          . https://code.google.com/ archive/p/ws4j/.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34. SPARQL Working Group.
          <article-title>Sparql 1.1 overview</article-title>
          . www.w3.org/TR/ sparql11-overview/.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Elena</surname>
            <given-names>Cabrio</given-names>
          </string-name>
          , Julien Cojan, Alessio Palmero Aprosio, Bernardo Magnini, Alberto Lavelli, and
          <string-name>
            <given-names>Fabien</given-names>
            <surname>Gandon</surname>
          </string-name>
          .
          <article-title>Qakis: an open domain QA system based on relational patterns</article-title>
          .
          <source>In Birte Glimm and David Huynh</source>
          , editors,
          <source>Proceedings of the ISWC 2012 Posters &amp; Demonstrations Track</source>
          , Boston, USA, November
          <volume>11</volume>
          -
          <issue>15</issue>
          ,
          <year>2012</year>
          , volume
          <volume>914</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>