<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">LooPings: a Look at Semantic Similarities</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Adama</forename><surname>Sow</surname></persName>
							<email>adama.sow.4@ulaval.ca</email>
							<affiliation key="aff0">
								<orgName type="department">Département Génie Informatique et Télécommunications</orgName>
								<orgName type="institution">Ecole Polytechnique de Thiès (EPT)</orgName>
								<address>
									<country key="SN">Senegal</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jean-Rémi</forename><surname>Bourguet</surname></persName>
							<email>jrbourguet@inf.ufes.br</email>
							<affiliation key="aff1">
								<orgName type="department">Dipartimento di Scienze Politiche ed Ingegneria dell&apos;Informazione</orgName>
								<orgName type="institution">Università degli Studi di Sassari (UNISS)</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Núcleo de Estudos em Modelagem Conceitual e Ontologias</orgName>
								<orgName type="institution">Federal University of Esprito Santo (UFES)</orgName>
								<address>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">LooPings: a Look at Semantic Similarities</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">90F5309076A78B4D87ECDF357897EDE9</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T22:47+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Semantic Similarity</term>
					<term>Ontology</term>
					<term>Description logic</term>
					<term>Natural language processing</term>
					<term>Interface</term>
					<term>Empirical data</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Semantic similarities is a cross-field research in Natural Language Processing and Ontologies with some possible fallouts in Artificial Intelligence. Formerly, similarities were computed following a syntactical treatment to support case-based reasoning. Textual similarities are now guided by semantic machineries, offering various ways to compute relatedness measures. In this paper, we present both a logical and a visual framework aiming to reason with them. For that reason, we introduced FLH ± , a fragment of description logic underpinning the well-known lexical database Wordnet. We illustrated this framework with the path length relatedness, one of the historical similarity measures occurring in a taxonomy. The core of our framework orchestrates the computation of similarity scores supported by REVERB, STANFORD CORENLP and WORD-NET:SIMILARITY APIs and interfaces global similarities in graphical way by positioning them on segments. We also depicted some experimental results to confront our computational framework with some empirical data.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Reasoning with similarities is seen as one of the crucial steps in Artificial Intelligence. Turing, in his paper Computing Machinery and Intelligence <ref type="bibr" target="#b0">[1]</ref>, suggested that a machine having passed the so-called test, should appear as a human 70% of the time after five minutes of conversation. From Joseph Weizenbaum, and his seminal proposal Eliza <ref type="bibr" target="#b1">[2]</ref> in 1966, until the last generation of programs Jabberwacky and Cleverbot developed by Rollo Carpenter <ref type="bibr" target="#b2">[3]</ref> during the last decade, the treatment of similarities attracted a lot of interest to tackle this issue. Formerly, one classical way was to deal with Natural Language Processing (NLP), focusing on syntactic similarities through a case-based reasoning machinery.</p><p>These last years, the Semantic Web <ref type="bibr" target="#b3">[4]</ref> has given complementary resources in terms of knowledge bases, offering new standards to deal with lexical databases. The semantic dimension of similarities is now guided by well-established and moderated repositories of knowledge including ontological layers (see <ref type="bibr" target="#b4">[5]</ref>), but few attempts was performed to combine the classical NLP-based technologies together with semantic web infrastructures in order to analyze the limits of such interactions (see for example <ref type="bibr" target="#b5">[6]</ref>). Moreover, if some methods were also proposed to compute global scores of similarities between two phrases based on local semantic similarities of their components (see <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>), we remark that the usual way to output a set of similarities between a phrase and a set of phrases is an ordered list of items (e.g. the popular search engines), and very few approaches were proposed in the literature in order to screen the similarity scores in different ways (see <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10]</ref>).</p><p>In this paper, we present a logical and visual framework to represent and reason with textual relatedness measures guided by ontologies. In order to bridge the gap between natural language and ontologies, we introduced a fragment of Description Logic (DL), underpinning the well-known lexical database WORDNET <ref type="bibr" target="#b10">[11]</ref>. With this fragment, we can properly redefine some relatedness measures historically used in taxonomies. To gain place, we only introduced here the path length relatedness measure denoted plr. Nevertheless, other historical similarity measures considering the maximum depth in taxonomy (see lch in <ref type="bibr" target="#b11">[12]</ref>), the depth of the least common subsumer (see wup in <ref type="bibr" target="#b12">[13]</ref>) or the supported information content (see jcn in <ref type="bibr" target="#b13">[14]</ref> and lin in <ref type="bibr" target="#b14">[15]</ref>) have also been integrated in this framework.</p><p>After some preliminaries to introduce our logical framework in section 2, we describe in section 3 both our computation and an interface to screen the similarities between a phrase and a set of phrases. Finally, in section 4 we present an experimental validation to confront the theoretical similarities with some experimental ones.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Preliminaries</head><p>DL is a well-known family of formal knowledge representation models. Semantic languages used on the web to share knowledge (e.g. RDFS <ref type="bibr" target="#b15">[16]</ref>, OWL <ref type="bibr" target="#b16">[17]</ref> and OWL2 <ref type="bibr" target="#b17">[18]</ref>) have all some direct underpinning logics that are fragments of DL. The core interpretation of a DL in first order logic was given by Baader and Nutt in <ref type="bibr" target="#b18">[19]</ref> as follows: Definition 1. Let C the set all the atomic concepts and R the set all the atomic roles, an interpretation I is an ordered pair (∆ I , • I ) such that: -∆ I is the domain, i.e. a non-empty set of individuals, -• I is the interpretation function which maps:</p><p>-each atomic concept A ∈ C to a set A I ⊆ ∆ I , -each atomic role r ∈ R to a binary relation r I ⊆ ∆ I × ∆ I .</p><p>In this paper, we chose to introduce FLH ± defined in Definition 2 as an extension of AL <ref type="bibr" target="#b19">[20]</ref> using transitivity and hierarchy for roles but removing negation, intersection, limited existential and value restrictions. Definition 2. Let an interpretation I, {A, B} ⊆ C and {r, s} ⊆ R</p><formula xml:id="formula_0">I = ∆ I top concept ⊥ I = ∅ bottom concept (A B) I = A I ⊆ B I inclusion axiom (r s) I = r I ⊆ s I role hierarchy (r + ) I = (a, b) ∈ r I ∧ (b, c) ∈ r I → (a, c) ∈ r I transitivity (dom(r) ≡ A) I = ∀a, b ∈ ∆ I .a ∈ A I ∪ (a, b) / ∈ r I domain (ran(r) ≡ B) I = ∀a, b ∈ ∆ I .b ∈ B I ∪ (a, b) / ∈ r I range</formula><p>As redefined in <ref type="bibr" target="#b18">[19]</ref>, a Terminology Box or a TBox in DL is a finite set of axioms, with no symbolic name (equality whose left-hand side is an atomic concept), which is defined more than once. Princeton's WORDNET is a lexical database for the English language <ref type="bibr" target="#b20">[21]</ref>. A decade after the creation of this database, van Assem <ref type="bibr" target="#b21">[22]</ref> proposed a conversion of WORDNET in OWL. Example 1 presents a sample of TBox underpinning WORDNET in OWL introducing particularly the concept SYNSET <ref type="foot" target="#foot_0">4</ref> and the role h<ref type="foot" target="#foot_1">5</ref> .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Example 1 (Sample of WORDNET</head><formula xml:id="formula_1">TBox). SYNSET , dom(h) ≡ SYNSET, ran(h) ≡ SYNSET, h + .</formula><p>In the terms of Baader and Nuts <ref type="bibr" target="#b18">[19]</ref>, an Assertional Box or an ABox is a specific state of affairs of an application domain in terms of concepts and roles. Example 2 depicts a graph representing a sample of ABox underpinning WORDNET in OWL where n represents the individual root of all the individual nouns present in WORDNET, the edge represents synsets (e.g. SYNSET(Astract)) and the directed arcs represent the hyponym relation assertions between two individuals (e.g. h(Attribute, Abstract)). Note that the WORDNET Abox is partitioned in different set, dealing with different parts of speech (nouns, verbs, adjectives and adverbs).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Example 2 (Sample of WORDNET ABox).</head><p>n Astract Relation Attribute</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>P hysical</head><p>The idea of computing relatedness measures between concepts occurring in a taxonomy is an old issue (see <ref type="bibr" target="#b22">[23]</ref>). Few approaches (see <ref type="bibr" target="#b23">[24,</ref><ref type="bibr" target="#b24">25]</ref>) have described the common relatedness measures through a DL formalism. According to Rada <ref type="bibr" target="#b25">[26]</ref>, a path length is founded on a node-counting scheme concerning the smallest specified role counting between two individuals. Given an interpretation and two individuals, we redefined the shortest path length function denoted L as follows:</p><formula xml:id="formula_2">Definition 3. Let an interpretation I, {u k , v l } ∈ ∆ I and h ∈ R L(u k , v l ) = min(2n-(k+l)) s.t. ∀(i, j) ∈ [[k, n]×[[l, n]. h(ui, ui+1)∧h(vj, vj+1)∧un=vn</formula><p>Note that if a practitioner deals with the transitive closure of the WORDNET ABox, a min max operator may be used due to the transitivity of the relation h. Example 3 lists the shortest path length between individual nouns introduced in Example 2. We denote v the individual root of all the verbs. The ABox being partitioned, note that no verb is involved in a hyponym relation with a noun (and vice-versa).</p><p>Example 3 (Shortest Path Length).</p><formula xml:id="formula_3">L(Relation, Abstract) = 1 L(Relation, Attribute) = 2 L(P hysical, P hysical) = 0 L(P hysical, v ) → ∞</formula><p>As reintroduced by Pedersen <ref type="bibr" target="#b26">[27]</ref>, the path length based relatedness score is equal to the inverse of the shortest path length between two concepts. Naturally, it is inversely proportional to the number of nodes along the shortest path between the synsets. The path length based relatedness denoted plr is defined as follows:</p><formula xml:id="formula_4">Definition 4. Let an interpretation I, {u, v} ∈ ∆ I plr(u, v) = 1 1+L(u,v)</formula><p>If no path exists (e.g. between a verb and a noun) we consider that L tends to ∞, and then the path length based relatedness score approaches 0. The highest possible score occurs when the two synsets are the same, in which case the score is 1.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Example 4 (Path Length based Relatedness).</head><p>plr(Relation, Abstract) = 1/2 plr(Relation, Attribute) = 1/3 plr(P hysical, P hysical) = 1 plr(P hysical, v ) → 0</p><p>In the next section, we present our method to compute similarity scores between two phrases based on the local semantic similarities of their components (e.g. plr), moreover we depict a way to screen them through a graphical user interface.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Computational Framework</head><p>The finality of our framework is to screen similarity scores between one phrase and a set of phrases. We chose to investigate the way transforming a phrase in a set of triples before performing the computation of similarities. We selected the API REVERB, an information extractor for massive corpora (working without pre-specified vocabulary). In <ref type="bibr" target="#b27">[28]</ref>, REVERB was judged with a extraction precision of 0,8 or higher in at least 30% of the time, substantially outperforming others extractors like TEXTRUNNER (see <ref type="bibr" target="#b28">[29]</ref>) and WOE (see <ref type="bibr" target="#b29">[30]</ref>). Formally and for the following, we can see a phrase p as a set of triples {t 1 , . . . , t n } where t l = t 1 l , t 2 l , t 3 l with t k l ∈ ∆ I . Thereafter, we define the global score between two phrases as the maximum of the averages of similarities between the components from the triples (average of similarities between subjects, objects, and complements). Moreover, we arbitrary avoid to take into account two kinds of scores: scores of 0, it is the case when at least one entry is not present in the lexical base of WORDNET (see section 2), and the score of 1 (similar strings) only if the individuals in comparison are present in a stop words set denoted θ.</p><p>Definition 5. Let θ and two phrases p, q, the Score of similarity is defined as follows:</p><formula xml:id="formula_5">S(p, q)=max ti,tj (avg({s(t k i , t k j )| s(t k i , t k j ) = 0 ∧ (s(t k i , t k j ) = 1 ∨ t k i , t k j / ∈ θ)}))</formula><p>with ti ∈ p, tj ∈ q, k ∈ [ <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b2">3]</ref>], s ∈ {plr, lch, wup, jcn, lin} and S the upper case of s.</p><p>To support this computation, we used the STANFORD CORENLP API <ref type="bibr" target="#b30">[31]</ref> to perform the lemmatization of the components from the triples. The similarities calculus of the lemme were performed by WORDNET:SIMILARITY (developed by Pedersen et al. in <ref type="bibr" target="#b31">[32]</ref> and redesigned in Java by Shima <ref type="bibr" target="#b32">[33]</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 1: The system description of LooPings</head><p>All this treatment is included in the framework LooPings <ref type="foot" target="#foot_2">6</ref> (Lexical and ontological observations to Plot ingathering similarities). Looking at the system description of Fig. <ref type="figure">1</ref>, LooPings is decomposed in several modules:</p><p>INPUT takes a phrase and some parameters (e.g. local measure, stop words list, etc.). LIBRARIES comprise REVERB, STANFORD CORENLP and WS4J API (bundled with WORDNET). CORE manages the computation. OUTPUT is devoted to screen and to plot the similarity scores.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 2: An interface of LooPings</head><p>One of the most challenging steps is the connection between the NLP APIs RE-VERB and WS4J. The main aspect that made this compatibility a little immature is the fact that REVERB can output expressions (several raw words) while WS4J accepts only a single element as input (a lemme or an expression of lemmes linked themselves in one string by underscores). Then, we had to perform a treatment from our raw phrases, following some basic heuristics, as described through the example below:</p><p>What can we learn from the neural networks of C.elegans to understand human brains?</p><p>The outputted triple is we,learn from,the neural networks of C.elegans .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Lemmatization:</head><p>we,learn from,the neural network of C.elegans 2. Chunking:</p><p>we,learn from,the neural network C.elegans 3. Stop words removing:</p><p>we, learn, neural network C.elegans 4. Cutting:</p><p>we, learn, neural network ; we, learn, C.elegans</p><p>Note that during the step of the removal, stop words were never removed from the triples when it involved an empty set for one component. Fig. <ref type="figure">2</ref> presents the interface of LooPings, where practitioners can visualize and confront the semantic similarities. Note that this framework is oriented to integrate queries and SPARQL <ref type="bibr" target="#b33">[34]</ref> interpretations allowing a possible integration of other semantic web technologies. The similarity scores are represented on a segment [0,1].</p><p>The last section is dedicated to the confrontation between the theoretical scores with some experimental ones.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Experimental Validation</head><p>In order to deal with real queries, we used STACK EXCHANGE API allowing us to extract from a website some structured data (in JSON format) about different questionand-answer themes. The total number of available queries for the Cognitive Science section of STACK EXCHANGE website was 1245, we decided to extract the 1200 first queries (w.r.t. the extraction order), from each of them we stored ids (from 1 to 4697) and queries. Formally, a query referenced by an id is denoted q id , moreover we call series S j a set of queries: S j ⊆ {q id |1 ≤ id ≤ 4697}. A tricky aspect was the fact that REVERB outputted nothing in 70.75 percent of queries. So, we decided to divide, the 1200 queries in 12 series of 100 queries, finally giving a median series of 28 queries. Due to space limitation, we will depict in this article only two series (A and B).</p><p>Once the extraction was performed, we created an on-line semantic recall based experience. For each series, we designed a witness query. We attached a questionnaire to each series and sent them by e-mail to mailing lists of PhD students. The requirement for participating was to be proficient in English. Questionnaires were designed as follows:</p><p>1. Instruction to read carefully a witness query. 2. Instruction to read the series of queries and to check 1, 2 or 3 queries among them, (at least 1, at most 3), the most similar, for the user, to the witness queries. 3. Instruction to partially preorder the selected query(ies).</p><p>We succeeded in obtaining 50 volunteers. We gave accumulated marks for queries in each series w.r.t the preorders given by the volunteers. For example if a volunteer selected only one query q i in the series, the accumulated mark for q i was increased by 6, the remaining possible situations are listed below: q i q j q i (+4); q j (+2) q i ∼ q j q i , q j (+3) q i q j q k q i (+3); q j (+2); q k (+1) q i ∼ q j ∼ q k q i , q j , q k (+2) q i q j ∼ q k q i (+3); q j , q k (+1.5) q i ∼ q j q k q i , q j (+2.5); q k (+1)</p><p>The maximum mark is translated to an experimental score of 1, after what all the other marks have been transposed in experimental scores by cross-multiplications. As depicted in Fig. <ref type="figure" target="#fig_0">3</ref>, we took the PLR score as a reference in order to observe how the other scores behaved following it. Thereafter, we describe what is globally remarkable in the behavior of the theoretical similarities.</p><p>-The higher the PLR is, the lower is the difference between scores founded on path lengths (LCH, WUP) and scores founded on information contents (JCN, LIN). -WUP and LIN react in the same way in the case of a sudden increase or drop.</p><p>-LIN and JCN scores have a behavior of exponential shape in all the series. Nevertheless, there are some limitations for this framework. We relate for example the case of a saturation. Series B is remarkable by the fact that 5 queries are outputted with the maximal score.</p><p>Where could I find psychological experiments tools? The outputted triples were I,find,psychological ; I,find,experiment ; I,find,tool</p><p>Here, the systematic presence of two stop words "I" and "find" make that all the similarity scores are 1 iff one of the words "psychological", "experiment" or "tool" is present in one of the extracted triples. This experience showed some natural limits concerning our approach. The main issue seems to be the automatic semantic annotation step (REVERB) and the matching step with WORDNET individuals (through WORDNET:SIMILARITY).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion</head><p>In this paper, we presented both a computational and a visual framework to support semantic similarities guided by ontologies. The first contribution was to redefine some taxonomy-based relatedness measures (e.g. path length relatedness) in a DL fragment (FLH ± ) we introduced. The core of our framework orchestrates the computation of similarity scores supported by REVERB, STANFORD CORENLP and WORDNET:SIMILARITY APIs and interfaces results in graphical way through segments. Moreover, we plotted the behavior of our computations by designing a semantic recall-based experience to confront empirical similarities with the theoretical ones. Some research perspectives for this work would be to extend LooPings in two ways. The first concerns the core of our framework by interpreting scores founded on other relations like for instance the mereology in WORDNET. The second is to integrate other repositories to support the similarities or other frameworks to compute similarities directly on semantic web languages (e.g. using the technology of QAKiS <ref type="bibr" target="#b34">[35]</ref>).</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 3 :</head><label>3</label><figDesc>Fig. 3: Some plots of LooPings</figDesc><graphic coords="8,134.77,311.53,345.83,124.77" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="6,134.77,200.14,345.83,238.70" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_0">Short for "Synonym set" to call a non-empty set of synonyms.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_1">Short for "hyponym" to call the well-known "hyponymy" relation.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_2">The framework LooPings is available at http://nemo.inf.ufes.br/?p=1185.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Computing machinery and intelligence</title>
		<author>
			<persName><forename type="first">Alan</forename><surname>Mathison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Turing</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Mind</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="issue">236</biblScope>
			<biblScope unit="page" from="433" to="460" />
			<date type="published" when="1950">1950</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Eliza;a computer program for the study of natural language communication between man and machine</title>
		<author>
			<persName><forename type="first">Joseph</forename><surname>Weizenbaum</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Commun. ACM</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="36" to="45" />
			<date type="published" when="1966">1966</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Computing machinery and the individual: The Personal Turing Test</title>
		<author>
			<persName><forename type="first">Rollo</forename><surname>Carpenter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jonathan</forename><surname>Freeman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
		<respStmt>
			<orgName>Jabberwacky</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical report</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The semantic web</title>
		<author>
			<persName><forename type="first">Tim</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">James</forename><surname>Hendler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ora</forename><surname>Lassila</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientific american</title>
		<imprint>
			<biblScope unit="volume">284</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="28" to="37" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Ontologies improve text document clustering</title>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Hotho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Steffen</forename><surname>Staab</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gerd</forename><surname>Stumme</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2003 IEEE International Conference on Data Mining</title>
				<meeting>the 2003 IEEE International Conference on Data Mining</meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2003">November 19-22, 2003</date>
			<biblScope unit="page" from="541" to="544" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Exploiting prolog and nlp techniques for matching ontologies and for repairing correspondences</title>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Mascardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Angela</forename><surname>Locoro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabrizio</forename><surname>Larosa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 24th Convegno Italiano di Logica Computazionale</title>
				<meeting>the 24th Convegno Italiano di Logica Computazionale</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="10" to="24" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Sentence similarity based on semantic nets and corpus statistics</title>
		<author>
			<persName><forename type="first">Yuhua</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Mclean</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zuhair</forename><surname>Bandar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O'</forename><surname>James</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Keeley</forename><forename type="middle">A</forename><surname>Shea</surname></persName>
		</author>
		<author>
			<persName><surname>Crockett</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Knowl. Data Eng</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="1138" to="1150" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The evaluation of sentence similarity measures</title>
		<author>
			<persName><forename type="first">Palakorn</forename><surname>Achananuparp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xiaohua</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xiajiong</forename><surname>Shen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery, DaWaK &apos;08</title>
				<meeting>the 10th International Conference on Data Warehousing and Knowledge Discovery, DaWaK &apos;08</meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="305" to="316" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Grouper: A dynamic clustering interface to web search results</title>
		<author>
			<persName><forename type="first">Oren</forename><surname>Zamir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Oren</forename><surname>Etzioni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Comput. Netw</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="issue">11-16</biblScope>
			<biblScope unit="page" from="1361" to="1374" />
			<date type="published" when="1999-05">May 1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Distributed representations of words and phrases and their compositionality</title>
		<author>
			<persName><forename type="first">Tomas</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ilya</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kai</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gregory</forename><forename type="middle">S</forename><surname>Corrado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jeffrey</forename><surname>Dean</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013</title>
				<meeting><address><addrLine>Lake Tahoe, Nevada, United States</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">December 5-8, 2013. 2013</date>
			<biblScope unit="page" from="3111" to="3119" />
		</imprint>
	</monogr>
	<note>Proceedings of a meeting held</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Wordnet: A lexical database for english</title>
		<author>
			<persName><forename type="first">George</forename><forename type="middle">A</forename><surname>Miller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="39" to="41" />
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Combining local context and wordnet similarity for word sense identification</title>
		<author>
			<persName><forename type="first">Claudia</forename><surname>Leacock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martin</forename><surname>Chodorow</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">WordNet: An Electronic Lexical Database</title>
				<imprint>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="265" to="283" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Verb semantics and lexical selection</title>
		<author>
			<persName><forename type="first">Zhibiao</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martha</forename><forename type="middle">Stone</forename><surname>Palmer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Annual Meeting of the Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">James</forename><surname>Pustejovsky</surname></persName>
		</editor>
		<meeting><address><addrLine>New Mexico State University, Las Cruces, New Mexico, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers / ACL</publisher>
			<date type="published" when="1994-06-30">27-30 June 1994. 1994</date>
			<biblScope unit="page" from="133" to="138" />
		</imprint>
	</monogr>
	<note>Proceedings</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Semantic similarity based on corpus statistics and lexical taxonomy</title>
		<author>
			<persName><forename type="first">Jay</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Conrath</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Int&apos;l. Conf. on Research in Computational Linguistics</title>
				<meeting>of the Int&apos;l. Conf. on Research in Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="1997">1997</date>
			<biblScope unit="page" from="19" to="33" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">An information-theoretic definition of similarity</title>
		<author>
			<persName><forename type="first">Dekang</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fifteenth International Conference on Machine Learning, ICML &apos;98</title>
				<meeting>the Fifteenth International Conference on Machine Learning, ICML &apos;98<address><addrLine>San Francisco, CA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="296" to="304" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<ptr target="www.w3.org/TR/rdf-schema/" />
		<title level="m">Resource description framework schema</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<ptr target="www.w3.org/TR/owl-features/" />
		<title level="m">Web ontology language</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<ptr target="owl2).www.w3.org/TR/owl2-overview/" />
		<title level="m">Web ontology language 2</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Basic description logics</title>
		<author>
			<persName><forename type="first">Franz</forename><surname>Baader</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Werner</forename><surname>Nutt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Description Logic Handbook</title>
				<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="43" to="95" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">A correspondence theory for terminological logics: Preliminary report</title>
		<author>
			<persName><forename type="first">Klaus</forename><surname>Schild</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of IJCAI-91</title>
				<meeting>of IJCAI-91</meeting>
		<imprint>
			<date type="published" when="1991">1991</date>
			<biblScope unit="page" from="466" to="471" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">WordNet: an electronic lexical database</title>
		<author>
			<persName><forename type="first">Christiane</forename><surname>Fellbaum</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>MIT Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<title level="m" type="main">Rdf/owl representation of wordnet</title>
		<author>
			<persName><forename type="first">Aldo</forename><surname>Mark Van Assem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guus</forename><surname>Gangemi</surname></persName>
		</author>
		<author>
			<persName><surname>Schreiber</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<pubPlace>W3C</pubPlace>
		</imprint>
	</monogr>
	<note type="report_type">Technical report</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Features of similarity</title>
		<author>
			<persName><forename type="first">Amos</forename><surname>Tversky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Psychological Review</title>
		<imprint>
			<biblScope unit="volume">84</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="327" to="352" />
			<date type="published" when="1977">1977</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Comparing concepts in differentiated ontologies</title>
		<author>
			<persName><forename type="first">C</forename><surname>Peter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">William</forename><forename type="middle">P</forename><surname>Weinstein</surname></persName>
		</author>
		<author>
			<persName><surname>Birmingham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Twelfth Workshop on Knowledge Acquisition, Modeling and Management (KAW&apos;99</title>
				<meeting>the Twelfth Workshop on Knowledge Acquisition, Modeling and Management (KAW&apos;99</meeting>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Towards measuring similarity in description logics</title>
		<author>
			<persName><forename type="first">Alexander</forename><surname>Borgida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><forename type="middle">J</forename><surname>Walsh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Haym</forename><surname>Hirsh</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2005 International Workshop on Description Logics (DL2005)</title>
				<meeting>the 2005 International Workshop on Description Logics (DL2005)<address><addrLine>Edinburgh, Scotland, UK</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">July 26-28, 2005. 2005</date>
			<biblScope unit="page">147</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Development and application of a metric on semantic nets</title>
		<author>
			<persName><forename type="first">Roy</forename><surname>Rada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hafedh</forename><surname>Mili</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ellen</forename><surname>Bicknell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Maria</forename><surname>Blettner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Systems, Man, and Cybernetics</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="17" to="30" />
			<date type="published" when="1989">1989</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Wordnet::similarity: Measuring the relatedness of concepts</title>
		<author>
			<persName><forename type="first">Ted</forename><surname>Pedersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Siddharth</forename><surname>Patwardhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jason</forename><surname>Michelizzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Demonstration Papers at HLT-NAACL 2004</title>
				<meeting><address><addrLine>Stroudsburg, PA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="38" to="41" />
		</imprint>
	</monogr>
	<note>HLT-NAACL-Demonstrations &apos;04</note>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Identifying relations for open information extraction</title>
		<author>
			<persName><forename type="first">Anthony</forename><surname>Fader</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephen</forename><surname>Soderland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Oren</forename><surname>Etzioni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Conference of Empirical Methods in Natural Language Processing (EMNLP &apos;11)</title>
				<meeting>the Conference of Empirical Methods in Natural Language Processing (EMNLP &apos;11)<address><addrLine>Edinburgh, Scotland, UK</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">July 27-31 2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Open information extraction from the web</title>
		<author>
			<persName><forename type="first">Michele</forename><surname>Banko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Michael</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephen</forename><surname>Cafarella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Matt</forename><surname>Soderland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Oren</forename><surname>Broadhead</surname></persName>
		</author>
		<author>
			<persName><surname>Etzioni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IN IJCAI</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="2670" to="2676" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Open information extraction using wikipedia</title>
		<author>
			<persName><forename type="first">Fei</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Daniel</forename><forename type="middle">S</forename><surname>Weld</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL &apos;10</title>
				<meeting>the 48th Annual Meeting of the Association for Computational Linguistics, ACL &apos;10</meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="118" to="127" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">The Stanford CoreNLP natural language processing toolkit</title>
		<author>
			<persName><forename type="first">D</forename><surname>Christopher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mihai</forename><surname>Manning</surname></persName>
		</author>
		<author>
			<persName><forename type="first">John</forename><surname>Surdeanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Bauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Steven</forename><forename type="middle">J</forename><surname>Finkel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Bethard</surname></persName>
		</author>
		<author>
			<persName><surname>Mcclosky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Association for Computational Linguistics (ACL) System Demonstrations</title>
				<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="55" to="60" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Wordnet: Similarity -measuring the relatedness of concepts</title>
		<author>
			<persName><forename type="first">Ted</forename><surname>Pedersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Siddharth</forename><surname>Patwardhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jason</forename><surname>Michelizzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th National Conference on Artifical Intelligence, AAAI&apos;04</title>
				<meeting>the 19th National Conference on Artifical Intelligence, AAAI&apos;04</meeting>
		<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="1024" to="1025" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<author>
			<persName><forename type="first">Hideki</forename><surname>Shima</surname></persName>
		</author>
		<ptr target="https://code.google.com/archive/p/ws4j/" />
		<title level="m">Wordnet similarity for java</title>
				<imprint/>
	</monogr>
	<note>ws4j</note>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<ptr target=".1overview.www.w3.org/TR/sparql11-overview/" />
		<title level="m">Sparql 1</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Qakis: an open domain QA system based on relational patterns</title>
		<author>
			<persName><forename type="first">Elena</forename><surname>Cabrio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Cojan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alessio</forename><surname>Palmero Aprosio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bernardo</forename><surname>Magnini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alberto</forename><surname>Lavelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabien</forename><surname>Gandon</surname></persName>
		</author>
		<ptr target="WS.org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ISWC 2012 Posters &amp; Demonstrations Track</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">Birte</forename><surname>Glimm</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">David</forename><surname>Huynh</surname></persName>
		</editor>
		<meeting>the ISWC 2012 Posters &amp; Demonstrations Track<address><addrLine>Boston, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012">November 11-15, 2012. 2012</date>
			<biblScope unit="volume">914</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
