<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Named Entity Disambiguation using Freebase and Syntactic Parsing</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>LATL, Department of linguistics University of Geneva Switzerland</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Named Entity Disambiguation (NED) is a fundamental task of semantic annotation for the Semantic Web. The task of Word Sense Disambiguation (WSD) in Ontology-Based Information Extraction (OBIE) aims to establish a link between the textual entity mention and the corresponding class in the ontology. In this paper, we propose a NED process integrated in a rule-based OBIE system for French. We show that our SVM approach can improve disambiguation e ciency using syntactic features provided by the Fips parser and popularity score features extracted from the Freebase knowledge base.</p>
      </abstract>
      <kwd-group>
        <kwd>Kamel Nebhi</kwd>
        <kwd>Named Entity Disambiguation</kwd>
        <kwd>Syntactic Parsing</kwd>
        <kwd>Linked Open Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The realization of the Web of data on a large scale implies the widespread
annotation of Web documents with ontology base knowledge markup [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. To
encourage the emergence of solutions, OBIE seems to be a mature NLP technology to
introduce supplementary information and knowledge into a document.
      </p>
      <p>OBIE has been conceived only a few years ago and has recently emerged as
a sub eld of Information Extraction (IE). OBIE is di erent from traditional IE
because it nds type of extracted entity by linking it to its semantic description
in a formal ontology. The main di culty of OBIE is the disambiguation process,
which identi es the meaning of a word when it has multiple senses. For
example, the string \Washington" is used to refer to more than 90 di erent NE in
DBpedia database.</p>
      <p>In this paper, we present a Named Entity Disambiguation (NED) process
integrated in a rule-based OBIE system for French. We show that our SVM approach
can improve disambiguation e ciency using syntactic features provided by the
Fips parser and popularity score features extracted from the Freebase knowledge
base.</p>
      <p>
        This paper is divided as follows. In Section 2 we describe related work to
Word Sense Disambiguation. Then, we present our approach in Section 3. Next,
we show our experimental setup for testing in Section 4. Finally, we summarize
the paper.
In 1949, WSD was introduced as a fundamental task of machine translation [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
WSD consists in determining which sense of a word is used when it appears in a
particular context. In recent years, WSD is also an intermediate task in several
NLP applications as information retrieval [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] or information extraction [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>For OBIE, the task of WSD aims to establish a link between the textual
entity mention and the corresponding class in the ontology. The researches on
WSD for IE task are divided on two main approaches: unsupervised approach
and supervised approach. Recent works generally use additional resources as
knowledge bases to improve traditional methods.</p>
      <p>
        WSD unsupervised approach for IE uses clustering word occurrences to
induce word senses and do not exploit any manually sense-tagged corpus. One of
the rst works in the domain of disambiguation of named entity was in 1998
by [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. They create context vectors for each occurrence of a name. Next, they
compute the similarity among names using the cosine measure. The results were
encouraging with an F-Measure of 84 percent. In 2003, the KIM semantic
annotation platform [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] uses a basic rule-based approach without knowledge
resources to disambiguate named entity. The IE system obtains an F-Measure of
84 percent. Recently, in 2011, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposes an approach based on a resource of
contextual words called LinkedData Inferface (LDI). The disambiguation task
uses the Semantic Disambiguation Algorithm (SDA), which identify the item
in the LDI that is most similar to the context of the named entity. The
system shows very promising results with 90 percent in French and 86 percent in
English.
      </p>
      <p>
        WSD supervised approach for IE uses machine-learning techniques to learn a
classi er from manually sense-annotated data sets. In 2006, [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] resolves entities
using an SVM kernel trained on Wikipedia. This approach generates a ranked
list of plausible entities. Experimental results show that the method improves
accuracy. In 2007, [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] suggests a disambiguation process based on information
extracted from Wikipedia and Web search results. They use a similarity
measure and richer features for the similarity comparison. The IE system obtains a
precision of 88 percent. A few years later, in 2009, [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] uses a kernel classi er
to determine the right matching entity in Wikipedia, which is considered as an
unambiguous reference. The model is trained on 100000 Wikipedia articles for
German entity disambiguation. The F-Measure of the system was reported to be
about 80 percent. In 2011, [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] presents a robust disambiguation approach using
knowledge bases like YAGO and several measures, such as popularity prior and
keyphrase based similarity, to build a weighted graph. To improve this method,
they also consider the syntactic context of the mention using a large corpus for
training. The approach obtains good results with a precision of 81 percent.
Nevertheless, the more advanced con gurations of the system did not use syntactic
context.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Approach</title>
      <p>The task is divided into three steps in our OBIE system. In this section, we
describe the di erent steps of the proposed system.
3.1</p>
      <sec id="sec-2-1">
        <title>Ontology-based named entity recognition</title>
        <p>
          In the rst step, we use an OBIE system for French using a rule-based approach
to recognize entities in text. Our application [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] is built on GATE [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] to annotate
entities and relate them to the DBpedia ontology1. The GATE application
consists of a set of processing resources, executed in a pipeline over a corpus of news
articles. The pipeline consists of 5 parts: linguistic pre-processing, gazetteers,
rule-based semantic annotation2 and nal RDF output.
3.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Entity Candidate</title>
        <p>
          LOD refers to data published with a number of best practices based on W3C
standards for publishing and connecting structured data on the Web [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. In the
past few years, we have assisted to a growth in LOD publishing on the Web,
leading to 31 billion RDF triples published online. In this context, using these
resources as complementary information can enhance IE [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          In the second step, we use Freebase3 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] to obtain entity candidate. For each
entity detected by our OBIE system, we request the Freebase Suggest API to
have all possible entities for a surface form.
3.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Named Entity Disambiguation</title>
        <p>The NED task can be modeled as a classi cation problem. In the third step,
a named entity disambiguation SVM classi er is trained using syntactic and
popularity score features.</p>
        <p>Popularity score features. This score is provided by Freebase Search API.
Popularity score of entities can be seen as a probabilistic estimation based on
Wikipedia frequencies in link anchor. The Freebase Search API allows access to
Freebase data given a text query. A large number of lter constraints are
supported to better aim the search at the entities being looked for. For example, the
highest ranked result for the query \Washington" is the city \Washington,D.C.".</p>
        <p>
          Syntactic features. Syntactic Features (SF) are commonly used in WSD
classi cation [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] because they are much richer in information than bag-of-words
features4. The syntactic parser Fips [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] was used to annotate the training data.
1 We used DBpedia interlinking to resolve ontological di erences between the schema
provided by Freebase and the DBpedia ontology.
2 The grammar rules are written in a language called JAPE which is a nite state
transducer.
3 Freebase is a collaborative knowledge base that contains structured data of almost
23 million entities.
4 Bag-of-words features only specify words in symmetric window around target
ignoring position.
        </p>
        <p>Fips was developed over the last decade in our laboratory, LATL. It is a
deep symbolic parser based on generative grammar concepts for its linguistic
component and object-oriented design for its implementation. The parser uses a
bottom up parsing algorithm with parallel treatment of alternatives, as well as
heuristics to rank alternatives.</p>
        <p>
          As remarked by [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], incorporating syntactic information is a strong clue
about the interpretation of a word. Our algorithm uses the context around the
surface forms. For this, we construct a context from all target entities in news
articles using syntactic information based on dependency structures provided
by Fips. From the output of the parser, we extract syntactic (binary) relations
between constituents (subject-verb, verb-object, verb-complement, etc.). For
example, from \Hollande souhaite diminuer le nombre de deputes" a subject-verb
relation is extracted between \Hollande" and \souhaite". The SF also includes
information on each of its words such as part-of-speech and lemma.
        </p>
        <p>Algorithm. The input of the algorithm is a set of ambiguous entities. For
each ambiguous entity we nd a set of candidate entities. Then, we use popularity
scores and syntactic information as features to train an SVM classi er. Table 1
shows examples of NED for a sentence of LeMonde.fr.</p>
        <p>News
Extracted
Mentions
Candidate</p>
        <p>Entities</p>
        <p>John Kerry a declare que Washington prevoyait de fournir une</p>
        <p>aide a l'opposition syrienne.5</p>
        <p>John Kerry (senator), Washington (city)
fJohn Kerry (senator), John Kerry (author)g, fGeorges
Washington (president), Washington (city), Denzel Washington</p>
        <p>(actor))g</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiments</title>
      <sec id="sec-3-1">
        <title>Data set</title>
        <p>The data set we use for our experiments is a collection of 1100 news articles6
extracted from LeMonde.fr. We designate 2/3 of the data as the training set and
the remaining 1/3 as the test set. For the experiments, we trained our model
with an SVM classi er on 735 news articles.
5 John Kerry said Washington planned to provide assistance to the Syrian opposition.
6 This set contains approximately 20 named entities/news article on average.</p>
        <p>To build the training set, we used a semi-automatic method. To start, the
NE detection task was automatically done by our ontology-based named entity
recognition system (cf. section 3.1). In addition, we removed or corrected
manually the wrong semantic links to the DBpedia ontology classes. Finally, we used
the parser Fips and the Freebase Search API to produce syntactic and popularity
score features.
4.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Results</title>
        <p>
          We evaluate our system on a set of 365 test articles7. Traditional measures like
Precision, Recall and F-Measure are inadequate when dealing with ontologies,
thus we used the Balanced Distance Metric (BDM) which is useful to measure
performance of OBIE systems taking into account ontological similarity [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>F1
BDM F1</p>
        <p>OBIE without
disambiguation
0.65
0.69</p>
        <p>Popularity</p>
        <p>Score
0.70
0.73</p>
        <p>Popularity Score + Syntactic</p>
        <p>Parsing
0.86
0.90</p>
        <p>In table 2, the OBIE system without disambiguation process achieved a
traditional F-Measure of 65% and an augmented F-Measure of 69%. Adding a
disambiguation process based on popularity score does not improve a lot F-Measure
with 70% and 73%. Adding the entire disambiguation layer improves extraction
e ectiveness, traditional F-Measure rises to 86% and augmented F-Measure rises
to 90%.
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>In this paper, we presented a NED process integrated in an OBIE system for
French using a rule-based approach. Our named entity disambiguation SVM
classi er is trained using syntactic and popularity score features. As our
evaluation shows, this method of extraction performed signi cantly better using the
entire disambiguation features.</p>
      <p>In our future work, we aim to incorporate more knowledge from LOD and we'll
integrate the application into a complete annotation pipeline for the Semantic
Web.
7 For the evaluation, we only use Person, Organization and Location named entity
categories.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bagga</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Entity-based cross-document coreferencing using the vector space model</article-title>
          .
          <source>In Proceedings of the 36th ACL and 17th ICCL</source>
          , Montreal, Canada,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Linked Data - The Story So Far</article-title>
          .
          <source>International Journal on Semantic Web and Information Systems</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bollacker</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sturge</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , J.:
          <article-title>Freebase: a collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In Proceedings of the 2008 ACM SIGMOD</source>
          , Vancouver, Canada,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bunescu</surname>
            ,
            <given-names>R. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasca</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Using encyclopedic knowledge for named entity disambiguation</article-title>
          .
          <source>In EACL. ACL</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Chai</surname>
            ,
            <given-names>J. Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Biermann</surname>
            ,
            <given-names>A. W.:</given-names>
          </string-name>
          <article-title>The use of word sense disambiguation in an information extraction system</article-title>
          .
          <source>In Proceedings of the 16th National Conference on Arti cial Intelligence</source>
          , Orlando, USA,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Charton</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gagnon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ozell</surname>
            .,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Automatic semantic web annotation of named entities</article-title>
          .
          <source>In Canadian Conference on AI, St. John's, Canada</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ciravegna</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gentile</surname>
            ,
            <given-names>A. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>LODIE: Linked Open Data for Web-scale Information Extraction</article-title>
          . SWAIE:
          <fpage>11</fpage>
          -
          <lpage>22</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cucerzan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Large-scale named entity disambiguation based on Wikipedia data</article-title>
          .
          <source>In Proceedings of the 2007 EMNLP and CoNLL</source>
          , Prague, Czech Republic,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Cunningham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maynard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bontcheva</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tablan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aswani</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gorrell</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Funk</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Angus</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Damljanovic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heitz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greenwood</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saggion</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petrak</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wim</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Text Processing with GATE (Version 6</article-title>
          ). University of She eld,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Using syntactic dependency as local context to resolve word sense ambiguity</article-title>
          .
          <source>In Proceedings of ACL '98</source>
          , Madrid, Spain,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ho</surname>
            <given-names>art</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Yosef</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bordino</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , Furstenau, H.,
          <string-name>
            <surname>Pinkal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spaniol</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taneva</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thater</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Robust disambiguation of named entities in text</article-title>
          .
          <source>In Proceedings of EMNLP '11</source>
          ,
          <string-name>
            <surname>Edinburgh</surname>
          </string-name>
          , United Kingdom,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Levin</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Lexical Semantics in Review. Lexicon Project Working Papers</article-title>
          . MIT Press,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Maynard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Evaluating evaluation metrics for ontology-based applications: In nite re ection</article-title>
          .
          <source>In LREC, Marrakech, Morocco</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Nebhi</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Ontology-Based Information Extraction for French Newspaper Articles</article-title>
          .
          <source>In KI</source>
          <year>2012</year>
          , Saarbrucken, Germany,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Pilz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paa</surname>
          </string-name>
          , G.:
          <article-title>Named entity resolution using automatically extracted semantic information</article-title>
          .
          <source>In KDML 2009 : Workshop on KDML</source>
          , pages
          <volume>84</volume>
          {
          <fpage>91</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Popov</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiryakov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kirilov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goranov</surname>
            ,
            <given-names>O. M.:</given-names>
          </string-name>
          <article-title>Semantic annotation platform</article-title>
          .
          <source>In Proceedings of the 2nd Intl. Semantic Web Conf</source>
          .
          <year>2003</year>
          , pages
          <fpage>834</fpage>
          {
          <fpage>849</fpage>
          . Springer,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. Schutze, H.,
          <string-name>
            <surname>Pedersen</surname>
            ,
            <given-names>J. O.</given-names>
          </string-name>
          :
          <article-title>Information retrieval based on word senses</article-title>
          .
          <source>In Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval</source>
          , pages
          <volume>161</volume>
          {
          <fpage>175</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Weaver</surname>
            , W.: Translation. In
            <given-names>W. N.</given-names>
          </string-name>
          <string-name>
            <surname>Locke</surname>
          </string-name>
          and A. D. Boothe, editors,
          <source>Machine Translation of Languages</source>
          , pages
          <volume>15</volume>
          {
          <fpage>23</fpage>
          . MIT Press, Cambridge, MA,
          <year>1949</year>
          /
          <year>1955</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Wehrli</surname>
          </string-name>
          , E.:
          <article-title>Fips, a deep linguistic multilingual parser:</article-title>
          <source>In ACL 2007 Workshop on deep Linguistic Processing</source>
          , Prague, Czech Republic,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Wilks</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brewster</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Natural language processing as a foundation of the semantic web</article-title>
          .
          <source>Foundations and Trends in Web Science</source>
          ,
          <volume>1</volume>
          (
          <issue>3</issue>
          -4):
          <volume>199</volume>
          {
          <fpage>327</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>