<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A framework for structured knowledge extraction and representation from natural language via deep sentence analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Stefania Costantini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Niva Florio</string-name>
          <email>niva.florio@univaq.it</email>
          <email>v@X</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessio Paolucci</string-name>
          <email>u@X</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dip. di Informatica, Universita` di L'Aquila</institution>
          ,
          <addr-line>Coppito 67100, L'Aquila</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present a framework that we are currently developing, that allows one to extract knowledge from natural language sentences using a deep analysis technique based on linguistic dependencies. The extracted knowledge is represented in OOLOT, an intermediate format that we have introduced, inspired by the Language of Thought (LOT) and based on Answer Set Programming (ASP). OOLOT uses an ontologyoriented lexicon and syntax. Therefore, it is possible to export the extracted knowledge into OWL and native ASP.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Many intelligent systems have to deal with knowledge expressed in natural
language, either extracted from books, web pages and documents in general, or
expressed by human users. Knowledge acquisition from these sources is a
challenging matter, and many attempts are presently under way towards automatically
translating natural language sentences into an appropriate knowledge
representation formalism [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Although this task is a classic Artificial Intelligence challenge
(mainly related to Natural Language Processing and Knowledge Representation
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]), with the Semantic Web growth new interesting scenarios are opening. The
Semantic Web aims at complementing the current text-based web with machine
interpretable semantics; however, the manual population of ontologies is very
tedious and time-consuming, and practically unrealistic at the web scale [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
Given the enormous amount of textual data that is available on the web, to
overcome the knowledge acquisition bottleneck, the ontology population task
must rely on the use of natural language processing techniques to extract
relevant information from the Web and transforming it into a machine-processable
representation.
      </p>
      <p>In this paper we present a framework that we are currently developing. It
allows one to extract knowledge from natural language sentences using a deep
analysis technique based on linguistic dependencies and phrase syntactic
structure. We also introduce OOLOT (Ontology-Oriented Language of Thought). It is
an intermediate language based on ASP, specifically designed for the
representation of the distinctive features of the knowledge extracted from natural language.
Since OOLOT is based on an ontology-oriented lexicon, our framework can be
easily integrated in the context of the Semantic Web.</p>
      <p>
        It is important to emphasize that the choice of ASP is a key point and
it is of fundamental relevance. In fact according to [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], this formalism is the
most appropriate one to deal with normative statements, default statements,
exceptions and many other characteristic aspects of knowledge encoded through
Natural Language.
      </p>
      <p>Though this is an ongoing work, we believe to be able to argue in favour of
the usefulness and the potential of the proposed approach.</p>
      <p>In particular, in Section 2 we introduce the framework architecture. In
Section 3 we analyze the state of the art for parsing and extraction of dependencies
taking into account our translation needs, taking into particular account three
kinds of parsers. Section 4 describes the context disambiguation and lexical item
resolution methods that we have devised. Section 5 introduces the intermediate
format OOLOT, while Section 6.3 describes the translation methodology with
the help of an example. Finally, Section 7 shows an exporting from OOLOT into
OWL example, and in Section 8 we conclude with a brief resume of achieved
goals and future works.
2</p>
    </sec>
    <sec id="sec-2">
      <title>FRAMEWORK ARCHITECTURE</title>
      <p>The proposed framework aims at allowing automatic knowledge extraction from
plain text, like a web page, producing a structured representation in OWL or
ASP as output. Thus, the framework can be seen as a standalone system, or can
be part of wider workflow, e.g. a component of complex semantic web
applications.</p>
      <p>Starting from plain text written in natural language, as first step we process
the sentence through a statistical parser (see Section 3). If we use a parser with
embedded dependency extractor, we can perform a single step and have as output
both the parse tree (constituents), and in the meantime the dependency graph.
Otherwise, if we use two different components, the workflow is that of Fig.1. In
this case, we use a simple algorithm for context disambiguation (see Section 4).
Then, each token is resolved w.r.t. popular ontologies including DBPedia and
OpenCYC and the context is used in case of multiple choices.</p>
      <p>
        At this point we have enough information to translate the knowledge
extracted from a natural language sentence into our intermediate OOLOT format.
OOLOT stands for “Ontological Oriented Language Of Though”, a language
mainly inspired by [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], that we have introduced as an intermediate
representation language for the extracted knowledge in a way that is totally
independent from the original lexical items, and therefore, from the original language.
OOLOT is itself a language, but its lexicon is ontology-based; it uses Answer
Set Programming as basic host environment that allows us to compose a native,
high expressive knowledge representation and reasoning environment. For the
translation process described in Section 6, we employ a -calculus engine that
drives the translation into OOLOT, using information about the deep structure
of the sentence extracted in the previous steps.
      </p>
      <p>From the ontology-based Language of Thought it is possible to directly
translate the encoded knowledge into OWL. In addition, it is also possible to export
the knowledge base in pure ASP.</p>
    </sec>
    <sec id="sec-3">
      <title>PARSING</title>
      <sec id="sec-3-1">
        <title>Background</title>
        <p>In informatics and linguistics, parsing is the process that can determine the
morpho-syntactic structure of a sentence; parsing associates a sentence expressed
in a natural language with a structure (e.g. a parse tree structure), that analyses
the sentence by a certain point of view; thus there are morphological parsing,
syntactic parsing, semantic parsing, etc.</p>
        <p>
          With regard to the syntactic parsing, analysis consists of a definition of the
phrases building up the sentence in their hierarchical order, likewise the
constituent analysis proposed by Chomsky [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. In the 1950s Noam Chomsky said
natural language sentences can be generated by a formal grammar [
          <xref ref-type="bibr" rid="ref7 ref8">8, 7</xref>
          ]; this is
the so-called generative approach, motivated by the fact that people are able to
generate sentences that are syntactically correct and totally new (i.e., that have
never been heard before). Syntactic parser decomposes a text into a sequence of
tokens (for example, words), and attributes them their grammatical functions
and thematic or logical roles, with respect to a given formal grammar. The task
of syntactic parser is to say if the sentence can be generated by the grammar
and, if so, it gives the appropriate sentence syntactic representation (called parse
tree), showing also the relations between the various elements of the sentence [8,
        </p>
        <p>
          Most of today’s syntactic parsers are mainly statistical [
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref9">9–12</xref>
          ]. They are based
on a corpus of training data that have been previously annotated (parsed) by
hand. This approach allows the system to gather information on the frequency
with which the various constructs are needed in specific contexts. A statistical
parser can use a search procedure on the space of all candidates, and it would
provide the probability of each candidate and makes it possible to derive the
most probable parse of a sentence.
        </p>
        <p>
          In the ’90, Collins proposes a conditional and generative model which
describes a straightforward decomposition of a lexicalized parse tree [
          <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
          ] based
on a Probabilistic Context Free Grammar (PCFG). Charniak and Johnson’s
parser [
          <xref ref-type="bibr" rid="ref10 ref13">10, 13</xref>
          ] is based on a parsing algorithm for PCFG, but it is a
lexicalized N-Best PCFG parser: it is a generative and discriminative reranking parser
which uses the MaxEnt reranker to select the best among the possible parses.
The Berkeley parser uses an automatically induced PCFG, parsing sentences
with a hierarchical state-splitting. Only statistics is not enough to determine
when to split each symbol in sub-symbols [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]; thus [
          <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
          ] present an
automatic approach for obtaining annotation trees through a split-merge method.
At a first stage, this parser considers a simple PCFG derived from a treebank,
but then it iteratively refines this grammar, in order to sub-categorize basic
symbols (like NP and VP) into sub-symbols. So non-terminal basic symbols are
split and merged in order to maximize the training treebank and to add a larger
number of annotations to the previous grammar. This parser can learn
automatically the type of linguistic distinction showed in the manually annotated
treebank and then it can create annotation trees thanks to a more complex and
complete grammar.
        </p>
        <p>
          Statistical parsing is useful to solve problems like ambiguity and efficiency,
but with this kind of parsing we lose part of the semantic information; this
aspect is recovered thanks to dependency representation [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
        </p>
        <p>
          Dependency grammars (DGs) were proposed by the French linguist Tesni`ere
[
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] and have recently received renewed attention (cfr. [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] and the references
therein). In Dependency Grammars, words in a sentence are connected by means
of binary, asymmetrical governor-dependent relationships. In fact, Tesni`ere
assumes that each syntactic connection corresponds to a semantic relation. In a
sentence, the verb is seen as the highest level word, governing a set of
complements, which govern their own complements themselves. Opposed to the notion
of the sentence division into a subject and predicate, the grammatical subject in
Tesni`ere’s work is also considered subordinate to the verb. The valence of a verb
(its property of requiring certain elements in a sentence) determines the
structure of the sentence it occurs in. Tesni`ere distinguishes between actants, which
are required by the valence of the verb, and circonstants which are optional.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Parser analysis</title>
        <p>
          It is difficult to evaluate parsers; we can compare them in many ways, such as
the speed with which they examine a sentence or their accuracy in the analysis
(e.g. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]). The task based evaluation seems to be the best one [16, ?]: we must
choose whether to use a parser rather than another simply basing on our needs.
At this stage of our ongoing research, we use the Stanford parser because it is
more suited to our requirements, both for the analysis of the constituents and
for that of the dependencies.
        </p>
        <p>
          Stanford parser performs a dependency and constituent analysis [
          <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
          ]. This
parser provides us with different types of parsing. In fact, it can be used as an
unlexicalized PCFG parser [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] to analyse sentences, or it can be used as a
lexicalized probabilistic parser [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Thanks to an A* algorithm, the second version
combines the PCFG analysis with the lexical dependency analysis. At the
moment the Stanford parser provides us a typed dependency and a phrase structure
tree. The Stanford typed dependencies (cfr. [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]) describe the grammatical
relations in a sentence. The relations are binary and are arranged hierarchically; as
Tesni`ere suggested, Stanford dependency relations have a head and its
dependent but, unlike Tesni`ere, the head of a dependency can be any content words,
not only verbs. Thanks to rules [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] applied on phrase structure trees (also
created by the Stanford parser), typed dependencies are generated.
In particular, for constituent analysis, we choose to analyse the sentence "Many
girls eat apples.". Seeing Fig.2, we can notice that the parser attributes to each
token its syntactic roles, and it provides us also the grammatical function of each
word. In order to better understand the hierarchy of the syntactic structure is
useful to represent it as a tree (Fig.3).
        </p>
        <p>(ROOT
(S
(NP (JJ Many) (NNS girls))
(VP (VBP eat)
(NP (NNS apples)))
(. .)))</p>
        <p>
          With regard to dependency analysis, the Stanford parser gives us two versions
of this analysis: the typed dependency structure (Fig.4) and the collapsed typed
dependency structure (Fig.5). In the first, each node of the sentence is a node
connected with a binary relation to another node; in the second, prepositions
are turned into relations (unfortunately, in this example, you may not notice the
difference). Thus, Fig.4 and Fig.5 show us that girls and many are connected
with an amod relation, that means an adjective phrase modifies the meaning of
the noun phrase; eat is connected to girls with a nsubj relation, where the noun
phrase is the syntactical subject of the verb; eat and apple are connected with
a dobj relation because the direct object of the verb phrase is the object of the
verb [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
        </p>
        <p>As reference for the following steps, we use the Stanford syntactic phrase
structure (Fig.2) and the Stanford collapsed typed dependencies structure (Fig.5).</p>
        <p>NNS</p>
        <p>VBP</p>
        <p>NP</p>
        <p>NNS
Many
girls
eat
apples</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>CONTEXT DISAMBIGUATION AND LEXICAL</title>
    </sec>
    <sec id="sec-5">
      <title>ITEM RESOLUTION</title>
      <p>
        The context disambiguation task is a very important step in our work flow, as we
need to assign each lexical unit to the correct meaning, and this is particularly
hard due to the polysemy. For this task, we use a simple algorithm: we have a
finite set of contexts (political, technology, sport, ...), and as first step we built
a corpus of web pages for each context, and then we used each set as a training
set to build a simple lexical model. Basically we build a matrix where for each
row we have a lexical item, and for each column we have a context. The relation
(lexical item, context) is the normalized frequency of each lexical item into the
given context. The model is then used to assign the correct context to a given
sentence. We use a n m matrix, where n is the number of lexical tokens
(or items), and m is the number of contexts. In other words, we give a score
for each lexical token in relation to each context. To obtain the final score we
perform a simple sum of the local values to obtain the global score, and thus
assign the final context to the sentence. Our method for context disambiguation
can certainly be improved, in particular [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], seems to be a good development
direction.
      </p>
      <p>The context is crucial to choose the correct reference when a lexical item
has multiple meanings, and thus, in an ontology can be resolved in multiple
references. The context becomes the key factor for the resolution of each lexical
item to the relative reference.</p>
      <p>We perform a lookup in the ontology for each token, or a set of them (using a
sliding window of length k). For example, using DBPedia, for each token (or a set
of tokens of length k), we perform a SPARQL query, assuming the existence of a
reference to the lexical item: if this is true, we’ve found the reference, otherwise
we go forward. If we found multiple references, we use the context to choose the
most appropriate one.</p>
      <p>Given the sentence Many girls eat apples and it’s syntactic phrase structure
(Fig.2) and dependencies structure (Fig.5), as first step we tokenize the sentence,
obtaining:</p>
      <p>Many, girls, eat, apples.</p>
      <p>Before the lookup, we use the part of speech tagging from the parse tree to
group the consecutive tokens that belong to the same class. In this case, such
peculiar aspect of natural language is not present and thus the result is simply
the following:
(Many ), (girls), (eat ), (apples).</p>
      <p>Excluding the lexicon for which we have a direct form, for each other lexicon
the reference ontology is resolved through a full text lookup; thus we obtain the
lexical item resolution in Table 1.
girls
eat
apples
5</p>
    </sec>
    <sec id="sec-6">
      <title>FROM</title>
    </sec>
    <sec id="sec-7">
      <title>OOLOT</title>
    </sec>
    <sec id="sec-8">
      <title>THE LANGUAGE OF THOUGHT TO</title>
      <p>
        The Language of Thought (LOT) is an intermediate format mainly inspired by
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. It has been introduced to represent the extracted knowledge in a way that
is totally independent from original lexical items and, therefore, from original
language.
      </p>
      <p>Our LOT is itself a language, but its lexicon is ontology oriented, so we
adopted the acronym OOLOT (Ontology-Oriented Language Of Thought). This
is a very important aspect: OOLOT is used to represent the knowledge extracted
from natural language sentences, so basically the bricks of OOLOT (lexicons)
are ontological identifier related to concepts (in the ontology), and they are not
a translation at lexical level.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] a translation methodology from natural language sentences into ASP
that takes into accounts all words of the sentence is presented. In this method,
the final representation is itself dependent from the original lexical structure,
and this is sub-optimal if we want to export our knowledge base into a more
general formalism like, e.g., OWL.
      </p>
      <p>In the next section we present the translation process.
6
6.1</p>
    </sec>
    <sec id="sec-9">
      <title>TRANSLATING INTO OOLOT</title>
      <sec id="sec-9-1">
        <title>Background</title>
        <p>
          [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] describes a technique for extracting knowledge from natural language and
automatically translate it into ASP. To achieve this result we built an extension
of -calculus and we have introduced meta expressions to fully automate the
translation process, originally inspired by [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          This is a key point in the process of representing knowledge extracted from
natural language, and thus for using it into other contexts, e.g. the Semantic
Web. The selection of a suitable formalism plays an important role: in fact,
though under many aspects first-order logic would represent a natural choice,
it is actually not appropriate for expressing various kinds of knowledge, i.e.
for dealing with default statements, normative statements with exceptions, etc.
Recent work has investigated the usability of non-monotonic logics, like ASP
[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] with encouraging results in terms of dealing with the kind of knowledge
represented through natural language.
        </p>
        <p>OOLOT allows us to have native reasoning capabilities (using ASP) to
support the syntactic and semantic analysis tasks. Embedded reasoning is of
fundamental importance for the correct analysis of complex sentences, as shown in
[27]. The advantages of adopting an ASP-based host language is not limited to
the previous aspects: in fact, the integration of ASP and the Semantic Web is not
limited to the Natural Language Processing side. Answer Set Programming fits
very well with Semantic Web as demonstrated by the recent research efforts of
integrating rule-based inference methods with current knowledge representation
formalisms in the Semantic Web [28, 29].</p>
        <p>
          Ontology languages such as OWL and RDF Schema are widely accepted and
successfully used for semantically enriching knowledge on the Web. However,
these languages have a restricted expressivity if we have to infer new knowledge
from existing one. Semantic Web needs a powerful rule language complementing
its ontology formalisms in order to facilitate sophisticated reasoning tasks. To
overcome this gap, different approaches have been presented on how to combine
Description Logics with rules, like in [29].
-calculus is a formal system designed to investigate function definition,
function application and recursion. Any computable function can be expressed and
evaluated via this formalism [30]. In [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] we extended the -calculus
introducing the -ASP -ExpressionT that allows a native support for ASP, and, at the
same time, permits to formally instantiated to -ASP-Expression [
          <xref ref-type="bibr" rid="ref26 ref5">5, 26</xref>
          ]. For the
purpose of our running example, the set of -ASP -ExpressionT is available in
Table 2.
        </p>
        <p>In the following subsection, we illustrate the translation process based on
-calculus. It is important to note that the choice of the lambda calculus was
made because it fully matches the specifications of the formal tool we need to
drive the execution of the steps in the right way.
According to the workflow in Fig.1, the translation from plain text to the
OOLOT intermediate format makes use of the information extracted in several
steps. The information on the deep structure of the sentence is now used to drive
the translation using the -calculus according to the - expression definitions in
Table2.
many
det</p>
        <p>For each lexicon, we use the phrase structure in Fig.2 to determine the
semantic class to which it belongs. In this way, we are able to fetch the correct
lambda-ASP-expression template from the Table 2.</p>
        <p>For the running example, as result we have the -ASP-expressions of Table
3.
many
x:dbpedia : Apple(x)
y w:dbpedia : Eating(y; w)
z:dbpedia : Girl(x)</p>
        <p>
          Now, differently from [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], we use the dependencies, that is, we use the deep
structure information, to drive the translation.
        </p>
        <p>According to the dependency in Fig.5, the first relation that we use is amod(girls
2; many 1). Thus, for the -calculus definition, we apply the -ASP-expression
for girls to the -ASP-expression for many:
obtaining:
)</p>
        <p>The second relation, nsubj(eat 3; girls 2), drives the application of the
-expression for eat to the expression for girls that we obtained in the previous
step:
that reduces to:
dbpedia : Eating(X; W ) dbpedia : Girl(X);
not :dbpedia : Eating(X; W );
possible(dbpedia : Eating(X; W ); dbpedia : Girl(X));
usual(dbpedia : Eating(X; W ); dbpedia : Girl(X))</p>
        <p>Then, we apply apple to the expression we have seen, obtaining the final
result:
dbpedia : Eating(X; dbpedia : Apple) dbpedia : Girl(X);
not :dbpedia : Eating(X; dbpedia : Apple);
possible(dbpedia : Eating(X; dbpedia : Apple); dbpedia : Girl(X));
usual(dbpedia : Eating(X; dbpedia : Apple); dbpedia : Girl(X))
7</p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>EXPORTING INTO OWL</title>
      <p>Our framework has been designed to export the knowledge base from OOLOT
into a target formalism. For now, we are working on a pure ASP and OWL
exporter.</p>
      <p>Exporting into OWL is a very important feature, because it allows endless
possibilities due to its native Semantic Web integration. In this way, the
framework as a whole becomes a power tool that starting from plain text produces
the RDF/OWL representation of the sentences; through ASP it takes care of
special reasoning and representation features of natural language.</p>
      <p>To complete the example, the resulting RDF representation is:</p>
      <p>The export into OWL has, at the present stage, some drawbacks, including
loosing of some aspects of natural language that instead are perfectly managed
in OOLOT. The export procedure is ongoing work, so there is room for
improvement. Due to the nature of the problem, that is very complex, these aspects will
be the subject of a future work.
8</p>
    </sec>
    <sec id="sec-11">
      <title>CONCLUSION</title>
      <p>In this paper, we have proposed a comprehensive framework for extracting
knowledge from natural language and representing the extracted knowledge in suitable
formalisms so as to be able to reason about it and to enrich existing knowledge
bases. The proposed framework is being developed and an implementation is
under way and will be fully available in short time. The proposed approach
incorporates the best aspects and results from previous related works and,
although in the early stages, it exhibits a good potential and its prospects for
future development are in our opinion really interesting.</p>
      <p>Future improvements concern many aspects of the framework. First of all, we
have to choose the best parser or establish how to combine the best aspects of
all them. On the OOLOT side, there is the need to better formalize the language
itself, and better investigate the reasoning capabilities that it allows, and how
to take the best advantage from them.</p>
      <p>The ontology-oriented integration is at a very early stage, and there is room
for substantial improvements, including a better usage of the current reference
ontologies, and the evaluation study about using an upper level ontology, in
order to have a more homogeneous translation.
27. Costantini, S., Paolucci, A.: Semantically augmented DCG analysis for
nextgeneration search engine. CILC (July 2008) (2008)
28. Eiter, T.: Answer set programming for the semantic web. Logic Programming
(2010) 23–26
29. Schindlauer, R.: Answer-set programming for the Semantic Web (2006)
30. Church, A.: A set of postulates for the foundation of logic. The Annals of
Mathematics 33(2) (1932) 346–366</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markert</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Recognising textual entailment with logical inference</article-title>
          .
          <source>In: HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics (
          <year>2005</year>
          )
          <fpage>628</fpage>
          -
          <lpage>635</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Pereira</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shieber</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Prolog and natural-language analysis</article-title>
          .
          <source>Microtome Publishing</source>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobilarov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ives</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Dbpedia: A nucleus for a web of open data</article-title>
          .
          <source>The Semantic Web</source>
          (
          <year>2007</year>
          )
          <fpage>722</fpage>
          -
          <lpage>735</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Kasneci</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramanath</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>The YAGO-NAGA approach to knowledge discovery</article-title>
          .
          <source>SIGMOD Record</source>
          <volume>37</volume>
          (
          <issue>4</issue>
          ) (
          <year>2008</year>
          )
          <fpage>41</fpage>
          -
          <lpage>47</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Baral</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dzifcak</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Son</surname>
          </string-name>
          , T.C.:
          <article-title>Using answer set programming and lambda calculus to characterize natural language sentences with normatives and exceptions</article-title>
          .
          <source>In: Proceedings of the 23rd national conference on Artificial intelligence -</source>
          Volume
          <volume>2</volume>
          , AAAI Press (
          <year>2008</year>
          )
          <fpage>818</fpage>
          -
          <lpage>823</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kowalski</surname>
          </string-name>
          , R.:
          <article-title>Computational Logic and Human Thinking: How to be Artificially Intelligent</article-title>
          - In Press
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chomsky</surname>
            ,
            <given-names>N.: Syntactic</given-names>
          </string-name>
          <string-name>
            <surname>Structures</surname>
          </string-name>
          . The MIT Press (
          <year>1957</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Chomsky</surname>
          </string-name>
          , N.:
          <article-title>Three models for the description of language</article-title>
          .
          <source>IEEE Transactions on Information Theory</source>
          <volume>2</volume>
          (
          <issue>3</issue>
          ) (
          <year>1956</year>
          )
          <fpage>113</fpage>
          -
          <lpage>124</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Charniak</surname>
          </string-name>
          , E.:
          <article-title>Tree-bank grammars</article-title>
          .
          <source>In: Proceedings of the National Conference on Artificial Intelligence</source>
          . (
          <year>1996</year>
          )
          <fpage>1031</fpage>
          -
          <lpage>1036</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Charniak</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          , M.:
          <article-title>Coarse-to-fine n-best parsing and maxent discriminative reranking</article-title>
          .
          <source>In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics</source>
          , Association for Computational Linguistics (
          <year>2005</year>
          )
          <fpage>173</fpage>
          -
          <lpage>180</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Collins,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>A new statistical parser based on bigram lexical dependencies</article-title>
          .
          <source>In: Proceedings of the 34th annual meeting on Association for Computational Linguistics</source>
          , Association for Computational Linguistics (
          <year>1996</year>
          )
          <fpage>184</fpage>
          -
          <lpage>191</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Collins,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Three generative, lexicalised models for statistical parsing</article-title>
          . In:
          <article-title>Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, Association for Computational Linguistics (</article-title>
          <year>1997</year>
          )
          <fpage>16</fpage>
          -
          <lpage>23</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>McClosky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charniak</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          , M.:
          <article-title>Effective self-training for parsing</article-title>
          .
          <source>In: Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics</source>
          , Association for Computational Linguistics (
          <year>2006</year>
          )
          <fpage>152</fpage>
          -
          <lpage>159</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Petrov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barrett</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thibaux</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Learning accurate, compact, and interpretable tree annotation</article-title>
          .
          <source>In: Proceedings of the 21st International Conference on Computational Linguistics</source>
          and
          <article-title>the 44th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics (</article-title>
          <year>2006</year>
          )
          <fpage>433</fpage>
          -
          <lpage>440</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Petrov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Improved inference for unlexicalized parsing</article-title>
          .
          <source>In: Proceedings of NAACL HLT</source>
          <year>2007</year>
          .
          <article-title>(</article-title>
          <year>2007</year>
          )
          <fpage>404</fpage>
          -
          <lpage>411</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. de Marneffe,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Manning</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>The stanford typed dependencies representation</article-title>
          .
          <source>In: Coling 2008: Proceedings of the workshop on Cross-Framework</source>
          and
          <article-title>Cross-Domain Parser Evaluation, Association for Computational Linguistics (</article-title>
          <year>2008</year>
          )
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. Tesni`ere, L.:
          <article-title>El`ements de syntaxe structurale</article-title>
          .
          <source>Klincksieck</source>
          , Paris (
          <year>1959</year>
          )
          <article-title>ISBN 2252018615</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Neuhaus</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Bro¨ker, N.:
          <article-title>The complexity of recognition of linguistically adequate dependency grammars</article-title>
          .
          <source>In: Proc. of ACL-97/EACL-97</source>
          . (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Accurate unlexicalized parsing</article-title>
          .
          <source>In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume</source>
          <volume>1</volume>
          , Association for Computational Linguistics (
          <year>2003</year>
          )
          <fpage>423</fpage>
          -
          <lpage>430</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Fast exact inference with a factored model for natural language parsing</article-title>
          .
          <source>Advances in neural information processing systems</source>
          (
          <year>2003</year>
          )
          <fpage>3</fpage>
          -
          <lpage>10</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>De Marneffe</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>MacCartney</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Generating typed dependency parses from phrase structure parses</article-title>
          .
          <source>In: Proceedings of LREC. Volume</source>
          <volume>6</volume>
          .,
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          (
          <year>2006</year>
          )
          <fpage>449</fpage>
          -
          <lpage>454</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Sleator</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Temperley</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Parsing english with a link grammar</article-title>
          .
          <source>Arxiv preprint cmp-lg/9508004</source>
          (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Cer</surname>
            , D., de Marneffe,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jurafsky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Parsing to stanford dependencies: Trade-offs between speed and accuracy</article-title>
          .
          <source>LREC</source>
          <year>2010</year>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>24. : Index to link grammar documentation http://www.link.cs.cmu.edu/link/ dict/.</mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pedersen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>An adapted lesk algorithm for word sense disambiguation using wordnet</article-title>
          .
          <source>Computational Linguistics and Intelligent Text Processing</source>
          (
          <year>2002</year>
          )
          <fpage>117</fpage>
          -
          <lpage>171</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Costantini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paolucci</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Towards translating natural language sentences into asp</article-title>
          .
          <source>In: Proc. of the Intl. Worksh. on Answer Set Programming and Other Computing Paradigms (ASPOCP)</source>
          ,
          <year>Edimburgh</year>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>