<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Italian Question Answering System Based on Grammars Automatically Generated from Ontology Lexica</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gennaro Nolano</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohammad Fazleh Elahi</string-name>
          <email>melahi@techfak.uni-bielefeld.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Pia di Buono</string-name>
          <email>mpdibuono@unior.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Basil Ell</string-name>
          <email>bell@techfak.uni-bielefeld.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Philipp Cimiano</string-name>
          <email>cimiano@techfak.uni-bielefeld.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>. Cognitive Interaction Technology Center, Bielefeld University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>. Department of Informatics, University of Oslo</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>. UniOr NLP Research Group, University of Naples ”L'Orientale”</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper presents an Italian question answering system over linked data. We use a model-based approach to question answering based on an ontology lexicon in lemon format. The system exploits an automatically generated lexicalized grammar that can then be used to interpret and transform questions into SPARQL queries. We apply the approach for the Italian language and implement a question answering system that can answer more than 1.6 million questions over the DBpedia knowledge graph.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>As the amount of linked data published on the Web
keeps increasing, there is an expanding demand
for multilingual tools and user interfaces that
simplify the access and browsing of data by end-users,
so that information can be explored in an intuitive
way. This need is what motivated the
development of tools such as Question Answering (QA)
systems, whose main aim is to make users be able
to explore complex datasets and an ever growing
amount of data in an intuitive way, through natural
language.</p>
      <p>
        While the default approach for many NLP tasks
has recently been represented by machine learning
systems, the use of such approaches
        <xref ref-type="bibr" rid="ref4">(Chakraborty
et al., 2019)</xref>
        for QA over RDF data suffers from
lack of controllability, making the governance and
incremental improvement of the system
challenging, not to mention the initial effort of collecting
and providing training data for a specific language.
      </p>
      <p>An alternative is the so-called model-based
approach to QA, in which a model is first used to</p>
      <p>
        Copyright © 2021 for this paper by its authors. Use
permitted under Creative Commons License Attribution 4.0
International (CC BY 4.0).
specify how concepts and relations are realized
in natural language, and then this specification is
employed to interpret questions from users. One
such system is the one proposed by
        <xref ref-type="bibr" rid="ref1">(Benz et al.,
2020)</xref>
        , which makes use of a lexicon in lemon
format
        <xref ref-type="bibr" rid="ref14 ref5">(McCrae et al., 2011)</xref>
        to specify how the
vocabulary elements of an ontology or knowledge
graph (e.g., entities and relations from a
Knowledge Graph) are realized in natural language.
      </p>
      <p>
        The previous work on this approach shows how,
leveraging on lemon lexica, question answering
grammars can be automatically generated, and
those can, in turn, be used to interpret questions
and then parse them into SPARQL queries. A
QA web application developed in previous work
        <xref ref-type="bibr" rid="ref7 ref8">(Elahi et al., 2021)</xref>
        has further shown that such QA
systems can scale to large numbers of questions
and that the performance of the system is
practically real-time from an end-user perspective.
      </p>
      <p>
        In this work we describe the extension to the
Italian language of the model-based approach
described in
        <xref ref-type="bibr" rid="ref1">(Benz et al., 2020)</xref>
        and the QA
system described in
        <xref ref-type="bibr" rid="ref7 ref8">(Elahi et al., 2021)</xref>
        . By doing
so, we develop a QA system that can answer more
than 1.6 million Italian questions over the
DBpedia knowledge graph1.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Besides the goal of creating QA systems that are
robust and have high performance, an important
goal is also to develop systems that can be ported
to languages other than English. The interest in
other languages is, for example, explicitly stated in
the Multiple Language Question Answering Track
at CLEF 2003
        <xref ref-type="bibr" rid="ref11 ref20">(Magnini et al., 2004)</xref>
        , that includes
Italian among others.
      </p>
      <p>
        One of the earlier attempts in this regard has
been the DIOGENE model
        <xref ref-type="bibr" rid="ref10 ref20">(Magnini et al., 2002;
Tanev et al., 2004)</xref>
        , which exploits linguistic
tem1https://www.dbpedia.org/
plates and keyword recognition to answer
questions over document collections. Other efforts
have been made in the QALL-ME project
        <xref ref-type="bibr" rid="ref2 ref21 ref3">(Cabrio
et al., 2007; Cabrio et al., 2008; O´scar Ferra´ndez
et al., 2011)</xref>
        , where a system was created for
the tourism domain through an instance-based
method, that is by clustering together similar
question-answer pairs.
      </p>
      <p>
        More recently, the QuASIt model
        <xref ref-type="bibr" rid="ref16">(Pipitone et
al., 2016)</xref>
        , makes use of the Construction
Grammar and an abstraction of cognitive processes to
account for the inherent fluidity of language, while
exploiting linguistic and domain knowledge (in
the form of an ontology) to answer essay and
multiple choice questions. Similarly, the authors of
        <xref ref-type="bibr" rid="ref9">(Leoni et al., 2020)</xref>
        built a system to answer
questions regarding a specific domain using IBM
Watson services and online articles as source of
information.
      </p>
      <p>
        These kind of systems, built to answer
questions using textual information, have been largely
growing in recent years, especially since the
availability of large QA datasets such as the Stanford
Question Answering Dataset (SQuAD)2, which
allows to train complex deep learning models with
millions of parameters
        <xref ref-type="bibr" rid="ref17 ref18">(Rajpurkar et al., 2016;
Rajpurkar et al., 2018)</xref>
        . While the performance
shown by these models is impressive, they suffer
from major drawbacks: first of all, they need an
extremely large dataset to be trained on, making
the porting of such a system to another language
extremely demanding;3 furthermore, they show a
lack of controllability in the sense that it is
unclear which new examples are to be added to make
a new question answerable. This makes systems
opaque and difficult to maintain.
      </p>
      <p>
        The MULIB system
        <xref ref-type="bibr" rid="ref19">(Siciliani et al., 2019)</xref>
        tackles the problem of answering questions in Italian
over structured data. The system is based on a
modified version of the automaton developed for
CANaLI
        <xref ref-type="bibr" rid="ref13 ref16 ref17">(Mazzeo and Zaniolo, 2016)</xref>
        , but it
employs a Word2Vec model
        <xref ref-type="bibr" rid="ref15">(Mikolov et al., 2013)</xref>
        to allow for more flexibility in language use. In
contrast to these trained approaches, we present a
model that generates (i) a deeper interconnection
of semantic and syntactic information through the
integration of a lemon lexicon with the DBpedia
ontology, and (ii) the focus on Linked Open Data
2https://rajpurkar.github.io/SQuAD-ex
plorer/
      </p>
      <p>3The Italian translation for SQuAD, for example, has been
described in Croce et al. (2018)
as a source of knowledge.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        The architecture consists of two components: (i)
the grammar generator and (ii) the QA component.
The approach to grammar generation for different
syntactic frames according to LexInfo
        <xref ref-type="bibr" rid="ref5">(Cimiano et
al., 2011)</xref>
        for the English language was described
in a previous work
        <xref ref-type="bibr" rid="ref1">(Benz et al., 2020)</xref>
        . In this
paper we show that, through a simple language
adaptation, we are able to adjust the system so that the
system also accepts questions in Italian language.
      </p>
      <p>In a nutshell, the grammar generation approach
relies on a mapping between syntactic
constructions and classes and properties from a given
ontology and/or knowledge graph. This generation
process makes use of several frames, each
describing the linguistic realizations of specific properties
that might appear in questions. Thus, the frames
employed in this work are: NounPPFrame,
TransitiveFrame, IntransitivePPFrame,
AdjectiveAttributive and AdjectiveGradable.</p>
      <p>For example, the (lexicalized) construction for
the NounPPFrame ‘the capital of X’, can be
regarded as expressing the DBpedia property
dbo:capital, with Country as domain and
City as range. This would lead to the generation
of the following questions:
• What is the capital of X (Country)?
• Which city is the capital of X (Country)?
Similar grammar generation rules exist for
transitive constructions (TransitiveFrame) as well as
constructions involving an intransitive verb with a
prepositional complement (IntransitivePPFrame)
as well as adjective constructions in attributive
(AdjectiveAttributive) and predicate form
(AdjectiveGradable).</p>
      <p>In the context of this work, we adapted the
generation of rules to the Italian language, without
extending or modifying the existing types of
constructions4.</p>
      <p>In adapting the grammar generation to
Italian, we had to accommodate for the following
language-specific properties:
• Sentence order, e.g., in sentence starting with
interrogative pronouns the subject has to be
4The code for our grammar generation for Italian is
available at https://github.com/fazleh2010/ques
tion-grammar-generator
placed at the end of the sentence, e.g., Dove
si trova Vienna? (Where is Vienna?)
• The presence of auxiliary verbs, either avere
(have) or essere (be), in compound tenses;
• Interrogative pronoun rules, e.g., chi (who) is
invariable and refers only to people;
• The use of interrogative adjectives, e.g.,
quale (which);
• The use of different prepositions, either
simple or articulated, on the basis of
range/domain semantics (e.g., toponyms
might require different prepositions);
• The presence of a determiner/articulated
preposition on the basis of range/domain
semantics (e.g., toponyms are preceded by a
determiner when the noun refers to a country).
Consider the lemon lexical entry in Figure 15 for
the relational noun ‘capitale della’. The entry
states that the canonical written form of the
entry is “capitale”. It states that the entry has
a NounPPFrame as syntactic behaviour, that is
it corresponds to a copulative construction X e`
5In this paper we abbreviate URIs with the namespace
prefixes dbo, dbp, lemon, and lexinfo which can be
expanded into http://dbpedia.org/ontology/,
http://dbpedia.org/property/,
https://lemon-model.net/lemon#, and
http://www.lexinfo.net/ontology/2.0/lexinfo#,
respectively.
la capitale della Y with two arguments, where
copulativeArg corresponds to the copula
subject X and the prepositional adjunct corresponds
to the prepositional object Y.</p>
      <p>We give examples for the different syntactic
frames below to illustrate the behaviour of the
Italian grammar generation.</p>
      <p>NounPPFrame Assuming that in the
corresponding lemon lexicon we model the
connection between the NounPP construction capitale
della (capital of) as referring to the property
dbo:capital with domain Country and range
City, we can generate questions automatically
such as:
1. Qual e` la capitale della (What is the capital
of) (X—Country NP)?
2. Quale citta` e` la capitale della (Which city is
the capital of) (X—Country NP)?
where X is a placeholder allowing to fill in a
particular country, e.g. Germania (Germany), or a
noun phrase, e.g., paese dove si parla tedesco (the
country where German is spoken).</p>
      <p>TransitiveFrame Assuming that the lemon
lexicon captures the meaning of the construction X
‘scrive’ (write) Y as referring to the property
dbp:author, with Song as domain and Person
as range, the following questions would then be
covered by an automatically generated grammar:
1. Chi ha scritto (Who wrote) (X—Song NP)?
2. Quale cantante ha scritto (Which singer
wrote) (X—Song NP)?
3. Quale (Which) (X—Song NP) e` stata scritta
da (was written by) (Y—Person NP)?
IntransitivePPFrame Assuming that the lemon
lexicon captures the meaning of the construction
‘X pubblicare nel Y’ (‘X published in Y’) as
representation of the property dbp:published, with
Song as its domain and Date as its range, the
following questions would be generated:
1. Quando e` stata pubblicata (X—Song NP)?
(When was (X—Song NP) published?),
2. Quale (X—Song NP) e` stata pubblicata nel
(Y—date)? (Which (X—Song NP) was
published in (Y—date)?
3. In quale data e` stata pubblicata (In which
date was) (X—Song NP)?
AdjectiveAttributive
AdjectiveGradable
Transitive
IntransitivePP</p>
      <p>Syntactic Pattern
WDT/WP V* DT [noun] IN DT
[domain]
WDT dbo:range V* DT [noun] IN
[domain]?
WDT/WP V* DT [noun] in [domain]
[range] V* DT [noun] IN (DT)
[domain]
WDT V* DT dbo:range [adjective]
[domain] VB (DT) [adjective]
WRB V* [adjective] DT [domain]
WDT V* DT [domain] JJS IN (DT)
[range]
WP V* [domain]
WDT dbo:range V* [domain]
WP V* DT [domain]
WDT dbo:range V* DT [domain]
[domain] V* [range]
WRB VB [domain]
IN WDT dbo:domain VB [range]
WDT dbo:domain VB IN [range]
[domain] V* IN [range]
Question Sample
Qual e` la capitale della Germania?
Quale citta` e` la capitale della Germania?
Chi era la moglie di Abraham Lincoln?
Rita Wilson e` la moglie di Tom Hanks?
Chi era un vescovo cristiano spagnolo?
Barack Obama e` un democratico?
Quanto e` lungo il Barguzin?
Qual e` la montagna pi u` alta della Germania?
Chi ha scritto Ziggy Stardust?
Quale cantante ha scritto Ziggy Stardust?
Chi ha fondato C&amp;A?
Quale persona ha fondato C&amp;A?
Socrate ha influenzato Aristotele?
Quando e` iniziata l’operazione Overlord?
In quale data e` iniziata l’operazione Overlord?
Quale libro e` stato pubblicato nel 1563?</p>
      <p>Il libro dei martiri di Foxe e` stato pubblicato nel 1563?</p>
      <sec id="sec-3-1">
        <title>AdjectiveAttributive and AdjectiveGradable</title>
        <p>Assuming that the lemon lexicon would capture
the meaning of the (gradable) adjective lungo
(long) as referring to the ontological property
dpb:length, the grammar generation approach
would generate the following types of questions:
1. Quanto e` lungo il (How long is the)
(X—River NP)?
2. Qual e` il fiume pi u` lungo (del mondo, del
Kentucky)? (What is the longest river in (the
world, Kentucky)?).</p>
        <p>
          The rules implemented for the generation of
Italian questions are shown in further detail in
Table 1. In particular, we use the tagset6
from the Penn Treebank Project
          <xref ref-type="bibr" rid="ref12">(Marcus et
al., 1993)</xref>
          , with V* defining all possible forms
of a given verb, words in brackets defining
6https://www.sketchengine.eu/englishtreetagger-pipeline-2/
nouns/verbs/adjectives that realize a specific
property, and dbo:range/dbo:domain defining
the possible labels that may represent classes (e.g.,
dbo:Country might be represented by either
paese or stato).
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>We apply our system to the DBpedia dataset and
manually created a lemon lexicon comprising of
249 lexical entries7. Table 2 shows the number of
grammar rules and questions generated for each
syntactic type. Altogether, the approach generates
620 grammar rules and about 1.6 million
questions. The web-based demonstration is available
online8.</p>
      <p>We used the training set of multilingual
QALD7https://scdemo.techfak.uni-bielefeld
.de/quegg-resources/</p>
      <p>8https://webtentacle1.techfak.uni-bie
lefeld.de/quegg/
79 to evaluate our approach. QALD-7 contains
a total of 214 questions over linked data,
covering for more relations than the ones we
considered so far. In order to overcome this issue, a
total of 109 entries were added to our system (22
NounPPFrame, 41 TransitiveFrame, 41
IntransitiveFrame, 1 AdjectiveAttributiveFrame and 4
AdjectiveGradable).</p>
      <p>Precision
Recall</p>
      <sec id="sec-4-1">
        <title>F-Measure</title>
        <p>The results of the evaluation process (Table 3)
show a quite satisfying precision, but a low recall.
The main reason behind such results is related
to the presence of different types of questions in
QALD. Indeed, besides single-triple questions,
QALD presents also complex questions referring
to more than one triple, e.g., A quale movimento
artistico apparteneva il pittore de I tre ballerini?
(What was the artistic movement of the author
of The Three Dancers?), which are not covered
yet by our model. Nevertheless, when taking into
account all the questions in QALD-7, our system
recognizes 46.98% (101 questions) of the total set
of questions.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and Future Work</title>
      <p>We presented an approach to developing Italian
QA systems over linked data that relies on the
automatic generation of grammars from
corresponding lemon lexica describing how elements of the
dataset are realized in natural language. The
approach is controllable, since the introduction of
a lexical entry increases the question coverage
in a fully predictable way. Our proof-of-concept
implementation over DBpedia covers 1.6 million
questions generated from 249 lemon entries.</p>
      <p>
        In future work, we intend to further automatize
grammar generation by using LexExMachina
        <xref ref-type="bibr" rid="ref7 ref8">(Ell
et al., 2021)</xref>
        , which induces lexicon entries
bridging the gap between ontology and natural language
from a corpus in an unsupervised manner.
Acknowledgments This work has been funded
by the European Commission under grant 825182
(Preˆt-a`-LLOD) as well as Nexus Linguarum Cost
9https://github.com/ag-sc/QALD
Action. M.P. di Buono has been partially
supported by Programma Operativo Nazionale
Ricerca e Innovazione 2014-2020 - Fondo Sociale
Europeo, Azione I.2 “Attrazione e Mobilita`
Internazionale dei Ricercatori” Avviso D.D. n 407 del
27/02/2018. B. Ell has been partially supported by
the SIRIUS centre: Norwegian Research Council
project No 237898.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Viktoria</given-names>
            <surname>Benz</surname>
          </string-name>
          , Philipp Cimiano, Mohammad Fazleh Elahi, and
          <string-name>
            <given-names>Basil</given-names>
            <surname>Ell</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Generating Grammars from lemon lexica for Questions Answering over Linked Data: a Preliminary Analysis</article-title>
          .
          <source>In NLIWOD workshop at ISWC</source>
          , volume
          <volume>2722</volume>
          , pages
          <fpage>40</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Elena</given-names>
            <surname>Cabrio</surname>
          </string-name>
          , Bonaventura Coppola, Roberto Gretter, Milen Kouylekov, Bernardo Magnini, and
          <string-name>
            <given-names>Matteo</given-names>
            <surname>Negri</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Question answering based annotation for a corpus of spoken requests</article-title>
          .
          <source>In Proceedings of the workshop on the Semantic Representation of Spoken Language</source>
          , volume
          <volume>31</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Elena</given-names>
            <surname>Cabrio</surname>
          </string-name>
          , Milen Kouylekov, Bernardo Magnini, Matteo Negri, Laura Hasler, Constantin Orasan, David Toma´s, Jose Luis Vicedo, Guenter Neumann, and
          <string-name>
            <given-names>Corinna</given-names>
            <surname>Weber</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>The QALL-ME benchmark: a multilingual resource of annotated spoken requests for question answering</article-title>
          .
          <source>In LREC'08.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Nilesh</given-names>
            <surname>Chakraborty</surname>
          </string-name>
          , Denis Lukovnikov, Gaurav Maheshwari, Priyansh Trivedi, Jens Lehmann, and
          <string-name>
            <given-names>Asja</given-names>
            <surname>Fischer</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs</article-title>
          . CoRR, abs/
          <year>1907</year>
          .09361.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Philipp</given-names>
            <surname>Cimiano</surname>
          </string-name>
          , Paul Buitelaar, John P.
          <string-name>
            <surname>McCrae</surname>
            ,
            <given-names>and Michael</given-names>
          </string-name>
          <string-name>
            <surname>Sintek</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>LexInfo: A declarative model for the lexicon-ontology interface</article-title>
          .
          <source>JWS</source>
          ,
          <volume>9</volume>
          (
          <issue>1</issue>
          ):
          <fpage>29</fpage>
          -
          <lpage>51</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Danilo</given-names>
            <surname>Croce</surname>
          </string-name>
          , Alexandra Zelenanska, and
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Basili</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Neural learning for question answering in italian</article-title>
          .
          <source>In AI*IA</source>
          <year>2018</year>
          , pages
          <fpage>389</fpage>
          -
          <lpage>402</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Mohammad</given-names>
            <surname>Fazleh</surname>
          </string-name>
          <string-name>
            <surname>Elahi</surname>
          </string-name>
          , Basil Ell, Frank Grimm, and
          <string-name>
            <given-names>Philipp</given-names>
            <surname>Cimiano</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Question Answering on RDF Data based on Grammars Automatically Generated from Lemon Models</article-title>
          . In SEMANTiCS Conference, Posters and Demonstrations.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Basil</given-names>
            <surname>Ell</surname>
          </string-name>
          , Mohammad Fazleh Elahi, and
          <string-name>
            <given-names>Philipp</given-names>
            <surname>Cimiano</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Bridging the Gap Between Ontology and Lexicon via Class-Specific Association Rules Mined from a Loosely-Parallel Text-Data Corpus</article-title>
          .
          <source>In LDK 2021</source>
          , pages
          <fpage>33</fpage>
          :
          <fpage>1</fpage>
          -
          <lpage>33</lpage>
          :
          <fpage>21</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Chiara</given-names>
            <surname>Leoni</surname>
          </string-name>
          , Ilaria Torre, and
          <string-name>
            <given-names>Gianni</given-names>
            <surname>Vercelli</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>ConversIAmo: Improving Italian Question Answering Exploiting IBM Watson Services</article-title>
          . In Text, Speech, and Dialogue, pages
          <fpage>504</fpage>
          -
          <lpage>512</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Bernardo</given-names>
            <surname>Magnini</surname>
          </string-name>
          , Matteo Negri, Roberto Prevete, and
          <string-name>
            <given-names>Hristo</given-names>
            <surname>Tanev</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Mining Knowledge from Repeated Co-Occurrences: DIOGENE at TREC</article-title>
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Bernardo</given-names>
            <surname>Magnini</surname>
          </string-name>
          , Simone Romagnoli, Alessandro Vallin,
          <article-title>Jesu´s Herrera, Anselmo Pen˜as, V´ıctor Peinado, Felisa Verdejo</article-title>
          , and Maarten de Rijke.
          <year>2004</year>
          .
          <article-title>The Multiple Language Question Answering Track at CLEF 2003</article-title>
          .
          <source>In Comparative Evaluation of Multilingual Information Access Systems</source>
          , pages
          <fpage>471</fpage>
          -
          <lpage>486</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Mitchell P.</given-names>
            <surname>Marcus</surname>
          </string-name>
          , Mary Ann Marcinkiewicz, and
          <string-name>
            <given-names>Beatrice</given-names>
            <surname>Santorini</surname>
          </string-name>
          .
          <year>1993</year>
          .
          <article-title>Building a Large Annotated Corpus of English: The Penn Treebank</article-title>
          . Comput. Linguist.,
          <volume>19</volume>
          (
          <issue>2</issue>
          ):
          <fpage>313</fpage>
          --
          <lpage>330</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Giuseppe M. Mazzeo</surname>
            and
            <given-names>Carlo</given-names>
          </string-name>
          <string-name>
            <surname>Zaniolo</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Answering controlled natural language questions on RDF knowledge bases</article-title>
          .
          <source>In EDBT</source>
          , pages
          <fpage>608</fpage>
          -
          <lpage>611</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>John P. McCrae</surname>
            ,
            <given-names>Dennis</given-names>
          </string-name>
          <string-name>
            <surname>Spohr</surname>
            ,
            <given-names>and Philipp</given-names>
          </string-name>
          <string-name>
            <surname>Cimiano</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Linking Lexical Resources and Ontologies on the Semantic Web with Lemon</article-title>
          .
          <source>In ESWC Conference</source>
          , pages
          <fpage>245</fpage>
          -
          <lpage>259</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado, and
          <string-name>
            <given-names>Jeff</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          , volume
          <volume>26</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Arianna</given-names>
            <surname>Pipitone</surname>
          </string-name>
          , Giuseppe Tirone, and
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Pirrone</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>QuASIt: A Cognitive Inspired Approach to Question Answering for the Italian Language</article-title>
          . volume
          <volume>10037</volume>
          , pages
          <fpage>464</fpage>
          -
          <lpage>476</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Pranav</given-names>
            <surname>Rajpurkar</surname>
          </string-name>
          , Jian Zhang, Konstantin Lopyrev, and
          <string-name>
            <given-names>Percy</given-names>
            <surname>Liang</surname>
          </string-name>
          .
          <year>2016</year>
          . SQuAD:
          <volume>100</volume>
          , 000+
          <article-title>Questions for Machine Comprehension of Text</article-title>
          . CoRR, abs/1606.05250.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Pranav</given-names>
            <surname>Rajpurkar</surname>
          </string-name>
          , Robin Jia, and
          <string-name>
            <given-names>Percy</given-names>
            <surname>Liang</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Know What You Don't Know: Unanswerable Questions for SQuAD</article-title>
          . CoRR, abs/
          <year>1806</year>
          .03822.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Lucia</given-names>
            <surname>Siciliani</surname>
          </string-name>
          , Pierpaolo Basile, Giovanni Semeraro, and
          <string-name>
            <given-names>Matteo</given-names>
            <surname>Mennitti</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>An italian question answering system for structured data based on controlled natural languages</article-title>
          . In CLiC-it.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Hristo</given-names>
            <surname>Tanev</surname>
          </string-name>
          , Matteo Negri, Bernardo Magnini, and
          <string-name>
            <given-names>Milen</given-names>
            <surname>Kouylekov</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>The DIOGENE question answering system at CLEF-2004</article-title>
          . volume
          <volume>3491</volume>
          , pages
          <fpage>435</fpage>
          -
          <lpage>445</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <article-title>O´ scar Ferra´ndez, Christian Spurk</article-title>
          , Milen Kouylekov, Iustin Dornescu, Sergio Ferra´ndez, Matteo Negri, Rube´n Izquierdo, David Toma´s, Constantin Orasan, Guenter Neumann, Bernardo Magnini, and Jose Luis Vicedo.
          <year>2011</year>
          .
          <article-title>The QALL-ME Framework: A specifiable-domain multilingual Question Answering architecture</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>9</volume>
          (
          <issue>2</issue>
          ):
          <fpage>137</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>