<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Building a computational lexicon by using SQL</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alessandro Mazzei Dipartimento di Informatica</string-name>
          <email>mazzei@di.unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universit degli Studi di Torino Corso Svizzera 185</institution>
          ,
          <addr-line>10149 Torino</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. This paper presents some issues about a computational lexicon employed in a generation system for Italian (Mazzei et al., 2016). The paper has three goals: (i) to describe the SQL resources produced during the construction of the lexicon; (ii) to describe the algorithm for building the lexicon; (iii) to present an ongoing work for enhancing the lexicon by using the syntactic information extracted from a treebank.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        A number of free large multilingual resources
covering Italian have been released, e.g.
MultiWordnet, UniversalWordnet, BabelNet
        <xref ref-type="bibr" rid="ref10 ref3 ref5 ref7">(Pianta et al.,
2002; de Melo and Weikum, 2009; Navigli and
Ponzetto, 2012)</xref>
        . Moreover, several lexical
corpora have been built specifically for Italian, as the
detailed map of the Italian NLP resources
produced within the PARLI project shows1.
Unfortunately most resources are designed to represent
lexical semantics rather than morpho-syntactic
relations among the words. As a consequence, these
resources cannot be employed in statistical or
rulebased natural language morho-syntactic analyzer
or generator.
      </p>
      <p>
        A notable exception is the
PAROLE-SIMPLECLIPS lexicon, that is a four-level (i.e.
phonological, morphological, syntactical, semantic) general
purpose lexicon composed by 53; 044 lemmata
        <xref ref-type="bibr" rid="ref12">(Ruimy et al., 1998)</xref>
        . Unfortunately, a strong
limitation for the usage of PAROLE-SIMPLE-CLIPS
is the licence, since it is not freely available for
research or commercial use.
      </p>
      <p>
        Rule-based natural language realization
engines, that are systems performing
linearisation and morphological inflections of a
protosyntactic input tree
        <xref ref-type="bibr" rid="ref3 ref5">(Gatt and Reiter, 2009)</xref>
        , need
wide coverage morpho-syntactic information as
knowledge-base. In other words, to perform
realization, that is the last step of natural language
generation
        <xref ref-type="bibr" rid="ref11">(Reiter and Dale, 2000)</xref>
        , one needs two
main kinds of linguistic knowledge: (i) the
grammatical/syntactical knowledge that specifies the
syntactic rules of the language and which is
usually encoded into formal rules; (ii) the
morphological and lexical knowledge, which is usually
encoded into a computational lexicon. In the
porting of the SimpleNLG system to Italian
(henceforth SimpleNLG-IT)
        <xref ref-type="bibr" rid="ref6">(Mazzei et al., 2016)</xref>
        , we
have used the grammar
        <xref ref-type="bibr" rid="ref9">(Patota, 2006)</xref>
        as the
linguistic reference for the syntax: we have encoded
the Italian syntactic inflections and word ordering
by using IF-THEN-ELSE rules in Java. However,
since Italian has a high number of irregularities
for verb and adjective inflections, we needed for
a specifically designed computational lexicon too.
We needed for a lexicon that has both a good
coverage and a detailed account of the morphological
irregularities.
      </p>
      <p>
        In order to build this specific lexicon, that we
have called SimpleLEX-IT, we have decided to
merge three free resources for Italian, namely
Morph-it!
        <xref ref-type="bibr" rid="ref14">(Zanchetta and Baroni, 2005)</xref>
        , the
Vocabolario di base della lingua italiana
        <xref ref-type="bibr" rid="ref2">(De Mauro,
1985)</xref>
        and, for some specific issues, Wikipedia.
The differences between the three resources can be
referred to both the reasons for which the authors
developed them and the adopted methodology and
approach they applied in their development: the
first is a hand-made list of basic words; the second
one is an extensional corpus based morphological
lexicon; the third one is a collection of
encyclopedic entries about irregular verbs in Italian.
      </p>
      <p>This paper is organized as follows: in Section 2
we describe the conversion of the three lexical
resources used into a relational database; in
Section 3 we provide some details about the algorithm
used to build SimpleLEX-IT; in Section 4 we
describe a work in progress to enrich the lexicon by
using the syntactic information extracted from a
treebank; finally, Section 5 closes the paper with
conclusions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Using relational database for representing linguistic data</title>
      <p>In order to merge different lexical resources we
needed to convert them in a common
computational representation. We used a relational
database2 (SQL henceforth) since all the three
resources are originally provided as text files,
organized as tables or simple list.</p>
      <p>
        The first resource that we exploited for
populating SimpleLEX-IT is Morph-it!
        <xref ref-type="bibr" rid="ref14">(Zanchetta and
Baroni, 2005)</xref>
        . The dataset released in the
Morphit! project consists of a lexicon organized
according to the inflected word forms, with associated
lemmas and morphological features. The lexicon
is provided by the authors as a text file where the
values of the information about each lexical
entry are separated by a tab key. It is an
alphabetically ordered list of triples form-lemma-features.
An example of the annotation for the form corsi
(ran) is:
      </p>
      <p>corsi correre-VER:ind past+1+s
where the features are the part of speech (PoS,
VERb), the mood of the verb (indicative), the
tense (past), the person (1), and the number
(singular). The last released version of
Morphit! (v.48, 2009-02-23) contains 505; 074
different forms corresponding to 35; 056 lemmas. It
has been realized starting from a large
newspaper corpus, nevertheless it is not balanced and a
2We used the PostgreSQL database .
small number of also very common Italian words
are not included in the lexicon, e.g. sposa (bride),
ovest (west) or aceto (vinegar). Morph-it!
represents extensionally the Italian language by listing
all the morphological inflections, i.e. adjective,
verbs, nouns inflections are represented as a list
rather than by using morphological rules. We
converted Morph-it! in SQL by exploiting its original
feature structure: we used one single attribute to
represent one single feature3. We used one table
to collect all the lemmata and seven tables, with a
different number of attributes, to collect the
various inflected forms:
the table lemmata is formed by 3 attributes: a
lemma, its PoS and its ID (integer). This table
contains 34; 725 records. A number of lemmata
belonging to the original version of Morph-it!
have been excluded in our conversion: proper
nouns, emoticons and cardinals beginning with
a digit (e.g. 15mila).
the tables det demo table, pro demo table,
pronou table are used to collect inflected form
of demonstrative determiners (116 records, 4
attributes: ID word, form, ID lemma, number,
gender), demonstrative pronouns (95 records,
5 attributes: ID word, form, ID lemma,
number, gender), personal pronouns (63 records, 7
attributes: ID word, form, ID lemma, person,
number, gender, clitics).
the tables adv table, adj table, nou table,
ver table are used to collect inflected form of
adverbs (1; 594 records, 3 attributes: ID word,
form, ID lemma), adjectives (72; 367 records,
6 attributes: ID word, form, ID lemma, kind,
number, gender), nouns (35; 618 records, 5
attributes: ID word, form, ID lemma,
number, gender) and verbs (392; 139, 8 attributes:
records: ID word, form, ID lemma, mode, time,
person, number, gender) respectively.</p>
      <p>
        The second resource we exploited for
populating SimpleLEX-IT is the “Vocabolario di base
della lingua italiana” (VdB-IT henceforth), a
collection of around 7; 000 words created by the
linguist Tullio De Mauro and his team4
        <xref ref-type="bibr" rid="ref2">(De Mauro,
1985)</xref>
        . The development of this vocabulary has
been mainly driven by the distinction between the
3Morph-IT! is provided with a script that allows for a
naive conversion into SQL that use one single table and one
single attribute for all the features.
      </p>
      <p>
        4The second edition of the vocabulary has been
announced
        <xref ref-type="bibr" rid="ref1 ref4">(Chiari and De Mauro, 2014)</xref>
        and it is going to be
released (p.c.).
most frequent words (around 5:000) and the most
familiar words (around 2:000). VdB-IT is
therefore organized in the following three sections:
the vocabolario fondamentale (fundamental
vocabulary), which contains 2; 000 words featured
by the highest frequency into a balanced
corpus of Italian texts (composed of novels, movie
and theater scripts, newspapers, basic
scholastic books); amore (love), lavoro (work), pane
(bread) are in this section.
the vocabolario di alto uso (vocabulary of high
usage), which includes other 2; 937 words with
high frequency, but lesser than the vocabolario
fondamentale; ala (wing), seta (silk), toro (bull)
are in this section
the vocabolario di alta disponibilita`
(vocabulary of high availability), is composed of 1; 753
words not often used in written language, but
featured by a high frequency in spoken
language, which are indeed perceived as especially
familiar by native speakers; aglio (garlic),
cascata (waterfall), passeggero (passenger) are in
this section.
      </p>
      <p>The list of lemmata of VdB has been converted in
SQL by using one single table, called lemmadema
(6540 records), which have two attributes, i.e. an
ID (integer) and the lemma.</p>
      <p>
        The third resource that we used for the
lexicon creation is Wikipedia. Our reference grammar
        <xref ref-type="bibr" rid="ref9">(Patota, 2006)</xref>
        reports a partial list of the principal
Italian irregular verbs, but we decided to use the
larger list of verbs reported in Wikipedia5 (VerIrr
henceforth). Another linguistic distinction for
Italian verbs reported in Wikipedia6 (VerInc
henceforth) has been exploited in the lexicon: the
incoativi verbs are a subclass of the third
conjugation that have a special behavior in the present time
(e.g. capire). So, in order to produce the correct
conjugation of these verbs in SimpleNLG-IT, they
needed to be marked in the lexicon. Both these
lists of verbs have been converted in SQL by using
two distinct tables which have two attributes, i.e.
an ID (integer) and the verb in the infinitive form.
The two tables are verbiirregolari (858 records)
and verbiincoativi (726 records).
      </p>
      <p>A notable advantage of the SQL representation
for linguistic resources is the possibility to extract
intrinsic information with simple queries. Indeed,
5https://it.wikipedia.org/wiki/Verbi_
irregolari_italiani</p>
      <p>6https://it.wikipedia.org/wiki/Verbi_
incoativi
we found that Morph-it! and VdB share 4; 086
nouns and 1; 448 verbs, but there are 245 lemmas
belonging to VdD and not belonging to Morph-it!:
most of these words are nouns, for instance
lavapiatti, chimica, incinta, but we found too a
systematic difference for verbs. Indeed, VdB consider as
proper reflexive a number of verbs, for instance
avvantaggiarsi, sdraiarsi. In contrast, these verbs
are are treated as improper reflexive in
Morphit!, which annotates avvantaggiare and sdraiare as
their lemmata.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Building SimpleLEX-IT 1.0</title>
      <p>In this section we describe the algorithm used
to build the computational lexicon
SimpleLEXIT, which is based on the three resources
described in the Section 2, and that has been used
in SimpleNLG-IT.</p>
      <p>
        A computational lexicon can be split in two
major classes: open and closed classes. The closed
class, that are usually composed by function words
(i.e. prepositions, determiners, conjunctions,
pronouns, etc.) is one to which new words are very
rarely added. In contrast, the open classes, that
is usually composed by lexical words (i.e. nouns,
verbs, adjectives, adverbs), accept the addition of
new words. We adopted the same strategy of
        <xref ref-type="bibr" rid="ref13">(Vaudry and Lapalme, 2013)</xref>
        : we built by hand
the closed part of the Italian lexicon and we built
automatically the open part by using the available
resources.
      </p>
      <p>
        In order to build the open class for
SimpleLEXIT we needed both a large coverage and a detailed
account of morphological irregularities, also
considering their high frequency in Italian.
Moreover, in order to have good time execution
performance in the realiser (cf.
        <xref ref-type="bibr" rid="ref1 ref4">(De Oliveira and
Sripada, 2014)</xref>
        ), a trade-off between the size of
the lexicon and its usability for our task must
be achieved, which consists in assuming a form
of word classification where fundamental Italian
words are distinguished from the less-fundamental
ones. In order to balance completeness and
efficiency in SimpleLEX-IT, we put in the lexicon the
open classes words belonging to the intersection
of VdB-IT and Morph-it!.
      </p>
      <p>We reported in Algorithm 1 the process used
for the insertion and the annotation of the words
belonging to the open classes in SimpleLEX-IT.
Note that in order to recognize proper reflexive
verbs, we check if the infinitive form of the verb
else
end
foreach adverb 2 Morph-it! \ VdB-IT do</p>
      <p>Add the adverb in normal form into SimpleLEX-IT
end
foreach adjective 2 Morph-it! \ VdB-IT do</p>
      <p>Add the adjective in normal form (masculine-singular) and in
feminine-singular, masculine-plural, feminine-plural forms, into
SimpleLEX-IT
end
foreach noun 2 Morph-it! \ VdB-IT do</p>
      <p>Add the noun in normal form (singular), the plural form, and the
gender into SimpleLEX-IT
end
foreach verb 2 Morph-it! \ VdB-IT do
if verb 2 VerIrr then</p>
      <p>Add all the inflections for the indicativo presente,
congiuntivo presente, futuro semplice, condizionale,
imperfetto, participio passato, passato remoto into
SimpleLEX-IT
if verb 2 VerInc then</p>
      <p>Set active the incoativo feature in the entry
end
if the verb is properly reflexive (i.e. ”...rsi”) then</p>
      <p>Set active the reflexive feature in the entry
end</p>
      <p>Add the verb in normal form into SimpleLEX-IT
end
Algorithm 1: The algorithm for building
the adverbs, adjectives, nouns and verbs in
SimpleLEX-IT
has the postfix “rsi”, since MorphIT! contains this
inflection as its normal form. In Table 1 we
reported some statistics about SimpleLEX-IT
composition. Most of the lexicon is composed by
nouns (58%), followed by verbs (21%), adjectives
(19%), and adverbs (2%).</p>
      <p>PoS
Adverb
Verb (irr.)
Verb (reg.)
Adjective</p>
      <p>Noun
Total</p>
    </sec>
    <sec id="sec-4">
      <title>Work in Progress: adding information from a treebank</title>
      <p>
        The Universal Dependency Treebank (UDT) is a
recent project that releases freely available
treebanks for 33 languages (in this work, version 1.2)
        <xref ref-type="bibr" rid="ref8">(Nivre et al., 2016)</xref>
        . Each UDT is split in three
sections, train, dev and test, which can be exploited in
the evaluation of NLP/NLG systems.
      </p>
      <p>We are working on the idea of adding more
information in SimpleLEX-IT by using UDT-IT, i.e.
the Italian section of UDT. A specific case that we
are currently considering regards auxiliary verbs.
The current version of SimpleNLG-IT does not
manage auxiliary verbs: in order to produce some
complex verb tense, e.g. passato prossimo, the
user needs to give in input to the realiser the
correct auxiliary, i.e. essere (to be, e.g. Io sono nato
a Napoli) or avere (to have, e.g. Io ho amato la
scuola.). Our reference grammar reports complex
rules based on lexical semantics in order to choose
the correct auxiliary verb and, unfortunately, these
rules have many exceptions. So, we can use
UDTIT to empirically decide the correct auxiliary in
SimpleNLG-IT. By following this idea, we
converted UDT-IT in SQL by exploiting its original
feature structuree. We used one table to collect
information about the sentences, and one table to
collect information about the words:
the table sentence ud is formed by 4 attributes:
an ID (integer), the original treebank (i.e. TUT,
ISST, etc.), the original ID, the section (i.e.
DEV, TRAIN, TEST).
the table words ud is used to collect all the
words of the UDT-IT. It uses 21 attributes: one
attribute id sentence, contains the id of the
sentence in the table sentence ud, and 20 attributes
correspond to the featured used in the UD
annotation.</p>
      <p>In order to find the correct auxiliary for a specific
verb in UDT-IT, we need to exclude passive,
reflexive and modal verb constructions in the query.
We found 512 verbs of SimpleLEX-IT that are
used in UDT-IT with an auxiliary. It is interesting
to note that 60 verbs are used both with the
auxiliary essere and with the auxiliary avere: this is
grammatical for some verbs (e.g. vivere), but more
often we found an annotation error in the UDT-IT.</p>
      <p>Finally, note that another possible use of
UDTIT regards the evaluation of the lexicon. In
future work we plan to quantify the coverage of
SimpleLEX-IT by using the TEST section of the
UDT-IT.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>In this paper we have presented some issues about
the computational lexicon SimpleLEX-IT. We
described the algorithm used to build the lexicon,
three SQL resources produced as side effects of
the lexicon building and a work in progress about
the extraction of syntactic information from
UDIT.</p>
      <p>All the resources described in this paper can
be downloaded at https://github.com/
alexmazzei/SimpleLEX-IT.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Isabella</given-names>
            <surname>Chiari and Tullio De Mauro</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>The New Basic Vocabulary of Italian as a linguistic resource</article-title>
          . In Roberto Basili, Alessandro Lenci, and Bernardo Magnini, editors,
          <source>1th Italian Conference on Computational Linguistics (CLiC-it)</source>
          , volume
          <volume>1</volume>
          , pages
          <fpage>93</fpage>
          -
          <lpage>97</lpage>
          . Pisa University Press, December.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Tullio De Mauro</surname>
          </string-name>
          .
          <year>1985</year>
          .
          <article-title>Guida all'uso delle parole. Libri di base</article-title>
          .
          <source>Editori Riuniti.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Gerard de Melo and Gerhard Weikum</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Towards a universal wordnet by learning from combined evidence</article-title>
          .
          <source>In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM</source>
          <year>2009</year>
          ), pages
          <fpage>513</fpage>
          -
          <lpage>522</lpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Rodrigo De Oliveira and Somayajulu Sripada</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Adapting simplenlg for brazilian portuguese realisation</article-title>
          .
          <source>In Proc. of INLG</source>
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Albert</given-names>
            <surname>Gatt</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ehud</given-names>
            <surname>Reiter</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>SimpleNLG: A Realisation Engine for Practical Applications</article-title>
          .
          <source>In Proc. of ENLG</source>
          <year>2009</year>
          , ENLG '
          <fpage>09</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Mazzei</surname>
          </string-name>
          , Cristina Battaglino, and
          <string-name>
            <given-names>Cristina</given-names>
            <surname>Bosco</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>SimpleNLG-IT: adapting SimpleNLG to Italian</article-title>
          .
          <source>In Proc. of INLG</source>
          <year>2016</year>
          . TO APPEAR.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Navigli</surname>
          </string-name>
          and Simone Paolo Ponzetto.
          <year>2012</year>
          .
          <article-title>BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network</article-title>
          .
          <source>Artificial Intelligence</source>
          ,
          <volume>193</volume>
          :
          <fpage>217</fpage>
          -
          <lpage>250</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Joakim</given-names>
            <surname>Nivre</surname>
          </string-name>
          ,
          <string-name>
            <surname>Marie-Catherine de Marneffe</surname>
            , Filip Ginter, Yoav Goldberg, Jan Hajic,
            <given-names>Christopher D.</given-names>
          </string-name>
          <string-name>
            <surname>Manning</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ryan</surname>
            <given-names>McDonald</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Slav</given-names>
            <surname>Petrov</surname>
          </string-name>
          , Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, and
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Zeman</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Universal Dependencies v1:A Multilingual Treebank Collection</article-title>
          .
          <source>In Proc. of LREC'16</source>
          , may. TO APPEAR.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Patota</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Grammatica di riferimento dell'italiano contemporaneo</article-title>
          .
          <source>Guide linguistiche. Garzanti Linguistica.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Emanuele</given-names>
            <surname>Pianta</surname>
          </string-name>
          , Luisa Bentivogli, and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Girardi</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Multiwordnet: developing an aligned multilingual database</article-title>
          .
          <source>In Proceedings of the First International Conference on Global WordNet</source>
          , January.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Ehud</given-names>
            <surname>Reiter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Robert</given-names>
            <surname>Dale</surname>
          </string-name>
          .
          <year>2000</year>
          .
          <article-title>Building Natural Language Generation Systems</article-title>
          . Cambridge University Press, New York, NY, USA.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Nilda</given-names>
            <surname>Ruimy</surname>
          </string-name>
          , Ornella Corazzari, Elisabetta Gola, Antonietta Spanu, Nicoletta Calzolari, and Antonio Zampolli.
          <year>1998</year>
          .
          <article-title>The European LE-PAROLE project: the Italian Syntactic Lexicon</article-title>
          .
          <source>In Proceedings of the First International Conference on Language resources and Evaluation</source>
          , pages
          <fpage>241</fpage>
          -
          <lpage>248</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Pierre-Luc Vaudry</surname>
            and
            <given-names>Guy</given-names>
          </string-name>
          <string-name>
            <surname>Lapalme</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Adapting simplenlg for bilingual english-french realisation</article-title>
          .
          <source>In Proc. of ENLG</source>
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Eros</given-names>
            <surname>Zanchetta</surname>
          </string-name>
          and
          <string-name>
            <given-names>Marco</given-names>
            <surname>Baroni</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Morph-it! a free corpus-based morphological resource for the italian language</article-title>
          .
          <source>Corpus Linguistics</source>
          <year>2005</year>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>