<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="it">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Building a computational lexicon by using SQL</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Alessandro</forename><surname>Mazzei</surname></persName>
							<email>mazzei@di.unito.it</email>
							<affiliation key="aff0">
								<orgName type="department">Dipartimento di Informatica Universit degli Studi di</orgName>
								<orgName type="institution">Torino Corso Svizzera</orgName>
								<address>
									<addrLine>185</addrLine>
									<postCode>10149</postCode>
									<settlement>Torino</settlement>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Building a computational lexicon by using SQL</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">BCDF4416A1A916F119CF31C9A2F88692</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T23:47+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>English. This paper presents some issues about a computational lexicon employed in a generation system for Italian <ref type="bibr" target="#b5">(Mazzei et al., 2016)</ref>. The paper has three goals: (i) to describe the SQL resources produced during the construction of the lexicon; (ii) to describe the algorithm for building the lexicon; (iii) to present an ongoing work for enhancing the lexicon by using the syntactic information extracted from a treebank.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italiano.</head><p>Questo lavoro descrive la costruzione di un lessico computazionale per la generazione automatica dell'italiano <ref type="bibr" target="#b5">(Mazzei et al., 2016)</ref>. Il lavoro ha tre obiettivi: (i) descrivere alcune risorse SQL prodotte funzionalmente alla costruzione del lessico; (ii) descrivere l'algoritmo per la costruzione del lessico; (iii) presentare un lavoro in divenire per migliorare il lessico che usa l'informazione sintattica estratta da un treebank.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="it">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>A number of free large multilingual resources covering Italian have been released, e.g. MultiWordnet, UniversalWordnet, BabelNet <ref type="bibr" target="#b9">(Pianta et al., 2002;</ref><ref type="bibr" target="#b2">de Melo and Weikum, 2009;</ref><ref type="bibr" target="#b6">Navigli and Ponzetto, 2012)</ref>. Moreover, several lexical corpora have been built specifically for Italian, as the detailed map of the Italian NLP resources produced within the PARLI project shows<ref type="foot" target="#foot_0">1</ref> . Unfortunately most resources are designed to represent lexical semantics rather than morpho-syntactic relations among the words. As a consequence, these resources cannot be employed in statistical or rulebased natural language morho-syntactic analyzer or generator.</p><p>A notable exception is the PAROLE-SIMPLE-CLIPS lexicon, that is a four-level (i.e. phonological, morphological, syntactical, semantic) general purpose lexicon composed by 53, 044 lemmata <ref type="bibr" target="#b11">(Ruimy et al., 1998)</ref>. Unfortunately, a strong limitation for the usage of PAROLE-SIMPLE-CLIPS is the licence, since it is not freely available for research or commercial use.</p><p>Rule-based natural language realization engines, that are systems performing linearisation and morphological inflections of a protosyntactic input tree <ref type="bibr" target="#b4">(Gatt and Reiter, 2009)</ref>, need wide coverage morpho-syntactic information as knowledge-base. In other words, to perform realization, that is the last step of natural language generation <ref type="bibr" target="#b10">(Reiter and Dale, 2000)</ref>, one needs two main kinds of linguistic knowledge: (i) the grammatical/syntactical knowledge that specifies the syntactic rules of the language and which is usually encoded into formal rules; (ii) the morphological and lexical knowledge, which is usually encoded into a computational lexicon. In the porting of the SimpleNLG system to Italian (henceforth SimpleNLG-IT) <ref type="bibr" target="#b5">(Mazzei et al., 2016)</ref>, we have used the grammar <ref type="bibr" target="#b8">(Patota, 2006)</ref> as the linguistic reference for the syntax: we have encoded the Italian syntactic inflections and word ordering by using IF-THEN-ELSE rules in Java. However, since Italian has a high number of irregularities for verb and adjective inflections, we needed for a specifically designed computational lexicon too. We needed for a lexicon that has both a good coverage and a detailed account of the morphological irregularities.</p><p>In order to build this specific lexicon, that we have called SimpleLEX-IT, we have decided to merge three free resources for Italian, namely Morph-it! <ref type="bibr" target="#b12">(Zanchetta and Baroni, 2005)</ref>, the Vo-cabolario di base della lingua italiana <ref type="bibr" target="#b1">(De Mauro, 1985)</ref> and, for some specific issues, Wikipedia. The differences between the three resources can be referred to both the reasons for which the authors developed them and the adopted methodology and approach they applied in their development: the first is a hand-made list of basic words; the second one is an extensional corpus based morphological lexicon; the third one is a collection of encyclopedic entries about irregular verbs in Italian.</p><p>This paper is organized as follows: in Section 2 we describe the conversion of the three lexical resources used into a relational database; in Section 3 we provide some details about the algorithm used to build SimpleLEX-IT; in Section 4 we describe a work in progress to enrich the lexicon by using the syntactic information extracted from a treebank; finally, Section 5 closes the paper with conclusions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Using relational database for representing linguistic data</head><p>In order to merge different lexical resources we needed to convert them in a common computational representation. We used a relational database<ref type="foot" target="#foot_1">2</ref> (SQL henceforth) since all the three resources are originally provided as text files, organized as tables or simple list.</p><p>The first resource that we exploited for populating SimpleLEX-IT is Morph-it! <ref type="bibr" target="#b12">(Zanchetta and Baroni, 2005)</ref>. The dataset released in the Morphit! project consists of a lexicon organized according to the inflected word forms, with associated lemmas and morphological features. The lexicon is provided by the authors as a text file where the values of the information about each lexical entry are separated by a tab key. It is an alphabetically ordered list of triples form-lemma-features. An example of the annotation for the form corsi (ran) is: corsi correre-VER:ind past+1+s where the features are the part of speech (PoS, VERb), the mood of the verb (indicative), the tense (past), the person (1), and the number (singular). The last released version of <ref type="bibr">Morphit! (v.48, 2009-02-23)</ref> contains 505, 074 different forms corresponding to 35, 056 lemmas. It has been realized starting from a large newspaper corpus, nevertheless it is not balanced and a small number of also very common Italian words are not included in the lexicon, e.g. sposa (bride), ovest (west) or aceto (vinegar). Morph-it! represents extensionally the Italian language by listing all the morphological inflections, i.e. adjective, verbs, nouns inflections are represented as a list rather than by using morphological rules. We converted Morph-it! in SQL by exploiting its original feature structure: we used one single attribute to represent one single feature<ref type="foot" target="#foot_2">3</ref> . We used one table to collect all the lemmata and seven tables, with a different number of attributes, to collect the various inflected forms: The second resource we exploited for populating SimpleLEX-IT is the "Vocabolario di base della lingua italiana" (VdB-IT henceforth), a collection of around 7, 000 words created by the linguist Tullio De Mauro and his team<ref type="foot" target="#foot_3">4</ref>  <ref type="bibr" target="#b1">(De Mauro, 1985)</ref>. The development of this vocabulary has been mainly driven by the distinction between the most frequent words (around 5.000) and the most familiar words (around 2.000). VdB-IT is therefore organized in the following three sections:</p><p>• the vocabolario fondamentale (fundamental vocabulary), which contains 2, 000 words featured by the highest frequency into a balanced corpus of Italian texts (composed of novels, movie and theater scripts, newspapers, basic scholastic books); amore (love), lavoro (work), pane (bread) are in this section. • the vocabolario di alto uso (vocabulary of high usage), which includes other 2, 937 words with high frequency, but lesser than the vocabolario fondamentale; ala (wing), seta (silk), toro (bull) are in this section • the vocabolario di alta disponibilità (vocabulary of high availability), is composed of 1, 753 words not often used in written language, but featured by a high frequency in spoken language, which are indeed perceived as especially familiar by native speakers; aglio (garlic), cascata (waterfall), passeggero (passenger) are in this section. The list of lemmata of VdB has been converted in SQL by using one single table, called lemmadema (6540 records), which have two attributes, i.e. an ID (integer) and the lemma.</p><p>The third resource that we used for the lexicon creation is Wikipedia. Our reference grammar <ref type="bibr" target="#b8">(Patota, 2006)</ref> reports a partial list of the principal Italian irregular verbs, but we decided to use the larger list of verbs reported in Wikipedia<ref type="foot" target="#foot_4">5</ref> (VerIrr henceforth). Another linguistic distinction for Italian verbs reported in Wikipedia<ref type="foot" target="#foot_5">6</ref> (VerInc henceforth) has been exploited in the lexicon: the incoativi verbs are a subclass of the third conjugation that have a special behavior in the present time (e.g. capire). So, in order to produce the correct conjugation of these verbs in SimpleNLG-IT, they needed to be marked in the lexicon. Both these lists of verbs have been converted in SQL by using two distinct tables which have two attributes, i.e. an ID (integer) and the verb in the infinitive form. The two tables are verbiirregolari (858 records) and verbiincoativi (726 records).</p><p>A notable advantage of the SQL representation for linguistic resources is the possibility to extract intrinsic information with simple queries. Indeed, we found that Morph-it! and VdB share 4, 086 nouns and 1, 448 verbs, but there are 245 lemmas belonging to VdD and not belonging to Morph-it!: most of these words are nouns, for instance lavapiatti, chimica, incinta, but we found too a systematic difference for verbs. Indeed, VdB consider as proper reflexive a number of verbs, for instance avvantaggiarsi, sdraiarsi. In contrast, these verbs are are treated as improper reflexive in Morphit!, which annotates avvantaggiare and sdraiare as their lemmata.</p><p>3 Building SimpleLEX-IT 1.0</p><p>In this section we describe the algorithm used to build the computational lexicon SimpleLEX-IT, which is based on the three resources described in the Section 2, and that has been used in SimpleNLG-IT.</p><p>A computational lexicon can be split in two major classes: open and closed classes. The closed class, that are usually composed by function words (i.e. prepositions, determiners, conjunctions, pronouns, etc.) is one to which new words are very rarely added. In contrast, the open classes, that is usually composed by lexical words (i.e. nouns, verbs, adjectives, adverbs), accept the addition of new words. We adopted the same strategy of (Vaudry and Lapalme, 2013): we built by hand the closed part of the Italian lexicon and we built automatically the open part by using the available resources.</p><p>In order to build the open class for SimpleLEX-IT we needed both a large coverage and a detailed account of morphological irregularities, also considering their high frequency in Italian. Moreover, in order to have good time execution performance in the realiser (cf. <ref type="bibr" target="#b3">(De Oliveira and Sripada, 2014)</ref>), a trade-off between the size of the lexicon and its usability for our task must be achieved, which consists in assuming a form of word classification where fundamental Italian words are distinguished from the less-fundamental ones. In order to balance completeness and efficiency in SimpleLEX-IT, we put in the lexicon the open classes words belonging to the intersection of VdB-IT and Morph-it!.</p><p>We reported in Algorithm 1 the process used for the insertion and the annotation of the words belonging to the open classes in SimpleLEX-IT. Note that in order to recognize proper reflexive verbs, we check if the infinitive form of the verb 4 Work in Progress: adding information from a treebank</p><p>The Universal Dependency Treebank (UDT) is a recent project that releases freely available treebanks for 33 languages (in this work, version 1.2) <ref type="bibr" target="#b7">(Nivre et al., 2016)</ref>. Each UDT is split in three sections, train, dev and test, which can be exploited in the evaluation of NLP/NLG systems. We are working on the idea of adding more information in SimpleLEX-IT by using UDT-IT, i.e. the Italian section of UDT. A specific case that we are currently considering regards auxiliary verbs. The current version of SimpleNLG-IT does not manage auxiliary verbs: in order to produce some complex verb tense, e.g. passato prossimo, the user needs to give in input to the realiser the correct auxiliary, i.e. essere (to be, e.g. Io sono nato a Napoli) or avere (to have, e.g. Io ho amato la scuola.). Our reference grammar reports complex rules based on lexical semantics in order to choose the correct auxiliary verb and, unfortunately, these rules have many exceptions. So, we can use UDT-IT to empirically decide the correct auxiliary in SimpleNLG-IT. By following this idea, we converted UDT-IT in SQL by exploiting its original feature structuree. We used one table to collect information about the sentences, and one table to collect information about the words:</p><p>• the table sentence ud is formed by 4 attributes:</p><p>an ID (integer), the original treebank (i.e. TUT, ISST, etc.), the original ID, the section (i.e. DEV, TRAIN, TEST). • the table words ud is used to collect all the words of the UDT-IT. It uses 21 attributes: one attribute id sentence, contains the id of the sentence in the table sentence ud, and 20 attributes correspond to the featured used in the UD annotation. In order to find the correct auxiliary for a specific verb in UDT-IT, we need to exclude passive, reflexive and modal verb constructions in the query. We found 512 verbs of SimpleLEX-IT that are used in UDT-IT with an auxiliary. It is interesting to note that 60 verbs are used both with the auxiliary essere and with the auxiliary avere: this is grammatical for some verbs (e.g. vivere), but more often we found an annotation error in the UDT-IT.</p><p>Finally, note that another possible use of UDT-IT regards the evaluation of the lexicon. In future work we plan to quantify the coverage of SimpleLEX-IT by using the TEST section of the UDT-IT.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusions</head><p>In this paper we have presented some issues about the computational lexicon SimpleLEX-IT. We described the algorithm used to build the lexicon, three SQL resources produced as side effects of the lexicon building and a work in progress about the extraction of syntactic information from UD-IT.</p><p>All the resources described in this paper can be downloaded at https://github.com/ alexmazzei/SimpleLEX-IT.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>• the table lemmata is formed by 3 attributes: a lemma, its PoS and its ID (integer). This table contains 34, 725 records. A number of lemmata belonging to the original version of Morph-it! have been excluded in our conversion: proper nouns, emoticons and cardinals beginning with a digit (e.g. 15mila). • the tables det demo table, pro demo table,</figDesc><table /><note>pronou table are used to collect inflected form of demonstrative determiners (116 records, 4 attributes: ID word, form, ID lemma, number, gender), demonstrative pronouns (95 records, 5 attributes: ID word, form, ID lemma, number, gender), personal pronouns (63 records, 7 attributes: ID word, form, ID lemma, person, number, gender, clitics). • the tables adv table, adj table, nou table, ver table are used to collect inflected form of adverbs (1, 594 records, 3 attributes: ID word, form, ID lemma), adjectives (72, 367 records, 6 attributes: ID word, form, ID lemma, kind, number, gender), nouns (35, 618 records, 5 attributes: ID word, form, ID lemma, number, gender) and verbs (392, 139, 8 attributes: records: ID word, form, ID lemma, mode, time, person, number, gender) respectively.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 :</head><label>1</label><figDesc>foreach adverb ∈ Morph-it! ∩ VdB-IT do Add the adverb in normal form into SimpleLEX-IT end foreach adjective ∈ Morph-it! ∩ VdB-IT do Add the adjective in normal form (masculine-singular) and in feminine-singular, masculine-plural, feminine-plural forms, into SimpleLEX-IT end foreach noun ∈ Morph-it! ∩ VdB-IT do Add the noun in normal form (singular), the plural form, and the gender into SimpleLEX-IT end foreach verb ∈ Morph-it! ∩ VdB-IT do if verb ∈ VerIrr then Number of adverbs, adjectives, nouns and verbs in SimpleLEX-IT.</figDesc><table><row><cell cols="3">Add all the inflections for the indicativo presente,</cell></row><row><cell cols="3">congiuntivo presente, futuro semplice, condizionale,</cell></row><row><cell cols="3">imperfetto, participio passato, passato remoto into</cell></row><row><cell>SimpleLEX-IT</cell><cell></cell><cell></cell></row><row><cell>else</cell><cell></cell><cell></cell></row><row><cell>if verb ∈ VerInc then</cell><cell></cell><cell></cell></row><row><cell cols="3">Set active the incoativo feature in the entry</cell></row><row><cell>end</cell><cell></cell><cell></cell></row><row><cell cols="3">if the verb is properly reflexive (i.e. "...rsi") then</cell></row><row><cell cols="3">Set active the reflexive feature in the entry</cell></row><row><cell>end</cell><cell></cell><cell></cell></row><row><cell cols="3">Add the verb in normal form into SimpleLEX-IT</cell></row><row><cell>end</cell><cell></cell><cell></cell></row><row><cell>end</cell><cell></cell><cell></cell></row><row><cell cols="3">Algorithm 1: The algorithm for building</cell></row><row><cell cols="3">the adverbs, adjectives, nouns and verbs in</cell></row><row><cell>SimpleLEX-IT</cell><cell></cell><cell></cell></row><row><cell cols="3">has the postfix "rsi", since MorphIT! contains this</cell></row><row><cell cols="3">inflection as its normal form. In Table 1 we re-</cell></row><row><cell cols="3">ported some statistics about SimpleLEX-IT com-</cell></row><row><cell cols="3">position. Most of the lexicon is composed by</cell></row><row><cell cols="3">nouns (58%), followed by verbs (21%), adjectives</cell></row><row><cell cols="2">(19%), and adverbs (2%).</cell><cell></cell></row><row><cell>PoS</cell><cell>Number</cell><cell>%</cell></row><row><cell>Adverb</cell><cell>146</cell><cell>2</cell></row><row><cell>Verb (irr.)</cell><cell>283</cell><cell>4</cell></row><row><cell>Verb (reg.)</cell><cell>1168</cell><cell>17</cell></row><row><cell>Adjective</cell><cell>1333</cell><cell>19</cell></row><row><cell>Noun</cell><cell>4092</cell><cell>58</cell></row><row><cell>Total</cell><cell>7022</cell><cell>100</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://parli.di.unito.it/resources_ en.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">We used the PostgreSQL database .</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">Morph-IT! is provided with a script that allows for a naive conversion into SQL that use one single table and one single attribute for all the features.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">The second edition of the vocabulary has been announced<ref type="bibr" target="#b0">(Chiari and De Mauro, 2014)</ref> and it is going to be released (p.c.).</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://it.wikipedia.org/wiki/Verbi_ irregolari_italiani</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">https://it.wikipedia.org/wiki/Verbi_ incoativi</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The New Basic Vocabulary of Italian as a linguistic resource</title>
		<author>
			<persName><forename type="first">Isabella</forename><surname>Chiari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tullio</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mauro</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">1th Italian Conference on Computational Linguistics (CLiC-it)</title>
				<editor>
			<persName><forename type="first">Roberto</forename><surname>Basili</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Alessandro</forename><surname>Lenci</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Bernardo</forename><surname>Magnini</surname></persName>
		</editor>
		<imprint>
			<publisher>Pisa University Press</publisher>
			<date type="published" when="2014-12">2014. December</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="93" to="97" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Guida all&apos;uso delle parole</title>
		<author>
			<persName><forename type="first">Tullio</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mauro</forename></persName>
		</author>
		<imprint>
			<date type="published" when="1985">1985</date>
			<publisher>Editori Riuniti</publisher>
		</imprint>
	</monogr>
	<note>Libri di base</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Towards a universal wordnet by learning from combined evidence</title>
		<author>
			<persName><forename type="first">Gerard</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Melo</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Gerhard</forename><surname>Weikum</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009)</title>
				<meeting>the 18th ACM Conference on Information and Knowledge Management (CIKM 2009)<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="513" to="522" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Adapting simplenlg for brazilian portuguese realisation</title>
		<author>
			<persName><forename type="first">Rodrigo</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Oliveira</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Somayajulu</forename><surname>Sripada</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of INLG</title>
				<meeting>of INLG</meeting>
		<imprint>
			<date type="published" when="2014">2014. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">SimpleNLG: A Realisation Engine for Practical Applications</title>
		<author>
			<persName><forename type="first">Albert</forename><surname>Gatt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ehud</forename><surname>Reiter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of ENLG 2009, ENLG &apos;09</title>
				<meeting>of ENLG 2009, ENLG &apos;09</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">SimpleNLG-IT: adapting Sim-pleNLG to Italian</title>
		<author>
			<persName><forename type="first">Alessandro</forename><surname>Mazzei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Battaglino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of INLG 2016</title>
				<meeting>of INLG 2016</meeting>
		<imprint>
			<publisher>TO AP-PEAR</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network</title>
		<author>
			<persName><forename type="first">Roberto</forename><surname>Navigli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simone</forename><surname>Paolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ponzetto</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">193</biblScope>
			<biblScope unit="page" from="217" to="250" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Universal Dependencies v1:A Multilingual Treebank Collection</title>
		<author>
			<persName><forename type="first">Joakim</forename><surname>Nivre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marie-Catherine</forename><surname>De Marneffe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Filip</forename><surname>Ginter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yoav</forename><surname>Goldberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jan</forename><surname>Hajic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christopher</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ryan</forename><surname>Mcdonald</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Slav</forename><surname>Petrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sampo</forename><surname>Pyysalo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Natalia</forename><surname>Silveira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Reut</forename><surname>Tsarfaty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Daniel</forename><surname>Zeman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of LREC&apos;16</title>
				<meeting>of LREC&apos;16</meeting>
		<imprint>
			<publisher>TO APPEAR</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Grammatica di riferimento dell&apos;italiano contemporaneo</title>
		<author>
			<persName><forename type="first">Giuseppe</forename><surname>Patota</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Guide linguistiche</title>
				<imprint>
			<publisher>Garzanti Linguistica</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Multiwordnet: developing an aligned multilingual database</title>
		<author>
			<persName><forename type="first">Emanuele</forename><surname>Pianta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Luisa</forename><surname>Bentivogli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christian</forename><surname>Girardi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First International Conference on Global WordNet</title>
				<meeting>the First International Conference on Global WordNet</meeting>
		<imprint>
			<date type="published" when="2002-01">2002. January</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Building Natural Language Generation Systems</title>
		<author>
			<persName><forename type="first">Ehud</forename><surname>Reiter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robert</forename><surname>Dale</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2000">2000</date>
			<publisher>Cambridge University Press</publisher>
			<pubPlace>New York, NY, USA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The European LE-PAROLE project: the Italian Syntactic Lexicon</title>
		<author>
			<persName><forename type="first">Nilda</forename><surname>Ruimy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ornella</forename><surname>Corazzari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elisabetta</forename><surname>Gola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Antonietta</forename><surname>Spanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nicoletta</forename><surname>Calzolari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Antonio</forename><surname>Zampolli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">; Pierre-Luc</forename><surname>Vaudry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guy</forename><surname>Lapalme</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First International Conference on Language resources and Evaluation</title>
				<meeting>the First International Conference on Language resources and Evaluation</meeting>
		<imprint>
			<date type="published" when="1998">1998. 2013. 2013</date>
			<biblScope unit="page" from="241" to="248" />
		</imprint>
	</monogr>
	<note>Proc. of ENLG</note>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Morph-it! a free corpus-based morphological resource for the italian language</title>
		<author>
			<persName><forename type="first">Eros</forename><surname>Zanchetta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marco</forename><surname>Baroni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Corpus Linguistics</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2005">2005. 2005</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
