<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Is Neural Language Model Perplexity Related to Readability?</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Alessio</forename><surname>Miaschi</surname></persName>
							<email>alessio.miaschi@phd.unipi.it</email>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of Computer Science</orgName>
								<orgName type="department" key="dep2">Istituto di Linguistica Computazionale &quot;Antonio Zampolli&quot;</orgName>
								<orgName type="laboratory">ItaliaNLP Lab</orgName>
								<orgName type="institution">University of Pisa</orgName>
								<address>
									<settlement>Pisa</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Chiara</forename><surname>Alzetta</surname></persName>
							<email>chiara.alzetta@edu.unige.it</email>
							<affiliation key="aff1">
								<orgName type="department">DIBRIS</orgName>
								<orgName type="institution">University of Genoa</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dominique</forename><surname>Brunato #</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Felice</forename><surname>Dell'orletta #</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
						</author>
						<title level="a" type="main">Is Neural Language Model Perplexity Related to Readability?</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6CEE98D041E669AEC1A5E1DEFDE99C03</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T15:40+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper explores the relationship between Neural Language Model (NLM) perplexity and sentence readability. Starting from the evidence that NLMs implicitly acquire sophisticated linguistic knowledge from a huge amount of training data, our goal is to investigate whether perplexity is affected by linguistic features used to automatically assess sentence readability and if there is a correlation between the two metrics. Our findings suggest that this correlation is actually quite weak and the two metrics are affected by different linguistic phenomena. 1</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction and Motivation</head><p>Standard Neural Language Models (NLMs) are trained to predict the next token given a context of previous tokens. The metric commonly used for assessing the performance of a language model is perplexity, which corresponds to the inverse geometric mean of the joint probability of words w 1 , ..., w n in a held-out test corpus C. While being primarily an intrinsic metric of NLM quality, perplexity has been used in a variety of scenarios, such as to classify between formal and colloquial tweets <ref type="bibr" target="#b15">(González, 2015)</ref>, to detect the boundaries between varieties belonging to the same language family <ref type="bibr" target="#b14">(Gamallo et al., 2017)</ref> or to identify speech samples produced by subjects with cognitive and/or language diseases e.g. dementia <ref type="bibr" target="#b7">(Cohen and Pakhomov, 2020)</ref> or Specific Language Impairment <ref type="bibr" target="#b13">(Gabani et al., 2009)</ref>. From the perspective of computational studies aimed at modeling human language processing, perplexity scores have also been shown to effectively match various human behavioural measures, such as gaze duration during reading <ref type="bibr" target="#b12">(Demberg and Keller, 2008;</ref><ref type="bibr" target="#b16">Goodkind and Bicknell, 2018)</ref>.</p><p>In this paper we focus on a less investigated perspective addressing the connection between perplexity and readability. Since by definition perplexity gives a good approximation of how well a model recognises an unseen piece of text as a plausible one, our intuition is that lower model perplexity should be assigned to easy-to-read sentences, while difficult-to-read ones should obtain higher perplexity. On the other hand, state-ofthe-art NLMs trained on huge data have shown to implicitly learn a sophisticated knowledge of language phenomena, also with respect to complex syntactic properties of sentences <ref type="bibr" target="#b24">(Tenney et al., 2019;</ref><ref type="bibr" target="#b17">Jawahar et al., 2019;</ref><ref type="bibr" target="#b19">Miaschi et al., 2020)</ref>. This could suggest that variations in terms of linguistic complexity, especially when related to subtle morpho-syntactic and syntactic features of sentence rather than lexical ones, could not impact on model perplexity to a great extent. This assumption seems to be confirmed by the (still unpublished) results by <ref type="bibr" target="#b18">Martinc et al. (2019)</ref> which, to our knowledge, is the only one explicitly leveraging unsupervised neural language model predictions in the context of readability assessment. According to this study, a NLM is even less perplexed by articles addressed at adults than by documents conceived for a younger readership. From a relatively different perspective focused on the ability of automatic comprehension systems to solve cloze tests, <ref type="bibr">Benzahra and Yvon (2019)</ref> showed that NLMs performance is not affected by the level of text complexity.</p><p>In order to test the validity of all these hypotheses, we rely on the perplexity score given by a state-of-the-art NLM for the Italian language to several datasets representative of different textual genres containing both easy-and complexto-read sentences: ideally, such datasets should emphasise the correlation between perplexity and readability (if present) since the corpora are explicitly designed to contain both simple and difficult examples.</p><p>Contributions We inspect whether and to which extent it is possible to find a relationship between a readability score and the perplexity of a NLM. To this aim we investigate (i) if the perplexity of a NLM and the readability score of a set of sentences show a significant correlation and (ii) whether the two metrics are equally affected by the same set of linguistic phenomena that occur in the sentence.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Experimental Design</head><p>According to our research questions, we devised a set of experiments to study whether NLMs perplexity reflects the level of readability of a sentence and which are the linguistic phenomena mostly involved in each metric. For this purpose, we firstly investigated whether sentence-level perplexity scores computed with one of the most prominent NLM model correlate with the scores assigned to the same sentences by a supervised readability assessment tool. Secondly, we investigated which are the linguistic features of the considered sentences that correlate in a statistically significant way with the perplexity and readability score respectively. In order to verify whether correlations hold across different typology of texts, we tested our approach on five Italian datasets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Models</head><p>READ-IT. Automatic readability (henceforth ARA) was assessed using READ-IT <ref type="bibr" target="#b10">(Dell'Orletta et al., 2011)</ref> the first readability assessment tool for Italian which combines traditional raw text features with lexical, morpho-syntactic and syntactic information extracted from automatically parsed documents. In READ-IT, analysis of readability is modelled as a binary classification task, based on Support Vector Machines using LIBSVM <ref type="bibr" target="#b5">(Chang and Lin, 2001)</ref>. Training corpora are representative of two classes of texts, i.e. difficult-vs. easyto-read ones, both containing newspaper articles. The set of features exploited for predicting readability has been proved to capture different aspects of sentence complexity. Thus, the assigned readability score ranges between 0 (easy-to-read) and 1 (difficult-to-read) referring to the percentage probability for unseen documents or sentences to be-long to the class of difficult-to-read documents. For the purposes of our work, we carried out readability assessment at sentence level, making the analysis reliable for the comparison with sentencebased perplexity of a NLM. GePpeTto. Sentence-level perplexity scores were computed relying on GePpeTto <ref type="bibr" target="#b8">(De Mattei et al., 2020)</ref>. GePpeTto is a generative language model trained on the Italian language and built using the GPT-2 architecture <ref type="bibr" target="#b20">(Radford et al., 2019)</ref>. The model was trained on a dump of Italian Wikipedia (2.8GB) and on the itWac corpus <ref type="bibr">(Baroni et al., 2009)</ref>, which amounts to 11GB of web texts. The perplexity (PPL) of the model was computed as follows:</p><formula xml:id="formula_0">P P L = e ( N LL N )</formula><p>where N N L and N correspond respectively to the negative log-likelihood and to the length of each sentence w 1:n = [w 1 , ..., w n ] in the datasets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Corpora</head><p>In order to test the reliability of our initial hypothesis, we chose four corpora containing different typologies of texts, i.e. web pages, educational materials, narrative texts, newspaper and scientific articles. Each corpus includes a balanced amount of difficult-and easy-to-read sentence. In addition, we also considered in the analysis the Italian Universal Dependency treebank. This is meant to verify whether the connection between sentencelevel readability and perplexity also holds in a well-acknowledged benchmark corpus. For each of them, we excluded from our analysis short sentences, i.e. having less than 5 tokens. PACCSS-IT<ref type="foot" target="#foot_2">2</ref>  <ref type="bibr" target="#b2">(Brunato et al., 2016)</ref>: we took into account 125,977 sentences belonging to PACCSS-IT, a corpus of complex-simple aligned sentences extracted from the ItWaC corpus. The resource was build using an automatic approach for acquiring large corpora of paired sentences able to intercept structural transformations (such as deletion, reordering, etc.). For example, the two following sentences represent a pair in the corpus, where a reordering operation occurs at phrase level (i.e. the subordinate clause proceeds vs. follows the main clause):</p><p>• Complex: Ringraziandola per la sua cortese attenzione, resto in attesa di risposta. [Lit:</p><p>Thanking you for your kind attention, I look forward to your answer.]</p><p>• Simple: Resto in attesa di una risposta e ringrazio vivamente per l'attenzione. [Lit: I look forward to your answer and I thank you greatly for your attention.]</p><p>Terence and Teacher 3 <ref type="bibr" target="#b1">(Brunato et al., 2015)</ref>: two corpora of original and manually simplified texts aligned at sentence level. Terence contains short Italian novels for children and their manually simplified version carried out by linguists and psycholinguists targeting children with text comprehension difficulties. Teacher is a corpus of pairs of documents belonging to different genres (e.g. literature, handbooks) used in educational settings manually simplified by teachers. We exploited 1,644 sentences belonging to these corpora.</p><p>Multi-Genre Multi-Type Italian corpus: a collection of Italian texts representative of three traditional textual genres: Journalism, Scientific prose and Narrative. Each genre has been internally subdivided into two sub-corpora representative of an easy-vs difficult-to-read variety, which was defined according to the intended target audience for a given genre. The journalistic prose corpus includes articles automatically downloaded from the online versions of two general-purpose newspapers 4 , while the "easy" sub-corpus contains articles from two easy-to-read newspapers 5 addressed to adults with low literacy skills or mild intellectual disabilities. The scientific prose collection consists of scholarly publications on linguistics and computational linguistics and Wikipedia pages downloaded from the portal "Linguistics", representative of the complex and easy variety respectively. For the narrative genre, we included long novels written by novelists of the last century and contemporary writers in the corpora of complex variety, while for the easy variety we collected short novels for children. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Sentence Perplexity and Readability</head><p>Our analysis starts from a comparison between the average perplexity and readability scores obtained for each sentence of the five considered datasets. As shown in Table <ref type="table" target="#tab_1">1</ref>, readability values (column ARA) are quite homogeneous across the datasets, with low standard deviation values. On the contrary, the range of perplexity scores is wider (column PPL), going from an average score of 3,905.83 of PACCSS-IT to 436.75 of the IUDT miscellaneous portion (Italian UD). These differences seem to provide a first evidence that perplexity and readability are not correlate to each other. This intuition has been proved computing the Spearman's rank correlation coefficient between the perplexity and readability scores for each dataset. Results are reported in Table <ref type="table">2</ref>, column PPL-ARA. As it can be seen, all correlation rates are significant, except for the result obtained on the Terence and Teacher corpus, possibly due to the fact that the size of the corpus is too small to allow a significant comparison. Contrary to our expectations, no correlation was detected between the two metrics for all corpora, suggesting that perplexity and and readability are independent from each other.</p><p>To further investigate the reasons behind these scores and to deepen the analysis about the relationship between the two metrics, we investigated whether they capture the same (or similar) linguistic properties of the sentences. To this aim, we tested the presence and strength of the correlation between each of the two metrics and a set of 176 linguistic features, which have been shown to capture properties of sentence complex- ity <ref type="bibr" target="#b3">(Brunato et al., 2018)</ref>. In particular, this analysis is based on the set of features described in <ref type="bibr" target="#b4">Brunato et al. (2020)</ref>, which are acquired from raw, morpho-syntactic and syntactic levels of annotation. They range from basic information on the average sentence and word length, to lexical information about the internal composition of the vocabulary of the text (e.g. the distribution of lemmas belonging to the Basic Italian Vocabulary <ref type="bibr" target="#b9">(De Mauro, 2000)</ref>). They also include morphosyntactic information (e.g. POS distribution and of inflectional properties of verbs) and more complex aspects of sentence structure derived from syntactic annotation and modeling global and local properties of parsed tree structure, e.g. the relative order of subjects and objects with respect to the verb, the use of subordination. In order to extract these features, the considered corpora were morpho-syntactically annotated and dependency parsed by the UDPipe pipeline <ref type="bibr" target="#b23">(Straka et al., 2016)</ref>, with the exception of the IUDT corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dataset</head><p>Column Feats of Table <ref type="table">2</ref> illustrates the results of this analysis: we report the Spearman's correlation coefficients between the two rankings of linguistic features, each ordered by strength of correlation between feature value and perplexity score and readability score respectively. Once again we observe rather weak correlation values, with the only exception of Italian-UD which is the only one reporting a medium correlation (.332). Overall, these results corroborate our previous findings that the two metrics are not particularly related with each other, and they further suggest that the linguistic phenomena affecting the perplexity of NLM and the readability level of a sentence are very different. Consider for example the two following sentences:</p><p>(1) Il furto è avvenuto giovedì notte.</p><p>The theft has taken place Thursday night.</p><p>(2) Il comitato di bioetica: no all'eutanasia.</p><p>The bioethics committee: no to euthanasia.</p><p>While ( <ref type="formula">1</ref>) is very easy-to-read, with a readability score of 0.25, but it has a quite high perplexity score, i.e. 40,737.81, ( <ref type="formula">2</ref>) is quite difficult-to-read (ARA=1) but is has a very low perplexity score (PPL=11.24).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">In-Depth Linguistic Investigation</head><p>To better explore the motivation behind these results, we performed an in-depth investigation aimed at understating the relationship between our set of linguistic features and the two metrics taken into consideration. Since we noticed that for all datasets a higher number of features correlates with ARA than with PPL, we selected those that are significantly correlated with the two metrics. The number of shared features varies for each dataset, depending on their size. For example, for the two smallest ones, i.e. Terence and Teacher and the UD Twitter Treebank, we could only consider 34.65% (61) and 44.88% (79) of the whole set of features respectively, while for the larger corpora the sub-set is wider: 81.81% (144) in PACCSS-IT, 78.97% (139) for Multi-Genre Multi-Type and 84.65% (149) for the IUD Treebank.</p><p>Table <ref type="table" target="#tab_3">3</ref> shows the top ten features for each dataset, i.e. those that obtained the strongest correlation with both PPL and ARA. As expected, correlations are generally stronger between linguistic features and readability scores, although they are lower than expected. This could be due to the fact that, even if the READ-IT classifier is trained with a similar set of features, the non-linear feature space makes it difficult to identify clear correlations with individual features. Similarly, our set of features seem to play only a marginal role on perplexity. However, this is not the case of the PACCSS-IT corpus, for which the set of considered linguistic features have an higher correlation with PPL. This can be possibly related to the partial overlap between the GePpeTto training data and the PACCSS-IT sentences, since the latter is drawn from the ItWac corpus which is included in the GePpeTto's training.</p><p>Inspecting these results, we can also observe that correlations between features and PPL seem to be more affected by genre-specific characteristics. This is particularly clear if we consider the Italian UD Twitter treebank, for which among the top ten most correlated features we find some of them characterising social media language, e.g. symbols (upos-xpos dist SYM) or the vocative relation, which marks a dialogue participant addressed in a text along with the specification, specifically used for Twitter @-mentions (dep dist vocative:mention).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion</head><p>The paper presented a study aimed at investigating the relationship between two metrics computed at sentence-level, i.e. perplexity of a state-of-the-art NLM for the Italian language and readability score automatically assigned to a sentence by a supervised classifier. We carried out our analysis considering several datasets differing at the level of textual genre and language variety. Specifically, we observed that comparing the rankings obtained using the two metrics we cannot find any significant correlation, either between the scores of the two metrics or with respect to the set of linguistic features that mostly impact their values. Further investigation within this line of research will explore whether we can draw the same observations when a different NLM is exploited to compute sentence perplexity.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>References</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 :</head><label>1</label><figDesc>Perplexity (PPL)  and Readability (ARA) mean and standard deviation values for the 5 datasets.</figDesc><table><row><cell>PPL</cell><cell>ARA</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 :</head><label>3</label><figDesc>Top 10 features along with their correlation scores between perplexity and readability. cessed web-crawled corpora. Language resources and evaluation, 43(3):209-226. Marc Benzahra and Franc ¸ois Yvon. 2019. Measuring text readability with machine comprehension: a pilot study. In Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, pages 412-422, Florence, Italy, August. Association for Computational Linguistics.</figDesc><table /><note>C. Bosco, S. Montemagni, and M. Simi. 2013. Converting italian treebanks: Towards an italian stanford dependency treebank. In Proceedings of the ACL</note></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Copyright c</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2020" xml:id="foot_1">for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2">http://www.italianlp.it/resources/paccss-it-parallelcorpus-of-complex-simple-sentences-for-italian/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_3">https://github.com/UniversalDependencies/UD Italian-ISDT</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_4">7 https://github.com/UniversalDependencies/UD Italian-</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_5">PoSTWITA 8 https://universaldependencies.org/treebanks/it twittiro</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Linguistic Annotation Workshop &amp; Interoperability with Discourse</title>
		<imprint>
			<date>August</date>
			<pubPlace>Sofia, Bulgaria</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Design and annotation of the first italian corpus for text simplification</title>
		<author>
			<persName><forename type="first">Dominique</forename><surname>Brunato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simonetta</forename><surname>Montemagni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of The 9th Linguistic Annotation Workshop</title>
				<meeting>The 9th Linguistic Annotation Workshop</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="31" to="41" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">PaCCSS-IT: A parallel corpus of complex-simple sentences for automatic text simplification</title>
		<author>
			<persName><forename type="first">Dominique</forename><surname>Brunato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrea</forename><surname>Cimino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2016 Conference on Empirical Methods in Natural Language Processing<address><addrLine>Austin, Texas</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2016-11">2016. November</date>
			<biblScope unit="page" from="351" to="361" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Is this sentence difficult? do you agree?</title>
		<author>
			<persName><forename type="first">Dominique</forename><surname>Brunato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lorenzo</forename><forename type="middle">De</forename><surname>Mattei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Benedetta</forename><surname>Iavarone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2018 Conference on Empirical Methods in Natural Language Processing<address><addrLine>Brussels, Belgium</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2018-10">2018. October-November</date>
			<biblScope unit="page" from="2690" to="2699" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Profiling-UD: a tool for linguistic profiling of texts</title>
		<author>
			<persName><forename type="first">Dominique</forename><surname>Brunato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrea</forename><surname>Cimino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simonetta</forename><surname>Montemagni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of The 12th Language Resources and Evaluation Conference</title>
				<meeting>The 12th Language Resources and Evaluation Conference<address><addrLine>Marseille, France</addrLine></address></meeting>
		<imprint>
			<publisher>European Language Resources Association</publisher>
			<date type="published" when="2020-05">2020. May</date>
			<biblScope unit="page" from="7145" to="7151" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">LIB-SVM: a library for support vector machines</title>
		<author>
			<persName><forename type="first">Chih-Chung</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chih-Jen</forename><surname>Lin</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Presenting TWITTIR Ò-UD: An italian twitter treebank in universal dependencies</title>
		<author>
			<persName><forename type="first">Alessandra</forename><surname>Teresa Cignarella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fifth International Conference on Dependency Linguistics (Depling</title>
				<meeting>the Fifth International Conference on Dependency Linguistics (Depling<address><addrLine>Syn-taxFest</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">A tale of two perplexities: Sensitivity of neural language models to lexical retrieval deficits in dementia of the Alzheimer&apos;s type</title>
		<author>
			<persName><forename type="first">Trevor</forename><surname>Cohen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serguei</forename><surname>Pakhomov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 58th Annual Meeting of the Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020-07">2020. July</date>
			<biblScope unit="page" from="1946" to="1957" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Geppetto carves italian into a language model</title>
		<author>
			<persName><forename type="first">Lorenzo</forename><surname>De Mattei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michele</forename><surname>Cafagna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marco</forename><surname>Guerini</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2004.14253</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">Tullio</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mauro</forename></persName>
		</author>
		<title level="m">Il dizionario della lingua italiana</title>
				<imprint>
			<publisher>Paravia</publisher>
			<date type="published" when="2000">2000</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">READ-IT: Assessing readability of Italian texts with a view to text simplification</title>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simonetta</forename><surname>Montemagni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies</title>
				<meeting>the Second Workshop on Speech and Language Processing for Assistive Technologies<address><addrLine>Edinburgh, Scotland, UK</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2011-07">2011. July</date>
			<biblScope unit="page" from="73" to="83" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">VIT -Venice Italian Treebank: Syntactic and quantitative features</title>
		<author>
			<persName><forename type="first">Rodolfo</forename><surname>Delmonte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Antonella</forename><surname>Bristot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sara</forename><surname>Tonelli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories</title>
				<meeting>the Sixth International Workshop on Treebanks and Linguistic Theories</meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Data from eyetracking corpora as evidence for theories of syntactic processing complexity</title>
		<author>
			<persName><forename type="first">V</forename><surname>Demberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Frank</forename><surname>Keller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cognition</title>
		<imprint>
			<biblScope unit="volume">109</biblScope>
			<biblScope unit="page" from="193" to="210" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A corpus-based approach for the prediction of language impairment in monolingual English and Spanish-English bilingual children</title>
		<author>
			<persName><forename type="first">Keyur</forename><surname>Gabani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Melissa</forename><surname>Sherman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thamar</forename><surname>Solorio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yang</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lisa</forename><surname>Bedore</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elizabeth</forename><surname>Peña</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics</title>
				<meeting>Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics<address><addrLine>Boulder, Colorado</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2009-06">2009. June</date>
			<biblScope unit="page" from="46" to="55" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">A perplexity-based method for similar languages discrimination</title>
		<author>
			<persName><forename type="first">Pablo</forename><surname>Gamallo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jose</forename><forename type="middle">Ramom</forename><surname>Pichel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Iñaki</forename><surname>Alegria</surname></persName>
		</author>
		<ptr target="http://web.science.mq.edu.au/smal-masi/vardial4/index.html" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects</title>
		<title level="s">Association for Computational Linguistics</title>
		<meeting>the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects<address><addrLine>Valencia, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017-04-03">2017. April 3, 2017. c c 2017</date>
			<biblScope unit="page" from="109" to="114" />
		</imprint>
	</monogr>
	<note>VarDial2017 workshop at EACL 2017</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">An analysis of twitter corpora and the differences between formal and colloquial tweets</title>
		<author>
			<persName><forename type="first">M</forename><surname>González</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">TweetMT@SEPLN</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Predictive power of word surprisal for reading times is a linear function of language model quality</title>
		<author>
			<persName><forename type="first">Adam</forename><surname>Goodkind</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Klinton</forename><surname>Bicknell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2018</title>
				<editor>
			<persName><forename type="first">B</forename><surname>Asad</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Cassandra</forename><surname>Sayeed</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Tal</forename><surname>Jacobs</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Marten</forename><surname>Linzen</surname></persName>
		</editor>
		<editor>
			<persName><surname>Van Schijndel</surname></persName>
		</editor>
		<meeting>the 8th Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2018<address><addrLine>Salt Lake City, Utah, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2018-01-07">2018. January 7, 2018</date>
			<biblScope unit="page" from="10" to="18" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">What does bert learn about the structure of language?</title>
		<author>
			<persName><forename type="first">Ganesh</forename><surname>Jawahar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Benoît</forename><surname>Sagot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Djamé</forename><surname>Seddah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Samuel</forename><surname>Unicomb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gerardo</forename><surname>Iñiguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Márton</forename><surname>Karsai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yannick</forename><surname>Léo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Márton</forename><surname>Karsai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carlos</forename><surname>Sarraute</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Éric</forename><surname>Fleury</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">57th Annual Meeting of the Association for Computational Linguistics (ACL)</title>
				<meeting><address><addrLine>Florence, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Supervised and unsupervised neural approaches to text readability</title>
		<author>
			<persName><forename type="first">Matej</forename><surname>Martinc</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Senja</forename><surname>Pollak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marko Rob-Niksikonja</forename></persName>
		</author>
		<idno type="arXiv">arXiv:1503.06733</idno>
	</analytic>
	<monogr>
		<title level="m">Computing Research Repository</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note>Version 2</note>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Linguistic profiling of a neural language model</title>
		<author>
			<persName><forename type="first">Alessio</forename><surname>Miaschi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dominique</forename><surname>Brunato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giulia</forename><surname>Venturi</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2010.01869</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Language models are unsupervised multitask learners</title>
		<author>
			<persName><forename type="first">Alec</forename><surname>Radford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jeffrey</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rewon</forename><surname>Child</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Luan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dario</forename><surname>Amodei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ilya</forename><surname>Sutskever</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Part-TUT: The turin university parallel treebank</title>
		<author>
			<persName><forename type="first">Manuela</forename><surname>Sanguinetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Harmonization and Development of Re-sources and Tools for Italian Natural Language Processing within the PARLI Project</title>
				<editor>
			<persName><forename type="first">Roberto</forename><surname>Basili</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="51" to="69" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">PoSTWITA-UD: an Italian Twitter Treebank in universal dependencies</title>
		<author>
			<persName><forename type="first">Manuela</forename><surname>Sanguinetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alberto</forename><surname>Lavelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alessandro</forename><surname>Mazzei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabio</forename><surname>Tamburini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eleventh Language Resources and Evaluation Conference</title>
				<meeting>the Eleventh Language Resources and Evaluation Conference</meeting>
		<imprint>
			<publisher>LREC</publisher>
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">UD-Pipe: Trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, pos tagging and parsing</title>
		<author>
			<persName><forename type="first">M</forename><surname>Straka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hajic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Strakova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC)</title>
				<meeting>the Tenth International Conference on Language Resources and Evaluation (LREC)</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">BERT rediscovers the classical NLP pipeline</title>
		<author>
			<persName><forename type="first">Ian</forename><surname>Tenney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dipanjan</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ellie</forename><surname>Pavlick</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 57th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Florence, Italy</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2019-07">2019. July</date>
			<biblScope unit="page" from="4593" to="4601" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Universal dependencies 2.5</title>
		<author>
			<persName><forename type="first">Joakim</forename><surname>Daniel Zeman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mitchell</forename><surname>Nivre</surname></persName>
		</author>
		<author>
			<persName><surname>Abrams</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics ( ÚFAL)</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
