<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="it">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">LatInfLexi: an Inflected Lexicon of Latin Verbs</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Matteo</forename><surname>Pellegrini</surname></persName>
							<email>matteo.pellegrini@unibg.it</email>
							<affiliation key="aff0">
								<orgName type="institution">Università di Bergamo/Pavia</orgName>
								<address>
									<addrLine>Piazza Rosate</addrLine>
									<postCode>2 -24129</postCode>
									<settlement>Bergamo</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Marco</forename><surname>Passarotti</surname></persName>
							<email>marco.passarotti@unicatt.it</email>
							<affiliation key="aff1">
								<orgName type="department">CIRCSE Research Centre</orgName>
								<orgName type="institution">Università Cattolica del Sacro Cuore Largo Gemelli</orgName>
								<address>
									<postCode>1 -20123</postCode>
									<settlement>Milan</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">LatInfLexi: an Inflected Lexicon of Latin Verbs</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">A25834205071BE71C6CE8B84BB5257AF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T04:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>English. We present a paradigm-based inflected lexicon of Latin verbs built to provide empirical evidence supporting an entropybased estimation of the degree of uncertainty in inflectional paradigms. The lexicon contains information on the inflected forms that occupy the 254 morphologically possible paradigm cells of 3,348 verbal lexemes extracted from a frequency lexicon of Latin. The resource also includes annotation of vowel length and the frequency of each form in different epochs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italiano. Presentiamo un lessico di forme flesse basato sui paradigmi per i verbi latini, costruito per fornire evidenza empirica che permetta di quantificare il grado di incertezza nei paradigmi flessivi tramite l'entropia.</head><p>Il lessico contiene informazioni sulle forme flesse che occupano le 254 celle possibili dal punto di vista morfologico di 3.348 lessemi verbali estratti da un dizionario frequenziale del latino. La risorsa include anche l'annotazione della lunghezza vocalica e la frequenza di ogni forma in diverse epoche.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="it">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>In this paper, we describe the construction of LatInfLexi, an inflected lexicon of Latin verbs organized in lexemes<ref type="foot" target="#foot_0">1</ref> and paradigm cells.</p><p>In morphological theory, there is a recent trend towards a more realistic modelling of complex inflectional systems: for instance, <ref type="bibr" target="#b0">Ackerman et al. (2009)</ref> and <ref type="bibr" target="#b5">Bonami and Boyé (2014)</ref> propose that the analysis should take a full inflected form as a starting point, without assuming any segmentation a priori. In such approaches, what is investigated is not the construction of forms from smaller units like stems and inflectional endings, but rather their predictability given knowledge of other forms. This can be done by using the information theoretic notion of conditional entropy to estimate the uncertainty in guessing the content of the paradigm cell of a lexeme knowing another inflected form of the same lexeme, by weighting the probability of application of each inflectional pattern based on their type frequency in real data.</p><p>To do so, large-scale inflected lexicons listing all forms of a representative selection of lexemes are needed. Such resources are increasingly being developed for modern languages -see among else <ref type="bibr" target="#b21">Zanchetta and Baroni (2005)</ref> and <ref type="bibr" target="#b9">Calderone et al. (2017)</ref> for <ref type="bibr">Italian, Neme (2013)</ref> for Arabic, <ref type="bibr">Bonami et al. (2014)</ref> and <ref type="bibr" target="#b12">Hathout et al. (2014)</ref> for French. However, to the best of our knowledge, there are no resources of this kind for Latin, although their (semi-)automatic building is made possible by the current availability of several morphological analyzers for Latin, including Words (http://archives.nd.edu/words.html), <ref type="bibr">Lemlat (www.lemlat3.eu)</ref>, Morpheus (https://github.com/tmallon/morpheus), the PROIEL Latin morphology system (https://github.com/mlj/proielaim at a resource suitable for theoretical inquiries, we use the first term as a label in our resource.</p><p>webapp/tree/master/lib/morphology) and LatMor (http://cistern.cis.lmu.de). Our resource was created to fill this gap and to enable a quantitative, entropy-based analysis of Latin verb inflection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Design</head><p>A distinctive feature of our inflected lexicon is that it is based on lexemes and paradigm cells, rather than on forms. This means that for each lexeme, all the morphologically possible paradigm cells are filled with a form, and not only those forms that are indeed attested in Latin texts are stored in paradigm cells. In this respect, our resource is similar to other recently developed inflected lexicons, like for instance Flexique for French <ref type="bibr">(Bonami et al., 2014)</ref>.</p><p>For each paradigm cell, the following information is provided:</p><formula xml:id="formula_0">(i)</formula><p>the inflected form that occupies the paradigm cell; (ii) a univocal identifier of the lexeme to which it belongs; (iii) the set of its morphological features; (iv) information on the frequency of the form in different epochs.</p><p>As for (i), it should be noted that there is never more than one form per paradigm cell. In cases of overabundance (i.e. cells that are filled by more than one form, cf. <ref type="bibr" target="#b19">Thornton, 2012)</ref>, a choice was made to decide which "cell-mate" <ref type="bibr">(Thornton, 2012: 183</ref>) should be kept, and which one discarded.</p><p>On the other hand, in some cases a paradigm cell could be empty, either because it is defective -like for instance the passive cells of intransitive verbs -or because it is not filled by a synthetic form, but rather it is analytically expressed, by means of a phrase -like for instance, in Latin, the perfective cells of deponent verbs, for which the periphrasis PRF.PTCP<ref type="foot" target="#foot_1">2</ref> + AUX esse 'to be' is used (e.g. PRF.IND.1SG hortātus sum 'I incited'). In both cases, the cell is marked as #DEF# in the resource. This convention is adopted also in Flexique <ref type="bibr">(Bonami et al., 2014</ref><ref type="bibr">(Bonami et al., : 2585))</ref>, and it fits the requirements of the Qumin package for entropy calculations on the predictability of implic-ative relations between inflected forms <ref type="bibr" target="#b4">(Bonami and Beniamine, 2016;</ref><ref type="bibr" target="#b2">Beniamine, 2017)</ref>.</p><p>As for (ii), the identifier corresponds to the citation form of the lexeme, almost always the first-person singular of the present indicative, following the Latin lexicographical and didactical tradition. A diacritic is added in those rare cases where different verbs have the same citation form (see infra, §3.2).</p><p>Regarding (iii), we use the PoS-tags of the Universal Part-of-Speech Tagset by <ref type="bibr">Petrov et al. (2012)</ref> and the morphological features used in Universal Dependencies (http://universaldependencies.org/u/feat /index.html).</p><p>Lastly, the frequency data in (iv) are taken from Tombeur's (1998) Thesaurus Formarum Totius Latinitatis (see infra, §3.3).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Building the Lexicon</head><p>This section details the procedure followed to build the lexicon.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Selecting the Lexemes</head><p>Our first objective is to build an inflected lexicon of Latin featuring all the possible inflected forms of verbs only. To this aim, we include all the verbal entries contained in <ref type="bibr">Delatte et al.'s (1981)</ref> Dictionnaire fréquentiel et Index inverse de la langue latine (henceforth DFILL). This yields a total of 3,348 verbs. In rare cases, more than one entry of DFILL corresponds to one and the same lexeme in our resource. This happens because some verbs are lemmatized twice in DFILL. For instance, for the verb verso two different entries appear in DFILL, using as citation form both the first-person singular of the present active indicative verso and the corresponding morphologically passive form versor. This choice is likely to be motivated by the different semantics of the two verbs, with the first one meaning 'to turn' and the second one meaning 'to remain'. However, in such cases our resource gives priority to collecting into one common inflectional paradigm all the forms that can be assigned to the same lexeme based on their morphological relatedness, rather than separating them in paradigms of different lexemes according to semantic criteria. Therefore, our lexicon includes only one lexeme verso, for which both active and passive forms are listed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Generating the Forms</head><p>In order to fill all of the paradigm cells of the selected lexemes, we exploit the database of Lemlat <ref type="bibr" target="#b16">(Passarotti et al., 2017)</ref>. For each lexeme, the database of Lemlat contains a list of segments called LES -roughly corresponding to the stems that are used in different subparadigmseach with a corresponding CODLES that provides (among else) information on the inflectional endings that can be attached to a LES. We make use of this information to generate the relevant forms.</p><p>To illustrate the details of the procedure, let's consider the verb rumpo 'to break'. For this verb, the database of Lemlat features the LESs and CODLESs shown in Table <ref type="table" target="#tab_0">1</ref>. The two LESs with CODLES "fe" ("forma eccezionale", 'exceptional form') were discarded, since they are full irregular forms that are stored as such. As for the other LESs, the one with CODLES "v3r" is used to fill all the cells of the present system, by adding the inflectional endings of the conjugation represented by the CODLES (i.e. the 3 rd conjugation). Similarly, the LES with CODLES "v7s" is used to fill the cells of the perfect system. From the remaining LESs, some nominal forms built upon the so-called "third stem" <ref type="bibr" target="#b1">(Aronoff, 1994</ref>) can be derived, namely the supine rupt-um and rupt-ū from the LES with CODLES "n41", the perfect participle rupt-us, -a, -um from the LES with CODLES "n6p1" and the future participle ruptūr-us, -a,um from the LES with CODLES "n6p2".</p><p>This given, our first step is to extract information on the LESs and CODLESs of each lexeme. Since Lemlat is a tool built to analyze rather than produce forms, it contains also several LESs occurring only in irregular and/or rare forms. To avoid the risk of overgeneration, we choose and keep only one LES for each CODLES. The choice is based on lexicographical sources, namely <ref type="bibr" target="#b14">Lewis and Short (1879)</ref> and <ref type="bibr" target="#b11">Glare (1982)</ref>. In these dictionaries, at the very beginning of each verbal entry there is a set of four "principal parts" <ref type="bibr">(Bennett, 1908: 55)</ref>, i.e. exemplary inflected forms from which the whole paradigm of the lexeme can be inferred. We keep only those LESs that correspond to such principal parts, excluding the ones that correspond to more marginal forms that do appear in dictionaries but are given less prominence in the entry. For instance, Lemlat includes two LESs with CODLES "v3r" for the verb dico 'to say': "dic" and "deic". However, in both the lexicographical sources we use, the relevant principal parts are dico and dicere, corresponding to the first LES, while the second one is only mentioned later in the entries as an alternative form. Therefore, the LES selected for our resource is "dic".</p><p>We use the same dictionaries also to manually annotate the vowel length for each LES. This is a necessary enhancement, because in Latin verb inflection there are homographic forms that can be distinguished only based on that, like for instance PRS.ACT.IND.3SG fugit '(s)he flees' vs. PRF.ACT.IND.3SG fūgit '(s)he fleed'.</p><p>Following this process, we fill all the 254 paradigm cells of each of the 3,348 lexemes. However, because of Lemlat's design, for some quite frequent verbs with a highly irregular inflectional paradigm, it was not possible to apply the same procedure, at least for the cells of the present system, which is where most irregularity of the inflectional endings of Latin verbs happens. For the verbs shown in Table <ref type="table">2</ref> and for those derived from them by prefixation (e.g. abeo 'to go away' from verb eo 'to go'), although it was technically possible to adopt a similar approach by using more than one LES for a CODLES, it proved to be faster and practical to manually record the correct forms as such. To each of the 850,392 generated paradigms cells, a univocal lexeme identifier is assigned, which corresponds to the lemma used in Lemlat. In those rare cases where two or more verbs have the same lemma in Lemlat (although they inflect differently), a numeric diacritic is added to make the relevant distinction: for instance, we have volo1 'to fly' and volo2 'to want'.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Lemma</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Frequency Data</head><p>Many forms included in the paradigm cells of our lexicon are never attested in Latin texts. In order to make it possible to distinguish between plausible but unattested forms and those indeed occurring in texts, we enhance forms with information on their frequency. This information is taken from Tombeur's (1998) Thesaurus Formarum Totius Latinitatis (henceforth TFTL), where each form is assigned the number of its occurrences in four different epochs, respectively called Antiquitas (from the origins to the end of the 2 nd century A.D.), Aetas Patrum (2 nd century-735 A.D.), Medium Aeuum (736-1499) and Recentior <ref type="bibr">Latinitas (1500</ref><ref type="bibr">Latinitas ( -1965))</ref>.</p><p>By including the frequency of each form in the lexicon, we know how many of the 752,537<ref type="foot" target="#foot_2">3</ref> forms recorded in the lexicon are never actually attested. Table <ref type="table">3</ref>   <ref type="table">3</ref>: not attested forms It can be observed that a significant amount of forms recorded in our lexicon are not attested, even in such a large corpus as the one the TFTL is based on. However, this is not surprising: recent large-scale corpus-based investigations (e.g. <ref type="bibr">Bonami and Beniamine, 2016: 158 ff.)</ref> show that 4 Discussion and Future Work</p><p>We described the design and building of a lexeme-based inflected lexicon consisting of 850,392 paradigm cells of 3,348 Latin verbs. Our first objective in the near future is to make the resource complete in terms of lexical coverage, including the lexemes of the other PoS. The lexicon is available for download as a .csv file at https://github.com/matteopellegrini/LatInfLexi.</p><p>We also plan to include phonetic annotation, by giving the IPA transcription of each form, which can be obtained semi-automatically by applying a script provided by the Classical Language Toolkit <ref type="bibr">(Johnson et al., 2014-17)</ref> to stems and endings.</p><p>Another welcome addition would be to account for cases of overabundance, by allowing more than one form to appear in the same paradigm cell. However, to decide which cell-mates to keep and which ones to discard, their frequency in Latin texts should be preliminarily evaluated. In this respect, it has to be noted that the frequencies in the TFTL refer to bare surface forms, with no contextual disambiguation. For instance, the frequency of veniam comprises not only occurrences of both the PRS.ACT.SBJV.1SG and FUT.ACT.IND.1SG of the verb venio 'to come', but also of the ACC.SG of the noun venia 'indulgence'.</p><p>To get an idea of the impact of morphological ambiguity on our lexicon, we analyzed all the generated forms with Lemlat (version 3.0). We found that only for about 23% (170,735) of the 752,537 forms Lemlat outputs only one analysis (i.e. one lemma and one set of morphological features), the remaining 581,802 (about 77%) being ambiguous. This result weakens the reliability of the frequency data provided in the lexicon. Therefore, disambiguation is needed, although this would require a very time-consuming work.</p><p>However, to tackle the problem of ambiguity, a first useful step is distinguishing between cases like veniam above, which can be analyzed as an inflected form of two different lemmas, and cases where the different analyses only refer to different forms of the same lemma, e.g. laudatis, that appears both in the PRS.ACT.IND.2PL and in the PRF.PTCP.DAT/ABL.PL of laudo 'to praise', but cannot be a form of other lemmas. We call these different types 'exolemmatic <ref type="bibr">' and 'endolemmatic' ambiguity, respectively (cf. Passarotti and Ruffolo, 2004)</ref>. Cases of exolemmatic ambiguity are clearly more problematic, but they are also much rarer: only 79,490 (about 10%) of the forms in our resource belong to this type. The great majority of ambiguous forms only give rise to endolemmatic ambiguity, as can be observed in Table <ref type="table">4</ref>  As far as endolemmatic ambiguity is concerned, although its quantitative impact is far greater, it could be considerably reduced in a principled manner. Indeed, it should be noted that in many cases this kind of ambiguity is due to systematic syncretism. For instance, the cells FUT.ACT.IMP.2SG and FUT.ACT.IMP.3SG are never unambiguously analyzed, because they are always identical for a same verb. Given the full systematicity of this syncretism, which holds for all lexemes, these cells could be considered as only one from a purely morphological point of view. Therefore, the problem of endolemmatic ambiguity could be at least reduced by adopting an approach based on "morphomic paradigms" <ref type="bibr">(Boyé and Schalchli, 2016)</ref>, where always syncretic cells are conflated, rather than on morphosyntactic paradigms. This would be helpful especially in nominal forms like participles and gerundives, where such cases of systematic syncretism are widespread.</p><p>When such ambiguity issues will have been resolved, it will also be possible to exploit the frequency data in a more systematic fashion, e.g. to perform diachronic investigations on how the frequency of specific (groups of) forms or paradigm cells change across the four considered epochs, or to model Latin inflectional morphology in an even more realistic way, by considering also the token frequency of inflected forms, as has been recently proposed by <ref type="bibr" target="#b7">Boyé (2016)</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>the verb rumpo in Lemlat 3.0</figDesc><table><row><cell>LES</cell><cell>CODLES</cell></row><row><cell>rump</cell><cell>v3r</cell></row><row><cell cols="2">rumpisse fe</cell></row><row><cell>rup</cell><cell>v7s</cell></row><row><cell>rupsit</cell><cell>fe</cell></row><row><cell>rupt</cell><cell>n41</cell></row><row><cell>rupt</cell><cell>n6p1</cell></row><row><cell>ruptur</cell><cell>n6p2</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head></head><label></label><figDesc>reports the relevant data 4 .</figDesc><table><row><cell>TFTL epoch</cell><cell>unattested forms (%)</cell></row><row><cell>Antiquitas</cell><cell>544,395 (72.34%)</cell></row><row><cell>Aetas Patrum</cell><cell>482,324 (64.1%)</cell></row><row><cell>Medium Aeuum</cell><cell>484,421 (64.37%)</cell></row><row><cell>Recentior Latinitas</cell><cell>640,552 (85.12%)</cell></row><row><cell>all epochs</cell><cell>401,690 (53.38%)</cell></row><row><cell>Table</cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The term "lexeme" is used for the abstract theoretical concept normally adopted in morphology and lexicology, while "lemma" refers to the concrete citation form representing an entry in dictionaries. Since we</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">Throughout the paper, we will refer to grammatical features by using the standard abbreviations of the Leipzig Glossing Rules.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">The 97,855 paradigm cells marked as #DEF# are excluded from this count.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">In total, the TFTL includes</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="554" xml:id="foot_4">,828 different forms, corresponding to 62,922,781 occurrences in the reference corpus used by the Thesaurus. Our lexicon contains 165,898 of these unique forms (forms appearing in more than one paradigm cell are counted only once), for a total of 18,261,179 occurrences. This means that our resource covers around 30% of the forms of the TFTL, in terms of both type and token frequency. In addition, it also contains several other forms that are not attested in the TFTL (245,623 unique forms).in languages with large inflectional paradigmslike the ones of Latin verbs -it is perfectly normal that many plausible forms do not appear, even in very large datasets, and the lexemes for which the full paradigm is attested are very few.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Parts and wholes: Implicative patterns in inflectional paradigms</title>
		<author>
			<persName><forename type="first">James</forename><forename type="middle">P</forename><surname>Farrell Ackerman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robert</forename><surname>Blevins</surname></persName>
		</author>
		<author>
			<persName><surname>Malouf</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Analogy in Grammar: Form and Acquisition</title>
				<editor>
			<persName><forename type="first">James</forename><forename type="middle">P</forename><surname>Blevins</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Juliette</forename><surname>Blevins</surname></persName>
		</editor>
		<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="54" to="82" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Morphology by itself: Stems and inflectional classes</title>
		<author>
			<persName><forename type="first">Mark</forename><surname>Aronoff</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1994">1994</date>
			<publisher>MIT Press</publisher>
			<pubPlace>Cambridge/London</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Un algorithme universel pour l&apos;abstraction automatique d&apos;alternances morphophonologiques</title>
		<author>
			<persName><forename type="first">Sacha</forename><surname>Beniamine</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Conférence sur le Traitement Automatique des Langues Naturelles</title>
		<imprint>
			<date type="published" when="2017">2017</date>
			<publisher>TALN</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">New Latin Grammar</title>
		<author>
			<persName><forename type="first">Charles</forename><surname>Edwin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bennett</forename></persName>
		</author>
		<imprint>
			<date type="published" when="1908">1908</date>
			<publisher>Bolchazy-Carducci Publishers</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Joint predictiveness in inflectional paradigms</title>
		<author>
			<persName><forename type="first">Olivier</forename><surname>Bonami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sarah</forename><surname>Beniamine</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Word Structure</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="156" to="182" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">Olivier</forename><surname>Bonami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gilles</forename><surname>Boyé</surname></persName>
		</author>
		<title level="m">Foisonnements morphologiques: Études en hommage à Françoise Kerleroux</title>
				<editor>
			<persName><forename type="first">Florence</forename><surname>Villoing</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Sophie</forename><surname>David</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Sarah</forename><surname>Leroy</surname></persName>
		</editor>
		<meeting><address><addrLine>Paris Ouest, Paris</addrLine></address></meeting>
		<imprint>
			<publisher>Presses universitaires de</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="17" to="45" />
		</imprint>
	</monogr>
	<note>De formes en thèmes</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Construction d&apos;un lexique flexionnel phonétisé libre du français</title>
		<author>
			<persName><forename type="first">Olivier</forename><surname>Bonami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gauthier</forename><surname>Caron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Clément</forename><surname>Plancq</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Actes du quatrième congrès mondial de linguistique française</title>
				<editor>
			<persName><forename type="first">Franck</forename><surname>Neveu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Peter</forename><surname>Blumenthal</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Linda</forename><surname>Hriba</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Annette</forename><surname>Gerstenberg</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Judith</forename><surname>Meinschaefer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Sophie</forename><surname>Prévost</surname></persName>
		</editor>
		<meeting>s du quatrième congrès mondial de linguistique française</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="2583" to="2596" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Pour une modélisation surfaciste de la flexion. Le cas de la conjugaison du français</title>
		<author>
			<persName><forename type="first">Gilles</forename><surname>Boyé</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">SHS Web of Conferences.</title>
		<imprint>
			<biblScope unit="volume">27</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>EDP Sciences</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">The status of paradigms</title>
		<author>
			<persName><forename type="first">Gilles</forename><surname>Boyé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gauvain</forename><surname>Shalchli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Cambridge Handbook of Morphology</title>
				<editor>
			<persName><forename type="first">Andrew</forename><surname>Hippisley</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Gregory</forename><surname>Stump</surname></persName>
		</editor>
		<meeting><address><addrLine>Cambridge</addrLine></address></meeting>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="206" to="234" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Hybrid method for stress prediction applied to GLAFF-IT, a large-scale Italian lexicon</title>
		<author>
			<persName><forename type="first">Basilio</forename><surname>Calderone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Matteo</forename><surname>Pascoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nabil</forename><surname>Hathout</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Franck</forename><surname>Sajous</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Language, Data and Knowledge</title>
				<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="26" to="41" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">Louis</forename><surname>Delatte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Étienne</forename><surname>Evrard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Suzanne</forename><surname>Govaerts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Joseph</forename><surname>Denooz</surname></persName>
		</author>
		<title level="m">Dictionnaire fréquentiel et index inverse de la langue latine</title>
				<editor>
			<persName><forename type="first">L</forename><forename type="middle">A S L</forename></persName>
		</editor>
		<imprint>
			<date type="published" when="1981">1981</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">W</forename><surname>Peter</surname></persName>
		</author>
		<author>
			<persName><surname>Glare</surname></persName>
		</author>
		<title level="m">Oxford Latin Dictionary</title>
				<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="1982">1982</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">GLÀFF, a large versatile French lexicon</title>
		<author>
			<persName><forename type="first">Nabil</forename><surname>Hathout</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Franck</forename><surname>Sajous</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Basilio</forename><surname>Calderone</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC&apos;14)</title>
				<meeting>the Ninth International Conference on Language Resources and Evaluation (LREC&apos;14)</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="1007" to="1012" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">CLTK: The Classical Language Toolkit</title>
		<author>
			<persName><forename type="first">Kyle</forename><forename type="middle">P</forename><surname>Johnson</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo593336</idno>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page">2017</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">A Latin Dictionary</title>
		<author>
			<persName><forename type="first">Charlton</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Charles</forename><surname>Short</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1879">1879</date>
			<pubPlace>Clarendon; Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">A fully inflected Arabic verb resource constructed from a lexicon of lemmas by using finite-state transducers</title>
		<author>
			<persName><forename type="first">Alexis</forename><surname>Amid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Neme</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Revue RIST: revue de l&apos;information scientifique et technique</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="7" to="19" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">The Lemlat 3.0 Package for Morphological Analysis of Latin</title>
		<author>
			<persName><forename type="first">Marco</forename><surname>Passarotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marco</forename><surname>Budassi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eleonora</forename><surname>Litta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Ruffolo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language</title>
				<meeting>the NoDaLiDa 2017 Workshop on Processing Historical Language</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="24" to="31" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">L&apos;utilizzo del lemmatizzatore LEMLAT per una sistematizzazione dell&apos;omografia in latino</title>
		<author>
			<persName><forename type="first">Marco</forename><surname>Passarotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Ruffolo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">EUPHROSYNE</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="page" from="99" to="110" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">A universal part-of-speech tagset</title>
		<author>
			<persName><forename type="first">Slav</forename><surname>Petrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dipanjan</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ryan</forename><surname>Mcdonald</surname></persName>
		</author>
		<idno>ArXiv:1104-2086</idno>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Reduction and maintenance of overabundance. A case study on Italian verb paradigms</title>
		<author>
			<persName><forename type="first">Anna</forename><forename type="middle">M</forename><surname>Thornton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Word Structure</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="183" to="207" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Thesaurus formarum totius latinitatis a Plauto usque ad saeculum XXum</title>
		<author>
			<persName><forename type="first">Paul</forename><surname>Tombeur</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>Brepols</publisher>
			<pubPlace>Turnhout</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<title level="m" type="main">Morph-it!: a free corpus-based morphological resource for the italian language</title>
		<author>
			<persName><forename type="first">Eros</forename><surname>Zanchetta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marco</forename><surname>Baroni</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
