<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">DWUGs-IT: Extending and Standardizing Lexical Semantic Change Detection for Italian</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Pierluigi</forename><surname>Cassotti</surname></persName>
							<email>pierluigi.cassotti@gu.se</email>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of Philosophy</orgName>
								<orgName type="department" key="dep2">Linguistics and Theory of Science</orgName>
								<orgName type="institution">University of Gothenburg</orgName>
								<address>
									<settlement>Gothenburg</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pierpaolo</forename><surname>Basile</surname></persName>
							<email>pierpaolo.basile@uniba.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<addrLine>via E. Orabona</addrLine>
									<postCode>70125</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nina</forename><surname>Tahmasebi</surname></persName>
							<email>nina.tahmasebi@gu.se</email>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of Philosophy</orgName>
								<orgName type="department" key="dep2">Linguistics and Theory of Science</orgName>
								<orgName type="institution">University of Gothenburg</orgName>
								<address>
									<settlement>Gothenburg</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department">Tenth Italian Conference on Computational Linguistics</orgName>
								<address>
									<addrLine>Dec 04 -06</addrLine>
									<postCode>2024</postCode>
									<settlement>Pisa</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">DWUGs-IT: Extending and Standardizing Lexical Semantic Change Detection for Italian</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">F9FA3FD8D05B014DDF7A19F2B4C869B1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Lexical Semantic Change</term>
					<term>Sense-annotated corpora</term>
					<term>Italian</term>
					<term>Historical Linguistics</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Lexical Semantic Change Detection (LSCD) is the task of determining whether a word has undergone a change in meaning over time. There has been a marked increase in interest in this task, accompanied by a corresponding growth in the scientific community involved in developing computational approaches to semantic change. In recent years, a number of resources have been made available for the evaluation of LSC models in a number of languages, including English, Swedish, German, Latin, Russian and Chinese. DIACR-ITA is the only existing resource for LSCD in Italian. However, DIACR-ITA has a different format from that used for other languages. In this paper, we present DWUGs-IT, which extends the DIACR-ITA dataset with additional target words and usage-sense pair annotations and adapts it to the DURel format, including the first implementation of a LSCD graded task for Italian.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>As is the case with both society and culture, language is subject to change over time. Two key factors cause such linguistic change. Firstly, there are purely evolutionary and linguistic considerations driven by the need for more efficient communication <ref type="bibr" target="#b0">[1]</ref>. One example of this is the use of abbreviations and acronyms, such as LOL (Laughing Out Loud), which have become commonplace on social media platforms. Secondly, changes in society and culture lead to changes in language. This can be seen, for example, in the adoption of a more inclusive language, as evidenced by grammatically gendered languages, including Italian and the introduction of @ to replace masculine and feminine endings <ref type="bibr" target="#b1">[2]</ref>.</p><p>Language may undergo alteration at various levels, including morphological, syntactic, and semantic. Semantic change concerns the alteration of the meaning of words over time. The study of semantic change is a prominent area of research in Historical Linguistics, with the aim of investigating the linguistic mechanisms that characterize the change and the causes that trigger it. For instance, Blank <ref type="bibr" target="#b2">[3]</ref> provides a broad study on the characterization of semantic change, identifying a number of different types of change, including metaphor, metonymy, generalization, specialization, co-hyponym transfer and auto-antonym. The English word bad, for example, has acquired an auto-antonym meaning, i.e. a meaning that is the opposite of its original meaning. In addition to its original connotation of poor quality or negative, it has also acquired the opposite connotation of good or cool. The term meat has undergone a process of specialization in its meaning, whereby it has shifted from referring to any kind of food in general to exclusively denoting the meat of animals consumed as food.</p><p>While traditional linguistic methods are informative, they are often based on small, carefully curated samples. In contrast, linguistic analyses using computational models not only accelerate our understanding of language change but also provide broader and more detailed insights, thereby facilitating the study of vast corpora across a wider range of genres and time <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>.</p><p>From a computational perspective, two key challenges emerge in the study of semantic change: the modelling of word meanings over time and the detection of change <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>. At the synchronic level, ignoring the temporal dimension with a focus on modern corpora, the Natural Language Processing community has made significant strides in modelling word meanings, with approaches such as Word Sense Disambiguation (WSD) <ref type="bibr" target="#b7">[8]</ref> playing a pivotal role. Computational modelling of semantic change introduces a significant level of complexity, as it necessitates the handling of meanings that are either extinct or novel in comparison to existing lexicographic resources, such as WordNet, as well as dynamically changing meaning representations.</p><p>In recent years, great efforts have been made to advance the field of computational methods for Lexical Semantic Change Detection. With initiatives such as the Workshop on Computational Approaches to Historical Language Change <ref type="bibr" target="#b8">[9]</ref> promoting research in this field or shared tasks such as SemEval 2020 Task 1 <ref type="bibr" target="#b9">[10]</ref>, RuShiftEval <ref type="bibr" target="#b10">[11]</ref>, DIACR-ITA <ref type="bibr" target="#b11">[12]</ref>, or LSCD Discovery <ref type="bibr" target="#b12">[13]</ref> leading to the development of the first evaluation resources. DIACR-Ita, hosted in EVALITA 2020 <ref type="bibr" target="#b14">[14]</ref>, is the first shared task specifically created for the evaluation of models for Lexical Semantic Change in Italian. The majority of the evaluation resources follow a two-task approach: (1) a binary task, which requires the assignment of a word to either the changed or stable label, based on whether the word has undergone a change in meaning or not; and (2) a graded (ranking) task, which requires the sorting of words based on the extent of their change (over time). These labels are assigned on the basis of human-annotated data, typically in the form of a graded word-in-context task.</p><p>DIACR-Ita, however, diverges from the evaluation process employed in SemEval 2020 Task 1, RushiftEval and several other datasets that emerged subsequently. This results in a distinct configuration of the task and the released data. For example, DIACR-Ita only has a binary task but does not include a graded task. Moreover, only the target words with their gold truth labels were made available for the shared task, while the remaining data produced during the annotation process were not. In this paper, 1. we release DWUGs-IT <ref type="foot" target="#foot_0">1</ref>  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>DURel <ref type="bibr" target="#b15">[15]</ref> is a framework for the annotation of Lexical Semantic Change across a pair of time periods or corpora.</p><p>The annotation involves human labelling of pairs of sentences containing the target word. The sentences can be contemporary, i.e. originating from the same time period, or diachronic, denoting a divergence in time between the two periods under consideration. An annotator has to decide whether the meaning expressed by the word in the two sentences is Unrelated (1), Distantly Related</p><p>(2), Closely Related (3) or Identical <ref type="bibr" target="#b3">(4)</ref>. The scale of semantic relatedness is derived from the cognitive model proposed by <ref type="bibr">Blank [20]</ref> and corresponds to the values of Homonymy (1), Polysemy (2), Context Variance (3) and Identity (4).</p><p>The annotations are then presented in the form of a graph, specifically a Word Usage Graph (WUGs) or a Diachronic Word Usage Graph (DWUGs) <ref type="bibr" target="#b21">[21]</ref> in cases where the usages originate from different time periods. In these graphs, the nodes correspond to the word uses and the edges correspond to the median of the annotations. The diachronic graph is then subjected to clustering in order to identify the senses. Before clustering, a new graph is created by binarizing the edges, where an edge between two uses is established if the score of the original edge weight is less than 2.5, or in other words if the average annotation for this pair of uses is less than 2.5. Since the graph typically exhibits considerable sparsity, which limits the applicability of conventional clustering algorithms, a variation of the correlation clustering algorithm <ref type="bibr" target="#b22">[22]</ref> is typically used, as it is able to model this type of sparsely connected graph.</p><p>Once the (diachronic) clusters have been obtained, they can be considered to represent the senses. The distribution of the usages from different time periods in each cluster (sense) is then analyzed to obtain a change score. For instance, one can determine a graded change score by computing the Jensen-Shannon Distance (JSD) on the probability distributions of senses across various time periods. This is expressed as</p><formula xml:id="formula_0">√︂ 𝐷(𝑃 || 𝑀 ) + 𝐷(𝑄 || 𝑀 ) 2</formula><p>where 𝑃 and 𝑄 represent the probability distributions of clusters from different historical periods, 𝐷 denotes the Kullback-Leibler divergence, and 𝑀 = (𝑃 +𝑄) 2 <ref type="bibr" target="#b23">[23,</ref><ref type="bibr" target="#b24">24]</ref>. Furthermore, a binary label can be obtained, whereby words that have undergone a change in meaning over time are assigned a changed label (words that have gained/lost a sense), while words that have retained their meaning are labelled stable. The label is typically assigned by evaluating the frequency of senses in different time periods and establishing thresholds to distinguish stable and changed words.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Datasets based on DURel</head><p>SemEval 2020 Task 1 <ref type="bibr" target="#b9">[10]</ref> is the first initiative to standardize the evaluation of computational approaches to semantic change. SemEval 2020 Task 1 focuses on English, German, Swedish and Latin and proposes a common evaluation framework with two tasks: classifying target words as those whose meaning has changed or remained stable, and ranking words according to their degree of change. Special attention is given to Latin due to the lack of native speakers. Therefore, in the annotation of the Latin dataset, usage-sense pairs are considered rather than usage-usage pairs, and the annotator is asked to decide how related the considered usage is to a particular sense, using the DURel scale from Unrelated to Identical. RuShiftEval <ref type="bibr" target="#b10">[11]</ref> aimed to detect semantic shifts in Russian across pre-Soviet, Soviet, and post-Soviet periods. The dataset included 111 Russian nouns, with participants ranking them by their degree of change (using the COMPARE measure <ref type="bibr" target="#b15">[15]</ref>, an approximation of the JSD). The task focused on ranking changes, with evaluations based on Spearman rank correlations. LSC Discovery <ref type="bibr" target="#b12">[13]</ref> focused on detecting and discovering semantic changes in Spanish. It is divided into Graded Change Discovery and Binary Change Detection. The task required evaluations for all vocabulary words in the corpus, covering periods from 1810-1906 and 1994-2020. NorDiaChange <ref type="bibr" target="#b25">[25]</ref> studied diachronic semantic change in Norwegian. The dataset included 80 nouns reflecting significant historical periods, such as pre-and post-war events and technological advances. ZhShiftEval <ref type="bibr" target="#b26">[26,</ref><ref type="bibr" target="#b27">27]</ref> assessed semantic change in Chinese over 50 years, focusing on the period around Reform and Opening Up. The dataset used texts from the People's Daily and included 20 words chosen for their frequency and noted changes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">DIACR-ITA</head><p>The DIACR-ITA annotation was conducted on word usages extracted from L'Unità corpus <ref type="bibr" target="#b28">[28]</ref>. L'Unità corpus comprises a collection of Italian texts extracted from the newspaper L'Unità. In order to evaluate semantic change, the corpus has been divided into two sub-corpora, covering the period from 1948 to 1970 and the period from 1990 to 2014, respectively. A time window of 20 years between the sub-corpora ensures sufficient distance between the two periods, allowing for the tracking of potentially more pronounced semantic changes. The sub-corpora statistics are presented in Table <ref type="table" target="#tab_1">1</ref>.</p><p>The selection of target words was based on the information provided in the Sabatini-Coletti dictionary of the Italian language, which records the year of the first occurrence of a word's sense. The initial step involved the extraction of a list of words from Sabatini-Coletti for which the dictionary reported a semantic change, i.e. the introduction of at least one new sense after 1970. Moreover, an examination of the set of words was conducted to ensure that the sampled words appeared at least 10 times in each of the two periods and that the occurrences of these words were not significantly affected by OCR errors. Consequently, 26 target words were identified. For each target word, up to 50 occurrences from each of the two sub-corpora were extracted.</p><p>The senses of each word were classified into two groups: the senses recorded in the Sabatini-Coletti dic-tionary for the period 1948-1970 (Group 1) and the new senses introduced after 1970 (Group 2). The annotators were required to determine whether the sense of each word usage belonged to Group 1, Group 2, or to another category if the word sense did not align with either group (Other). Additionally, the annotator may indicate a preference of Cannot decide for the uses in which they were uncertain. Five annotators fluent in Italian annotated DIACR-ITA. Each sentence was annotated by two annotators. The disagreement cases were resolved by the two annotators involved, analyzing the disagreement and deciding on an unambiguous label.</p><p>Each target word was labelled as stable or changed. A word was considered changed if there was at least one instance of Group 2 among the extracted usages from the period between 1990 and 2014 and no instances of Group 2 among the extracted usages from the period between 1948 and 1970. The final dataset consisted of 18 words, of which 6 were changed and 12 were stable. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Corpus</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">DWUGs-IT</head><p>DWUGs-IT builds on the DIACR-ITA dataset, adapting it to the DURel format and adding eight new words. It also provides the usage-sense annotated pairs that were not initially released, as summarized in Table <ref type="table">2</ref>. For each target word, we format the annotated usages following the WUG style, including the time period of the usage and the word's position in the sentence. Similarly, we format and release the annotated sense labels in a way similar to DWUG LA <ref type="bibr" target="#b29">[29]</ref>. Unlike the traditional WUG approach, where sense preference is not explicitly marked, in DIACR-ITA, annotators clearly indicate their preference for one sense over others. For example, in the usage of the word api (Italian for bees), in the sentence "Dalle api un dolce dono" ("From bees, a sweet gift"), the annotators choose the sense insect while discarding the alternative sense means of transport. For each use-sense pair not selected by annotators, a rating of 1 (Unrelated) is assigned, while matched pairs receive a rating of 4 (Identical), in line with the DURel scale.</p><p>Since human annotators already provide the sense labels, we do not cluster usages automatically (as is typically done in the WUG approach), but directly assign the annotated meanings. All subsequent calculations, such as Piccolo computer facilmente trasportabile, funzionante anche a batteria e quindi utilizzabile in viaggio -telefono portatile impegnativa agg. che richiede impegno Dichiarazione con cui si assume un impegno; in partic. nel l. burocr., documento con cui un ente mutualistico si impegna a coprire, nella misura prevista dalla legge, le spese sanitarie di un suo iscritto: fare l'i. per le analisi</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 2</head><p>Newly introduced words together with the senses of <ref type="bibr">Group 1 (1948</ref><ref type="bibr">Group 1 ( -1970))</ref>, Group 2 which involves senses introduced after 1970, and an indication of the presence of other senses not listed in Group 1 and Group 2.</p><p>change scores and related statistics, follow the standard WUG methodology.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Evaluation</head><p>XL-LEXEME has been tested on different languages before but has never been evaluated on Italian. In this section, we evaluate XL-LEXEME on the new DWUGs-IT dataset using the traditional evaluation pipeline for the DWUGs <ref type="bibr" target="#b19">[19,</ref><ref type="bibr" target="#b30">30]</ref>. We assess the ability to derive a reliable change score (Graded Change Detection) and evaluate the possibility of clustering the XL-LEXEME vectors to automatically induce target word senses, which are then compared to the DWUGs-IT annotations via the Adjusted Rand Index and the Purity measure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Model</head><p>XL-LEXEME, built on XLM-RoBERTa large <ref type="bibr" target="#b31">[31]</ref>, is trained for the Word-in-Context (WiC) task <ref type="bibr" target="#b32">[32]</ref>, which determines if a word has the same meaning in two sentences. Using a Siamese architecture <ref type="bibr" target="#b33">[33]</ref>, it creates word vectors. The loss function adjusts weights via cosine distance, aligning vectors for the same meanings and separating them for different meanings. To calculate the change score, a classic approach is to use the Average Pairwise Distance between the vectors computed over the two different periods:</p><formula xml:id="formula_1">LSC(𝑠 𝑡 0 𝑤 , 𝑠 𝑡 1 𝑤 ) = 1 𝑁 • 𝑀 𝑁 ∑︁ 𝑖=0 𝑀 ∑︁ 𝑗=0 𝛿(𝑠 𝑡 0 𝑤,𝑖 , 𝑠 𝑡 1 𝑤,𝑗 )<label>(1)</label></formula><p>where 𝛿 is the cosine distance and 𝑠 𝑡 𝑤 is the set of sentences containing the word 𝑤 at time 𝑡. For the Word Sense Induction step, we cluster the vectors into senses using Agglomerative Clustering<ref type="foot" target="#foot_1">2</ref> with a cosine threshold of 0.5 and Average Linkage, which merges clusters with a similarity greater than 0.5.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Metrics</head><p>We test the ability of XL-LEXEME in ranking words according to their change scores (Graded Change Detection) using Spearman Correlation. Cluster quality is assessed using the Adjusted Rand Index (ARI) <ref type="bibr" target="#b34">[34]</ref>, which is defined as follows: 𝐴𝑅𝐼 = 𝑅𝐼 − 𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑𝑅𝐼 𝑚𝑎𝑥(𝑅𝐼) − 𝐸𝑥𝑝𝑒𝑡𝑒𝑑𝑅𝐼 𝑅𝐼 stands for the Rand Index, which measures the number of pair agreements within the data -that is, pairs of instances that are correctly placed in the same cluster. The 𝐸𝑥𝑝𝑒𝑡𝑐𝑡𝑒𝑑𝑅𝐼 is the expected number of such agreements by chance, calculated based on the distribution of the clusters, while the 𝑚𝑎𝑥(𝑅𝐼) is the maximum possible value of 𝑅𝐼, which occurs when all pairs are classified perfectly. We use Purity in addition to ARI to capture cluster homogeneity and provide clearer insight about how mixed the clusters are in terms of class labels, i.e. Purity = 1 𝑁</p><formula xml:id="formula_2">∑︁ 𝑘 max 𝑗 |𝑐 𝑘 ∩ 𝑡𝑗|</formula><p>where 𝑁 is the total number of instances, 𝑐 𝑘 denotes cluster 𝑘, and 𝑡𝑗 represents class 𝑗. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Results</head><p>We begin to discuss qualitative results. Figure <ref type="figure" target="#fig_0">1</ref> illustrates the t-SNE visualization of XL-LEXEME embeddings for the usages of the words palmare, rampante, and pilotato.</p><p>For palmare (Figure <ref type="figure" target="#fig_0">1a</ref>), the senses are well separated except for some instances of the sense relating to the palm, clear, evident that are placed closer to the PDA device meaning, for example: sono state n levate di le impronte palmari che saranno inviate al1' archivio generale segnaletico di Roma. (en. The palm prints have been removed and will be sent to the general sign archive of Rome.)</p><p>For the word rampante (Figure <ref type="figure" target="#fig_0">1b</ref>), the annotators identi-fied an additional meaning (Other) that refers to a named entity, i.e., Il barone rampante written by Italo Calvino. The instances of Il barone rampante fall in the middle of the cluster of the rearing and ambitious meanings. Interestingly, the only instance annotated as Cannot decide falls in the rearing cluster:</p><p>Uno rampante » non ci aia ancora nulla da fare, comunque i tecnici....supremazia di le Ferrari. (en. A rampant » there is still nothing to be done, in any case the technicians.... supremacy of the Ferrari.)</p><p>This instance is ambiguous since the subject of rampante is missing in the sentence. However, interestingly, XL-LEXEME assumes it to have the rearing meaning, probably due to the presence of the word Ferrari, referring to the Ferrari logo. Figure <ref type="figure" target="#fig_0">1c</ref> shows how the embeddings of the usages of pilotato are perfectly split according to the sense labels. However, one instance of the meaning driven falls in the cluster of the manipulated instances, which can be considered ambiguous and open to interpretation:</p><p>Twingo Easy offre la grande comodità di un cambio con frizione pilotata, ovvero: non c' è più il pedale della frizione. (en. Twingo Easy offers the great convenience of a gearbox with a piloted clutch, that is: there is no longer a clutch pedal.)</p><p>The quantitative results of XL-LEXEME are reported in Table <ref type="table">3</ref>. Compared to LSCD benchmarks in other languages, XL-LEXEME shows similar results for the GCD score (ranging from 0.567 in NO to 0.851 in RU) and the ARI score (ranging from 0.249 in SV to 0.400 in ES). It also performs slightly better using the purity measure (ranging from 0.766 in SV to 0.836 in ZH). These results likely stem from the properties of the dataset that includes several monosemous words, but also from the process that has been used for DWUGs-IT where senses are modeled explicitly. Purity measures the extent to which clusters contain a single class. With many monosemous words, achieving high purity is easier since these words inherently belong to one sense group. ARI, on the other hand, evaluates the similarity between the clustering results and the ground truth, accounting for both the clustering quality and the number of clusters. In DWUGs-IT, most groups of word senses have just one meaning. But sometimes, a group of words can have several meanings, and how often each meaning is used can change over time. For example, the word palmare has three meanings in its Group 1: i) related to the palm of the hand, ii) something that fits in your hand, and iii) something that is obvious or clear. Over time, some of these meanings might be used more or less often. However, because all three meanings are grouped together, DWUGs-IT does not take into account how the use of each of those meanings changes over time. This broad categorization of senses Graded Change Detection (Spearman Correlation) 0.51 Adjusted Rand Index (ARI) 0.28 Purity 0.89</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 3 XL-LEXEME Results</head><p>can impact the performance of XL-LEXEME, which analyzes meanings at a more detailed level. Additionally, XL-LEXEME has been tested on different languages before but has never been evaluated on Italian. DWUGs-IT models senses explicitly, whereas previous datasets inferred senses automatically by comparing pairs of usages. This automatic inference process is similar to the approach XL-LEXEME uses, potentially making it better suited for datasets without explicit sense modelling.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>This paper presents DWUGs-IT, an extension and standardization of the Lexical Semantic Change Detection (LSCD) task for Italian, based on the existing DIACR-ITA dataset. The dataset is expanded with additional target words and its format is aligned with that of the resources used for other languages. This involves the introduction of the first graded task for Italian. The standardized dataset and the evaluation framework we provide can serve as a foundation for future research in LSCD for Italian. By aligning the Italian dataset with those of other languages, we facilitate cross-linguistic comparisons and contribute to the broader understanding of semantic change mechanisms. In addition, we provide a first evaluation of the state-of-the-art LSCD model, XL-LEXEME, for Italian and both show its effectiveness as well as set a baseline for future work.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: t-SNE visualization of XL-LEXEME embeddings with respect to the annotated clusters for changed words palmare, rampante, and pilotato.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Sub-corpora statistics.</figDesc><table><row><cell></cell><cell>Period</cell><cell>#Tokens</cell></row><row><cell>L'Unità</cell><cell>1948-1970</cell><cell>52,287,734</cell></row><row><cell>L'Unità</cell><cell>1990-2014</cell><cell>196,539,403</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">DWUGs-IT is available on Zenodo https://zenodo.org/records/ 13941618.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://scikit-learn.org/stable/modules/generated/sklearn. cluster.AgglomerativeClustering.html#sklearn.cluster. AgglomerativeClustering</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has in part been funded by the research program Change is Key! supported by Riksbankens Jubileumsfond (under reference number M21-0021). The computational resources were provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS), partially funded by the Swedish Research Council through grant agreement no. 2022-06725. We acknowledge the support of the PNRR project FAIR -Future AI Research (PE00000013), Spoke 6 -Symbiotic AI (CUP H97G22000210007) under the NRRP MUR program funded by the NextGenerationEU. We would also like to thank Tommaso Caselli, Annalina Caputo and Rossella Varvara, who contributed to the development of the DIACR-ITA resource, and Dominik Schlechtweg for valuable feedback on a preliminary draft of this work.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A synopsis of linguistic theory 1930-55</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Firth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Studies in linguistic analysis</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="page" from="1" to="32" />
			<date type="published" when="1952">1952. 1957</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Emerging trends in gender-specific occupational titles in italian newspapers</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Iovine</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Gemmis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Semeraro</surname></persName>
		</author>
		<ptr target="https://ceur-ws.org/Vol-3033/paper52.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eighth Italian Conference on Computational Linguistics, CLiC-it 2021</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">E</forename><surname>Fersini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Passarotti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Patti</surname></persName>
		</editor>
		<meeting>the Eighth Italian Conference on Computational Linguistics, CLiC-it 2021<address><addrLine>Milan, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">January 26-28, 2022. 2021</date>
			<biblScope unit="volume">3033</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Blank</surname></persName>
		</author>
		<title level="m">Prinzipien des lexikalischen Bedeutungswandels am Beispiel der romanischen Sprachen</title>
				<imprint>
			<publisher>Walter de Gruyter</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="volume">285</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Using synchronic definitions and semantic relations to classify semantic change types</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">D</forename><surname>Pascale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<idno type="DOI">10.18653/V1/2024.ACL-LONG.249</idno>
		<ptr target="https://doi.org/10.18653/v1/2024.acl-long.249.doi:10.18653/V1/2024.ACL-LONG.249" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">L</forename><surname>Ku</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Martins</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Srikumar</surname></persName>
		</editor>
		<meeting>the 62nd Annual Meeting of the Association for Computational Linguistics<address><addrLine>Bangkok, Thailand</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2024">August 11-16, 2024. 2024</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4539" to="4553" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Analyzing semantic change through lexical replacements</title>
		<author>
			<persName><forename type="first">F</forename><surname>Periti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<idno type="DOI">10.18653/V1/2024.ACL-LONG.246</idno>
		<ptr target="https://doi.org/10.18653/v1/2024.acl-long.246.doi:10.18653/V1/2024.ACL-LONG.246" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024</title>
				<editor>
			<persName><forename type="first">L</forename><surname>Ku</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Martins</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Srikumar</surname></persName>
		</editor>
		<meeting>the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024<address><addrLine>Bangkok, Thailand, Au</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2024">gust 11-16, 2024. 2024</date>
			<biblScope unit="page" from="4495" to="4510" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Survey of computational approaches to lexical semantic change detection</title>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Borin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jatowt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Computational approaches to semantic change</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">6</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><surname>Montanelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Periti</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2304.01666</idno>
		<title level="m">A Survey on Contextualised Semantic Shift Detection</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Word Sense Disambiguation: A Survey</title>
		<author>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</author>
		<idno type="DOI">10.1145/1459352.1459355</idno>
		<idno>doi:10.1145/1459352. 1459355</idno>
		<ptr target="https://doi.org/10.1145/1459352.1459355" />
	</analytic>
	<monogr>
		<title level="j">ACM Comput. Surv</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<ptr target="https://aclanthology.org/2023.lchange-1.0" />
		<title level="m">Proceedings of the 4th Workshop on Computational Approaches to Historical Language Change, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Montariol</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Kutuzov</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Alfter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Periti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</editor>
		<meeting>the 4th Workshop on Computational Approaches to Historical Language Change, Association for Computational Linguistics<address><addrLine>Singapore</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<ptr target="https://www.aclweb.org/anthology/2020.semeval-1.1/" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourteenth Workshop on Semantic Evaluation, SemEval@COLING2020, International Committee for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Herbelot</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">X</forename><surname>Zhu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Palmer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Schneider</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>May</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Shutova</surname></persName>
		</editor>
		<meeting>the Fourteenth Workshop on Semantic Evaluation, SemEval@COLING2020, International Committee for Computational Linguistics<address><addrLine>Barcelona (online</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1" to="23" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">RuShiftEval: A Shared Task on Semantic Shift Detection for Russian</title>
		<author>
			<persName><forename type="first">A</forename><surname>Kutuzov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Pivovarova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the International Conference on Computational Linguistics and Intellectual Technologies (Dialogue)</title>
				<meeting>of the International Conference on Computational Linguistics and Intellectual Technologies (Dialogue)</meeting>
		<imprint>
			<publisher>Redkollegija sbornika</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">20</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Diacr-ita @ EVALITA2020: overview of the EVALITA2020 diachronic lexical semantics (diacrita) task</title>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Caputo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Caselli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Varvara</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2765/paper158.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), Online event</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">V</forename><surname>Basile</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Croce</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><forename type="middle">D</forename><surname>Maro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">C</forename><surname>Passaro</surname></persName>
		</editor>
		<meeting>the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), Online event</meeting>
		<imprint>
			<date type="published" when="2020-12-17">December 17th, 2020. 2020</date>
			<biblScope unit="volume">2765</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">D</forename><surname>Zamora-Reina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Bravo-Marquez</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">LSCDiscovery: A Shared Task on Semantic Change Discovery and Detection in Spanish</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Workshop on Computational Approaches to Historical Language Change (LChange), Association for Computational Linguistics (ACL)</title>
				<meeting>of the Workshop on Computational Approaches to Historical Language Change (LChange), Association for Computational Linguistics (ACL)<address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="149" to="164" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Overview of the 7th evaluation campaign of natural language processing and speech tools for italian</title>
		<author>
			<persName><forename type="first">V</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Croce</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Di Maro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">C</forename><surname>Passaro</surname></persName>
		</author>
		<author>
			<persName><surname>Evalita</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop</title>
				<editor>
			<persName><forename type="first">V</forename><surname>Basile</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Croce</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Di Maro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">C</forename><surname>Passaro</surname></persName>
		</editor>
		<meeting>Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop<address><addrLine>EVALITA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020. 2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Diachronic Usage Relatedness (DURel): A Framework for the Annotation of Lexical Semantic Change</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Im Walde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Eckmann</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/n18-2027</idno>
		<ptr target="https://doi.org/10.18653/v1/n18-2027.doi:10.18653/v1/n18-2027" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT</title>
				<editor>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Walker</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Ji</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Stent</surname></persName>
		</editor>
		<meeting>the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT<address><addrLine>New Orleans, Louisiana, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="169" to="174" />
		</imprint>
	</monogr>
	<note>Short Papers</note>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">The durel annotation tool: Human and computational measurement of semantic proximity, sense clusters and semantic change</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Virk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Sander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sköldberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">T</forename><surname>Linke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kuhn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Im Walde</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2024.eacl-demo.15" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024 -System Demonstrations</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Aletras</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">O</forename><forename type="middle">D</forename><surname>Clercq</surname></persName>
		</editor>
		<meeting>the 18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024 -System Demonstrations<address><addrLine>St. Julians, Malta</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2024">March 17-22, 2024. 2024</date>
			<biblScope unit="page" from="137" to="149" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The durel annotation tool</title>
		<author>
			<persName><forename type="first">P</forename><surname>Sander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sköldberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Virk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Book of Abstracts of the Workshop Large Language Models and Lexicography</title>
				<imprint>
			<date type="published" when="2024-10-08">8 October 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">XL-LEXEME: WiC pretrained model for cross-lingual LEXical sEMantic changE</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Siciliani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Degemmis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Semeraro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.acl-short.135</idno>
		<ptr target="https://aclanthology.org/2023.acl-short.135.doi:10.18653/v1/2023.acl-short.135" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 61st Annual Meeting of the Association for Computational Linguistics<address><addrLine>Toronto, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="1577" to="1585" />
		</imprint>
	</monogr>
	<note>Short Papers), Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">A systematic comparison of contextualized word embeddings for lexical semantic change</title>
		<author>
			<persName><forename type="first">F</forename><surname>Periti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2024.naacl-long.240" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long Papers</title>
		<editor>
			<persName><forename type="first">K</forename><surname>Duh</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Gomez</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Bethard</surname></persName>
		</editor>
		<meeting>the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Mexico City, Mexico</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2024">2024</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4262" to="4282" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Why do new meanings occur? A cognitive typology of the motivations for lexical semantic change, Historical semantics and cognition</title>
		<author>
			<persName><forename type="first">A</forename><surname>Blank</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Annual Conference of the North American Chapter of the Association for Computational Linguistics</title>
				<meeting><address><addrLine>Mexico City, Mexico</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note>NAACL-HLT 2021), Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Correlation clustering</title>
		<author>
			<persName><forename type="first">N</forename><surname>Bansal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Blum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chawla</surname></persName>
		</author>
		<idno type="DOI">10.1023/B:MACH.0000033116.57574.95</idno>
	</analytic>
	<monogr>
		<title level="j">Machine Learning</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="page" from="89" to="113" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Divergence measures based on the shannon entropy</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Information Theory</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="page" from="145" to="151" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Dialectometric analysis of language variation in twitter</title>
		<author>
			<persName><forename type="first">G</forename><surname>Donoso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sanchez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects</title>
				<meeting>the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects<address><addrLine>Valencia, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="16" to="25" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Nordiachange: Diachronic semantic change dataset for norwegian</title>
		<author>
			<persName><forename type="first">A</forename><surname>Kutuzov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Touileb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Maehlum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">R</forename><surname>Enstad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wittemann</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2022.lrec-1.274" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Thirteenth Language Resources and Evaluation Conference, LREC 2022</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Calzolari</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Béchet</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Blache</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K</forename><surname>Choukri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Cieri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Declerck</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Goggi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Isahara</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">B</forename><surname>Maegaard</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Mariani</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Mazo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Odijk</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Piperidis</surname></persName>
		</editor>
		<meeting>the Thirteenth Language Resources and Evaluation Conference, LREC 2022<address><addrLine>Marseille, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022-06-25">20-25 June 2022. 2022</date>
			<biblScope unit="page" from="2563" to="2572" />
		</imprint>
	</monogr>
	<note>European Language Resources Association</note>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Lexicon of changes: Towards the evaluation of diachronic semantic shift in Chinese</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chersoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-R</forename><surname>Huang</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2022.lchange-1.11</idno>
		<ptr target="https://aclanthology.org/2022.lchange-1.11.doi:10.18653/v1/2022.lchange-1.11" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Montariol</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Kutuzov</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Borin</surname></persName>
		</editor>
		<meeting>the 3rd Workshop on Computational Approaches to Historical Language Change, Association for Computational Linguistics<address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="113" to="118" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chersoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Prokic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-R</forename><surname>Huang</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.10023263</idno>
		<ptr target="https://doi.org/10.5281/zenodo.10023263.doi:10.5281/zenodo.10023263" />
		<title level="m">Chiwug: Diachronic word usage graphs for chinese</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Varvara, A diachronic italian corpus based on &quot;l&apos;unità</title>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Caputo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Caselli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2769/paper_44.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">J</forename><surname>Monti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Dell'orletta</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Tamburini</surname></persName>
		</editor>
		<meeting>the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020<address><addrLine>Bologna, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">March 1-3, 2021. 2020</date>
			<biblScope unit="volume">2769</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<title level="m" type="main">Dwug la: Diachronic word usage graphs for latin</title>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.5255228</idno>
		<ptr target="https://doi.org/10.5281/zenodo.5255228.doi:10.5281/zenodo.5255228" />
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Sense through time: diachronic word sense annotations for word sense induction and lexical semantic change detection</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">D</forename><surname>Zamora-Reina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Bravo-Marquez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Arefyev</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10579-024-09771-7</idno>
		<ptr target="http://dx.doi.org/10.1007/s10579-024-09771-7.doi:10.1007/s10579-024-09771-7" />
	</analytic>
	<monogr>
		<title level="m">Language Resources and Evaluation</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Unsupervised Crosslingual Representation Learning at Scale</title>
		<author>
			<persName><forename type="first">A</forename><surname>Conneau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Khandelwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Chaudhary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Wenzek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Guzmán</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Grave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stoyanov</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.acl-main.747</idno>
		<ptr target="https://doi.org/10.18653/v1/2020.acl-main.747.doi:10.18653/v1/2020.acl-main.747" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Chai</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Schluter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Tetreault</surname></persName>
		</editor>
		<meeting>the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020</meeting>
		<imprint>
			<date type="published" when="2020">July 5-10, 2020. 2020</date>
			<biblScope unit="page" from="8440" to="8451" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Pilehvar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/n19-1128</idno>
		<ptr target="https://doi.org/10.18653/v1/n19-1128.doi:10.18653/v1/n19-1128" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019</title>
				<editor>
			<persName><forename type="first">J</forename><surname>Burstein</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Doran</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Solorio</surname></persName>
		</editor>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019<address><addrLine>Minneapolis, MN, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">June 2-7, 2019. 2019</date>
			<biblScope unit="page" from="1267" to="1273" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks</title>
		<author>
			<persName><forename type="first">N</forename><surname>Reimers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Gurevych</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/D19-1410</idno>
		<ptr target="https://aclanthology.org/D19-1410.doi:10.18653/v1/D19-1410" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Association for Computational Linguistics</title>
				<meeting>the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Association for Computational Linguistics<address><addrLine>Hong Kong, China</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="3982" to="3992" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">A survey of clustering algorithms for big data: Taxonomy and empirical analysis</title>
		<author>
			<persName><forename type="first">A</forename><surname>Fahad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Alshatri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Tari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Alamri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Khalil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Zomaya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Foufou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bouras</surname></persName>
		</author>
		<ptr target="https://ieeexplore.ieee.org/iel7/6245516/6939750/06832486.pdf" />
	</analytic>
	<monogr>
		<title level="j">IEEE transactions on emerging topics in computing</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="267" to="279" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
