<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Analysis of Lexical Ambiguity in Vector Space Models</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Marta</forename><forename type="middle">Vázquez</forename><surname>Abuín</surname></persName>
							<email>martavazquez.abuin@usc.gal</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Centro Singular de Investigación en Tecnoloxías Intelixentes (CITIUS)</orgName>
								<orgName type="institution">Universidade de Santiago de Compostela</orgName>
								<address>
									<postCode>15782</postCode>
									<settlement>Santiago de Compostela</settlement>
									<region>Galicia</region>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Analysis of Lexical Ambiguity in Vector Space Models</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">103D5D7C98A848416B771D8E24FD69DF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:02+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>lexical semantics</term>
					<term>distributional semantics</term>
					<term>Word Sense Disambiguation</term>
					<term>Galician</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The aim of this PhD is to analyze the capabilities of current vector models to deal with lexical ambiguity, particularly in the context of polysemy and homonymy, with a special focus on how models based on Transformer architectures represent these semantic phenomena. To achieve the main objective we have set the following specific objectives: (i) to perform a comprehensive analysis of the state-of-the-art, (ii) to compile a training and evaluation dataset according to the Word in Context format, and (iii) to extend it with data from other languages, with the potential to create a cross-lingual dataset including false friends and equivalent forms. In addition, (iv) to evaluate the computational models and (v, vi) to conduct experiments to compare the results obtained above with human judgments. To do so, we will compile and create datasets to perform an evaluation in terms of lexical ambiguity. We will assess state-of-the-art models to see if they are able to identify the different meanings of ambiguous words in context. Finally, the human judgments on the same dataset will be evaluated and compared with the computational models that have been analyzed. This analysis will be carried out mainly for the Galician language, but also for Portuguese and Spanish according to the evolution of the research.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction and Motivation</head><p>The development of deep learning and transformer-based models in recent years has been a huge improvement for the field of Natural Language Processing (NLP) <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. Nevertheless, the identification and processing of lexical ambiguity remains a significant challenge for linguistic technologies, particularly in the context of automatic and unsupervised approaches <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b4">5]</ref>. One strategy for evaluating the ability of computational models to address lexical ambiguity is through Word Sense Disambiguation (WSD) tasks <ref type="bibr" target="#b5">[6]</ref>. The objective of these tasks is to discern the precise meaning of a word within diverse contextual settings, analyze its behavior, and identify potential solutions <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9]</ref> .</p><p>In the computational modelling of lexical semantics, there are two main approaches, both inspired by theoretical proposals <ref type="bibr" target="#b9">[10]</ref>: (a) symbolic modelling, where each element has a specific meaning in a network of semantic relations, and (b) continuous approaches, where lexical forms are represented in a vector space. In the first case, we can highlight the case of Princeton WordNet (PWN) <ref type="bibr" target="#b10">[11]</ref>, which is one of the most frequently used lexical resources in the field of natural language processing. WordNet is a database of semantic relations that groups words into synonym sets (or synsets) and links them to other words according to their shared meaning <ref type="bibr" target="#b10">[11]</ref>. These synsets act as nodes within the semantic networks, with the relations between them represented as edges <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>. Furthermore, it provides glosses and examples that help users to illustrate word usages and clarify their meanings in context <ref type="bibr" target="#b11">[12]</ref>. In continuous approaches, we can include both approaches based on the distributional hypothesis <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref> and new models trained with deep learning architectures <ref type="bibr" target="#b14">[15]</ref>. In this context, we can differentiate between two categories: static models such as Word2Vec <ref type="bibr" target="#b15">[16]</ref>, fastText <ref type="bibr" target="#b16">[17]</ref> and GloVe <ref type="bibr" target="#b17">[18]</ref>, where each word is represented by a single vector with its meaning in a specific context; and models based on language models, which present each occurrence of a word in a given context with a different vector that can potentially disambiguate the meaning of the word in context (BERT <ref type="bibr" target="#b0">[1]</ref>, ELMo <ref type="bibr" target="#b14">[15]</ref> or GPT-3 <ref type="bibr" target="#b18">[19]</ref>).</p><p>Lexical ambiguity phenomena, such as polysemy or homonymy, are pervasive across natural languages and present a challenge to computational models as a single word from may have different meanings depending on the context <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b4">5]</ref>. In the context of this thesis, it is appropriate to describe these two phenomena. Polysemy involves a single lexeme with several related meanings depending on the context (e.g., the Galician coche 'car' referring to a four-wheeled vehicle or a train carriage). Homonymy, on the other hand, involves a single lexical form consisting of different lexemes with multiple independent meanings (e.g. the Galician canto as one of the divisions of an epic poem 'canto' or as the gap between two walls 'edge') <ref type="bibr" target="#b19">[20]</ref>.</p><p>For English, and other languages like Italian and German, there exists a comprehensive array of resources for the training and evaluation of the resolution of lexical ambiguity in linguistic models. Some examples include the different WordNets and Word in Context datasets, such as WiC <ref type="bibr" target="#b20">[21]</ref>, XL-WiC <ref type="bibr" target="#b21">[22]</ref> or ConSec <ref type="bibr" target="#b22">[23]</ref>). In contrast, the availability of these resources is very limited for under-resourced languages such as Galician. Furthermore, the representation of Galician in comparison with the other languages analyzed is often very small, as evidenced by the XL-WSD dataset <ref type="bibr" target="#b23">[24]</ref>.</p><p>The objective of this PhD is to examine the strategies followed by languages models in the context of lexical ambiguity in Galician <ref type="foot" target="#foot_0">1</ref> .</p><p>The remainder of this work is organized as follows: Section 2 is dedicated to an examination of the diverse tasks and challenges associated with lexical ambiguity and word sense disambiguation. Sections 3 and 4 present the objectives, hypotheses, and proposed research methodology. Finally, section 5 presents the preliminary results of the research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related work</head><p>Lexical ambiguity is very important in human language understanding <ref type="bibr" target="#b4">[5]</ref> and one of the most common problems in natural language technologies because it causes misinterpretation of natural language. As a result, identifying and resolving ambiguity is essential for improving the efficiency and reliability of these technologies <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b24">25]</ref>.</p><p>We talk about disambiguation to describe the process of resolving ambiguity errors where WSD is one of the most studied aspects by researchers <ref type="bibr" target="#b5">[6]</ref>. The main objective of the study of this aspect is to determine the word sense in a particular context, such as associating words in context with the most appropriate entry in a predefined sense inventory <ref type="bibr" target="#b6">[7]</ref>.</p><p>One of the most used linguistic resources for lexical disambiguation is WordNet <ref type="bibr" target="#b10">[11]</ref>. This lexical database has become the primary repository for senses in NLP and it is the source of the majority of the datasets and evaluation frameworks for WSD <ref type="bibr" target="#b6">[7]</ref>. WordNet was initially developed specifically for the English language. However, over time, other languages began to develop their own lexical resources based on it, for example, Galnet <ref type="bibr" target="#b25">[26]</ref> (The Galician WordNet <ref type="bibr" target="#b26">[27]</ref>) that is part of the Multilingual Central Repository <ref type="bibr" target="#b27">[28]</ref>. The creation of such resources represents a significant challenge in the majority of languages, especially for languages with limited linguistic resources, where the investment of time and human effort is often a significant obstacle. While new automatic and unsupervised methods have been developed for this purpose, it is also necessary to ensure that they are trained with the quality and quantity of data to obtain adequate results. For this reason, the most popular option has been to extend them with PWN using different methods adapted to their developments and needs, looking for automatic and unsupervised approaches that minimize human and economic effort.</p><p>However, there is still a notable disparity between English and other languages in terms of NLP tools and improvements, as well as the identification and treatment of lexical ambiguity. The gap is particularly significant when attempting to conduct research and work with under-resourced languages, such as Galician.</p><p>To assess the efficacy of automatic lexical disambiguation, WordNet has been employed to construct datasets such as WiC <ref type="bibr" target="#b20">[21]</ref> to evaluate vector models in a particular context and its extended version XL-WiC <ref type="bibr" target="#b21">[22]</ref>. Each instance in the dataset will comprise a target word and two contexts, each with a specific meaning of the target word. The objective is to identify whether the two contexts of the word have the same meaning or not. However, Galician is not included in this analysis due to the limited availability of examples <ref type="bibr" target="#b21">[22]</ref>.</p><p>Regarding the models that will be evaluated, there are the static models, inspired by the distributional hypothesis, where each vector represents all the meanings of a word <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b15">16]</ref> and the new models based on the deep neural networks <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b28">29]</ref> that have been a revolution in the field <ref type="bibr" target="#b1">[2]</ref>. These language models can model the meaning of words in context since different vectors are generated for each word in each sentence. Among the most used ones for these tasks, we can mention BERT <ref type="bibr" target="#b0">[1]</ref>, XLM-RoBERTa <ref type="bibr" target="#b29">[30]</ref>, DeBERTa <ref type="bibr" target="#b30">[31]</ref>, ELMo <ref type="bibr" target="#b14">[15]</ref> or GPT <ref type="bibr" target="#b18">[19]</ref>.</p><p>Concerning the behavior of the different models regarding lexical ambiguity, we can observe that contextualized models <ref type="bibr" target="#b8">[9]</ref> are effective in identifying homonyms and distinguishing between meanings <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b31">32,</ref><ref type="bibr" target="#b32">33]</ref>. However, the accuracy of the results declines when the contexts are similar due to the fine-grained distinctions <ref type="bibr" target="#b7">[8]</ref>. In Loureiro et al <ref type="bibr" target="#b7">[8]</ref> we can see that Transformer models, specifically BERT, have high results and capture sense distinctions, even with few examples, grouping polysemous words according to sense.</p><p>Concerning the human interpretation of lexical ambiguity, some studies have demonstrated that large models are capable of performing in a manner that is similar to human preference <ref type="bibr" target="#b33">[34,</ref><ref type="bibr" target="#b34">35]</ref>, but in Trott et al <ref type="bibr" target="#b35">[36]</ref> they found that the human behavior cannot be explained purely by exposure to language statistics as in the models assessed.</p><p>For Galician, the number of resources and datasets created for lexical disambiguation tasks and models evaluation is very low <ref type="bibr" target="#b32">[33]</ref>. This situation places Galician at a position of disadvantage in relation to other languages in terms of WSD tasks and evaluation. In this context, the present research proposes the design and creation of datasets to evaluate computational models for cases of lexical ambiguity, as well as the behavior in cases of inter-linguistic disambiguation, with special emphasis on false friends. Furthermore, we intend to examine and contrast the results with the human interpretation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Objectives</head><p>The objective of our research proposal is to conduct an analysis of the behaviour of current vector models with regard to lexical ambiguity in Galician. In order to achieve the primary objective, the following secondary objectives have been proposed. 1. To review the current state-of-the-art in lexical ambiguity for single words and natural language processing 2. To define and compile training and evaluation datasets in Galician, and eventually in other languages (Portuguese and Spanish) following the Word in Context (WiC) format 3. To compile a cross-lingual dataset in Galician, Portuguese and Spanish with a focus on the analysis of false friends 4. To evaluate the performance of computational models in handling lexical ambiguity using the developed datasets 5. To evaluate human judgments on the same dataset as the computational models analyzed 6. To examine and compare the identification and recognition of the phenomena by humans and models</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Research Questions</head><p>In the first stage of the PhD, the following questions were formulated to achieve the established objectives:</p><p>• RQ1: Is written language sufficient for the assessed models to learn how to resolve lexical ambiguity or, on the contrary, would other types of information (visual, speech, etc.) be necessary? H1: Our hypothesis is that it can be sufficient, but under optimal conditions concerning training and computing resources as in Loureiro et al <ref type="bibr" target="#b7">[8]</ref>.</p><p>• RQ2: Can distributional models resolve lexical ambiguity? H2: We assume that it would differ depending on the semantic relationship and the assessed model.</p><p>• RQ3: Which models are better to resolve the ambiguity? H3: Our hypothesis is that the new models based on deep neural networks perform better than static models <ref type="bibr" target="#b36">[37,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b37">38]</ref>.</p><p>• RQ4: Can the analyzed methods demonstrate an enhanced capacity to comprehend and process lexical ambiguity in a manner that transcends the limitations of human understanding? H4: We assume that humans can perform better regarding lexical ambiguity than actual vector space models <ref type="bibr" target="#b36">[37,</ref><ref type="bibr" target="#b31">32]</ref>, but we can have systematic errors <ref type="bibr" target="#b33">[34]</ref>. However, in certain models, the representations utilized to achieve this are largely aligned with human intuitions <ref type="bibr" target="#b37">[38]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Methodology</head><p>The research will be conducted in the following stages, each one with a specific methodology based on the actual state-of-the-art in computational linguistics and natural language process. The initial stage of the research will entail the creation of a training and evaluation dataset based on WiC <ref type="bibr" target="#b20">[21]</ref> for Galician. The dataset will be created, in accordance with the methodology proposed in the original <ref type="bibr" target="#b20">[21]</ref>, and the other languages of the XL-WiC <ref type="bibr" target="#b21">[22]</ref>, using Galnet<ref type="foot" target="#foot_1">2</ref> to obtain the context sentences. Despite the valuable insights it offers, the size of this resource is insufficient for training and evaluating WSD tools, particularly with regard to the number of example sentences <ref type="bibr" target="#b21">[22]</ref>, so it was necessary to work on expanding the examples and words. In relation to the computational models that will be assessed with this dataset, we will utilize models based on Transformer architectures, in addition to other proposed architectures throughout the study.</p><p>In regard to human behavior and understanding in relation to lexical ambiguity, the objective is to analyze how people identify some semantic phenomena and assess the limit at which one lemma changes its meaning to a new one, as well as the extent to which the senses of the same lexeme diverge from one another. These experiments will be conducted following the standard practices in the fields and will be guided in their performance of the tasks by the previously established methodology <ref type="bibr" target="#b38">[39,</ref><ref type="bibr" target="#b31">32]</ref>.</p><p>Moreover, the intention is to create a cross-lingual dataset that will include other languages, such as Portuguese and Spanish. The details of the dataset have yet to be determined, but it will be structured in accordance with the format previously outlined. Each instance will contain a word (which should have the same or a similar lexical form, taking into account orthographic conventions in both languages) with two contexts: one for Galician and one for the other language (Portuguese or Spanish). The target word has a specific meaning in each context, which may or may not be the same. The selection of comparable lexical forms allows for an examination of the performance of the analyzed models in the context of false friends in closely related languages.</p><p>Finally, a quantitative analysis of the results will be conducted through an evaluation employing accuracy and correlation metrics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Preliminary results</head><p>In the initial phase of the investigation, it was determined that the number of Galician examples was insufficient for the requisite WDS disambiguation tasks <ref type="bibr" target="#b21">[22]</ref>. We decided to design a method <ref type="bibr" target="#b39">[40]</ref> to enhance the quantity of Galician examples by translating the English examples associated with a synset containing a Galician word in Galnet relying on the state-of-the-art of neural machine translation systems, specifically NOS-MT-OpenNMT-gl-en <ref type="bibr" target="#b40">[41]</ref>. In some cases, however, the synset does not have an associated Galician word. Therefore, bilingual word embeddings were employed as probabilistic dictionaries to search for new words. The following procedure was utilized: the Wikipedia versions of each language were processed with FreeLing <ref type="bibr" target="#b41">[42]</ref> for Galician and UDPipe <ref type="bibr" target="#b42">[43]</ref> for English. Two monolingual fastText models <ref type="bibr" target="#b16">[17]</ref> where trained for each language: with the two versions of the corpus: one lemmatized, and another representing each word as a lemma_POS-tag pair. Finally, we have mapped the monolingual models to a shared vector space. Then, we have designed and evaluated straightforward heuristics to expand Galnet to check if the new sentence can be added as a new one or not. Following the preliminary experiments, we have increased more than 4,5k synsets and 13k Galnet examples. These are being employed to construct WiC-type develope in Galician.</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">During the course of the thesis, we will also be able to carry out comparative analyses between other languages such as Portuguese and Spanish.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://ilg.usc.gal/galnet/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Acknowledgments</head><p>This project is carried out within the Research Group in Computational Linguistics (LComp, GI-2201), which is part of the Department of Spanish Language and Literature, Theory of Literature and General Linguistics of the University of Santiago de Compostela. Within the activity 2021-PG012 'Consolidación 2021 Modalidade C. Proxectos de excelencia -Exploración do coñecemento semántico en modelos vectoriais: homonimia, sinonimia, polisemia e idiomaticidade', following the research line 'Comprensión das linguas naturais: sintaxe e semántica computacionais' (PIESP0027) in the Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS) with funding from a pre-doctoral grant from the Xunta de Galicia (ED481A-2024-070).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">BERT: Pre-training of deep bidirectional transformers for language understanding</title>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/N19-1423</idno>
		<ptr target="https://aclanthology.org/N19-1423.doi:10.18653/v1/N19-1423" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<editor>
			<persName><forename type="first">J</forename><surname>Burstein</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Doran</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Solorio</surname></persName>
		</editor>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4171" to="4186" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Survey of transformers and towards ensemble learning using transformers for natural language processing</title>
		<author>
			<persName><forename type="first">H</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">O</forename><surname>Shafiq</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of big Data</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">25</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Word sense disambiguation: A survey</title>
		<author>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM computing surveys (CSUR)</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="page" from="1" to="69" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Analysis and Evaluation of Language Models for Word Sense Disambiguation</title>
		<author>
			<persName><forename type="first">D</forename><surname>Loureiro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Rezaee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Pilehvar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli_a_00405</idno>
		<ptr target="https://doi.org/10.1162/coli_a_00405.doi:10.1162/coli_a_00405" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="page" from="387" to="443" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">We&apos;re afraid language models aren&apos;t modeling ambiguity</title>
		<author>
			<persName><forename type="first">A</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Michael</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Suhr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>West</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Koller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Swayamdipta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Choi</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2304.14399" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A comprehensive review on resolving ambiguities in natural language processing</title>
		<author>
			<persName><forename type="first">A</forename><surname>Yadav</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Patel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Shah</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.aiopen.2021.05.001</idno>
		<ptr target="http://dx.doi.org/10.1016/j.aiopen.2021.05.001.doi:10.1016/j.aiopen.2021.05.001" />
	</analytic>
	<monogr>
		<title level="j">AI Open</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="85" to="92" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Word sense disambiguation: A unified evaluation framework and empirical comparison</title>
		<author>
			<persName><forename type="first">A</forename><surname>Raganato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/E17-1010" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics</title>
		<title level="s">Long Papers, Association for Computational Linguistics</title>
		<editor>
			<persName><forename type="first">M</forename><surname>Lapata</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Blunsom</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Koller</surname></persName>
		</editor>
		<meeting>the 15th Conference of the European Chapter of the Association for Computational Linguistics<address><addrLine>Valencia, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="99" to="110" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><surname>Loureiro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Rezaee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Pilehvar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<idno>CoRR abs/2008.11608</idno>
		<ptr target="https://arxiv.org/abs/2008.11608.arXiv:2008.11608" />
		<title level="m">Language models and word sense disambiguation: An overview and analysis</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">From Word Types to Tokens and Back: A Survey of Approaches to Word Meaning Representation and Interpretation</title>
		<author>
			<persName><forename type="first">M</forename><surname>Apidianaki</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli_a_00474</idno>
		<ptr target="https://doi.org/10.1162/coli_a_00474.doi:10.1162/coli_a_00474" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="page" from="465" to="523" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Word meaning is both categorical and continuous</title>
		<author>
			<persName><forename type="first">S</forename><surname>Trott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Bergen</surname></persName>
		</author>
		<idno type="DOI">10.1037/rev0000420</idno>
		<ptr target="http://dx.doi.org/10.1037/rev0000420.doi:10.1037/rev0000420" />
	</analytic>
	<monogr>
		<title level="j">Psychological Review</title>
		<imprint>
			<biblScope unit="volume">130</biblScope>
			<biblScope unit="page" from="1239" to="1261" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">WordNet: An Electronic Lexical Database</title>
		<author>
			<persName><forename type="first">C</forename><surname>Fellbaum</surname></persName>
		</author>
		<idno type="DOI">10.7551/mitpress/7287.001.0001</idno>
		<ptr target="http://dx.doi.org/10.7551/mitpress/7287.001.0001.doi:10.7551/mitpress/7287.001.0001" />
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>APA</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Wordnet then and now</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Miller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Fellbaum</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10579-007-9044-6</idno>
		<ptr target="http://dx.doi.org/10.1007/s10579-007-9044-6.doi:10.1007/s10579-007-9044-6" />
	</analytic>
	<monogr>
		<title level="j">Language Resources and Evaluation</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="page" from="209" to="214" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Distributional memory: A general framework for corpus-based semantics</title>
		<author>
			<persName><forename type="first">M</forename><surname>Baroni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lenci</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli_a_00016</idno>
		<ptr target="https://aclanthology.org/J10-4006.doi:10.1162/coli_a_00016" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="673" to="721" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Vector space models of word meaning and phrase meaning: A survey</title>
		<author>
			<persName><forename type="first">K</forename><surname>Erk</surname></persName>
		</author>
		<idno type="DOI">10.1002/lnco.362</idno>
		<ptr target="http://dx.doi.org/10.1002/lnco.362.doi:10.1002/lnco.362" />
	</analytic>
	<monogr>
		<title level="j">Language and Linguistics Compass</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="635" to="653" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Deep contextualized word representations</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Neumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Iyyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gardner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/N18-1202</idno>
		<ptr target="https://aclanthology.org/N18-1202.doi:10.18653/v1/N18-1202" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Walker</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Ji</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Stent</surname></persName>
		</editor>
		<meeting>the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>New Orleans, Louisiana</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="2227" to="2237" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Efficient estimation of word representations in vector space</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Corrado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dean</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Workshop at ICLR</title>
				<meeting>Workshop at ICLR</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Enriching word vectors with subword information</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bojanowski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Grave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Joulin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<idno type="DOI">10.1162/tacl_a_00051</idno>
		<ptr target="https://aclanthology.org/Q17-1010.doi:10.1162/tacl_a_00051" />
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="135" to="146" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Glove: Global vectors for word representation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Pennington</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Socher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)</title>
				<meeting>the 2014 conference on empirical methods in natural language processing (EMNLP)</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="1532" to="1543" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Language models are few-shot learners</title>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ryder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Subbiah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Kaplan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dhariwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Neelakantan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Shyam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sastry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Askell</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Cruse</surname></persName>
		</author>
		<title level="m">Cambridge textbooks in linguistics: Lexical semantics</title>
				<meeting><address><addrLine>Cambridge, England</addrLine></address></meeting>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="1986">1986</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">WiC: the word-in-context dataset for evaluating contextsensitive meaning representations</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Pilehvar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/N19-1128</idno>
		<ptr target="https://aclanthology.org/N19-1128.doi:10.18653/v1/N19-1128" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<editor>
			<persName><forename type="first">J</forename><surname>Burstein</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Doran</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Solorio</surname></persName>
		</editor>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1267" to="1273" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">XL-WiC: A multilingual benchmark for evaluating semantic contextualization</title>
		<author>
			<persName><forename type="first">A</forename><surname>Raganato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pasini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Pilehvar</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.emnlp-main.584</idno>
		<ptr target="https://aclanthology.org/2020.emnlp-main.584.doi:10.18653/v1/2020.emnlp-main.584" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">B</forename><surname>Webber</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Cohn</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</editor>
		<meeting>the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="7193" to="7206" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">ConSeC: Word sense disambiguation as continuous sense comprehension</title>
		<author>
			<persName><forename type="first">E</forename><surname>Barba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Procopio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2021.emnlp-main.112</idno>
		<ptr target="https://aclanthology.org/2021.emnlp-main.112.doi:10.18653/v1/2021.emnlp-main.112" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Online and</title>
				<editor>
			<persName><forename type="first">M.-F</forename><surname>Moens</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">X</forename><surname>Huang</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Specia</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><forename type="middle">W</forename><surname>.-T</surname></persName>
		</editor>
		<editor>
			<persName><surname>Yih</surname></persName>
		</editor>
		<meeting>the 2021 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Online and<address><addrLine>Punta Cana, Dominican Republic</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="1492" to="1503" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">XL-WSD: An extra-large and cross-lingual evaluation framework for word sense disambiguation</title>
		<author>
			<persName><forename type="first">T</forename><surname>Pasini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Raganato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of AAAI</title>
				<meeting>of AAAI</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="13648" to="13656" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">A survey on lexical ambiguity detection and word sense disambiguation</title>
		<author>
			<persName><forename type="first">M</forename><surname>Abeysiriwardana</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sumanathilaka</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2403.16129</idno>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Galnet: o wordnet do galego. aplicacións lexicolóxicas e terminolóxicas</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Solla Portela</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Guinovart</surname></persName>
		</author>
		<idno type="DOI">10.17979/rgf.2015.16.0.1383</idno>
		<ptr target="http://dx.doi.org/10.17979/rgf.2015.16.0.1383.doi:10.17979/rgf.2015.16.0.1383" />
	</analytic>
	<monogr>
		<title level="j">Revista Galega de Filoloxía</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="page" from="169" to="201" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Galnet: Wordnet 3.0 do galego</title>
		<author>
			<persName><forename type="first">X</forename><forename type="middle">Gómez</forename><surname>Guinovart</surname></persName>
		</author>
		<ptr target="https://www.linguamatica.com/index.php/linguamatica/article/view/91" />
	</analytic>
	<monogr>
		<title level="j">Linguamática</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="61" to="67" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Multilingual central repository version 3.0</title>
		<author>
			<persName><forename type="first">A</forename><surname>Gonzalez-Agirre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Laparra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Rigau</surname></persName>
		</author>
		<ptr target="http://www.lrec-conf.org/proceedings/lrec2012/pdf/293_Paper.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC&apos;12), European Language Resources Association (ELRA)</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Calzolari</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K</forename><surname>Choukri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Declerck</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><forename type="middle">U</forename><surname>Doğan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">B</forename><surname>Maegaard</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Mariani</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Moreno</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Odijk</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Piperidis</surname></persName>
		</editor>
		<meeting>the Eighth International Conference on Language Resources and Evaluation (LREC&apos;12), European Language Resources Association (ELRA)<address><addrLine>Istanbul, Turkey</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="2525" to="2529" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Attention is all you need</title>
		<author>
			<persName><forename type="first">A</forename><surname>Vaswani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Shazeer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Parmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Uszkoreit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Gomez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">U</forename><surname>Kaiser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Polosukhin</surname></persName>
		</author>
		<ptr target="https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf" />
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems</title>
				<editor>
			<persName><forename type="first">I</forename><surname>Guyon</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">U</forename><forename type="middle">V</forename><surname>Luxburg</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Bengio</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Wallach</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Fergus</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Vishwanathan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Garnett</surname></persName>
		</editor>
		<imprint>
			<publisher>Curran Associates, Inc</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">30</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Unsupervised cross-lingual representation learning at scale</title>
		<author>
			<persName><forename type="first">A</forename><surname>Conneau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Khandelwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Chaudhary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Wenzek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Guzmán</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Grave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stoyanov</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.acl-main.747</idno>
		<ptr target="https://aclanthology.org/2020.acl-main.747.doi:10.18653/v1/2020.acl-main.747" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Chai</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Schluter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Tetreault</surname></persName>
		</editor>
		<meeting>the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="8440" to="8451" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Deberta: Decoding-enhanced bert with disentangled attention</title>
		<author>
			<persName><forename type="first">P</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Chen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Learning Representations</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Contextualized word embeddings encode aspects of human-like word sense knowledge</title>
		<author>
			<persName><forename type="first">S</forename><surname>Nair</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Srinivasan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Meylan</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2020.cogalex-1.16" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshop on the Cognitive Aspects of the Lexicon, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Zock</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Chersoni</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Lenci</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Santus</surname></persName>
		</editor>
		<meeting>the Workshop on the Cognitive Aspects of the Lexicon, Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="129" to="141" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Exploring the representation of word meanings in context: A case study on homonymy and synonymy</title>
		<author>
			<persName><forename type="first">M</forename><surname>Garcia</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2021.acl-long.281</idno>
		<ptr target="https://aclanthology.org/2021.acl-long.281.doi:10.18653/v1/2021.acl-long.281" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing</title>
		<title level="s">Long Papers</title>
		<editor>
			<persName><forename type="first">C</forename><surname>Zong</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Xia</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">W</forename><surname>Li</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Navigli</surname></persName>
		</editor>
		<meeting>the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="3625" to="3640" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<title level="m" type="main">Bidirectional transformer representations of (spanish) ambiguous words in context: A new lexical resource and empirical analysis</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D</forename><surname>Rivière</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Beatty-Martínez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Trott</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2406.14678.arXiv:2406.14678" />
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Scope ambiguities in large language models</title>
		<author>
			<persName><forename type="first">G</forename><surname>Kamath</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Schuster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Vajjala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Reddy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="738" to="754" />
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<analytic>
		<title level="a" type="main">Do large language models know what humans know?</title>
		<author>
			<persName><forename type="first">S</forename><surname>Trott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Michaelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Bergen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cognitive Science</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="page">e13309</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Does bert make any sense? interpretable word sense disambiguation with contextualized embeddings</title>
		<author>
			<persName><forename type="first">G</forename><surname>Wiedemann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Remus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Chawla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Biemann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers, German Society for Computational Linguistics &amp; Language Technology</title>
				<meeting>the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers, German Society for Computational Linguistics &amp; Language Technology<address><addrLine>Erlangen, Germany</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="161" to="170" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">Do large language models resolve semantic ambiguities in the same way as humans? the case of word segmentation in chinese sentence reading</title>
		<author>
			<persName><forename type="first">W</forename><surname>Liao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Shum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">B</forename><surname>Chan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hsiao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Annual Meeting of the Cognitive Science Society</title>
				<meeting>the Annual Meeting of the Cognitive Science Society</meeting>
		<imprint>
			<date type="published" when="2024">2024</date>
			<biblScope unit="volume">46</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<analytic>
		<title level="a" type="main">Word sense distance in human similarity judgements and contextualised word embeddings</title>
		<author>
			<persName><forename type="first">J</forename><surname>Haber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poesio</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2020.pam-1.17" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Probability and Meaning Conference (PaM 2020), Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">C</forename><surname>Howes</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Chatzikyriakidis</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Ek</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Somashekarappa</surname></persName>
		</editor>
		<meeting>the Probability and Meaning Conference (PaM 2020), Association for Computational Linguistics<address><addrLine>Gothenburg</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="128" to="145" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">Wordnet expansion with bilingual word embeddings and neural machine translation</title>
		<author>
			<persName><forename type="first">M</forename><surname>Vázquez Abuín</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Garcia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EPIA Conference on Artificial Intelligence</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Gamallo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Bardanca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Pichel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Rodríguez-Rey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>De Dios-Flores</surname></persName>
		</author>
		<ptr target="https://huggingface.co/proxectonos/NOS-MT-OpenNMT-en-gl" />
		<title level="m">Nos_mt-opennmt-en-gl</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<analytic>
		<title level="a" type="main">Analizadores multilingües en freeling</title>
		<author>
			<persName><forename type="first">L</forename><surname>Padró</surname></persName>
		</author>
		<ptr target="https://linguamatica.com/index.php/linguamatica/article/view/115" />
	</analytic>
	<monogr>
		<title level="j">Linguamática</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="13" to="20" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<analytic>
		<title level="a" type="main">UDPipe 2.0 prototype at CoNLL 2018 UD shared task</title>
		<author>
			<persName><forename type="first">M</forename><surname>Straka</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/K18-2020</idno>
		<ptr target="https://aclanthology.org/K18-2020.doi:10.18653/v1/K18-2020" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Zeman</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Hajič</surname></persName>
		</editor>
		<meeting>the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Association for Computational Linguistics<address><addrLine>Brussels, Belgium</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="197" to="207" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
