<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Building Foundations for Inclusiveness through Expert-Annotated Data</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Moreno</forename><surname>La Quatra</surname></persName>
							<email>moreno.laquatra@unikore.it</email>
							<affiliation key="aff0">
								<orgName type="institution">Kore University of Enna</orgName>
								<address>
									<settlement>Enna</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Salvatore</forename><surname>Greco</surname></persName>
							<email>salvatore_greco@polito.it</email>
							<affiliation key="aff1">
								<orgName type="institution">Politecnico di Torino</orgName>
								<address>
									<settlement>Turin</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Luca</forename><surname>Cagliero</surname></persName>
							<email>luca.cagliero@polito.it</email>
							<affiliation key="aff1">
								<orgName type="institution">Politecnico di Torino</orgName>
								<address>
									<settlement>Turin</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michela</forename><surname>Tonti</surname></persName>
							<email>michela.tonti@unibg.it</email>
							<affiliation key="aff2">
								<orgName type="institution">Università degli studi di Bergamo</orgName>
								<address>
									<settlement>Bergamo</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Francesca</forename><surname>Dragotto</surname></persName>
							<email>francesca.dragotto@gmail.com</email>
							<affiliation key="aff3">
								<orgName type="institution">Università degli Studi di Roma Tor Vergata</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Rachele</forename><surname>Raus</surname></persName>
							<email>rachele.raus@unibo.it</email>
							<affiliation key="aff4">
								<orgName type="institution">Università di Bologna</orgName>
								<address>
									<settlement>Bologna</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stefania</forename><surname>Cavagnoli</surname></persName>
							<email>stefania.cavagnoli@uniroma2.it</email>
							<affiliation key="aff3">
								<orgName type="institution">Università degli Studi di Roma Tor Vergata</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Tania</forename><surname>Cerquitelli</surname></persName>
							<email>tania.cerquitelli@polito.it</email>
							<affiliation key="aff1">
								<orgName type="institution">Politecnico di Torino</orgName>
								<address>
									<settlement>Turin</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Building Foundations for Inclusiveness through Expert-Annotated Data</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">7A56D47F8EB7FB68981B6B0C57BB6F1A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:17+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>inclusive language</term>
					<term>natural language processing</term>
					<term>text generation</term>
					<term>deep learning</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Natural Language Understanding and Generation models suffer from a limited capability of understanding the nuances of inclusive communication as they are trained on massive data, often including significant portions of non-inclusive content. Even when the models are specifically designed to address non-inclusive language detection or reformulation, they disregard, to a large extent, inclusivenessrelated features that are likely correlated with the inclusive language nuances, such as the discourse type, level of inclusiveness, and intended context of use. To assess the importance of additional inclusiveness-related features, we collect a new corpus of Italian administrative documents humanly annotated by linguistic experts. Linguistic experts not only highlight non-inclusive text snippets and propose possible reformulations, but also annotate multi-aspect labels related to different inclusive language nuances. We empirically show that a multi-task learning approach that leverages the multi-aspect annotations can improve the non-inclusive text reformulation performance, thereby confirming the potential of expert-annotated data in inclusive language processing.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Non-inclusive expressions are widespread in humanly written documents <ref type="bibr" target="#b0">[1]</ref>. Training Natural Language Understanding and Generation models on massive data exposes them to bias issues related to language inclusiveness. Addressing this issue is particularly relevant because Artificial Intelligence (AI)-based solutions must be used responsibly to correctly model inclusive language practices and not unintentionally marginalize or disadvantage certain groups.</p><p>To mitigate the presence of bias in data, applications based on AI rely on human supervision for model training and post-processing evaluation. This is quite common in the areas of Natural Language Understanding and Generative AI, in which applications like Large Language Models (LLMs) provide end-users with conversational and language editing services <ref type="bibr" target="#b1">[2]</ref>.</p><p>The computational linguistic community has agreed on the need to leverage human expert annotations in experience-based learning for bias detection and mitigation <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b5">6]</ref>. However, the linguistics literature often underestimates the importance of linguistic annotators because of the widespread tendency to value the figures of preand post-editors <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>. Editing and annotation are substan-tially different: while language editing tools rewrite parts of the source text based on predefined expert-provided rules, Natural Language Understanding and Generation models can leverage annotations to capture the nuances of annotated text in a self-supervised manner. The use of textual annotations also relieves annotators of the task of explicitly formulating or adhering to ad hoc linguistic rules.</p><p>In the context of inclusive language understanding and generation, most of the previous work exploits rule-based or round-trip translations to annotate texts for inclusivity issues <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12]</ref>. However, these works often overlook the significance of human expert annotations, opting instead for rule-based approaches or artificially created datasets generated through round-trip translations. The role of linguistic annotators in providing specific understanding and annotations of language data is crucial for developing more inclusive AI models <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref>.</p><p>A limited body of work has been devoted to generating and exploiting multi-faceted expert human annotations to drive AI models for inclusive language, e.g., <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b16">17]</ref>. However, existing benchmarks of annotated text for inclusive language processing neglect potentially relevant aspects such as the level of inclusiveness, the intended context of use, and the text genre. These aspects have the potential to improve the inclusive language understanding and generation capabilities of AI models.</p><p>This paper proposes an expert-annotated dataset covering these new aspects and investigates their usefulness in enhancing the performance of the task of non-inclusive text reformulation in the absence of rule-based editing models.</p><p>To this end, we enrich a corpus of Italian administrative documents with multi-aspect annotations, providing more insights into the inclusive language nuances. The purpose is to enable the study of new features describing inclusiveness aspects neglected by existing approaches, such as the level of inclusiveness, register, and genre. By enriching the language descriptions with new inclusiveness-related features, we provide the research community with new resources to enhance the understanding and writing capabilities of AIbased solutions.</p><p>We also collect preliminary results on the use of multiaspect annotations in a multi-task learning approach to enhance non-inclusive language reformulations. The results confirm the potential of the inclusiveness-related expert annotations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The annotation process</head><p>The term annotation is often used to indicate the process by which textual data are subjected to a tightly interrelated two-phase activity <ref type="bibr" target="#b5">[6]</ref>: a) Identification, selection, and localisation of specific documents, and b) Interpretation and labeling of those documents. The first phase entails identifying and detailing the text segments that exhibit the linguistic phenomenon under investigation. Subsequently, in the interpretation phase, the selected occurrences are humanly labeled. These annotations may encompass various forms ranging from a selection of pre-established alternatives to free-text comments or possible reformulations.</p><p>Unlike human annotators, AI models often lack cognitive abilities such as common sense reasoning and generalization capabilities due to the relatively limited numbers of linguistic examples used for model training compared to the impressive variety of natural language forms.</p><p>Human annotators need sufficient expertise to interpret nuanced phenomena and assign appropriate labels adequately. Their annotations are at the base of a supervised learning process. The trained models can progressively learn from annotated data as automatized humans do, but at a scale not possible through manual work alone.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Annotation of Italian administrative documents.</head><p>We have designed and utilized a novel benchmark dataset for inclusive language writing in Italian. This dataset comprises administrative communications sourced from the Italian public administration, spanning across both national and regional levels. We annotate the corpus at the sentence level. To this end, we set up a heterogeneous team of 13 linguistic experts with diverse experiences and expertise in inclusive language. The team consists of predominantly female individuals, all native Italian speakers. All the annotators are educated: 57% have at least 10 years of experience in linguistics, and 50% have at least 3 years of experience in inclusive language. In addition, the annotators received, on average, about 30 hours of training specific to inclusive language annotations.</p><p>Each human annotator independently assigns inclusiveness-related metadata to the document sentences. Each sentence can be enriched with multiple annotations. The annotations consist of (a) The reformulation of any non-inclusive piece of text, i.e., an alternative inclusive form; (b) The level of inclusiveness of the input sentence indicating whether a sentence is non-inclusive, inclusive, or not pertinent; (c) The register or intended context of use, i.e., Standard, Specialized, or Informative/Educational; (d) the discourse type or genre, i.e., Legal, Administrative, Technical, or Informative/Educational.</p><p>Additional contextual aspects could be included in future annotations to enhance models' understanding of inclusive language usage further. By jointly providing those annotations, the experts aimed to capture inclusive language's nuanced, multi-faceted nature.</p><p>By learning language inclusiveness patterns from a diversified, context-dependent set of expert annotations, AI models gain exposure to subtle interpretive differences. The consistency across annotations is ensured through detailed guidelines and instructions provided to experts. Before full annotation, a collaborative analysis of a sample set identifies any divergent interpretations to refine guidance. Example of annotations. Table <ref type="table" target="#tab_2">2</ref> shows an example of an Italian annotated sentence (as well as the corresponding English translation for non-Italian readers). Linguistics experts assign different annotations to each sentence. In this example, they have assigned three labels to the sentence. Regarding inclusiveness, the sentence has been categorized as non-inclusive because it contains "Il Presidente" (i.e., Chair/President) and "Rettore" (i.e., Rector), which are masculine declensions of professional roles. In addition, the sentence also contains "suo decreto", which refers to a decree that comes from a male person, so the sentence is not inclusive. The discourse sequence is of the administrative type, as the content refers to an administrative topic, and the used language is specialized, as the content describes specific and technical aspects.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Statistics on annotated data.</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Case study: Leveraging Aspects for Italian Inclusive Language Reformulation</head><p>We conduct an empirical analysis to examine the impact of utilizing expert annotations in inclusive language generation. Specifically, we investigate the advantages of simultaneously addressing two key objectives: reformulating non-inclusive language and predicting various aspects of inclusiveness.</p><p>Tasks. Given a non-inclusive piece of text 𝑇 , the Non-Inclusive Language Reformulation (NILR) task aims at generating an equivalent inclusive natural language form. The NILR task is a sequence-to-sequence problem, where the input is a non-inclusive sentence and the output is the corresponding inclusive sentence.</p><p>Given 𝑇 and an aspect 𝐴, the goal is to predict the 𝐴's value for 𝑇 . 𝐴 can be the level of inclusiveness, register or intended context of use, and discourse type or genre. According to the aspect under analysis, the corresponding sub-tasks are denoted by Inclusiveness Level Classification (ILC), Register Classification (RC), and Genre Classification  Performance comparison between Single-and Multi-task Learning approaches in inclusive language generation, evaluated based on ROUGE scores (R-1, R-2, R-L) and human evaluation.</p><p>(GC). The ILC, RC, and GC tasks are treated as separate classification problems, where the input is a sentence and the output is the corresponding aspect value.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Single-vs. Multi-Task Learning</head><p>To compare the performance of models trained using different learning approaches, we conducted experiments in both single-task and multi-task learning settings.</p><p>In Single-Task Learning, we exclusively focus on the task of Non-Inclusive Language Reformulation (NILR), disregarding all aspect-related annotations. We leverage an encoderdecoder architecture, specifically BART-IT <ref type="bibr" target="#b17">[18]</ref>, which is a BART architecture <ref type="bibr" target="#b18">[19]</ref> pre-trained on a clean Italian corpus <ref type="bibr" target="#b19">[20]</ref>. The model is fine-tuned on the NILR task with the twofold objective of modifying the input sentence to make it inclusive while maintaining the original meaning.</p><p>Conversely, in Multi-Task Learning, we integrate the NILR task with Aspect Classification tasks during training (i.e., ILC, RC, and GC). For the additional tasks, we specifically leverage the encoder component of the model, which extracts representations of the input text. The encoder component is additionally trained with a classification objective. Each task is associated with a separate classification head, trained to predict the corresponding aspect value for the input sentence. By interleaving these tasks during training, the model learns to simultaneously address NILR and create encoder representations that capture various aspects related to inclusiveness. Evaluation Metrics. We evaluate the quality of the text reformulation using a standard train-validation-test split on our expert-annotated data. To compare the automatically generated and expected reformulations, we use the established ROUGE F1-scores <ref type="bibr" target="#b20">[21]</ref>. They measure the unit overlap, in terms of the number of n-grams in common, between the two pieces of text. The larger the score, the higher the syntactic similarity. R-1, R-2, and R-L count the unit overlap in terms of unigrams, bigrams, and longest common subsequences, respectively.</p><p>To complement the quantitative evaluation, we also perform a qualitative evaluation of the achieved results. We involved six human evaluators who were asked to label each model-generated sentence as: correct if it accurately maintained the original meaning while using inclusive language appropriately for the context; partially correct if some aspects were reformed correctly, but others were missed or inaccurate; or not correct if the rewriting fundamentally failed to capture the original meaning or usage intention. This multi-level feedback aims at capturing the models' ability to perform the rewriting task sensitively across different scenarios beyond just string-matching metrics.</p><p>To each reformulation, we assign a score to each annotation as follows: 1 for correct, 0.5 for partially correct, and 0 for incorrect. The final score for each reformulation is computed as the average over all the expert annotations (𝑚 = 6). Finally, we average the scores for all the reformulations (𝑛 = 30) to obtain a single score for each model.</p><p>Results' overview. Columns 2, 3, and 4 in Table <ref type="table" target="#tab_3">3</ref> show the ROUGE scores for both models. The multi-task learning achieves the best performance on all the quantitative metrics. Regarding the human evaluation, we obtained 6 annotations for 30 reformulations for each model. For the model trained with the single task configuration, 93 were correct, 55 were partially correct, and 32 were incorrect. Instead, for the multi-task model, 101 were correct, 49 were partially correct, and 30 were incorrect. Column 5 reports the average human evaluation scores for both models. The human scores are coherent with the quantitative ones, showing that the model trained under multi-task settings benefits from the additional labels. Based on these preliminary results, we can conclude that the nuanced and multidimensional annotations of inclusive language have the potential to develop a more comprehensive approach to modeling inclusive language.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusions</head><p>This paper discussed and experimentally demonstrated that the role and contribution of human annotators are of paramount importance in improving the quality of NLP results and the writing capability of generative approaches in inclusive communication. Starting from a new Italian administrative corpus, we enriched it with a variety of annotations with the help of a team of language experts. This included (i) reformulating gendered language and acronyms, (ii) rewriting to enhance readability for the visually impaired, and (iii) defining the intended context of use (register) and text genre. The preliminary experimental results on the annotated corpus are promising and highlight the potential of the newly proposed annotations to develop a more comprehensive and richer approach that improves the ability of the generative algorithm to propose comprehensive and integrative reformulations.</p><p>Limitations. i) The annotation is language-specific, limited to the Italian language, thereby constraining its utility in multilingual scenarios; and ii) It is formal communicationspecific. Tailored to tackle the challenge of inclusive language in administrative and academic settings, the natural language tasks are exclusively trained on administrative documents, potentially lacking suitability for diverse contexts like legal and web communications.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Future work.</head><p>As part of the E-MIMIC 1 (Empowering Multilingual Inclusive Communication) project, we are currently working on a multilingual annotation process to overcome these issues and foster inclusive communication across different domains and languages. A team of experts is annotating a large corpus of documents according to linguistic criteria to label linguistic resources in a multilingual setting.</p><p>Finally, we want to exploit text-based explainability techniques <ref type="bibr" target="#b21">[22,</ref><ref type="bibr" target="#b22">23]</ref> to perform further human validation of the models produced.</p><p>Ethical Considerations. All the gathered documents are public and therefore freely accessible on the internet. All references to proper names of people and institutions have been anonymized and replaced with random names for privacy reasons.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>Table 1 reports the number of annotated sentences for each aspect, separately for the training, validation, and test sets.</figDesc><table><row><cell cols="4">Task ID Train Validation Test</cell></row><row><cell>NILR</cell><cell>6491</cell><cell>956</cell><cell>579</cell></row><row><cell>ILC</cell><cell>9207</cell><cell>1421</cell><cell>866</cell></row><row><cell>RC</cell><cell>2167</cell><cell>338</cell><cell>247</cell></row><row><cell>GC</cell><cell>2166</cell><cell>338</cell><cell>248</cell></row><row><cell>Table 1</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">Statistics on data. NILR=Non-Inclusive Language Reformulation,</cell></row><row><cell cols="4">ILC=Inclusiveness Level Classification, RC=Register Classifica-</cell></row><row><cell cols="2">tion, GC=Genre Classification.</cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2</head><label>2</label><figDesc>Example of sentence annotations illustrating non-inclusive language reformulation in Italian (IT) and English (EN), along with corresponding inclusiveness classification, discursive sequence, and clear language class.</figDesc><table><row><cell>Setting</cell><cell>R-1</cell><cell>R-2</cell><cell>R-L</cell><cell>Human Eval</cell></row><row><cell>Single-Task</cell><cell>74.95</cell><cell>64.09</cell><cell>74.79</cell><cell>0.67</cell></row><row><cell>Multi-Task</cell><cell cols="3">75.58 64.37 75.36</cell><cell>0.70</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3</head><label>3</label><figDesc></figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This study was carried out within the project "E-MIMIC: Empowering Multilingual Inclusive Communication", funded by the Ministero dell'Universitá e della Ricerca -with the PRIN 2022 (D.D. 104 -02/02/2022) program.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Three recommended inclusive language guidelines for scholarly publishing: Words matter</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">J</forename><surname>Ashwell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">K</forename><surname>Baskin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">L</forename><surname>Christiansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Dibari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Flanagin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Frey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Jemison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ricci</surname></persName>
		</author>
		<idno type="DOI">10.1002/LEAP.1527</idno>
		<ptr target="https://doi.org/10.1002/leap.1527.doi:10.1002/LEAP.1527" />
	</analytic>
	<monogr>
		<title level="j">Learn. Publ</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="94" to="99" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Automatic identification of harmful, aggressive, abusive, and offensive language on the web: A survey of technical biases informed by psychology literature</title>
		<author>
			<persName><forename type="first">A</forename><surname>Balayn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Szlávik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bozzon</surname></persName>
		</author>
		<idno type="DOI">10.1145/3479158</idno>
		<ptr target="https://doi.org/10.1145/3479158.doi:10.1145/3479158" />
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Soc. Comput</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page">56</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Bias decreases in proportion to the number of annotators</title>
		<author>
			<persName><forename type="first">R</forename><surname>Artstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poesio</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of FG-MoL 2005: The 10th conference on Formal Grammar and The 9th Meeting on</title>
				<meeting>FG-MoL 2005: The 10th conference on Formal Grammar and The 9th Meeting on</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">139</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Inter-coder agreement for computational linguistics</title>
		<author>
			<persName><forename type="first">R</forename><surname>Artstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poesio</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational linguistics</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="page" from="555" to="596" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Assessing agreement on classification tasks: The kappa statistic</title>
		<author>
			<persName><forename type="first">J</forename><surname>Carletta</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/J96-2004" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="249" to="254" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">What determines inter-coder agreement in manual annotations? a meta-analytic investigation</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">S</forename><surname>Bayerl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">I</forename><surname>Paul</surname></persName>
		</author>
		<idno type="DOI">10.1162/COLI_a_00074</idno>
		<ptr target="https://aclanthology.org/J11-4004.doi:10.1162/COLI_a_00074" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="page" from="699" to="725" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Dalla zairja alla traduzione automatica: riflessioni sulla traduzione nell&apos;era digitale</title>
		<author>
			<persName><forename type="first">J</forename><surname>Monti</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
			<publisher>Loffredo</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Selecting and preparing texts for machine translation: Pre-editing and writing for a global audience, Machine translation for everyone: Empowering users in the age of</title>
		<author>
			<persName><forename type="first">P</forename><surname>Sánchez-Gijón</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kenny</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">artificial intelligence</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="page">81</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">User-centric gender rewriting</title>
		<author>
			<persName><forename type="first">B</forename><surname>Alhafni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Habash</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bouamor</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2022.naacl-main.46</idno>
		<ptr target="https://aclanthology.org/2022.naacl-main.46.doi:10.18653/v1/2022.naacl-main.46" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics</title>
				<meeting>the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics<address><addrLine>Seattle, United States</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="618" to="631" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Exploiting biased models to de-bias text: A gender-fair rewriting model</title>
		<author>
			<persName><forename type="first">C</forename><surname>Amrhein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Schottmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sennrich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Läubli</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.acl-long.246</idno>
		<ptr target="https://aclanthology.org/2023.acl-long.246.doi:10.18653/v1/2023.acl-long.246" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</title>
				<meeting>the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<address><addrLine>Toronto, Canada</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="4486" to="4506" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">theirs: Rewriting with gender-neutral english</title>
		<author>
			<persName><forename type="first">T</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Webster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Shah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Johnson</surname></persName>
		</author>
		<author>
			<persName><surname>They</surname></persName>
		</author>
		<author>
			<persName><surname>Them</surname></persName>
		</author>
		<idno>CoRR abs/2102.06788</idno>
		<ptr target="https://arxiv.org/abs/2102.06788.arXiv:2102.06788" />
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Neu-Tral Rewriter: A rule-based and neural approach to automatic rewriting into gender neutral alternatives</title>
		<author>
			<persName><forename type="first">E</forename><surname>Vanmassenhove</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Emmery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Shterionov</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2021.emnlp-main.704</idno>
		<ptr target="https://aclanthology.org/2021.emnlp-main.704.doi:10.18653/v1/2021.emnlp-main.704" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Online and</title>
				<meeting>the 2021 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Online and<address><addrLine>Punta Cana, Dominican Republic</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="8940" to="8948" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Gender neutralization for an inclusive machine translation: from theoretical foundations to open challenges</title>
		<author>
			<persName><forename type="first">A</forename><surname>Piergentili</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Fucci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Savoldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bentivogli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Negri</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2023.gitt-1.7" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First Workshop on Gender-Inclusive Translation Technologies, European Association for Machine Translation</title>
				<meeting>the First Workshop on Gender-Inclusive Translation Technologies, European Association for Machine Translation<address><addrLine>Tampere, Finland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="71" to="83" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Beyond obscuration and visibility: Thoughts on the different strategies of genderfair language in italian</title>
		<author>
			<persName><forename type="first">M</forename><surname>Rosola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Frenda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T</forename><surname>Cignarella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pellegrini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Marra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Floris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 9th Italian Conference on Computational Linguistics</title>
				<meeting>the 9th Italian Conference on Computational Linguistics<address><addrLine>Venice, Italy</addrLine></address></meeting>
		<imprint>
			<publisher>CEUR-WS</publisher>
			<date type="published" when="2023-11-30">2023. November 30-December 2, 2023. 2023</date>
			<biblScope unit="volume">3596</biblScope>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
	<note>CLiC-it</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">E-mimic: Empowering multilingual inclusive communication</title>
		<author>
			<persName><forename type="first">G</forename><surname>Attanasio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Greco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>La Quatra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Cagliero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tonti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Cerquitelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Raus</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2021 IEEE International Conference on Big Data (Big Data)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="4227" to="4234" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Inclusively: An ai-based assistant for inclusive writing</title>
		<author>
			<persName><forename type="first">M</forename><surname>La Quatra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Greco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Cagliero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Cerquitelli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Joint European Conference on Machine Learning and Knowledge Discovery in Databases</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="361" to="365" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">L´analyse du discours et l´intelligence artificielle pour réaliser une écriture inclusive : le projet emimic</title>
		<author>
			<persName><forename type="first">Rachele</forename><surname>Raus</surname></persName>
		</author>
		<author>
			<persName><surname>Tonti</surname></persName>
		</author>
		<author>
			<persName><surname>Michela</surname></persName>
		</author>
		<author>
			<persName><surname>Cerquitelli</surname></persName>
		</author>
		<author>
			<persName><surname>Tania</surname></persName>
		</author>
		<author>
			<persName><surname>Cagliero</surname></persName>
		</author>
		<author>
			<persName><surname>Luca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Giuseppe</forename><surname>Attanasio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">La</forename><surname>Quatra</surname></persName>
		</author>
		<author>
			<persName><surname>Moreno</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Salvatore</forename><surname>Greco</surname></persName>
		</author>
		<idno type="DOI">10.1051/shsconf/202213801007</idno>
		<ptr target="https://doi.org/10.1051/shsconf/202213801007.doi:10.1051/shsconf/202213801007" />
	</analytic>
	<monogr>
		<title level="j">SHS Web Conf</title>
		<imprint>
			<biblScope unit="volume">138</biblScope>
			<biblScope unit="page">1007</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Bart-it: An efficient sequence-tosequence model for italian text summarization</title>
		<author>
			<persName><forename type="first">La</forename><surname>Quatra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cagliero</forename></persName>
		</author>
		<idno type="DOI">10.3390/fi15010015</idno>
		<ptr target="http://dx.doi.org/10.3390/fi15010015.doi:10.3390/fi15010015" />
	</analytic>
	<monogr>
		<title level="j">Future Internet</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page">15</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension</title>
		<author>
			<persName><forename type="first">M</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ghazvininejad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mohamed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Levy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stoyanov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.acl-main.703</idno>
		<ptr target="https://aclanthology.org/2020.acl-main.703.doi:10.18653/v1/2020.acl-main.703" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</title>
				<meeting>the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="7871" to="7880" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Sarti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Nissim</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2203.03759</idno>
		<title level="m">It5: Large-scale text-to-text pretraining for italian language understanding and generation</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">ROUGE: A package for automatic evaluation of summaries</title>
		<author>
			<persName><forename type="first">C.-Y</forename><surname>Lin</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/W04-1013" />
	</analytic>
	<monogr>
		<title level="m">Text Summarization Branches Out, Association for Computational Linguistics</title>
				<meeting><address><addrLine>Barcelona, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="74" to="81" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Inseq: An interpretability toolkit for sequence generation models</title>
		<author>
			<persName><forename type="first">G</forename><surname>Sarti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Feldhus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Sickert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Van Der Wal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bisazza</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.acl-demo.40</idno>
		<ptr target="https://aclanthology.org/2023.acl-demo.40.doi:10.18653/v1/2023.acl-demo.40" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Association for Computational Linguistics</title>
				<meeting>the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Association for Computational Linguistics<address><addrLine>Toronto, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="421" to="435" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Trusting deep learning natural-language models via local and global explanations</title>
		<author>
			<persName><forename type="first">F</forename><surname>Ventura</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Greco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Apiletti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Cerquitelli</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10115-022-01690-9</idno>
		<ptr target="https://doi.org/10.1007/s10115-022-01690-9.doi:10.1007/s10115-022-01690-9" />
	</analytic>
	<monogr>
		<title level="j">Knowl. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">64</biblScope>
			<biblScope unit="page" from="1863" to="1907" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
