<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Comparative Evaluation of Humour Translation from English to Spanish: A Study with BLOOM and Googletrans Notebook for the JOKER Lab at CLEF 2024</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Olga</forename><surname>Popova</surname></persName>
							<email>olga.popova@uca.es</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Cadiz</orgName>
								<address>
									<addrLine>9 Paseo Carlos III St</addrLine>
									<postCode>11003</postCode>
									<settlement>Cadiz</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Comparative Evaluation of Humour Translation from English to Spanish: A Study with BLOOM and Googletrans Notebook for the JOKER Lab at CLEF 2024</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">EFAD624DCB28D267ED5DFC9636C4DFE9</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:56+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Translation</term>
					<term>pun translation</term>
					<term>automatic translation</term>
					<term>CLEF2024</term>
					<term>JOKER</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The problem of accurately translating wordplay in automatic humour analysis remains challenging, as highlighted by the CLEF 2024 JOKER Track. This problem is particularly interesting because humour is deeply cultural and context-dependent, making it difficult for language models to handle. Our study compares last year's and this year's results to determine if there have been improvements in the automatic translation of wordplay using BLOOM and Googletrans. The findings indicate that while numerical metrics show high precision, recall, and F1 scores, the actual translation and conveyance of jokes' meanings still rely heavily on coincidence, underscoring the need for further training and enhancement of language models.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>According to the Overview of the CLEF 2024 JOKER Track (Automatic Humour Analysis), this Lab was established in 2022 at the Conference and Labs of the Evaluation Forum (CLEF) <ref type="bibr" target="#b0">[1]</ref>. The author of this article is participating in the tasks of this laboratory for the second consecutive year <ref type="bibr" target="#b1">[2]</ref>. In this way, this article will attempt to compare last year's results with this year's results to see if the automatic translation of wordplay using language models and automatic translators has improved.</p><p>Before sharing the methodology and the results of this year's runs in comparison with last year's, we will describe the JOKER tasks for this year, which include three different tasks:</p><p>• Task 1: Humour-aware information retrieval • Task 2: Humour classification according to genre and technique • Task 3: Translation of puns from English to French This year, task 3 includes translation only from English to French, whereas in CLEF 2023, it also included Spanish. However, despite the exclusion of Spanish this year, the organizers of JOKER Track at CLEF 2024 allowed the author of this article to carry out the task using the Spanish language. Therefore, these working notes will focus solely on describing task 3 of JOKER Track.</p><p>The overall objective of this paper is to translate puns from English to Spanish using BLOOM and Google Translate. BLOOM is the only language model used, as the previous study found that it produced better results than all other models, including GPT, Simple T5, EasyNMT-Opus, and EasyNMT-mbart <ref type="bibr" target="#b1">[2]</ref>.</p><p>These working notes are organized as follows: after the introduction is the Experimental Setup section dedicated to the approach (data and method description); the third section is Experimental Results that includes some tables with metrics and scores, comparing the results of the task 3 of CLEF 2023 and CLEF 2024; and the fourth section is the discussion of the results and conclusions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Experimental Setup</head><p>In this section we will give a brief description of the data provided to perform task 3, the methods used to solve it and the evaluation methods.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Data description</head><p>The task will provide an updated test set of English punning jokes, for French language the training data is also available. However, considering that the training data for Spanish is not needed for this experiment, we only use the test data. The test data is provided in JSON format. As we can see, we were provided with 5,727 jokes in English to translate. The data itself looked as follows: She dumped him because of all their lousy dates. After all, whining and dining does get tiresome after a while.</p><p>Each entry in the table contains an identifying number for each joke, and a one-or two-sentence joke that includes a pun.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Method description</head><p>As mentioned in the introduction, to carry out task 3, we used BLOOM with two different prompts and Google Translate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.1.">BLOOM</head><p>BigScience Large Open-science Open-access Multilingual Language Model (BLOOM) is a transformerbased language model <ref type="bibr" target="#b2">[3]</ref>. Because the number of tokens is limited in BLOOM, each run was performed with only 100 puns.</p><p>The prompts used are provided below.</p><p>• Prompt 1:</p><p>"Original: Diabetics should not be allowed to have sweet dreams.\n\ Translation: Los diabéticos no deberían tener dulces sueños.\n\ \n\ Original: I'm going to the guillotine at dawn and my wife has already collected my severance pay.\n\ Translation: Al amanecer me van a pasar por la guillotina y mi mujer ya ha firmado la separación.\n\ \n\ Original: After 5 years with the same chiropractor, I moved and had to change doctors. It was quite an adjustment.\n\ Translation: Me mudé y tuve que buscar otros médicos después de estar cinco años con el mismo quiropráctico. Fue un mero ajuste.\n\ \n\ Original: A scientist doing a large experiment with liquid chemicals was trying to solve a problem when he fell in and became part of the solution.\n\ Translation: Un científico que hacía un gran experimento con productos químicos líquidos estaba intentando solucionar un problema cuando cayó en que él se convertiría en parte de la solución.\n\ \n\ Original: Old electricians never die, they just keep plugging away.\n\ Translation:"</p><p>• Prompt 2:</p><p>"Request: Translate from English to Spanish: Diabetics should not be allowed to have sweet dreams.\n\ Answer: Los diabéticos no deberían permitirse soñar dulce (expresión figurativa para decir que no deberían permitirse deseos excesivos o indulgencias).\n\ \n\ Request: I'm going to the guillotine at dawn and my wife has already collected my severance pay.\n\ Answer: Me voy a la guillotina al amanecer y mi esposa ya ha recogido mi indemnización ( expresión figurativa para decir que me estoy preparando para un evento desafortunado y mi esposa ya ha preparado los asuntos financieros para el futuro sin mí).\n\ \n\ Request: After 5 years with the same chiropractor, I moved and had to change doctors. It was quite an adjustment.\n\ Answer:"</p><p>It is curious that when we created the second prompt, BLOOM itself, along with the Spanish translation in brackets, provided an explanation of the wordplay:</p><p>• 'expresión figurativa para decir que no deberían permitirse deseos excesivos o indulgencias' = 'figurative expression to say that excessive desires or indulgences should not be allowed' • 'expresión figurativa para decir que me estoy preparando para un evento desafortunado y mi esposa ya ha preparado los asuntos financieros para el futuro sin mí' = 'figurative expression to say that I am preparing for an unfortunate event and my wife has already prepared the financial affairs for the future without me'</p><p>Thus, it appears that BLOOM has attempted to apply a sort of translator's explanatory note.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.2.">Googletrans</head><p>Googletrans is a free and unlimited python library that implements Google Translate API <ref type="bibr" target="#b3">[4]</ref>. Since we use Google Colab for code execution, it was absolutely impossible to process all of them at once with over 5000 examples. Therefore, we were compelled to divide the test data into four parts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Evaluation methods description</head><p>The task organizers used two methods for evaluating translations: BLEU and BERT Score <ref type="bibr" target="#b0">[1]</ref>. BLEU (BiLingual Evaluation Understudy) measures the lexical similarity between a candidate translation and a reference translation <ref type="bibr" target="#b4">[5]</ref>. The organizers utilized the sacreBLEU implementation9 with the default tokenizer 13a, which emulates the mteval-v13a script from Moses <ref type="bibr" target="#b5">[6]</ref>. Their report includes the BLEU score (harmonic mean) and BLEU precisions for n-grams.</p><p>BERT Score, obtained from the python bert-score package10 <ref type="bibr" target="#b6">[7]</ref>, presents mean values of precision, recall, and F1 scores.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Experimental Results</head><p>We divided the results section into two parts: numerical results and linguistic results. In the first subsection, we examine the evaluation metrics provided by the organizers, while in the second, we discuss some examples of translations produced using each method.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Numerical results</head><p>Above all, we retrieved the table of results from CLEF 2023 for task 3 with BLOOM and Googletrans. The results for BLOOM using prompt 1 remain the same for both BLEU and BERT score in CLEF 2023 and CLEF 2024. Googletrans shows superior metrics, but this advantage should be viewed with caution due to the larger number of evaluations. BLOOM_2 performs slightly better than BLOOM_1 in the few evaluations conducted, but both methods need more evaluations to provide a more reliable analysis. For a fairer and more accurate comparison, the number of evaluated examples should be equalized for all methods. With a larger amount of data, we might see a reduction in the performance gap between Googletrans and the BLOOM methods. Similarly, we can observe that there has been a slight change in the metrics performed by BLEU for Googletrans when comparing CLEF 2023 and CLEF 2024, this change should be analyzed qualitatively by comparing the translations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Linguistic results</head><p>Next, we will analyze some examples of the translations produced by BLOOM and Googletrans in 2023 and 2024. Los pianistas saben en qué bar se encuentran.</p><p>Los pianistas saben en qué bar se encuentran.</p><p>If we compare the translations produced by Googletrans in 2023 and 2024, we can observe that the vast majority of jokes are translated identically. In Example 1 of Table <ref type="table" target="#tab_3">7</ref>, the only difference is in the use of capitalization. We can infer that the differences in BLEU metrics are due to the fact that this year's test data does not include all the same jokes as last year's. OLD POLICEMEN never die they just cop out.</p><p>Los viejos policías nunca mueren, sólo se retiran.</p><p>Los viejos policías nunca mueren, sólo se retiran.</p><p>Los viejos policías nunca mueren, solo se jubilan. The boy swallowed a pillow, the hospital described his condition as comfortable.</p><p>El niño se tragó un cojín, el hospital describió su estado como cómodo.</p><p>El niño se tragó un cojín, el hospital describió su estado como cómodo.</p><p>El niño se tragó un cojín, el hospital describió su condición como cómoda.</p><p>I've never taken an elevator to the basement floor, that's just beneath me.</p><p>Nunca he cogido un ascensor hasta el sótano, eso está por debajo de mí.</p><p>Nunca he cogido un ascensor hasta el sótano, eso está por debajo de mí.</p><p>Nunca he tomado un ascensor al sótano, eso está justo debajo de mí.</p><p>Regarding the translations done by BLOOM, the outputs from 2023 and this year using prompt 1 are identical. In some cases, the joke is translated understandably, while in others, it is not. Concerning the translations with prompt 2, we observe that the target texts are more literal than those from prompt 1. While BLOOM_1 attempts to adapt the vocabulary to make it more natural, BLOOM_2 performs a highly literal translation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion and Conclusions</head><p>The numerical results, which are quite high in terms of precision, recall, and F1, suggest that the translations of the jokes are well-executed. However, when we analyze the results from a translational and linguistic perspective, we observe that the successful translation and conveyance of a joke's meaning are more coincidental. Although jokes are a cultural matter and usually require adaptation in translation, some jokes can be understood when translated literally, which explains some of the good translations in this task. By comparing the numerical and linguistic results, we can conclude that the more literal the translation, the higher the metrics. Therefore, we can conclude that language models still require significant training and improvement to accurately translate jokes in a "conscious" manner.</p><p>Future work could involve analyzing other language models, such as GPT, as it continues to improve and might offer different results. Additionally, exploring the translation of other types of jokes and wordplay, and attempting to train models with high-quality translations of jokes could be beneficial.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Data size for task 3</figDesc><table><row><cell></cell><cell>entries</cell></row><row><cell>test data (jokes) in English</cell><cell>5727</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Test data for pun translation</figDesc><table><row><cell cols="2">index id_en</cell><cell>text_en</cell></row><row><cell>0</cell><cell>en_8</cell><cell>"I find you guilty, " said the judge with conviction.</cell></row><row><cell>1</cell><cell>en_9</cell><cell>The student had such a big assignment, he had to burn his kindle at</cell></row><row><cell></cell><cell></cell><cell>both ends.</cell></row><row><cell>2</cell><cell>en_10</cell><cell>Herbivores come in browns and graze.</cell></row><row><cell>3</cell><cell>en_11</cell><cell>I'll corroborate that again, Tom reproved.</cell></row><row><cell>4</cell><cell>en_13</cell><cell>I used to do rock climbing as a youth, but I was much boulder back</cell></row><row><cell></cell><cell></cell><cell>then.</cell></row><row><cell>5</cell><cell>en_14</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Evaluation of task 3 CLEF 2023 with BLEU</figDesc><table><row><cell>run_id</cell><cell cols="6">count BLEU BLEU_1 BLEU_2 BLEU_3 BLEU_4</cell></row><row><cell>BLOOM</cell><cell>5.0</cell><cell>24.49</cell><cell>39.36</cell><cell>28.09</cell><cell>21.43</cell><cell>15.19</cell></row><row><cell cols="2">Googletrans 215.0</cell><cell>51.38</cell><cell>70.58</cell><cell>55.09</cell><cell>46.097</cell><cell>38.94</cell></row><row><cell>Table 4</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">Evaluation of task 3 CLEF 2023 with BERT score</cell><cell></cell><cell></cell><cell></cell></row><row><cell>run_id</cell><cell cols="6">count BERT_score_P BERT_score_R BERT_score_F1</cell></row><row><cell>BLOOM</cell><cell>8.0</cell><cell>0.74</cell><cell></cell><cell>0.82</cell><cell></cell><cell>0.779</cell></row><row><cell cols="2">Googletrans 644.0</cell><cell>0.86</cell><cell></cell><cell>0.86</cell><cell></cell><cell>0.86</cell></row><row><cell>Table 5</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="3">Evaluation of task 3 CLEF 2024 with BLEU</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>run_id</cell><cell cols="6">count BLEU BLEU_1 BLEU_2 BLEU_3 BLEU_4</cell></row><row><cell>BLOOM_1</cell><cell>5.0</cell><cell>24.49</cell><cell>39.36</cell><cell>28.09</cell><cell>21.43</cell><cell>15.19</cell></row><row><cell>BLOOM_2</cell><cell>5.0</cell><cell>28.25</cell><cell>41.98</cell><cell>32.89</cell><cell>25.35</cell><cell>18.18</cell></row><row><cell cols="3">Googletrans 215.0 51.199</cell><cell>70.62</cell><cell>55.04</cell><cell>45.96</cell><cell>38.72</cell></row><row><cell>Table 6</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">Evaluation of task 3 CLEF 2023 with BERT score</cell><cell></cell><cell></cell><cell></cell></row><row><cell>run_id</cell><cell cols="6">count BERT_score_P BERT_score_R BERT_score_F1</cell></row><row><cell>BLOOM_1</cell><cell>8.0</cell><cell>0.74</cell><cell></cell><cell>0.82</cell><cell></cell><cell>0.779</cell></row><row><cell>BLOOM_2</cell><cell>8.0</cell><cell>0.76</cell><cell></cell><cell>0.83</cell><cell></cell><cell>0.79</cell></row><row><cell cols="2">Googletrans 644.0</cell><cell>0.86</cell><cell></cell><cell>0.86</cell><cell></cell><cell>0.86</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 7</head><label>7</label><figDesc>Googletrans 2023 and 2024 translations</figDesc><table><row><cell>Original text</cell><cell>Googletrans 2023</cell><cell>Googletrans 2024</cell></row><row><cell>Some rappers are good but</cell><cell>Algunos raperos son buenos,</cell><cell>Algunos raperos son buenos,</cell></row><row><cell>others are Ludacris.</cell><cell>pero otros son ludacris.</cell><cell>pero otros son Ludacris.</cell></row><row><cell>I've got to fix the automobile,</cell><cell>Tengo que arreglar el automóvil,</cell><cell>Tengo que arreglar el automóvil,</cell></row><row><cell>said Tom mechanically.</cell><cell>dijo Tom mecánicamente.</cell><cell>dijo Tom mecánicamente.</cell></row><row><cell>Piano players know what bar</cell><cell></cell><cell></cell></row><row><cell>they are in.</cell><cell></cell><cell></cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Acknowledgments</head><p>I would like to thank the organizers of CLEF 2024 in general and the organizers of JOKER in particular for providing us with the opportunity to continually improve our research, learn, and develop. Above all, I would like to express my gratitude to Professor Liana Ermakova for her invaluable advice and great support throughout the execution of this task.</p><p>This project has received a government grant managed by the National Research Agency under the program "Investissements d'avenir" integrated into France 2030, with the Reference ANR-19-GURE-0001.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Overview of clef 2024 joker track on automatic humor analysis</title>
		<author>
			<persName><forename type="first">L</forename><surname>Ermakova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Miller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A.-G</forename><surname>Bosser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">M P</forename><surname>Preciado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sidorov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jatowt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Fifteenth International Conference of the CLEF Association (CLEF 2024)</title>
				<editor>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Mulhem</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Quénot</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Schwab</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Soulier</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><forename type="middle">M D</forename><surname>Nunzio</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Galuščáková</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><forename type="middle">G S</forename><surname>De Herrera</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Faggioli</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Does ai have a sense of humor? clef 2023 joker tasks 1, 2 and 3: Using bloom, gpt, simplet5, and more for pun detection, location, interpretation and translation</title>
		<author>
			<persName><forename type="first">O</forename><surname>Popova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dadic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CLEF (Working Notes)</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="1888" to="1908" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Workshop</surname></persName>
		</author>
		<idno type="DOI">10.57967/hf/0003</idno>
		<ptr target="https://huggingface.co/bigscience/bloom.doi:10.57967/hf/0003" />
		<title level="m">Bloom</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note>revision 4ab0472</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="https://pypi.org/project/googletrans/" />
		<title level="m">Googletrans 3</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Bleu: A method for automatic evaluation of machine translation</title>
		<author>
			<persName><forename type="first">K</forename><surname>Papineni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Roukos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ward</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhu</surname></persName>
		</author>
		<idno type="DOI">10.3115/1073083.1073135</idno>
		<ptr target="https://www.aclweb.org/anthology/P02-1040.doi:10.3115/1073083.1073135" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 40th Annual Meeting of the Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="page" from="311" to="318" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A call for clarity in reporting bleu scores</title>
		<author>
			<persName><forename type="first">M</forename><surname>Post</surname></persName>
		</author>
		<ptr target="https://www.aclweb.org/anthology/W18-6319" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Third Conference on Machine Translation: Research Papers, Association for Computational Linguistics</title>
				<meeting>the Third Conference on Machine Translation: Research Papers, Association for Computational Linguistics<address><addrLine>Belgium, Brussels</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="186" to="191" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Bertscore: Evaluating text generation with bert</title>
		<author>
			<persName><forename type="first">T</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Kishore</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Weinberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Artzi</surname></persName>
		</author>
		<ptr target="https://openreview.net/forum?id=SkeHuCVFDr" />
	</analytic>
	<monogr>
		<title level="m">International Conference on Learning Representations</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
