<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Automatic Translation Alignment Pipeline for Multilingual Digital Editions of Literary Works</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Maria</forename><surname>Levchenko</surname></persName>
							<email>maria.levchenko@studio.unibo.it</email>
							<affiliation key="aff0">
								<orgName type="department">Dipartimento di Filologia Classica e Italianistica</orgName>
								<orgName type="institution">University of Bologna</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Automatic Translation Alignment Pipeline for Multilingual Digital Editions of Literary Works</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">2F016C8966BAFBCB25848099E3EE8B16</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:48+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>multilingual digital edition</term>
					<term>Alessandro Manzoni</term>
					<term>translation alignment</term>
					<term>literary translation</term>
					<term>embeddings</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper investigates the application of translation alignment algorithms in the creation of a Multilingual Digital Edition (MDE) of Alessandro Manzoni's Italian novel I promessi sposi ("The Betrothed"), with translations in eight languages (English, Spanish, French, German, Dutch, Polish, Russian and Chinese) from the 19th and 20th centuries. We identify key requirements for the MDE to improve both the reader experience and support for translation studies. Our research highlights the limitations of current state-of-the-art algorithms when applied to the translation of literary texts and outlines an automated pipeline for MDE creation. This pipeline transforms raw texts into web-based, side-by-side representations of original and translated texts with different rendering options. In addition, we propose new metrics for evaluating the alignment of literary translations and suggest visualization techniques for future analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>From the very beginning of digital edition creation, there has been a tendency, supported by the power of web technologies, to represent not only the original text but also its translation(s), following the tradition of bilingual printed editions. In this paper, we propose to define multilingual digital editions (MDE) as editions in which translations are not supplementary but essential, intended to enrich both computational analysis and reader experience.</p><p>Beyond annotated file accessibility, the MDE should meet additional criteria to be effective. Primarily, the platform must display the original text alongside translations. It is anticipated that there will be a visual correlation between aligned pairs, which will facilitate straightforward comparison and analysis. The accuracy of alignment is, by default, ensuring that the corresponding parts of the texts are properly aligned. Furthermore, the platform should support the visual highlighting of omitted or inserted parts in the translations <ref type="bibr" target="#b5">[6]</ref>, which will enable users to discern differences and interpret the nuances of each translation.</p><p>These requirements are generally feasible for short, structured texts like poetry or historical documents (examples of MDE publishing strategies are described in Appendix A). The challenge is to develop a flexible, automated system that accurately aligns complex literary texts across multiple languages for computational analysis and user-friendly exploration. The technology should be able to handle the complexities of literary texts, including the splitting, merging, and reordering of sentences, and align text fragments of manageable length, ensuring that they are easy for users to read and understand at a glance to obtain insight into the linguistic and cultural nuances of each version. The automated alignment process should save researchers both time and resources.</p><p>For the MDE of Alessandro Manzoni's novel "I promessi sposi" (The Betrothed), we propose an automatic translation alignment pipeline that adapts state-of-the-art alignment techniques to the objectives of the multilingual digital edition of literary works for educational and research purposes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The Betrothed by Alessandro Manzoni and Its Translations</head><p>A comparative analysis of translations of the same literary work over time can provide valuable insights into the evolution of interpretation and understanding. "I promessi sposi" is particularly compelling in this context. Not only does it reflect the author's exploration of the Italian language during a period of significant linguistic evolution, but it has also been translated into many European languages over the past two centuries. This makes it an ideal case study for investigating the influence of temporal factors, linguistic shifts, and the reception of the original novel in different cultural contexts.</p><p>Two main original editions (1827 and 1840) were translated into European languages and published in parallel in the XIX century. For the development of an automated translation alignment pipeline, we selected and prepared the texts of a wide range of translations of the classic edition of the novel, also known as Quarantana (1840), including English translations from 1845, 1876, 1983, and 2022; Russian translations from 1854 and 1999; a Dutch (1849); a German (1884), a French (1874), a Spanish (1858), a Polish (1882) and a Chinese (1998) (see Appendix 1 for a list of translations).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Related Work</head><p>The core of the MDE creation process is translation alignment, which involves mapping corresponding units (typically words or sentences) between a source and target text. State-of-the-art alignment algorithms have evolved significantly in recent years and now perform optimally in many applications, including machine translation, bilingual dictionary creation, and parallel corpus development.</p><p>Modern methods have moved from statistical approaches <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b19">20,</ref><ref type="bibr" target="#b17">18,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b14">15,</ref><ref type="bibr" target="#b9">10]</ref> and lexical associations (Hunalign in <ref type="bibr" target="#b21">[22]</ref>), first to the use of machine translation (MT) systems and then to the alignment systems adopted multilingual sentence embeddings, which significantly improves the accuracy (LASER in <ref type="bibr" target="#b1">[2]</ref> and LaBSE in <ref type="bibr" target="#b7">[8]</ref>). Thomson and Koehn's Vecalign <ref type="bibr" target="#b20">[21]</ref> uses LASER embeddings and a recursive dynamic programming approach to achieve state-of-the-art results by reducing complexity from quadratic to linear <ref type="bibr" target="#b20">[21]</ref>. These methods use multilingual models to generate embeddings for each sentence, which are then compared using cosine similarity to find the best matches between the original and translated sentences. Liu and Zhu <ref type="bibr" target="#b16">[17]</ref> E n g li s h 2 0 2 2 E n g li s h 1 <ref type="bibr" target="#b8">9</ref>  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Challenges of the Sentence-Level Alignment</head><p>While the alignment of text and translation at the line level is sufÏcient for poetic, historical, and even verse dramatic texts (see <ref type="bibr" target="#b0">[1]</ref>), where we cannot expect significant variation in the splitting, merging, or reordering of lines, this alignment approach is inadequate for prose due to the extent of restructuring that inevitably occurs in literary prose translation. In such cases, the standard approach is sentence-level alignment. However, it can be challenging, particularly in the case of literary translations, due to the irregularity of the syntactic structure of the original text in another language. Literary translators are not limited to translating a single sentence into another single sentence (this can be described as a one-to-one type of alignment) but are free to manage sentence boundaries and reconfigure sentence structures to better convey the meaning and style of the original text. In this case, in working to achieve the highest similarity score for the aligned pairs, the alignment algorithms are forced to combine several sentences into one, using one-to-many, many-to-one, and many-to-many alignment types.</p><p>The ideal alignment type for the MDE is a one-to-one alignment type to maintain the granularity and consistency of the alignment. In our analysis of the sentence-level alignment of I promessi sposi (see Figure <ref type="figure" target="#fig_0">1</ref> for the different translations), while one-to-one alignments are the most common, a significant proportion are more complex types. While this does not inherently complicate the alignment process, as advanced tools such as Bertalign and Vecalign can handle this complexity, the results may be less optimal in terms of meaningfulness. The length of the aligned pairs becomes longer, including several sentences from both the source text and the translation (examples can be seen in the Appendix B). The edge case for this expanded alignment result would be the pairing of the paragraph or even the chapter of the original text with the same of the translation. That's why traditional metrics may not be sufÏcient for evaluating the alignment results. The performance of alignment algorithms is typically evaluated using established metrics such as precision, recall, F1 score, and Alignment Error Rate (AER) <ref type="bibr" target="#b22">[23]</ref>. The first limitation of this approach is that it is based on a "gold dataset, " which does not provide insight into the performance of the algorithm with respect to other types of text <ref type="bibr" target="#b8">[9]</ref>. A second consequence is that the scores may be high, but the results are not suitable for MDE because the aligned pairs are too large to be analyzed or identified at a glance by a human observer. We, therefore, suggest that, in addition to the increasing importance of the distribution of alignment types (one-to-one, one-to-many, many-to-one, many-to-many) as a metric of the acceptability of the results, the number and length of alignment pairs derived from the original sentences should also be considered. A number of aligned pairs close to the number of original sentences would indicate an effective alignment process. Conversely, a significant reduction in the number of aligned pairs would indicate limitations of the sentence-level alignment approach, as it implies that the alignment algorithm is forced to combine more sentences to obtain the appropriate similarity score.</p><p>The length of alignment pairs will indicate if they are suitable for human readers. In the context of creating a digital edition for educational or research purposes with multiple languages, it is not advisable to present long-aligned texts, given the limited attention span and working memory of the readers (for further insight, see studies of working memory and comprehension with multiple text reading <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>).</p><p>To illustrate, if a sentence in the source language (Italian) is 130 tokens long and its corresponding sentence in the target language (English) is 140 tokens long, readers may encounter difÏculties in comparing and understanding such lengthy segments. Even the use of color differentiation to highlight aligned pairs does not overcome this challenge (see Figure <ref type="figure" target="#fig_1">2</ref>).</p><p>In summary, in the context of MDE of literary works, sentence-level alignment still faces a significant challenge. 1) Sentence boundaries are not stable in different languages, which leads to a variety of alignment types and doesn't allow a consistent alignment across the MDE.</p><p>2) Strict sentence-level alignment does not fully reflect the variability of the translated texts, such as inserted or omitted parts, and 3) strains readers' attention spans and working memory and fails to achieve the alignment granularity that is comfortable for the overall reading experience. The alignment process needs to be modified to address these challenges to automated processing and readability.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Sentence Segmentation as an Alternative Solution</head><p>The alternative methods can provide more accurate and meaningful segmentation of literary texts. In an attempt to move from sentence-level alignment to phrase-or segment-level alignment, here are two promising approaches:</p><p>• Punctuation splitting: Applying punctuation marks (such as commas, periods, and semicolons) to create initial segments. This method provides natural breaks in the text, preserving the contextual meaning. However, by using this approach and aligning the resulting segments with Bertalign, we achieved a more granular alignment but increased the number of reordering problems that didn't occur with sentence-level alignment. • Zero/Few-Shot Prompting with LLM Models: The sentences of the original text are segmented using zero-shot prompting OpenAI CPT-4o model <ref type="bibr" target="#b18">[19]</ref>. The approved segments are then used as patterns for few-shot prompting to segment the sentences of the translations. This approach provides a robust foundation for universal alignment.</p><p>The similarity score can be visualized to evaluate segment-level alignment and compare its results with traditional sentence-level alignment. In addition, the visual representation of the similarity score of the aligned segments or sentences allows us to find the semantic outliers.</p><p>After extracting high-dimensional embeddings for each aligned line from the original and translated text using the multilingual model (LaBSE), we applied t-Distributed Stochastic Neighbour Embedding (t-SNE) to reduce them to two dimensions. By visually examining the cosine similarity, we can detect anomalies and curious translated fragments (see Figure <ref type="figure">3</ref>), even if the alignment algorithm establishes the correlation between sentences.</p><p>There are several quantitative metrics that can be used to assess the quality of alignment:</p><p>• Comparing sentence-level and segment-level alignment, we can assume that even sentencelevel alignment provides valuable insights into the differences between the translations and the original text (see Appendix D for examples in Spanish translation of I Promessi sposi); segmentlevel alignment allows us to go deeper and capture more nuanced variations between the original and the translated text. For example, we can identify the omission of the end of chapter 8 in the German translation (see Table <ref type="table" target="#tab_0">1</ref>) and two omissions in the Russian translation of chapter 1 (see Tables <ref type="table" target="#tab_3">3-2</ref>), which can be interpreted through the lens of cultural differences and/or censorship and could not be captured by sentence-level alignment. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Original</head><p>German 1880</p><p>Voi,» continuò volgendosi alle due donne, «potrete fermarvi a ***.</p><p>Und ihr», fuhr er, zu den beiden Frauen gewandt, fort, «ihr könnt euch in *** so lange aufhalten. By providing a more granular and accurate alignment, the segment-level approach also allows the length of aligned pairs to be reduced (increasing their number) and makes the MDE more suitable for reader reception compared to sentence-level alignment (see Figure <ref type="figure" target="#fig_3">4</ref>). </p><formula xml:id="formula_0">Intro Cap1 Cap2 Cap3 Cap4 Cap5 Cap6 Cap7 Cap8 Cap9 Cap10 Cap11 Cap12 Cap13 Cap14 Cap15 Cap16 Cap17 Cap18 Cap19 Cap20 Cap21 Cap22 Cap23 Cap24 Cap25 Cap26 Cap27 Cap28 Cap29 Cap30 Cap31 Cap32 Cap33 Cap34 Cap35 Cap36 Cap37 Cap38</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Multilingual Digital Edition Pipeline</head><p>The automated pipeline for the MDE is proposed as a means of enabling creators to prepare annotated TEI files that are accessible, adaptable, correct, easily parsed by computational tools, and rendered for readers (see Figure <ref type="figure" target="#fig_4">5</ref>). We start with the raw texts of the translations in TXT format, obtained after OCR and error checking. For Manzoni's text, we used TEI files with identifiers assigned to each token. This preparation allows us to take into account the irregular segmentation to be expected due to inconsistencies across the languages.</p><p>Step 1. The choice of the segmentation method. Based on the above analysis and the specifics of the texts to be published, the MDE developers can select the segmentation methodology in accordance with the projected audience and the project's objectives, enabling alignment at the sentence, phrase, or word level, or a combination, giving readers the flexibility to choose their preferred option.</p><p>Step 2-3. Segmentation of the original text and the translations. Depending on the decision made in the first step, the text can be split into sentences, segments, or even tokens.</p><p>Step 4. Applying alignment algorithms: By default, we applied Bertalign with the LaBSE model, trained on 109 languages <ref type="bibr" target="#b6">[7]</ref> to the segments obtained at the previous steps. Other multilingual sentence-transformer models, such as BGE M3-Embedding, can also be used <ref type="bibr" target="#b3">[4]</ref>.</p><p>Step 5. Choosing the encoding approach. The encoding approach determines the flexibility of the alignment description for future rendering and for establishing a link between the original and translated texts. For structured texts, where each segment in the original closely corresponds to an equivalent segment in the translations, it may be appropriate to mark each segment with the same identifier. Given the complexity of multilingual alignment, we have taken a different approach. The TEI encoding of the original text includes identifiers for each token, providing granular reference points. The TEI-encoded translation text is divided into segments, each referencing the start and end identifiers from the original text, allowing for flexible and accurate alignment.</p><p>Step 6. Encoding. By iterating over the alignment results, we assign the referencing start and end identifiers from the original text to each aligned segment from the translation and generate a new TEI file for the translation, ensuring that all segments are accurately linked to the corresponding elements.</p><p>Step 7. Rendering Aligned Texts on the Web. Render the original and translated texts from the TEI files as two columns on a web page with separate XSLT templates for the original and translated text. This interactive interface allows users to click on the original text and see the corresponding translation fragment highlighted, enhancing the user experience by providing an intuitive way to explore and compare the translations side by side.</p><p>Step 8. Visualization and evaluation. While the highly unstable text versions <ref type="bibr" target="#b12">[13]</ref> or linelevel aligned translations <ref type="bibr" target="#b15">[16]</ref> can be effectively visualized with the Sankey diagram or bipartite graph, the alignment results for the modern translations can be visualized with the approach described above, based on the embedding vectors with t-SNE and clustering with DBSCAN. As for the presentation in the user interface, ideally, all multilingual translations should be comparable and aligned with each other, allowing the user to see and interpret the differences.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Future Development and Challenges</head><p>• Current alignment algorithms face challenges in accurately aligning segments with reordered content. Future work will focus on improving the alignment performance in such cases, ensuring more precise matches even when the original and translated texts differ significantly in structure. • Previous studies on user behavior in digital editions have analyzed log files to understand interaction patterns <ref type="bibr" target="#b2">[3]</ref>. To gain deeper insights, we are using advanced tools such as ReactFlow to study more comprehensively how users interact with different elements of MDEs. For example, when readers view two lines in different languages side by side, the optimal reading span may differ from traditional reading practices. By analysing user interactions, we aim to determine the most effective segment length for the MDEs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Conclusion</head><p>The proposed pipeline aims to improve the development of Multilingual Digital Editions (MDE) by ensuring that MDE is both methodologically robust and user-centered. By prioritizing user experience and usability, the pipeline adapts existing computational methods and algorithms to the specific needs of educational and research applications.</p><p>We have also proposed new metrics for MDEs that focus on the consistency, meaningfulness and granularity of the alignment. These metrics assess the suitability of an alignment for educational and research purposes. By ensuring that the alignment is accessible to human readers while supporting translation studies, the pipeline balances conciseness and reader engagement.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Examples of Many-to-Many Alignment Type</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Examples of Omission Captured through Sentence-Level Alignment</head><p>Table <ref type="table" target="#tab_9">9</ref>: Italian / Spanish segment level alignment for the Chapter 7</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italian Spanish 1858</head><p>Gertrude domandò sommessamente e tremando, che cosa dovesse fare.</p><p>Gertrudis con mucha timidez pidió la explicación de aquellas palabras y lo que debía hacer en consecuencia. Il principe (non ci regge il cuore di dargli in questo momento il titolo di padre) non rispose direttamente, ma cominciò a parlare a lungo del fallo di Gertrude: e quelle parole frizzavano sull'animo della poveretta, come lo scorrere d'una mano ruvida sur una ferita. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italian</head><p>Spanish 1858</p><p>Continuò dicendo che, quand'anche... caso mai... che avesse avuto prima qualche intenzione di collocarla nel secolo, lei stessa ci aveva messo ora un ostacolo insuperabile; giacché a un cavalier d'onore, com'era lui, non sarebbe mai bastato l'animo di regalare a un galantuomo una signorina che aveva dato un tal saggio di sé.</p><p>Él continuó diciendo que... "á pesar de lo ocurrido... en el caso en que... hubiera sido con la intención de establecerse en el mundo, ella había contraído un lazo indisoluble y había creado un obstáculo invencible. Hombre de honor como era, jamás se habría atrevido á presentarla á ningún caballero después de tales antecedentes".</p><p>«Ebbene, non si parli più del passato: tutto è cancellato.</p><p>En hora buena; no hablemos más de lo pasado: todo está olvidado ya. Avete preso il solo partito onorevole, conveniente, che vi rimanesse; ma perché l'avete preso di buona voglia, e con buona maniera, tocca a me a farvelo riuscir gradito in tutto e per tutto: tocca a me a farne tornare tutto il vantaggio e tutto il merito sopra di voi. Ne prendo io la cura.»</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>E. Examples of Omission Captured through Segment-Level Alignment</head><p>Table <ref type="table" target="#tab_10">10</ref>: Italian / Spanish segment level alignment for the Chapter 7</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italian Spanish 1858</head><p>«Brava! bene!» esclamarono, a una voce, la madre e il figlio, -Muy bien, muy bien, exclamaron á la par madre é hijo. e l'uno dopo l'altra abbracciaron Gertrude; la quale ricevette queste accoglienze con lacrime, che furono interpretate per lacrime di consolazione. Allora il principe si diffuse a spiegar ciò che farebbe per render lieta e splendida la sorte della figlia. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Italian</head><p>Spanish 1858</p><p>Parlò delle distinzioni di cui goderebbe nel monastero e nel paese;</p><p>Entonces el príncipe habló de las distinciones que Gertrudis habría de tener en el convento y en el país. che, là sarebbe come una principessa, come la rappresentante della famiglia; che, appena l'età l'avrebbe permesso, sarebbe innalzata alla prima dignità; e, intanto, non sarebbe soggetta che di nome.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Alignment Types in The Betrothed</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: The visualization of the alignment of the long sentence.</figDesc><graphic coords="4,151.79,84.17,291.70,161.29" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>of sentence embeddings in the Chapter 23 Figure 3 :</head><label>233</label><figDesc>Figure 3: Similarity visualisation for sentence-level alignment in the German translation of chapter 23. This translation omits Don Abbondio's inner monologue, which is not captured in the sentencelevel alignment, but is evident in the visualisation, where several Italian sentences appear without corresponding German pairs.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Reducing the Lengths of Aligned Pairs for the Spanish 1858</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: The Translation Alignment Pipeline</figDesc><graphic coords="8,193.46,394.27,208.36,206.68" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Italian / German segment-level alignment</figDesc><table><row><cell>Original</cell><cell>German 1880</cell></row><row><cell>Presto, io spero, potrete ritornar sicuri a</cell><cell>Ich hoffe, ihr werdet bald ohne Gefahr in</cell></row><row><cell>casa vostra;</cell><cell>euer Haus zurückkehren können;</cell></row><row><cell>a ogni modo, Dio vi provvederà, per il</cell><cell>in jedem Falle wird Gott Alles zu euerm</cell></row><row><cell>vostro meglio;</cell><cell>Besten lenken.</cell></row><row><cell>e io certo mi studierò di non mancare alla</cell><cell></cell></row><row><cell>grazia che mi fa, scegliendomi per suo min-</cell><cell></cell></row><row><cell>istro, nel servizio di voi suoi poveri cari tri-</cell><cell></cell></row><row><cell>bolati.</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 :</head><label>1</label><figDesc>(continued)    </figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 :</head><label>2</label><figDesc>Italian / Russian segment-level alignment</figDesc><table><row><cell>Original</cell><cell cols="2">Russian 1854</cell><cell></cell></row><row><cell>Ai tempi in cui accaddero i fatti che pren-</cell><cell cols="3">Во время тех событий, которые мы</cell></row><row><cell>diamo a raccontare, quel borgo, già consid-</cell><cell cols="3">намерены описать, Лекко было уже</cell></row><row><cell>erabile, era anche un castello,</cell><cell cols="3">значительным местечком и маленькой</cell></row><row><cell></cell><cell cols="2">крепостцей;</cell><cell></cell></row><row><cell>e aveva perciò l'onore d'alloggiare un co-</cell><cell cols="3">вследствие чего в нем жили комендант</cell></row><row><cell>mandante, e il vantaggio di possedere una</cell><cell cols="3">и постоянный гарнизон испанских</cell></row><row><cell>stabile guarnigione di soldati spagnoli,</cell><cell>солдат,</cell><cell></cell><cell></cell></row><row><cell>che insegnavan la modestia alle fanciulle</cell><cell></cell><cell></cell><cell></cell></row><row><cell>e alle donne del paese, accarezzavan di</cell><cell></cell><cell></cell><cell></cell></row><row><cell>tempo in tempo le spalle a qualche marito,</cell><cell></cell><cell></cell><cell></cell></row><row><cell>a qualche padre; e, sul finir dell'estate,</cell><cell></cell><cell></cell><cell></cell></row><row><cell>non mancavan mai di spandersi nelle vi-</cell><cell>которые</cell><cell>занимались</cell><cell>собиранием</cell></row><row><cell>gne, per diradar l'uve, e alleggerire a' con-</cell><cell>винограда.</cell><cell></cell><cell></cell></row><row><cell>tadini le fatiche della vendemmia.</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 :</head><label>3</label><figDesc>Italian / Russian segment-level alignment</figDesc><table><row><cell>Original</cell><cell>Russian 1854</cell></row><row><cell>Con tutto ciò,</cell><cell>Несмотря на все,</cell></row><row><cell>anzi in gran parte a cagion di ciò,</cell><cell>и может быть, потому именно,</cell></row><row><cell>quelle gride, ripubblicate e rinforzate di</cell><cell></cell></row><row><cell>governo in governo, non servivano ad</cell><cell></cell></row><row><cell>altro che ad attestare ampollosamente</cell><cell></cell></row><row><cell>l'impotenza de' loro autori,</cell><cell></cell></row><row><cell>o, se producevan qualche effetto immedi-</cell><cell>если декреты имели минутную</cell></row><row><cell>ato...</cell><cell>действительность...</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 4 :</head><label>4</label><figDesc>Strategies of Text/Translation Representation in Multilingual Digital Editions (continued) The Betrothed, by Alessandro Manzoni. London, G. Bell and Sons, 1876.</figDesc><table><row><cell>Project Name</cell><cell>Alignment</cell><cell>Comparison</cell><cell>Notes</cell></row><row><cell></cell><cell>Type</cell><cell></cell><cell></cell></row><row><cell>The Community of the</cell><cell>sentences</cell><cell>+</cell><cell>The side-by-side viewer of</cell></row><row><cell>Realm in Scotland</cell><cell></cell><cell></cell><cell>the Latin text with its En-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>glish translation aligns the</cell></row><row><cell></cell><cell></cell><cell></cell><cell>sentences, allowing users</cell></row><row><cell></cell><cell></cell><cell></cell><cell>to click on the sentence</cell></row><row><cell></cell><cell></cell><cell></cell><cell>number in the original text,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>which automatically scrolls</cell></row><row><cell></cell><cell></cell><cell></cell><cell>the other side of the page to</cell></row><row><cell></cell><cell></cell><cell></cell><cell>the corresponding sentence.</cell></row><row><cell>Tabula Salomonis</cell><cell>lines</cell><cell>+</cell><cell>The TEI Publisher tool al-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>lows the user to highlight</cell></row><row><cell></cell><cell></cell><cell></cell><cell>corresponding parts and au-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>tomatically scroll when hov-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>ering over the lines.</cell></row><row><cell cols="4">• (English 1983). Alessandro Manzoni, The Betrothed, Bruce Penman (tr.), Penguin Ran-dom House UK. London, 1983.</cell></row><row><cell cols="4">• (English 2022). The Betrothed. A novel, translated and with Introduction of Michael Moore, Preface by Pulitzer Prize-Winning Author Jhumpa Lahiri, Modern Library, 2022.</cell></row><row><cell cols="4">• (Russian 1854). Обрученные : Медиолан. быль XVIII [!XVII] столетия, найден. и передел. Александром Манзони / Пер. с итал. В.С. Межевича. Ч. 1-4. Москва,</cell></row><row><cell cols="4">1854. 4 т.; 20. (Библиотека романов, повестей, путешествий и записок, изд. Н.Н.</cell></row><row><cell cols="2">Улитиным; Вып. 7, т. 1-2, 6-7).</cell><cell></cell><cell></cell></row><row><cell cols="4">• (Russian 1999). Обрученные [Повесть из истории Милана XVII в.] / А. Мандзони; [Пер. с итал. под ред. Н. Георгиевской, А. Эфроса]. Москва: Терра-Книжный клуб,</cell></row><row><cell>1999.</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">• (Dutch 1849). De verloofden: eene Milanesche geschiedenis uit de zeventiende eeuw. Vol. 1. Translated by Petrus Van Limburg Brouwer. Groningen, Van Boekeren, 1849. • (German 1884). Die Verlobten: eine Mailändischer Geschichte aus dem 17. Jahrhundert, Volume 1. 3rd ed. Regensburg, G.J. Manz, 1884. • (French 1874). Les fiancés: histoire milanaise du XVIIe siècle / Alexandre Manzoni; traduite de l'italien par Rey Dussueil. Paris: Charpentier, 1874.</cell></row></table><note>B. Translations• (English 1845). The Betrothed Lovers: A Milanese Story of the Seventeenth Century. With the Column of Infamy. By Alessandro Manzoni. In Three Volumes. Henry Francis C. Logan. London: Longman, Brown, Green, and Longmans, Paternoster-Row. • (English 1876).</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 5 :</head><label>5</label><figDesc>Italian / French: 3-1 alignment type</figDesc><table><row><cell>Italian</cell><cell>French</cell></row><row><cell>1 Sì; ma com'è dozzinale! com'è sguaiato!</cell><cell>1 Oui; mais comme il est commun! comme</cell></row><row><cell>com'è scorretto! 2 Idiotismi lombardi a iosa, frasi della lin-</cell><cell>il est inégal! comme il est incorrect! id-iotismes lombards à foison, phrases de</cell></row><row><cell>gua adoperate a sproposito, grammatica ar-</cell><cell>la langue employées à rebours, construc-</cell></row><row><cell>bitraria, periodi sgangherati. 3 E poi, qualche eleganza spagnola semi-</cell><cell>tions arbitraires, périodes boiteuses; et puis quelques petites élégances espagnoles</cell></row><row><cell>nata qua e là; e poi, ch'è peggio, ne' lu-</cell><cell>semées ça et là; et puis, ce qui est bien pis,</cell></row><row><cell>oghi più terribili o più pietosi della storia,</cell><cell>dans les endroits les plus terribles ou les</cell></row><row><cell>a ogni occasione d'eccitar maraviglia, o di</cell><cell>plus touchants de son histoire, à chaque</cell></row><row><cell>far pensare, a tutti que' passi insomma che</cell><cell>occasion d'exciter la surprise ou de faire</cell></row><row><cell>richiedono bensì un po' di rettorica, ma ret-</cell><cell>penser, à tous les passages enfin qui de-</cell></row><row><cell>torica discreta, fine, di buon gusto, costui</cell><cell>mandent, il est vrai, quelques fleurs de rhé-</cell></row><row><cell>non manca mai di metterci di quella sua</cell><cell>torique, mais d'une rhétorique sobre, fine,</cell></row><row><cell>così fatta del proemio.</cell><cell>de bon goût, ce digne homme ne manque</cell></row><row><cell></cell><cell>jamais d'y mettre quelque chose dans le</cell></row><row><cell></cell><cell>genre de son début.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 6 :</head><label>6</label><figDesc>Italian / German: 2-2 alignment type with the overlapping sentence boundaries</figDesc><table><row><cell>Italian</cell><cell>German</cell></row><row><cell>1 Né alcuno dirà questa sij imperfettione del</cell><cell>1 Und es wird gewiß niemand sagen, dies</cell></row><row><cell>Racconto, e defformità di questo mio rozzo</cell><cell>sei ein Geschichtsfälscher und eine Entstel-</cell></row><row><cell>Parto, a meno questo tale Critico non sij per-</cell><cell>lung dieser meiner einfältigen Erzählung,</cell></row><row><cell>sona affatto diggiuna della Filosofia: che</cell><cell>es sei denn der Tadel ein Mann, der aller</cell></row><row><cell>quanto agl'huomini in essa versati, ben vederanno nulla mancare alla sostanza di</cell><cell>Weltweisheit vollständig bar wäre. 2 Denn man wird bald sehen, daß in</cell></row><row><cell>detta Narratione. 2 Imperciocché, essendo cosa evidente, e da verun negata non essere i nomi se non puri purissimi accidenti...</cell><cell>Beziehung der darin vorkommenden Per-sonen am Wesentlichsten der besagten Erzählung nichts fehle; zumal es eine augenfällige, von niemand gelenkte Sache ist, daß Namen bloß reine Nebensachen seien…</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 7 :</head><label>7</label><figDesc>Italian / Dutch: 2-2 alignment type with the overlapping sentence boundaries</figDesc><table><row><cell>Italian</cell><cell>Dutch</cell></row><row><cell>1 Però alla mia debolezza non è lecito</cell><cell>1 Doch mijn' geringeren krachten is het</cell></row><row><cell>solleuarsi a tal'argomenti, e sublimità peri-</cell><cell>niet gegeven zich tot zoo hooge vlugt, tot</cell></row><row><cell>colose, con aggirarsi tra Labirinti de' Politici</cell><cell>zulk eene gevaarvolle verhevenheid te ver-</cell></row><row><cell>maneggj, et il rimbombo de' bellici Ori-</cell><cell>heffen, en zich te wagen in den doolhof der</cell></row><row><cell>calchi: solo che hauendo hauuto notitia di</cell><cell>staatkundige spitsvondigheden of te midden</cell></row><row><cell>fatti memorabili, se ben capitorno a gente meccaniche, e di piccol affare, mi accingo</cell><cell>van het geschal der schorre krijgsklaroenen. 2 Naardemaal 'er dus eenige merk-</cell></row><row><cell>di lasciarne memoria a Posteri, con far di</cell><cell>waardige gebeurtenissen ter mijner</cell></row><row><cell>tutto schietta e genuinamente il Racconto,</cell><cell>kennis gekomen zijn, welke, wel is waar,</cell></row><row><cell>ouuero sia Relatione. 2 Nella quale si vedrà in angusto Teatro</cell><cell>slechts menschen van gering bedrijve en lage geboorte betreffen, maar des alniet-</cell></row><row><cell>luttuose Traggedie d'horrori, e Scene</cell><cell>temin eene rijke vertooning opleveren van</cell></row><row><cell>di malvaggità grandiosa, con intermezi</cell><cell>droevige en vreesselijke ongevallen, voor-</cell></row><row><cell>d'Imprese virtuose e buontà angeliche, op-</cell><cell>beelden van drieste boosheid, doormengd</cell></row><row><cell>poste alle operationi diaboliche.</cell><cell>met vrome ondernemingen en verheerljkt</cell></row><row><cell></cell><cell>door het zielesterkend schouwspel van</cell></row><row><cell></cell><cell>hemelsche deugd, in onophoudelijken</cell></row><row><cell></cell><cell>strijd met de gruwelijke aanslagen der</cell></row><row><cell></cell><cell>helle, zoo heb ik besloten mij aan te</cell></row><row><cell></cell><cell>gorden om daarvan der nakomelingschap</cell></row><row><cell></cell><cell>een getrouw en nauwkeurig Verhaal ofte</cell></row><row><cell></cell><cell>Relaas achterlaten.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_8"><head>Table 8 :</head><label>8</label><figDesc>Italian / Spanish: 2-3 alignment type with the overlapping sentence boundaries</figDesc><table><row><cell>Italian</cell><cell>Spanish</cell></row><row><cell>1 Ma che? quando siamo stati al punto</cell><cell>1 Pero, ¡oh cielos! llegado el momento de re-</cell></row><row><cell>di raccapezzar tutte le dette obiezioni e</cell><cell>capitular las objeciones y sus respuestas y el</cell></row><row><cell>risposte, per disporle con qualche ordine,</cell><cell>de ordenarlas, hallamos, que habíamos he-</cell></row><row><cell>misericordia! venivano a fare un libro. 2 Veduta la qual cosa, abbiam messo da</cell><cell>cho un libro: visto lo cual, abandonamos nuestro intento por dos razones, que sin</cell></row><row><cell>parte il pensiero, per due ragioni che il let-</cell><cell>duda alguna el lector considerará oportu-</cell></row><row><cell>tore troverà certamente buone: la prima, che un libro impiegato a giustificarne un al-</cell><cell>nas. -2 La primera, porque temimos que el hacer</cell></row><row><cell>tro, anzi lo stile d'un altro, potrebbe parer</cell><cell>un libro para justificar otro, ó solo su estilo,</cell></row><row><cell>cosa ridicola: la seconda, che di libri basta uno per volta, quando non è d'avanzo.</cell><cell>parecería cosa ridícula. 3 La segunda, porque creemos que es sufi-</cell></row><row><cell></cell><cell>ciente, cuando no excesivo, el publicar un</cell></row><row><cell></cell><cell>solo libro á la vez.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_9"><head>Table 9 :</head><label>9</label><figDesc>(continued)    </figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_10"><head>Table 10 :</head><label>10</label><figDesc>(continued)    </figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This research was supported by the Dipartimento di Filologia Classica e Italianistica, University of Bologna, as part of the project "Manzoni online2: manoscritti e documenti inediti, tradizione e traduzioni" (CUP J34I19003370001, project code 2017CFZFAY_003). For more information on the Leggo Manzoni project, visit https://projects.dharc.unibo.it/leggomanzoni.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Appendixes</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Strategies of Text/Translation Representation in MDE</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Project Name</head><p>Alignment Comparison Notes</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Separate pages for the text and the translation</head><p>Decameron web --La entretenida by Miguel de Cervantes --</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Same page for the text and the translation with JS switcher</head><p>Furnace and Fugue -- The original text in Old Church Slavonic is directly followed by its corresponding parallel Greek text.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Dynamic alignment display</head><p>Electronic Beowulf lines + When the special view type and option are selected, and the user hovers the mouse over a line, the translation appears in a special area. Kassák Lajos:</p><p>The Horse Dies the Birds Fly Away lines + The page displays side-byside views of the original text and its translations into two other languages, highlighting the corresponding translated line when the mouse hovers over the original line.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">AlignVis: Semi-automatic Alignment and Visualization of Parallel Translations</title>
		<author>
			<persName><forename type="first">M</forename><surname>Alharbi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Cheesman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">S</forename><surname>Laramee</surname></persName>
		</author>
		<idno type="DOI">10.1109/iv51561.2020.00026</idno>
	</analytic>
	<monogr>
		<title level="m">2020 24th International Conference Information Visualisation (IV)</title>
				<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="98" to="108" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings</title>
		<author>
			<persName><forename type="first">M</forename><surname>Artetxe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Schwenk</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/P19-1309</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Korhonen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Traum</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Màrquez</surname></persName>
		</editor>
		<meeting>the 57th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Florence, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="3197" to="3203" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Editing for Man and Machine. Digital Scholarly Editions and their Users</title>
		<author>
			<persName><forename type="first">A</forename><surname>Baillot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Busch</surname></persName>
		</author>
		<idno type="DOI">10.4000/variants.1220</idno>
	</analytic>
	<monogr>
		<title level="j">Variants</title>
		<imprint>
			<biblScope unit="page" from="175" to="187" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Liu</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2402.03216</idno>
		<idno>eprint: 2402.03216</idno>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
	<note>cs.CL)</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Char_align: A Program for Aligning Parallel Texts at the Character Level</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">W</forename><surname>Church</surname></persName>
		</author>
		<idno type="DOI">10.3115/981574.981575</idno>
	</analytic>
	<monogr>
		<title level="m">31st Annual Meeting of the Association for Computational Linguistics</title>
				<meeting><address><addrLine>Columbus, Ohio, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="1993">1993</date>
			<biblScope unit="page" from="1" to="8" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Beyond translation: engaging with foreign languages in a digital library</title>
		<author>
			<persName><forename type="first">G</forename><surname>Crane</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Babeu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Cerrato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Parrish</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Penagos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Shamsian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tauber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wegner</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00799-023-00349-2</idno>
	</analytic>
	<monogr>
		<title level="j">International Journal on Digital Libraries</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Language-agnostic BERT Sentence Embedding</title>
		<author>
			<persName><forename type="first">F</forename><surname>Feng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Cer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Arivazhagan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Wang</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2007.01852</idno>
		<idno>eprint: 2007.01852</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Language-agnostic BERT Sentence Embedding</title>
		<author>
			<persName><forename type="first">F</forename><surname>Feng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Cer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Arivazhagan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">S</forename><surname>Muresan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Nakov</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Villavicencio</surname></persName>
		</editor>
		<meeting>the 60th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="878" to="891" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Squibs and Discussions: Measuring Word Alignment Quality for Statistical Machine Translation</title>
		<author>
			<persName><forename type="first">A</forename><surname>Fraser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Marcu</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli.2007.33.3.293</idno>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="293" to="303" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A Program for Aligning Sentences in Bilingual Corpora</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">A</forename><surname>Gale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">W</forename><surname>Church</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/J93-1004" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<editor>J. Hirschberg</editor>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="page" from="75" to="102" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Automated and controlled processes in comprehending multiple documents</title>
		<author>
			<persName><forename type="first">C</forename><surname>Hahnel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Goldhammer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Kroehne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Mahlow</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Artelt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Schoor</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:237538010" />
	</analytic>
	<monogr>
		<title level="j">Studies in Higher Education</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="page" from="2074" to="2086" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Working memory capacity as a predictor of multiple text comprehension</title>
		<author>
			<persName><forename type="first">L</forename><surname>Hildenbrand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wiley</surname></persName>
		</author>
		<idno type="DOI">10.1080/0163853x.2023.2197690</idno>
	</analytic>
	<monogr>
		<title level="j">Discourse Processes</title>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="issue">4-5</biblScope>
			<biblScope unit="page" from="378" to="396" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Interactive Visual Alignment of Medieval Text Versions</title>
		<author>
			<persName><forename type="first">S</forename><surname>Jänicke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Wrisley</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:39979643" />
	</analytic>
	<monogr>
		<title level="m">2017 IEEE Conference on Visual Analytics Science and Technology (VAST)</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="127" to="138" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Exploitation des cognats dans les systèmes d&apos;alignement bi-textuel : architecture et évaluation</title>
		<author>
			<persName><forename type="first">O</forename><surname>Kraif</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Revue TAL : traitement automatique des langues</title>
		<imprint>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="833" to="867" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Yet Another Fast, Robust and Open Source Sentence Aligner. Time to Reconsider Sentence Alignment?</title>
		<author>
			<persName><forename type="first">F</forename><surname>Lamraoui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Langlais</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">XIV Machine Translation Summit</title>
				<meeting><address><addrLine>Nice, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="77" to="84" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Interactive Visualisation of Shakespeare&apos;s Othello</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">S</forename><surname>Laramee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">J</forename><surname>Walton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:70244581" />
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
		<respStmt>
			<orgName>Swansea University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">MA thesis</note>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Bertalign: Improved word embedding-based sentence alignment for Chinese-English parallel corpora of literary texts</title>
		<author>
			<persName><forename type="first">L</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhu</surname></persName>
		</author>
		<idno type="DOI">10.1093/llc/fqac089</idno>
	</analytic>
	<monogr>
		<title level="j">Digital Scholarship in the Humanities</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="621" to="634" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Sentence and word alignment on the CRATER project: methods and assessment</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mcenery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Oakes</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Association for Computational Linguistics Workshop SIG-DAT Workshop</title>
				<meeting>the Association for Computational Linguistics Workshop SIG-DAT Workshop</meeting>
		<imprint>
			<date type="published" when="1995">1995</date>
			<biblScope unit="page" from="104" to="116" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title/>
		<author>
			<persName><surname>Openai</surname></persName>
		</author>
		<author>
			<persName><surname>Chatgpt</surname></persName>
		</author>
		<ptr target="https://chat.openai.com.2024" />
		<imprint>
			<date>May 13</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Using cognates to align sentences in bilingual corpora</title>
		<author>
			<persName><forename type="first">M</forename><surname>Simard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">F</forename><surname>Foster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Isabelle</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/1992.tmi-1.7" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourth Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages</title>
				<meeting>the Fourth Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages<address><addrLine>Montréal, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1992">1992</date>
			<biblScope unit="page" from="67" to="81" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Vecalign: Improved Sentence Alignment in Linear Time and Space</title>
		<author>
			<persName><forename type="first">B</forename><surname>Thompson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Koehn</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/D19-1136</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</title>
				<editor>
			<persName><forename type="first">K</forename><surname>Inui</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Jiang</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Ng</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">X</forename><surname>Wan</surname></persName>
		</editor>
		<meeting>the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)<address><addrLine>Hong Kong, China</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1342" to="1348" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Parallel corpora for medium density languages</title>
		<author>
			<persName><forename type="first">D</forename><surname>Varga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Halácsy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kornai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Viktor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Laszlo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>László</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Viktor</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:13133927" />
	</analytic>
	<monogr>
		<title level="m">Recent Advances in Natural Language Processing IV: Selected papers from RANLP</title>
				<imprint>
			<date type="published" when="2005">2005. 2007</date>
			<biblScope unit="page" from="292" to="247" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">EVALIGN: Visual Evaluation of Translation Alignment Models</title>
		<author>
			<persName><forename type="first">T</forename><surname>Yousef</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Heyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Jänicke</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.eacl-demo.31</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Croce</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Soldaini</surname></persName>
		</editor>
		<meeting>the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations<address><addrLine>Dubrovnik, Croatia</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="277" to="297" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
