<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Gleb</forename><surname>Schmidt</surname></persName>
							<email>gleb.schmidt@ru.nl</email>
							<affiliation key="aff0">
								<orgName type="institution">Radboud University Nijmegen</orgName>
								<address>
									<addrLine>Erasmusplein 1</addrLine>
									<postCode>6525 HT</postCode>
									<settlement>Nijmegen</settlement>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Veronica</forename><surname>Vybornaya</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">Independent scholar</orgName>
								<address>
									<settlement>St. Petersburg</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ivan</forename><forename type="middle">P</forename><surname>Yamshchikov</surname></persName>
							<email>ivan.yamshchikov@thws.de</email>
							<affiliation key="aff2">
								<orgName type="department" key="dep1">CAIRO</orgName>
								<orgName type="department" key="dep2">THWS</orgName>
								<orgName type="institution">Technische Hochschule Würzburg-Schweinfurt</orgName>
								<address>
									<addrLine>Franz-Horn Str. 2</addrLine>
									<postCode>97082</postCode>
									<settlement>Würzburg</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">BC93B922837BE9EB3A95187A06DF6862</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:49+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>pre-trained language models</term>
					<term>authorship attribution</term>
					<term>authorship analysis</term>
					<term>historical languages</term>
					<term>transfer learning</term>
					<term>ancient greek (roman period)</term>
					<term>Ps.-Dionysius&apos;s Ars Rhetorica</term>
					<term>BERT</term>
					<term>RoBERTa</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper explores the use of pre-trained language models for Ancient Greek in the context of authorship attribution. The study adopts a two-step approach: first, the models are fine-tuned on a domainspecific corpus using a masked language modeling (MLM) objective; second, based on the fine-tuned model, a classifier is trained to address the authorship attribution task. The analysis centers on a corpus of texts on rhetorical theory from the Second Sophistic period, with particular focus on the Pseudo-Dionysian Ars Rhetorica. The results of the experiment suggest that this approach offers valuable insights into the authorship of ancient texts. Notably, the findings align with some traditional scholarly views on the Ars Rhetorica while also opening the door to reconsidering long-discarded hypotheses about the treatise's internal structure. This study highlights how the integration of natural language processing and classical philology can significantly advance discussions in ancient literary scholarship.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Over the past several years, the application of transformer-based neural networks <ref type="bibr" target="#b50">[51]</ref> has led to significant advancements in many NLP tasks related to historical languages <ref type="bibr" target="#b43">[44,</ref><ref type="bibr" target="#b31">32,</ref><ref type="bibr" target="#b42">43]</ref>. However, unlike in the case of modern languages, where fine-tuning pre-trained transformers for linguistic forensics is very common <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b47">48,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b0">1]</ref>, the application of such models for authorship attribution tasks in historical languages remains relatively underexplored, although some excellent seminal studies and surveys have been recently published <ref type="bibr" target="#b44">[45,</ref><ref type="bibr" target="#b14">15,</ref><ref type="bibr" target="#b39">40,</ref><ref type="bibr" target="#b40">41]</ref>. The availability of state-of-the-art pre-trained language models <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b31">32,</ref><ref type="bibr" target="#b38">39,</ref><ref type="bibr" target="#b53">54]</ref> excelling in multiple downstream tasks suggests that the situation with authorship analysis can be different as well.</p><p>Yamshchikov, Tikhonov, Pantis, Schubert, and Jost <ref type="bibr" target="#b53">[54]</ref> obtained a pre-trained model for Ancient Greek by fine-tuning a Modern Greek BERT model <ref type="bibr" target="#b22">[23]</ref>. The resulting model subsequently served as the backbone for a classifier and proved effective for authorship attribution of the so-called Pseudo-Plutarch corpus. Interestingly, despite being fine-tuned on a limited amount of Ancient Greek data, the model obtained through transfer learning showed results comparable to those of the models trained from scratch on significantly larger corpora, as reported by Singh, Rutten, and Lefever <ref type="bibr" target="#b38">[39]</ref> and Riemenschneider and Frank <ref type="bibr" target="#b31">[32]</ref>. Drawing inspiration from Yamshchikov, Tikhonov, Pantis, Schubert, and Jost <ref type="bibr" target="#b53">[54]</ref>, this study experiments with a similar approach focusing on the works of late Greek rhetoricians.</p><p>Greek prose on rhetorical theory from the period known as the Second Sophistic serves as a crucial source, documenting the cultural and intellectual framework of Greek thought and literature in the first centuries AD <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b19">20,</ref><ref type="bibr" target="#b4">5]</ref>. However, the study of this extensive corpus of texts, collectively referred to under the broad concept of Rhetores Graeci <ref type="bibr" target="#b51">[52,</ref><ref type="bibr" target="#b41">42]</ref>, is significantly complicated by endless controversies surrounding authorship, dating, and contextual factors <ref type="bibr" target="#b20">[21]</ref>.</p><p>In this paper, we explore the potential of a transformer-based models, fine-tuned for sequence classification task, to provide further insights into the debate.</p><p>The focal point of our study is the text conventionally referred to as the Ars Rhetorica (Art of Rhetoric, hereafter ars). This work has long been attributed to, and frequently published under the name of, the rhetorician and historian Dionysius of Halicarnassus (ca. 60-7 BC). However, Sadée <ref type="bibr" target="#b35">[36]</ref>, followed by Usener <ref type="bibr" target="#b48">[49]</ref> and Usener and Radermacher <ref type="bibr" target="#b49">[50]</ref>, demonstrated that the ars most likely circulated anonymously, with its association to Dionysius emerging from a much later conjecture. This conjecture appears to have been based on an overinterpretation of a scholion (a marginal commentary) on chapter 10 of the text.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Ars Rhetorica</head><p>Several aspects of the ars must be discussed in the context of statistical modelling of its writing style.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Not one, but multiple works</head><p>The text has a complex structure. In Parisinus Graecus 1741 <ref type="bibr" target="#b29">[30]</ref>, the only manuscript that preserves all the material associated with the ars (ff. , the text is divided into 11 chapters. However, these chapters do not constitute a homogeneous work, as the text is generally understood to consist of two <ref type="bibr" target="#b17">[18]</ref>, three <ref type="bibr" target="#b48">[49,</ref><ref type="bibr" target="#b49">50,</ref><ref type="bibr" target="#b34">35,</ref><ref type="bibr" target="#b32">33]</ref>, or even four <ref type="bibr" target="#b37">[38]</ref> distinct parts.</p><p>The first part, covering ch. 1-7, provides concise instruction on ceremonial (epideictic) oratory, addressing seven epideictic genres. These chapters are connected by cross-references and recurring addresses to the author's former pupil, Echecrates, to whom the text is presented as a wedding gift. The remainder of the text, ch. 8-11, may be interpreted as a combination of two or three distinct works on separate topics. Ch. 8-9 explore the so-called "figured speeches", i.e., speeches intended to convey a hidden meaning that may conflict with the literal content and stated purpose of the speech, while ch. 10-11 focus on the criticism of declamation. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Authorship</head><p>Ch. 1-7 exhibit a consistent compositional pattern and a recognizable writing style, suggesting they were authored by the same rhetorician. However, whether these chapters form a coherent and complete treatise is a matter of debate. This portion of the ars has been interpreted as a collection of distinct letters or essays <ref type="bibr" target="#b37">[38,</ref><ref type="bibr" target="#b17">18]</ref>, as remnants or excerpts from a much longer work <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b48">49,</ref><ref type="bibr" target="#b49">50]</ref>, or as a unified treatise <ref type="bibr" target="#b52">[53,</ref><ref type="bibr" target="#b46">47,</ref><ref type="bibr" target="#b21">22]</ref>. For ch. 8-11, the situation is even more ambiguous. Usener <ref type="bibr" target="#b48">[49]</ref> speculated that ch. 8-9 were written by two different disciples attending separate lectures of the same teacher. Penndorf <ref type="bibr" target="#b27">[28]</ref> and Schöpsdau <ref type="bibr" target="#b36">[37]</ref> rejected the idea that ch. 8-9 had a single author, suggesting instead that these texts drew from various sources. Similarly, ch. 10-11 have been attributed either to the same author as ch. 8-9 (with Heath <ref type="bibr" target="#b17">[18]</ref> tentatively identifying him as Sarapion Aelius, a 2nd-century Alexandrian rhetorician whose entire corpus is lost) or to two different authors unrelated to the rest of the ars. Table <ref type="table" target="#tab_0">1</ref> summarizes the content and authorship hypotheses for the various sections of the ars.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Ars Rhetorica, Menanderian Corpus, and Pseudo-Hermogenes' On Method</head><p>Since the early days of scholarship on the Ars Rhetorica, it has been noted that the rhetorical instruction provided in ch. 1-7 and ch. 8-11 shows a clear methodological afÏnity with, respectively, the treatises ascribed to Menander Rhetor (particularly the second one) and Pseudo-Hermogenes' On Method. The parallels with the second treatise attributed to Menander are especially noteworthy. In both works:</p><p>• the occasion -rather than the subject, as in the first treatise attributed to Menanderdetermines the genre; • a very similar selection of genres is discussed (of the seven genres mentioned in the ars, only two are absent from Menander's purported work; see Table <ref type="table">7</ref>); • the author addresses a former disciple throughout the text. This afÏnity led Heath <ref type="bibr" target="#b17">[18]</ref> to describe the ars as "comparable to, though less sophisticated than" Menander's work.</p><p>The numerous parallels between ch. 8-11 of the ars and Pseudo-Hermogenes' On Method <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b17">18]</ref> have led scholars to hypothesize either a shared source <ref type="bibr" target="#b28">[29]</ref> or a closer, albeit indirect, connection <ref type="bibr" target="#b34">[35,</ref><ref type="bibr" target="#b17">18]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Dates</head><p>The dates of the texts constituting the ars have been assessed differently. For ch. 1-7, a mention of the 2nd-century sophist Nicostratus (ch. 2, par. 9, p. 266, l. 14), along with the considerable focus on speeches addressing Roman magistrates, suggests a composition date no earlier than the High Empire <ref type="bibr" target="#b34">[35,</ref><ref type="bibr" target="#b21">22]</ref>. Race <ref type="bibr" target="#b30">[31]</ref> posits that the first part of the ars is roughly contemporary with the corpus attributed to Menander Rhetor, which is datable to the late 3rd century AD. In contrast, ch. 8-11 may be a century earlier <ref type="bibr" target="#b17">[18]</ref>, i.e., 2nd century AD.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Aims</head><p>The hypotheses concerning the authorship of different parts of the ars have multiplied, as have suggestions regarding its potential relationship with other texts. However, the evidence presented in the scholarship so far is drawn almost exclusively from close reading and remains inconclusive. Additionally, unlike the case with ch. 8-11, no efforts have been made to identify the author responsible for ch. 1-7.</p><p>The aim of our investigation, therefore, is to apply modern natural language processing techniques to this rich textual material in order to gather new evidence about the structure of the ars and gain further insights into its authorship. The arguments formulated through language modeling could provide a novel and valuable contribution to the debate, particularly when considered alongside the accumulated philological evidence and existing codicological indications.</p><p>The main contributions of this work can be summarized as follows:</p><p>• We further fine-tune two pre-trained models for Ancient Greek and one model for Modern Greek on a corpus of Greek rhetoricians. We subsequently use the resulting models to train "open set" 10-class classifiers capable of attributing short fragments of text to different authors of the Second Sophistic period; • Analyzing in more details the results provided by two best-performing models, we shed light on the history of the Pseudo-Dionysian ars, suggesting that:</p><p>-Ch. 1-7 of the ars could have been authored by an individual from the same school as the author(s) of the Menandrian treatises; -Ch. 8-11 not only differ in authorship from ch. 1-7, but may have been written by two distinct individuals, one responsible for ch. 8-9 and another for ch. 10-11.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Corpus</head><p>The primary focus of our study is a corpus comprising at least 18 rhetores Graeci from the 1st-4th centuries AD and Dionysius of Halicarnassus. We only retained the authors whose teachings are relatively well-preserved, excluding those known only through fragmentary or indirect evidence. Importantly, we focus exclusively on rhetorical theory, i.e., works with a theoretical or pedagogical intent. A significant limitation of this dataset is that many of the rhetorical corpora within it have notorious attribution problems of their own. In particular, there is a compelling case for the heterogeneity of the Hermogenean corpus (see <ref type="bibr">Section 6)</ref>. Similarly, the question of whether both treatises attributed to Menander were authored by the same person remains unresolved <ref type="bibr" target="#b33">[34,</ref><ref type="bibr" target="#b16">17,</ref><ref type="bibr" target="#b30">31,</ref><ref type="bibr" target="#b7">8]</ref>. Other corpora raise similar questions, too <ref type="bibr" target="#b20">[21]</ref>. Being aware of this and currently working on a follow-up authorship verification study of these corpora (the importance of which was also insightfully emphasized by the reviewers of this work), for simplicity, we continue to group the studied texts by authorship as categorized in the Thesaurus Linguae Graecae (TLG), where our dataset stems from. <ref type="foot" target="#foot_0">1</ref> We deem this simplification legitimate. In most cases, these questionable attributions are rooted in long-standing traditions that date back to the early stages of textual transmission. For example, the Hermogenean corpus has been consistently attributed to Hermogenes of Tarsus since as early as the 5th century (for more details see Section 6). Therefore, with all necessary reservations, these conventional groupings can be considered to represent at least some kind of connection. Even if they do not link texts written by the same individual, they may still group works originating from the same school. After all, this is why such simplification is commonly used in scholarship.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">&lt;UNK&gt; category</head><p>The literature on oratory theory was undoubtedly much richer than what has been preserved. To account for this, we created an "open set" scenario. For this purpose, we set aside 9 smaller authorial corpora -those with a number of sentences below the dataset's median value of 517 (marked with * in Table <ref type="table" target="#tab_1">2</ref>). These texts were excluded from the dataset before our conventional 80/10/10 split and later added to the the test set. The idea is straightforward. If at the test stage the model encounters a text that does not belong to any of the authorial classes learned during the training, it is likely that the calibrated probability associated with the top prediction will be relatively low. If it falls below a certain threshold, the model is programmed to abstain from making a decision and assign an &lt;UNK&gt; label to the text in question. The samples with &lt;UNK&gt; label in the test split are necessary to monitor the model's capability to do.</p><p>An overview of the classification dataset is presented in the Table <ref type="table" target="#tab_1">2</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Methodology</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Base Transformers</head><p>To train our classifiers, we used three different pre-trained transformers as starting points:</p><p>(1) RoBERTa-sized GreBerta presented by Riemenschneider and Frank <ref type="bibr" target="#b31">[32]</ref>,</p><p>(2) Ancient Greek BERT trained by Singh, Rutten, and Lefever <ref type="bibr" target="#b38">[39]</ref> (3) Modern Greek BERT published by Koutsikakis, Chalkidis, Malakasiotis, and Androutsopoulos <ref type="bibr" target="#b22">[23]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Masked Language Modeling Fine-Tuning</head><p>Fine-tuning pre-trained models on domain-specific corpora prior to further tuning them for a downstream task at hand is a common practice in NLP. It allows the model to adapt better to the unique linguistic features of the target domain. This intermediate step may enhance the model's ability to capture specific syntactical patterns and vocabulary, which in turn improves the performance on the final downstream task, such as classification. For this reason, before training classifiers for authorship attribution, we ran training with a masked language modeling objective. BERT-sized models were trained for 3 epochs with a learning rate 1 × 10 −5 and warmup during the first 10% of training steps. RoBERTa-based model was trained for 1 epoch only with a learning rate 1 × 10 −4 and without warmup steps. In both scenarios, the learning rate was decreasing linearly.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Sequence Classification</head><p>Authorship classifiers were trained on both out-of-the-box models and their MLM-fine-tuned versions. We employed a sliding window technique to segment the texts into chunks. The process was as follows:</p><p>1. Tokenization: We used the bowphs/GreBerta tokenizer to convert the entire corpus into tokens.<ref type="foot" target="#foot_1">2</ref> 2. Chunking: The tokenized corpus was then divided into chunks of 64 tokens, respecting the boundaries of works (and even chapters -in the case of the ars); 3. Overlap: To ensure continuity and capture context that might span chunk boundaries, we implemented an overlap between chunks. Each chunk overlapped with its adjacent chunks by 32 tokens (half of the chunk length). 4. Decoding: Finally, we decoded these token chunks back into text, resulting in our training data segments. By using a single tokenizer to chunk the entire corpus beforehand instead of splitting the texts with a tokenizer of the corresponding model, we ensured that all models were trained on the same segments of text.</p><p>The training was carried out for 700 steps by sampling batches containing 4 chunks per authorial class. Validation set was checked each 350 steps, i.e., twice during the training. Test set including &lt;UNK&gt;-labelled samples was checked upon the end of training. We report the results obtained on the test set.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Results and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1.">General Performance</head><p>Table <ref type="table">3</ref> summarizes the overall performance of the classifiers. Notably, additional MLM training proved beneficial only for the RoBERTa-sized bowphs/GreBerta model. For BERT-sized models, however, the inclusion of new data was detrimental. bowphs/GreBerta appears to be more stable, behaving more like general-purpose language models trained for well-resourced modern languages. This stands to reason: out of the three models with which we experimented, bowphs/GreBerta <ref type="bibr" target="#b31">[32]</ref> is the largest and was trained on the riches and highest-quality Ancient Greek corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.">Authorship Attribution of the ars</head><p>The aim of this study was to get some fresh evidence about the authorship of the pseudo-Dionysian ars, a precious witness to the development of rhetorical theory during the High to Late Roman empire. Based on the status quaestionis surveyed in the section 2, we set up 3 research questions:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 3</head><p>Performance metrics on the test split with the &lt;UNK&gt; category (not represented in the training data). The models were configured to assign &lt;UNK&gt; to samples with a calibrated top probability below 80%. (R) denotes models fine-tuned with an MLM objective on the same data that was used to train the classifiers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Model</head><p>F1 Score Accuracy 1. Can we further comfort or challenge the existing consensus opinion, according to which the attribution to Dionysius of Halicarnassus is incorrect? 2. How many works are discernible in the ars in the form we know it? 3. Can the model convincingly suggest an alternative attribution for the ars or any of its parts?</p><p>To address these questions, we applied the trained classifier to individual chapters of the ars, split into chunks following the described procedure. Table <ref type="table" target="#tab_3">4</ref> summarizes the predictions made by the best-performing BERT-sized and RoBERTa-sized models. <ref type="foot" target="#foot_2">3</ref> For each chapter, we report the "majority vote" (i.e., the number of chunks in the chapter attributed to a given author), the author's "share" (i.e., the proportion of chunks assigned to that author in the total number of chapter chunks), and the mean probability of the author across the chunks of the chapter. In the "majority vote", the attribution is defined by the top probability even if it falls below 80%.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.3.">No trace of Dionysius of Halicarnassus</head><p>In line with previous scholarship, although the name of Dionysius of Halicarnassus appears among the attributions, its weight is insignificant. Therefore, with regard to the first of the research questions, the evidence is overwhelming: stylistic afÏnity with Dionysius of Halicarnassus's writings is scarce, and the attribution to him cannot be supported by any of the two models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.4.">ars's association to the Menandrean corpus further strengthened</head><p>Apart from this rather predictable conclusion, our classifiers yield new insights into more complicated questions concerning the inner structure of the ars and the authorship of the texts, which constitute it. As clear from the Table <ref type="table" target="#tab_3">4</ref>, the attribution profiles for ch. 1-7 and 8-11 are drastically different. Even when the probability is not high enough, Menander Rhetor is the top-ranked candidate in ch. 1-7. The signal is less clear in ch. 8-11. This difference goes in line with the communis opinio that the work is composite: a nearly identical attribution profile of ch. 1-7 being yet another argument in favour of its unity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.5.">What does the model learn?</head><p>For the sake of explainability, DH specialists still widely use the bag-of-words model and corpus-specific manual feature engineering for various tasks involving writing style analysis, such as authorship attribution, authorship and self-authorship verification, clustering, etc. <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b23">24,</ref><ref type="bibr" target="#b2">3]</ref>. Since deep learning methods lack this level of transparency, understanding exactly what our classifier learned is crucial. A thorough investigation of this matter will be the subject of a separate study, using explainable AI techniques such as integrated gradients and token attribution. Here, we limit our discussion to one insightful example, which seems to illustrate how the model works.</p><p>As previously mentioned in Section 2, all the genres addressed in ch. 2-5 are also discussed in the second treatise attributed to Menander Rhetor. Only the most prestigious of the epideictic genres, the panegyric -focused on in ch. 1 and 7 -does not correspond to any section in Menander's works. However, ch. 1, which provides introductory notes on panegyrics, often echoes the examples and some wording of the first treatise by Menander. Ch. 1 offers guidelines on how to appropriately praise gods ("leaders and name-givers of any festival"), cities where the festivals take place, and emperors who organize and preside over the festivals. All these topics are covered in Menander's first treatise.</p><p>Considering only ch. 2-5 or the fragments of ch. 1 that have clear parallels in Menander's work, one might argue that the classifier's decision was biased due to the significant content and semantic overlap, especially since such a tendency has been reported about the BERT-based classifiers <ref type="bibr" target="#b6">[7]</ref>. However, the consistency of the attribution profile across the chapters by both models is reassuring, as it suggests that they capture more than just semantics.</p><p>Menandrean association appears all the stronger when the values for the logical subdivisions of the ars, ch. 1-7, are calculated. As Korenjak <ref type="bibr" target="#b21">[22]</ref> has shown, in its current form, the order of the chapters is disorganized, and it is possible that the author intended to arrange them as follows: chapters 1 and 7 (panegyrics or appraisal speeches), chapters 2-4 (speeches related to</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 7</head><p>Content overlap between the ars and the second treatise attributed to Menander. Chapter division for Menander's treatise follows Race <ref type="bibr" target="#b30">[31]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Menander Rhetor</head><p>Treatise II ars 5, 6 2, 4 7 3 9 5 8, 10, <ref type="bibr">15 6</ref> family life occasions), and chapters 5-6 (speeches addressed to ofÏcials and epitaphs). In each of these sections, Menander maintains a stable leadership (Tables <ref type="table" target="#tab_5">5 and 6</ref>).</p><p>ars, ch. 8-11: multiple authors? The discrepancy between the attribution profiles of ch. 8-9 and ch. 10-11 might suggest a division, albeit a less distinct one, than ch. 1-7 versus ch. 8-11. This result aligns with the assessment made by Usener <ref type="bibr" target="#b48">[49]</ref>, although it does not provide any further hint at the identity of the possible author. However, the opposite hypothesis should still be considered seriously. In ch. 1-7, top two single attributions (i.e., Menander Rhetor and another author) in terms of "share" would cover at least 0.58-0.68 of the attributed chunks (ch. 7). In contrast, the top two attributions in ch. 8-11 provide, at best, 0.58-0.62 of the attributions (ch. 11 and 10), the attributions are more evenly distributed. Apparently, among the author classes present in our dataset, none is stylistically similar enough to the text of ch.8-11. This can be explained in two ways. Texts written in a comparable style are either completely absent from the dataset or are not appropriately distributed among author groups, making it challenging for the model to learn the features of this particular writing style. Keeping in mind the existing hypothesis about the relationship between the so-called Hermogenean canon and the works ascribed to Apsines, with extreme caution, we incline to the latter explanation.</p><p>Two works, which are part of the Hermogenean canon, On Invention and On Method, were already in Late Antiquity associated with the name of Hermogenes. In our dataset, therefore, following the TLG, we reproduce this conventional attribution. Yet, both are most likely inauthentic <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b24">25,</ref><ref type="bibr" target="#b25">26,</ref><ref type="bibr" target="#b26">27]</ref>. If the argumentation presented by Heath <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b16">17]</ref> proves correct and these two texts can securely be ascribed to Apsines, the "new" writing style they would represent might possibly demonstrate a more pronounced afÏnity with the style of ch. 8-11. This and similar possibilities should be thoroughly checked in further experiments.</p><p>The scope of the much-needed detailed follow-up study becomes evident. A systematic and critical reassessment of attribution problems within the corpus of the Rhetores Graeci is necessary. Beyond merely reflecting on the attributions of individual works, it is important to establish the homogeneity of different rhetorical corpora within the framework of a pairwise authorship verification study.</p><p>But if we set aside the obscure case of ch. 8-11, should we conclude that ch. 1-7 were written by Menander Rhetor? Given the aforementioned limitations of our dataset, we would not go that far. However, our results suggest that the connection between the first part of the Pseudo-Dionysian ars and the Menandrean corpus likely extends beyond a theoretical afÏnity. Despite the obvious terminological discrepancies between the texts and their different levels of elaboration, the possibility of multiple authorship within the same school, or even common authorship, should be considered with all seriousness. The divergence between the ars, ch. 1-7, and the Menandrean corpus can also be explained, apart from the natural evolution of personal style and preferences, by the likelihood that those presenting complex rhetorical theory would probably follow the advice formulated by the author of ch. 11. The art of rhetoric involves presenting material in a way that convinces the audience. Thus, orators are similar to doctors who must not only select the right medication but also administer it in a manner acceptable to the patient [50, ch. 11, par. 9, p. 385, ll. <ref type="bibr" target="#b6">[7]</ref><ref type="bibr" target="#b7">[8]</ref><ref type="bibr" target="#b8">[9]</ref><ref type="bibr" target="#b9">[10]</ref><ref type="bibr" target="#b10">[11]</ref><ref type="bibr" target="#b11">[12]</ref>. In other words, multiple contextual factors influenced the style of the presentation, and, in the cases when the stylistic afÏnity is clear, one should not probably overinterpret isolated differences.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusion</head><p>This study uses transformer-based models to analyze ancient rhetorical texts for authorship attribution in classical philology. First, we adapted these models to handle the linguistic nuances of Ancient Greek texts from the 1st to the 4th century AD using masked language modeling. We then apply the fine-tuned models to identify authorship markers in Ars Rhetorica, a text possibly written by multiple ancient writers. This application not only reminds of benefits of modern AI techniques to classical studies but also deepens our understanding of ancient literary compositions through modern computational methods.</p><p>The results of BERT and RoBERTa classifiers do not support connection of the ars to Dionysius of Halicarnassus, going in line with the previous studies that question his authorship. They also strengthen the link of ars to the Menandrean corpus, particularly evident in the distinct attribution profiles between chapters 1-7 and 8-11, which suggests a composite nature of the work.</p><p>Despite the lack of transparency of MLM techniques compared to conventional methods, which prioritize human-interpretable features, the effectiveness and relevance of machine learning methods is noteworthy.</p><p>While neural networks are often criticized in digital humanities for their black-box nature <ref type="bibr" target="#b11">[12]</ref>, their ability to detect writing styles make them a valuable tool in the field of digital humanities. The use of these models promises significant advancements in authorship attribution and our understanding of ancient literary works.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Limitations</head><p>This study has several limitations that should be considered when interpreting the results.</p><p>Firstly, the issue of disputed authorship within the dataset is a significant challenge. For instance, the Hermogenean corpus and Menandrean treatises, both central to our analysis, have long-standing debates regarding their true authorship, see Section 4. These uncertainties could affect the attribution accuracy. We are currently working on a study intended to solve this issue, adopting an authorship verification approach.</p><p>Secondly, the use of transformer-based models like BERT and RoBERTa, come with limitations related to their opaque nature. The lack of interpretability in these models means that understanding the specific features and patterns the models use to make attributions is challenging. This limits our ability to provide a transparent rationale for the models' decisions, which is often critical in digital humanities research. Yet, the attempts were made to find way to make the results of pre-trained language models more interpretable, e.g., by means of the so-called integrated gradients <ref type="bibr" target="#b45">[46]</ref>. These methods can perhaps be adapted for cases similar to ours.</p><p>Despite achieving notable accuracies with relatively short chunks (64 tokens), the models' performance still leaves room for improvement, particularly in terms of handling unbalanced corpora and downplaying the influence of the thematic clues. Nevertheless, their performance, comparable to state-of-the-art results for modern languages, demonstrates an ability to capture writing style. There clearly are instances where the models are overly confident, leading to incorrect authorship attribution. These errors could arise from factors such as the models' sensitivity to stylistic nuances and the complexity of the texts. Embracing more sophisticated methodologies for uncertainty-aware training would be an interesting avenue for further exploration.</p><p>Another potential avenue for future research is the development of chronological and regional classifiers. Texts from different regions and periods may exhibit unique linguistic and stylistic features that are not captured by a generalized model. Developing classifiers specific to historical periods or geographical (and cultural) areas could enhance attribution accuracy and offer more detailed insights into the ars and many other texts.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Themes addressed in the Ars Rhetorica and alleged authorship of its different parts. Each Roman number stands for one author. II-III means that the section might have been written by two different persons.</figDesc><table><row><cell>ars</cell><cell>Theme</cell><cell>Author</cell></row><row><cell>1</cell><cell>Panegyrics</cell><cell></cell></row><row><cell>2</cell><cell>Marriage speeches</cell><cell></cell></row><row><cell>3</cell><cell>Birthday speeches</cell><cell></cell></row><row><cell>4</cell><cell>Epithalamium</cell><cell>I</cell></row><row><cell>5</cell><cell>Addresses</cell><cell></cell></row><row><cell>6</cell><cell>Funeral speeches</cell><cell></cell></row><row><cell>7</cell><cell>Exhortations to athletes</cell><cell></cell></row><row><cell>8 9</cell><cell>"Figured speeches"</cell><cell>I or II or II-III</cell></row><row><cell cols="3">10 Criticism of declamations I or II or IV or IV-V 11</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Classification dataset. Texts by authors marked with * were grouped under the &lt;UNK&gt; label. This label is present only in the test data to evaluate the model's ability to deal with uncertainty in an "open set" scenario.</figDesc><table><row><cell>Name</cell><cell>TLG</cell><cell>Date</cell><cell>Location</cell></row><row><cell>Aelius Aristides</cell><cell>284</cell><cell>II AD</cell><cell>Mysia</cell></row><row><cell>Aelius Herodianus &amp; Pseudo-Herodianus</cell><cell>87</cell><cell>II AD</cell><cell>Alexandria</cell></row><row><cell>Aelius Theon*</cell><cell>607</cell><cell>I-II AD</cell><cell>Alexandria</cell></row><row><cell>Alciphron</cell><cell>640</cell><cell>II-III AD</cell><cell>Unknown</cell></row><row><cell>Alexander*</cell><cell>594</cell><cell>1st half II AD</cell><cell>Unknown</cell></row><row><cell>Anonymus Seguerianus*</cell><cell cols="2">2002 1st half III AD</cell><cell>Unknown</cell></row><row><cell>Cassius Longinus*</cell><cell>2178</cell><cell>mid III AD</cell><cell>Athens</cell></row><row><cell>Demetrius</cell><cell>613</cell><cell>I AD</cell><cell>Unknown</cell></row><row><cell>Dionysius Halicarnasseus</cell><cell>81</cell><cell>I BC</cell><cell>Halicarnassus</cell></row><row><cell>Eudemus</cell><cell>1376</cell><cell>II AD</cell><cell>Argos</cell></row><row><cell>Hermogenes</cell><cell>592</cell><cell>II-III AD</cell><cell>Tarsus</cell></row><row><cell>Lesbonax*</cell><cell>649</cell><cell>II AD</cell><cell>Miletus</cell></row><row><cell>Longinus*</cell><cell>560</cell><cell>I AD</cell><cell>Unknown</cell></row><row><cell>Marcus Cornelius Fronto*</cell><cell>186</cell><cell>II AD</cell><cell>Numidia</cell></row><row><cell>Menander</cell><cell>2586</cell><cell>III-IV AD</cell><cell>Laodicea</cell></row><row><cell>Minucianus Junior*</cell><cell>2903</cell><cell>III AD</cell><cell>Athens</cell></row><row><cell>Polyaenus</cell><cell>616</cell><cell>II AD</cell><cell>Macedonia</cell></row><row><cell>Polybius*</cell><cell>605</cell><cell>II AD</cell><cell>Sardis</cell></row><row><cell>Valerius Apsines</cell><cell>2027</cell><cell>III AD</cell><cell>Athens</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 "</head><label>4</label><figDesc>Majority vote", share, and mean prediction probability for each chapter of the ars: Ancient Greek BERT vs GreBerta (R). "Rest" stands for the sum of all minor attributions. Sorted by the mean prediction probability.</figDesc><table><row><cell></cell><cell cols="4">pranaydeeps/Ancient-Greek-BERT</cell><cell cols="3">bowphs/GreBerta (R)</cell><cell></cell></row><row><cell>Ch.</cell><cell>Author</cell><cell>Vote</cell><cell>Share</cell><cell>Prob.</cell><cell>Author</cell><cell>Count</cell><cell>Share</cell><cell>Prob.</cell></row><row><cell></cell><cell>Menander</cell><cell>32</cell><cell>0.70</cell><cell>66.59</cell><cell>Menander</cell><cell>39</cell><cell>0.85</cell><cell>79.21</cell></row><row><cell>1</cell><cell>Aelius Aristides Dionysius H.</cell><cell>5 3</cell><cell>0.11 0.07</cell><cell>12.51 6.80</cell><cell>Aelius Aristides Dionysius H.</cell><cell>3 3</cell><cell>0.07 0.07</cell><cell>7.39 6.57</cell></row><row><cell></cell><cell>Rest</cell><cell>6</cell><cell>0.13</cell><cell>14.10</cell><cell>Rest</cell><cell>1</cell><cell>0.02</cell><cell>6.83</cell></row><row><cell></cell><cell>Menander</cell><cell>30</cell><cell>0.59</cell><cell>57.46</cell><cell>Menander</cell><cell>36</cell><cell>0.71</cell><cell>68.80</cell></row><row><cell>2</cell><cell>Dionysius H. Aelius Aristides</cell><cell>9 9</cell><cell>0.18 0.18</cell><cell>15.20 14.48</cell><cell>Aelius Aristides Dionysius H.</cell><cell>5 6</cell><cell>0.10 0.12</cell><cell>9.32 9.23</cell></row><row><cell></cell><cell>Rest</cell><cell>3</cell><cell>0.06</cell><cell>12.87</cell><cell>Rest</cell><cell>4</cell><cell>0.08</cell><cell>12.66</cell></row><row><cell></cell><cell>Menander</cell><cell>25</cell><cell>0.96</cell><cell>94.13</cell><cell>Menander</cell><cell>25</cell><cell>0.96</cell><cell>89.75</cell></row><row><cell>3</cell><cell>Dionysius H. Hermogenes</cell><cell>1 0</cell><cell>0.04 0.00</cell><cell>2.05 1.43</cell><cell>Hermogenes Dionysius H.</cell><cell>1 0</cell><cell>0.04 0.00</cell><cell>4.68 2.10</cell></row><row><cell></cell><cell>Rest</cell><cell>0</cell><cell>0.00</cell><cell>2.40</cell><cell>Rest</cell><cell>0</cell><cell>0.00</cell><cell>3.47</cell></row><row><cell></cell><cell>Menander</cell><cell>15</cell><cell>0.79</cell><cell>69.45</cell><cell>Menander</cell><cell>16</cell><cell>0.84</cell><cell>79.77</cell></row><row><cell>4</cell><cell>Hermogenes Aelius Aristides</cell><cell>3 1</cell><cell>0.16 0.05</cell><cell>11.68 6.42</cell><cell>Aelius Aristides Demetrius</cell><cell>2 1</cell><cell>0.11 0.05</cell><cell>8.99 5.43</cell></row><row><cell></cell><cell>Rest</cell><cell>0</cell><cell>0.00</cell><cell>12.45</cell><cell>Rest</cell><cell>0</cell><cell>0.00</cell><cell>5.81</cell></row><row><cell></cell><cell>Menander</cell><cell>21</cell><cell>0.49</cell><cell>48.00</cell><cell>Menander</cell><cell>28</cell><cell>0.65</cell><cell>60.73</cell></row><row><cell>5</cell><cell>Aelius Aristides Dionysius H.</cell><cell>12 3</cell><cell>0.28 0.07</cell><cell>25.16 7.34</cell><cell>Aelius Aristides Hermogenes</cell><cell>11 2</cell><cell>0.26 0.05</cell><cell>21.61 6.29</cell></row><row><cell></cell><cell>Rest</cell><cell>7</cell><cell>0.16</cell><cell>19.50</cell><cell>Rest</cell><cell>2</cell><cell>0.05</cell><cell>11.37</cell></row><row><cell></cell><cell>Menander</cell><cell>34</cell><cell>0.61</cell><cell>52.98</cell><cell>Menander</cell><cell>34</cell><cell>0.61</cell><cell>56.71</cell></row><row><cell>6</cell><cell>Hermogenes Dionysius H.</cell><cell>9 5</cell><cell>0.16 0.09</cell><cell>15.58 8.78</cell><cell>Valerius Apsines Dionysius H.</cell><cell>6 7</cell><cell>0.11 0.12</cell><cell>12.06 11.11</cell></row><row><cell></cell><cell>Rest</cell><cell>8</cell><cell>0.14</cell><cell>22.66</cell><cell>Rest</cell><cell>9</cell><cell>0.16</cell><cell>20.12</cell></row><row><cell></cell><cell>Menander</cell><cell>31</cell><cell>0.39</cell><cell>37.87</cell><cell>Menander</cell><cell>42</cell><cell>0.53</cell><cell>48.34</cell></row><row><cell>7</cell><cell>Aelius Aristides Dionysius H.</cell><cell>15 12</cell><cell>0.19 0.15</cell><cell>18.39 15.14</cell><cell>Aelius Aristides Valerius Apsines</cell><cell>12 8</cell><cell>0.15 0.10</cell><cell>14.65 10.85</cell></row><row><cell></cell><cell>Rest</cell><cell>21</cell><cell>0.27</cell><cell>28.60</cell><cell>Rest</cell><cell>17</cell><cell>0.22</cell><cell>26.15</cell></row><row><cell></cell><cell>Hermogenes</cell><cell>49</cell><cell>0.21</cell><cell>21.63</cell><cell>Hermogenes</cell><cell>59</cell><cell>0.26</cell><cell>25.45</cell></row><row><cell>8</cell><cell>Valerius Apsines Aelius Aristides</cell><cell>41 45</cell><cell>0.18 0.19</cell><cell>17.19 16.92</cell><cell>Aelius Aristides Dionysius H.</cell><cell>58 49</cell><cell>0.25 0.21</cell><cell>21.86 19.96</cell></row><row><cell></cell><cell>Rest</cell><cell>96</cell><cell>0.42</cell><cell>44.26</cell><cell>Rest</cell><cell>65</cell><cell>0.28</cell><cell>32.73</cell></row><row><cell></cell><cell>Hermogenes</cell><cell>60</cell><cell>0.20</cell><cell>17.95</cell><cell>Aelius Aristides</cell><cell>78</cell><cell>0.26</cell><cell>23.26</cell></row><row><cell>9</cell><cell>Demetrius Aelius Aristides</cell><cell>45 49</cell><cell>0.15 0.16</cell><cell>15.37 14.55</cell><cell>Hermogenes Dionysius H.</cell><cell>51 45</cell><cell>0.17 0.15</cell><cell>17.99 15.03</cell></row><row><cell></cell><cell>Rest</cell><cell>145</cell><cell>0.48</cell><cell>52.14</cell><cell>Rest</cell><cell>125</cell><cell>0.42</cell><cell>43.72</cell></row><row><cell></cell><cell>Hermogenes</cell><cell>43</cell><cell>0.34</cell><cell>30.00</cell><cell>Hermogenes</cell><cell>52</cell><cell>0.42</cell><cell>38.21</cell></row><row><cell>10</cell><cell>Dionysius H. Valerius Apsines</cell><cell>31 17</cell><cell>0.25 0.14</cell><cell>21.65 14.49</cell><cell>Dionysius H. Valerius Apsines</cell><cell>25 16</cell><cell>0.20 0.12</cell><cell>20.63 11.48</cell></row><row><cell></cell><cell>Rest</cell><cell>34</cell><cell>0.27</cell><cell>33.86</cell><cell>Rest</cell><cell>32</cell><cell>0.26</cell><cell>29.68</cell></row><row><cell></cell><cell>Hermogenes</cell><cell>41</cell><cell>0.37</cell><cell>31.65</cell><cell>Hermogenes</cell><cell>34</cell><cell>0.30</cell><cell>28.39</cell></row><row><cell>11</cell><cell>Menander Dionysius H.</cell><cell>23 14</cell><cell>0.21 0.12</cell><cell>20.22 13.48</cell><cell>Menander Dionysius H.</cell><cell>23 21</cell><cell>0.21 0.19</cell><cell>19.30 18.81</cell></row><row><cell></cell><cell>Rest</cell><cell>34</cell><cell>0.30</cell><cell>34.64</cell><cell>Rest</cell><cell>34</cell><cell>0.30</cell><cell>33.51</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5 "</head><label>5</label><figDesc>Majority vote", share, and mean prediction probability for logical subdivisions within the ars, ch. 1-7:</figDesc><table><row><cell>GreBerta (R).</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell>Vote</cell><cell></cell><cell>Share (%)</cell><cell></cell><cell cols="2">Probability</cell><cell></cell></row><row><cell></cell><cell cols="6">1 &amp; 7 2-4 5 &amp; 6 1 &amp; 7 2-4 5 &amp; 6 1 &amp; 7</cell><cell cols="2">2-4 5 &amp; 6</cell></row><row><cell>Menander</cell><cell>81</cell><cell>77</cell><cell>62</cell><cell>0.68 0.83</cell><cell cols="4">0.65 59.70 76.64 58.45</cell></row><row><cell>Aelius Aristides</cell><cell>15</cell><cell>7</cell><cell>13</cell><cell>0.12 0.08</cell><cell cols="2">0.14 11.97</cell><cell cols="2">7.09 11.33</cell></row><row><cell>Hermogenes</cell><cell>7</cell><cell>2</cell><cell>7</cell><cell>0.06 0.02</cell><cell>0.07</cell><cell>7.87</cell><cell>3.77</cell><cell>9.01</cell></row><row><cell>Dionysius Halicarnassensis</cell><cell>9</cell><cell>6</cell><cell>7</cell><cell>0.08 0.06</cell><cell>0.07</cell><cell>7.6</cell><cell>5.7</cell><cell>7.06</cell></row><row><cell>Valerius Apsines</cell><cell>8</cell><cell>1</cell><cell>7</cell><cell>0.07 0.01</cell><cell>0.07</cell><cell>6.40</cell><cell>1.8</cell><cell>8.85</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 6 "</head><label>6</label><figDesc>Majority vote", share, and mean prediction probability for logical subdivisions within the ars, ch. 1-7: Ancient GreekBERT.   </figDesc><table><row><cell></cell><cell></cell><cell>Vote</cell><cell></cell><cell>Share (%)</cell><cell></cell><cell cols="2">Probability</cell><cell></cell></row><row><cell></cell><cell cols="6">1 &amp; 7 2-4 5 &amp; 6 1 &amp; 7 2-4 5 &amp; 6 1 &amp; 7</cell><cell cols="2">2-4 5 &amp; 6</cell></row><row><cell>Menander</cell><cell>63</cell><cell>70</cell><cell>55</cell><cell>0.52 0.74</cell><cell cols="4">0.59 48.44 69.76 50.82</cell></row><row><cell>Aelius Aristides</cell><cell>20</cell><cell>10</cell><cell>15</cell><cell>0.17 0.11</cell><cell cols="2">0.16 16.23</cell><cell cols="2">9.22 15.89</cell></row><row><cell>Dionysius Halicarnassensis</cell><cell>15</cell><cell>10</cell><cell>8</cell><cell>0.12 0.11</cell><cell cols="2">0.09 12.07</cell><cell>9.47</cell><cell>7.87</cell></row><row><cell>Hermogenes</cell><cell>12</cell><cell>3</cell><cell>11</cell><cell>0.10 0.03</cell><cell>0.12</cell><cell>9.46</cell><cell cols="2">4.18 11.36</cell></row><row><cell>Valerius Apsines</cell><cell>10</cell><cell>2</cell><cell>4</cell><cell>0.08 0.02</cell><cell>0.04</cell><cell>7.81</cell><cell>3.13</cell><cell>6.49</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">We cannot publish the full texts with all the corresponding metadata. However, the shufÒed chunks used in MLM fine-tuning and subsequent classifier training are made available on GitHub: https://github.com/glsch/rhetores_ graeci.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">We did not repeat the experiment producing chunks with other available tokenizers.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">pranaydeeps/Ancient-Greek-BERT and bowphs/GreBerta (R)</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>We extend their gratitude to Jürgen Jost, Charlotte Schubert, Friedrich Meissner, Caroline Macé, and Mark de Kreij for welcoming this study and future collaboration between machine learning, history, and philology.</p><p>We would also like to thank Ben Nagy and two anonymous reviewers for the careful reading and insightful feedback.</p><p>We thank Shari Boodts and Sven Meeder, Principal Investigators of the ERC Proof of Concept project "ManuscriptAI" and the ERC Consolidator project "SOLEMNE". Without their support, this research would not have been possible.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Online Resources</head><p>The code and both models considered in detail in this study are accessible at:</p><p>• https://huggingface.co/glsch • https://github.com/glsch/rhetores_graeci</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Whodunit? Learning to Contrast for Authorship Attribution</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Tan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tan</surname></persName>
		</author>
		<idno type="DOI">10.48550/ARXIV.2209.11887</idno>
		<imprint>
			<date type="published" when="2022">2022. Visited on 01/11/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Latin BERT: A Contextual Language Model for Classical Philology</title>
		<author>
			<persName><forename type="first">D</forename><surname>Bamman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Burns</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2009.10053" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The Elementary Particles: A Computational Stylometric Inquiry into the Mediaeval Greek-Latin Aristotle</title>
		<author>
			<persName><forename type="first">P</forename><surname>Beullens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Haverals</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Nagy</surname></persName>
		</author>
		<idno type="DOI">10.21071/mijtk.v9i.16723</idno>
	</analytic>
	<monogr>
		<title level="j">Mediterranea. International Journal on the Transfer of Knowledge</title>
		<idno type="ISSN">2445-2378</idno>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="385" to="408" />
			<date type="published" when="2024-04-05">Apr. 2024. Visited on 05/06/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><surname>Blass</surname></persName>
		</author>
		<ptr target="https://books.google.nl/books?id=k3g-AAAAcAAJ" />
		<title level="m">De Dionysii Halicarnassensis scriptis rhetoricis</title>
				<meeting><address><addrLine>Bonn</addrLine></address></meeting>
		<imprint>
			<publisher>Max Cohen et fil</publisher>
			<date type="published" when="1863">1863</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Paideia: the World of the Second Sophistic</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">E</forename><surname>Borg</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<publisher>de Gruyter</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Greek sophists in the Roman Empire</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">W</forename><surname>Bowersock</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1969">1969</date>
			<publisher>Clarendon Press</publisher>
			<pubPlace>Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Rethinking the Authorship Verification Experimental Setups</title>
		<author>
			<persName><forename type="first">F</forename><surname>Brad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Manolache</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Burceanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Barbalau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ionescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Popescu</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2112.05125" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">K</forename><surname>Brodersen</surname></persName>
		</author>
		<title level="m">Menandros. Abhandlungen zur Rhetorik. ger grc</title>
				<meeting><address><addrLine>Stuttgart</addrLine></address></meeting>
		<imprint>
			<publisher>Anton Hiersemann</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">88</biblScope>
		</imprint>
	</monogr>
	<note>Bibliothek der griechischen Literatur</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">C</forename><surname>Burgess</surname></persName>
		</author>
		<title level="m">Epideictic literature</title>
				<imprint>
			<date type="published" when="1902">1902</date>
			<biblScope unit="volume">3</biblScope>
		</imprint>
		<respStmt>
			<orgName>University of Michigan Library</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? I</title>
		<author>
			<persName><forename type="first">E</forename><surname>Bürgi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Wiener Studien</title>
		<imprint>
			<biblScope unit="volume">48</biblScope>
			<biblScope unit="page" from="187" to="197" />
			<date type="published" when="1930">1930</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? II</title>
		<author>
			<persName><forename type="first">E</forename><surname>Bürgi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Wiener Studien</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="page" from="40" to="69" />
			<date type="published" when="1931">1931</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Twenty-One* Pseudo-Chrysostoms and more: Authorship Verification in the Patristic World</title>
		<author>
			<persName><forename type="first">T</forename><surname>Clérice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Glaise</surname></persName>
		</author>
		<ptr target="https://inria.hal.science/hal-04211176" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Computational Humanities Research Conference</title>
				<meeting>the Computational Humanities Research Conference</meeting>
		<imprint>
			<date type="published" when="2022-12">2023. 2022. Dec. 2023</date>
		</imprint>
	</monogr>
	<note>Computational Humanities Research Conference</note>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts</title>
		<author>
			<persName><forename type="first">S</forename><surname>Corbara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moreo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Sebastiani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tavoni</surname></persName>
		</author>
		<ptr target="http://arxiv.org/abs/2006.12289" />
		<imprint>
			<date type="published" when="2021-02">Sept. 2021. 02/05/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">BertAA : BERT Fine-Tuning for Authorship Attribution</title>
		<author>
			<persName><forename type="first">M</forename><surname>Fabien</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Villatoro-Tello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Motlicek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Parida</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2020.icon-main.16" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 17th International Conference on Natural Language Processing (ICON)</title>
				<editor>
			<persName><forename type="first">P</forename><surname>Bhattacharyya</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Sharma</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Sangal</surname></persName>
		</editor>
		<meeting>the 17th International Conference on Natural Language Processing (ICON)<address><addrLine>Patna, India</addrLine></address></meeting>
		<imprint>
			<publisher>NLP Association of India (NLPAI</publisher>
			<date type="published" when="2020-12">Dec. 2020</date>
			<biblScope unit="page" from="127" to="137" />
		</imprint>
		<respStmt>
			<orgName>Indian Institute of Technology Patna</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Machine Learning and the Future of Philology: A Case Study</title>
		<author>
			<persName><forename type="first">B</forename><surname>Graziosi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Haubold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cowen-Breen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Brooks</surname></persName>
		</author>
		<idno type="DOI">10.1353/apa.2023.a901022</idno>
	</analytic>
	<monogr>
		<title level="j">TAPA</title>
		<idno type="ISSN">2575-7199</idno>
		<imprint>
			<biblScope unit="volume">153</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="253" to="284" />
			<date type="published" when="2023-03-10">Mar. 2023. Visited on 10/20/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Hermogenes&apos; Biographers</title>
		<author>
			<persName><forename type="first">M</forename><surname>Heath</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Eranos</title>
		<imprint>
			<biblScope unit="volume">96</biblScope>
			<biblScope unit="page" from="44" to="54" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Menander: a Rhetor in Context</title>
		<author>
			<persName><forename type="first">M</forename><surname>Heath</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<publisher>Oxford University Press</publisher>
			<pubPlace>USA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Pseudo-Dionysius Art of Rhetoric 8-11: Figured speech, Declamation, and Criticism</title>
		<author>
			<persName><forename type="first">M</forename><surname>Heath</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">American Journal of Philology</title>
		<imprint>
			<biblScope unit="volume">124</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="81" to="105" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">PART: Pre-Trained Authorship Representation Transformer</title>
		<author>
			<persName><forename type="first">J</forename><surname>Huertas-Tato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Huertas-Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Camacho</surname></persName>
		</author>
		<ptr target="http://arxiv.org/abs/2209.15373" />
		<imprint>
			<date type="published" when="2022">Sept. 2022. on 01/11/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Greek Rhetoric under Christian Emperors</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Kennedy</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<publisher>Wipf and Stock Publishers</publisher>
			<biblScope unit="volume">3</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Some Recent Controversies in the Study of Later Greek Rhetoric</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Kennedy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">American Journal of Philology</title>
		<imprint>
			<biblScope unit="volume">124</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="295" to="301" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Ps.-Dionysius Ars RhetoricaI-VII: One Complete Treatise</title>
		<author>
			<persName><forename type="first">M</forename><surname>Korenjak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Harvard Studies in Classical Philology</title>
		<imprint>
			<biblScope unit="volume">105</biblScope>
			<biblScope unit="page" from="239" to="254" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Greek-BERT: The Greeks Visiting Sesame Street</title>
		<author>
			<persName><forename type="first">J</forename><surname>Koutsikakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Chalkidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Malakasiotis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Androutsopoulos</surname></persName>
		</author>
		<idno type="DOI">10.1145/3411408.3411440</idno>
	</analytic>
	<monogr>
		<title level="m">11th Hellenic Conference on Artificial Intelligence. SETN 2020</title>
				<meeting><address><addrLine>Athens, Greece</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="110" to="117" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Authorship Analysis and the Ending of Seven Against Thebes: Aeschylus&apos; Antigone or Updating Adaptation?</title>
		<author>
			<persName><forename type="first">N</forename><surname>Manousakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Stamatatos</surname></persName>
		</author>
		<idno type="DOI">10.1353/clw.2023.0007</idno>
	</analytic>
	<monogr>
		<title level="j">Classical World</title>
		<idno type="ISSN">1558-9234</idno>
		<imprint>
			<biblScope unit="volume">116</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="247" to="274" />
			<date type="published" when="2023-03-02">Mar. 2023. Visited on 02/01/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Le De Inventione du Pseudo-Hermogène</title>
		<author>
			<persName><forename type="first">M</forename><surname>Patillon</surname></persName>
		</author>
		<idno type="DOI">10.1515/9783110815146-003</idno>
	</analytic>
	<monogr>
		<title level="m">Aufstieg und Niedergang der römischen Welt</title>
		<title level="s">Einzelne Autoren seit der hadrianischen Zeit und Allgemeines zur Literatur</title>
		<meeting><address><addrLine>Berlin; Boston</addrLine></address></meeting>
		<imprint>
			<publisher>De Gruyter</publisher>
			<date type="published" when="1997">1997</date>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="page" from="2064" to="2172" />
		</imprint>
	</monogr>
	<note>Teilband Sprache und Literatur</note>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Pseudo-Hermogène, L&apos;Invention</title>
		<author>
			<persName><forename type="first">M</forename><surname>Patillon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Corpus rhetoricum</title>
		<title level="s">Synopse des exordes. grc fre</title>
		<meeting><address><addrLine>Paris</addrLine></address></meeting>
		<imprint>
			<publisher>Les Belles lettres</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page">1</biblScope>
		</imprint>
	</monogr>
	<note>Anonyme</note>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Pseudo-Hermogène, La méthode de l&apos;habilité</title>
		<author>
			<persName><forename type="first">M</forename><surname>Patillon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Anonyme, Méthodes des discours d&apos;adresse</title>
		<title level="s">grc fre</title>
		<meeting><address><addrLine>Paris</addrLine></address></meeting>
		<imprint>
			<publisher>Les Belles lettres</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
	<note>Corpus rhetoricum</note>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">De sermone figurato quaestio rhetorica</title>
		<author>
			<persName><forename type="first">J</forename><surname>Penndorf</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Leipziger Studien zur classischen Philologie</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page" from="169" to="194" />
			<date type="published" when="1902">1902</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<title level="m" type="main">Hermogenis Opera</title>
		<author>
			<persName><forename type="first">H</forename><surname>Rabe</surname></persName>
		</author>
		<ptr target="https://books.google.nl/books?id=WreAtwEACAAJ" />
		<imprint>
			<date type="published" when="1985">1985</date>
			<publisher>Teubner</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Rhetoren-Corpora</title>
		<author>
			<persName><forename type="first">H</forename><surname>Rabe</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Rheinisches Museum</title>
		<imprint>
			<biblScope unit="volume">67</biblScope>
			<biblScope unit="page" from="321" to="357" />
			<date type="published" when="1912">1912</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Rhetor. Dionysius of Halicarnassus</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">H</forename><surname>Race</surname></persName>
		</author>
		<author>
			<persName><surname>Menander</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Loeb classical library</title>
		<title level="s">Ars Rhetorica. grc eng</title>
		<meeting><address><addrLine>Cambridge (Mass; London</addrLine></address></meeting>
		<imprint>
			<publisher>Harvard University Press</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">539</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Exploring Large Language Models for Classical Philology</title>
		<author>
			<persName><forename type="first">F</forename><surname>Riemenschneider</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Frank</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.acl-long.846</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 61st Annual Meeting of the Association for Computational Linguistics<address><addrLine>Toronto, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023. Visited on 02/10/2024</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="15181" to="15199" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Rhetors at the Wedding</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Russell</surname></persName>
		</author>
		<idno type="DOI">10.1017/S0068673500004156</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Cambridge Philological Society</title>
				<meeting>the Cambridge Philological Society</meeting>
		<imprint>
			<date type="published" when="1979-07-05">1979. Visited on 05/07/2024</date>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="104" to="117" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Russell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">G</forename><surname>Wilson</surname></persName>
		</author>
		<title level="m">Menander Rhetor. grc eng</title>
				<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Clarendon Press</publisher>
			<date type="published" when="1981">1981</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Classicizing Rhetoric and Criticism: the Pseudo-Dionysian Exetasis and Mistakes in Declamation</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Russell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Le Classicisme à Rome aux 1 ers siècles avant et après</title>
				<imprint>
			<date type="published" when="1979">1979</date>
			<biblScope unit="volume">25</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Sadée</surname></persName>
		</author>
		<title level="m">De Dionysii Halicarnassensis scriptis rhetoricis quaestiones criticae</title>
				<imprint>
			<publisher>Teubner</publisher>
			<date type="published" when="1878">1878</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Untersuchungen zur Anlage und Entstehung der beiden pseudodionysianischen Traktate περὶ ἐσχηματισμένων</title>
		<author>
			<persName><forename type="first">K</forename><surname>Schöpsdau</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Rheinisches Museum für Philologie</title>
		<imprint>
			<biblScope unit="volume">118</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="83" to="123" />
			<date type="published" when="1975">1975</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Schott</surname></persName>
		</author>
		<ptr target="https://books.google.nl/books?id=SiYUAAAAYAAJ" />
		<title level="m">ΤΕΧΝΗ ΡΗΤΟΡΙΚΗ: quae vulgo integra Dionysio Halicarnassensi tribuitur, emendata, nova versione Latina et commentario illustrata</title>
				<imprint>
			<publisher>Sumtibus E.B. Suicquerti</publisher>
			<date type="published" when="1804">1804</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<analytic>
		<title level="a" type="main">A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek</title>
		<author>
			<persName><forename type="first">P</forename><surname>Singh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Rutten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lefever</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2021.latechclfl-1.15</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature</title>
				<editor>
			<persName><forename type="first">S</forename><surname>Degaetano-Ortlieb</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Kazantseva</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Reiter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Szpakowicz</surname></persName>
		</editor>
		<meeting>the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature<address><addrLine>Punta Cana, Dominican Republic</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021-11">Nov. 2021</date>
			<biblScope unit="page" from="128" to="137" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">Machine Learning for Ancient Languages: A Survey</title>
		<author>
			<persName><forename type="first">T</forename><surname>Sommerschield</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Assael</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pavlopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stefanak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Senior</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bodel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Prag</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Androutsopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">De</forename><surname>Freitas</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli_a_00481</idno>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<idno type="ISSN">0891- 2017</idno>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="703" to="747" />
			<date type="published" when="2023">Sept. 2023. Visited on 10/20/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<analytic>
		<title level="a" type="main">Machine Learning for Ancient Languages: A Survey</title>
		<author>
			<persName><forename type="first">T</forename><surname>Sommerschield</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Assael</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pavlopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stefanak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Senior</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bodel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Prag</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Androutsopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>De Freitas</surname></persName>
		</author>
		<idno type="DOI">10.1162/coli_a_00481</idno>
	</analytic>
	<monogr>
		<title level="m">Computational Linguistics</title>
				<imprint>
			<date type="published" when="2023-09">Sept. 2023</date>
			<biblScope unit="page" from="703" to="747" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Spengel</surname></persName>
		</author>
		<title level="m">Rhetores Graeci</title>
				<imprint>
			<publisher>Teubner</publisher>
			<date type="published" when="1885">1885</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<monogr>
		<title level="m">Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Sprugnoli</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Passarotti</surname></persName>
		</editor>
		<meeting>the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024<address><addrLine>Torino, Italia</addrLine></address></meeting>
		<imprint>
			<publisher>ELRA and ICCL</publisher>
			<date type="published" when="2024-05">May 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<analytic>
		<title level="a" type="main">Overview of the EvaLatin 2022 Evaluation Campaign</title>
		<author>
			<persName><forename type="first">R</forename><surname>Sprugnoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Passarotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">M</forename><surname>Cecchini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Fantoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Moretti</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2022.lt4hala-1.29" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Sprugnoli</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Passarotti</surname></persName>
		</editor>
		<meeting>the Second Workshop on Language Technologies for Historical and Ancient Languages<address><addrLine>Marseille, France</addrLine></address></meeting>
		<imprint>
			<publisher>European Language Resources Association</publisher>
			<date type="published" when="2022-06">June 2022</date>
			<biblScope unit="page" from="183" to="188" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b44">
	<analytic>
		<title level="a" type="main">Like Two Pis in a Pod: Author Similarity Across Time in the Ancient Greek Corpus</title>
		<author>
			<persName><forename type="first">G</forename><surname>Storey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mimno</surname></persName>
		</author>
		<idno type="DOI">10.22148/001c.13680</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Cultural Analytics</title>
		<idno type="ISSN">2371- 4549</idno>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">2</biblScope>
			<date type="published" when="2020-07-10">July 2020. Visited on 10/20/2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<monogr>
		<title level="m" type="main">Axiomatic Attribution for Deep Networks</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sundararajan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Taly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Yan</surname></persName>
		</author>
		<idno>eprint: 1703.01365</idno>
		<ptr target="https://arxiv.org/abs/1703.01365" />
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b46">
	<analytic>
		<title level="a" type="main">Dionysii Halicarnasei quae fertur ars rhetorica rec. Hermannus Usener</title>
		<author>
			<persName><forename type="first">G</forename><surname>Thiele</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Göttingische Gelehrte Anzeigen</title>
		<imprint>
			<biblScope unit="volume">159</biblScope>
			<biblScope unit="page" from="237" to="243" />
			<date type="published" when="1897">1897</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b47">
	<monogr>
		<title level="m" type="main">On the State of the Art in Authorship Attribution and Authorship Verification</title>
		<author>
			<persName><forename type="first">J</forename><surname>Tyo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Dhingra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">C</forename><surname>Lipton</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2209.06869</idno>
		<ptr target="https://arxiv.org/abs/2209.06869" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b48">
	<monogr>
		<title level="m" type="main">Dionysii Halicarnasei quae fertur Ars Rhetorica</title>
		<author>
			<persName><forename type="first">H</forename><surname>Usener</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1895">1895</date>
			<publisher>Teubner</publisher>
			<pubPlace>Leipzig</pubPlace>
		</imprint>
	</monogr>
	<note>Latin</note>
</biblStruct>

<biblStruct xml:id="b49">
	<analytic>
	</analytic>
	<monogr>
		<title level="m">Dionysii Halicarnasei quae exstant</title>
		<title level="s">Opuscula, volumen secundum. grc</title>
		<editor>
			<persName><forename type="first">H</forename><surname>Usener</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Radermacher</surname></persName>
		</editor>
		<meeting><address><addrLine>Stuttgart-Leipzig</addrLine></address></meeting>
		<imprint>
			<publisher>Teubner</publisher>
			<date type="published" when="1929">1929</date>
			<biblScope unit="volume">6</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b50">
	<monogr>
		<title level="m" type="main">Attention Is All You Need</title>
		<author>
			<persName><forename type="first">A</forename><surname>Vaswani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Shazeer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Parmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Uszkoreit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Gomez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kaiser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Polosukhin</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/1706.03762" />
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b51">
	<monogr>
		<title level="m" type="main">Rhetores Graeci</title>
		<author>
			<persName><forename type="first">C</forename><surname>Walz</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1834">1834</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b52">
	<monogr>
		<author>
			<persName><forename type="first">K</forename><surname>Weismann</surname></persName>
		</author>
		<ptr target="https://books.google.nl/books?id=5XJSAAAAcAAJ" />
		<title level="m">De Dionysii Halicarnassei vita et scriptis: Diss. inaug. Steuber</title>
				<imprint>
			<date type="published" when="1837">1837</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b53">
	<monogr>
		<title level="m" type="main">BERT in Plutarch&apos;s Shadows</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">P</forename><surname>Yamshchikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tikhonov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Pantis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Schubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Jost</surname></persName>
		</author>
		<ptr target="http://arxiv.org/abs/2211.05673" />
		<imprint>
			<date type="published" when="2022-11-12">Nov. 2022. on 12/29/2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
