Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica

Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica GlebSchmidt gleb.schmidt@ru.nl Radboud University Nijmegen

Erasmusplein 1 6525 HT Nijmegen The Netherlands

VeronicaVybornaya Independent scholar

St. Petersburg Russia

IvanPYamshchikov ivan.yamshchikov@thws.de CAIRO THWS Technische Hochschule Würzburg-Schweinfurt

Franz-Horn Str. 2 97082 Würzburg Germany

Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica 1613-0073 BC93B922837BE9EB3A95187A06DF6862 GROBID - A machine learning software for extracting information from scholarly documents pre-trained language models authorship attribution authorship analysis historical languages transfer learning ancient greek (roman period) Ps.-Dionysius's Ars Rhetorica BERT RoBERTa

This paper explores the use of pre-trained language models for Ancient Greek in the context of authorship attribution. The study adopts a two-step approach: first, the models are fine-tuned on a domainspecific corpus using a masked language modeling (MLM) objective; second, based on the fine-tuned model, a classifier is trained to address the authorship attribution task. The analysis centers on a corpus of texts on rhetorical theory from the Second Sophistic period, with particular focus on the Pseudo-Dionysian Ars Rhetorica. The results of the experiment suggest that this approach offers valuable insights into the authorship of ancient texts. Notably, the findings align with some traditional scholarly views on the Ars Rhetorica while also opening the door to reconsidering long-discarded hypotheses about the treatise's internal structure. This study highlights how the integration of natural language processing and classical philology can significantly advance discussions in ancient literary scholarship.

Introduction

Over the past several years, the application of transformer-based neural networks [51] has led to significant advancements in many NLP tasks related to historical languages [44,32,43]. However, unlike in the case of modern languages, where fine-tuning pre-trained transformers for linguistic forensics is very common [14,48,19,1], the application of such models for authorship attribution tasks in historical languages remains relatively underexplored, although some excellent seminal studies and surveys have been recently published [45,15,40,41]. The availability of state-of-the-art pre-trained language models [2,32,39,54] excelling in multiple downstream tasks suggests that the situation with authorship analysis can be different as well.

Yamshchikov, Tikhonov, Pantis, Schubert, and Jost [54] obtained a pre-trained model for Ancient Greek by fine-tuning a Modern Greek BERT model [23]. The resulting model subsequently served as the backbone for a classifier and proved effective for authorship attribution of the so-called Pseudo-Plutarch corpus. Interestingly, despite being fine-tuned on a limited amount of Ancient Greek data, the model obtained through transfer learning showed results comparable to those of the models trained from scratch on significantly larger corpora, as reported by Singh, Rutten, and Lefever [39] and Riemenschneider and Frank [32]. Drawing inspiration from Yamshchikov, Tikhonov, Pantis, Schubert, and Jost [54], this study experiments with a similar approach focusing on the works of late Greek rhetoricians.

Greek prose on rhetorical theory from the period known as the Second Sophistic serves as a crucial source, documenting the cultural and intellectual framework of Greek thought and literature in the first centuries AD [9,6,20,5]. However, the study of this extensive corpus of texts, collectively referred to under the broad concept of Rhetores Graeci [52,42], is significantly complicated by endless controversies surrounding authorship, dating, and contextual factors [21].

In this paper, we explore the potential of a transformer-based models, fine-tuned for sequence classification task, to provide further insights into the debate.

The focal point of our study is the text conventionally referred to as the Ars Rhetorica (Art of Rhetoric, hereafter ars). This work has long been attributed to, and frequently published under the name of, the rhetorician and historian Dionysius of Halicarnassus (ca. 60-7 BC). However, Sadée [36], followed by Usener [49] and Usener and Radermacher [50], demonstrated that the ars most likely circulated anonymously, with its association to Dionysius emerging from a much later conjecture. This conjecture appears to have been based on an overinterpretation of a scholion (a marginal commentary) on chapter 10 of the text.

Ars Rhetorica

Several aspects of the ars must be discussed in the context of statistical modelling of its writing style.

Not one, but multiple works

The text has a complex structure. In Parisinus Graecus 1741 [30], the only manuscript that preserves all the material associated with the ars (ff. , the text is divided into 11 chapters. However, these chapters do not constitute a homogeneous work, as the text is generally understood to consist of two [18], three [49,50,35,33], or even four [38] distinct parts.

The first part, covering ch. 1-7, provides concise instruction on ceremonial (epideictic) oratory, addressing seven epideictic genres. These chapters are connected by cross-references and recurring addresses to the author's former pupil, Echecrates, to whom the text is presented as a wedding gift. The remainder of the text, ch. 8-11, may be interpreted as a combination of two or three distinct works on separate topics. Ch. 8-9 explore the so-called "figured speeches", i.e., speeches intended to convey a hidden meaning that may conflict with the literal content and stated purpose of the speech, while ch. 10-11 focus on the criticism of declamation.

Authorship

Ch. 1-7 exhibit a consistent compositional pattern and a recognizable writing style, suggesting they were authored by the same rhetorician. However, whether these chapters form a coherent and complete treatise is a matter of debate. This portion of the ars has been interpreted as a collection of distinct letters or essays [38,18], as remnants or excerpts from a much longer work [4,49,50], or as a unified treatise [53,47,22]. For ch. 8-11, the situation is even more ambiguous. Usener [49] speculated that ch. 8-9 were written by two different disciples attending separate lectures of the same teacher. Penndorf [28] and Schöpsdau [37] rejected the idea that ch. 8-9 had a single author, suggesting instead that these texts drew from various sources. Similarly, ch. 10-11 have been attributed either to the same author as ch. 8-9 (with Heath [18] tentatively identifying him as Sarapion Aelius, a 2nd-century Alexandrian rhetorician whose entire corpus is lost) or to two different authors unrelated to the rest of the ars. Table 1 summarizes the content and authorship hypotheses for the various sections of the ars.

Ars Rhetorica, Menanderian Corpus, and Pseudo-Hermogenes' On Method

Since the early days of scholarship on the Ars Rhetorica, it has been noted that the rhetorical instruction provided in ch. 1-7 and ch. 8-11 shows a clear methodological afÏnity with, respectively, the treatises ascribed to Menander Rhetor (particularly the second one) and Pseudo-Hermogenes' On Method. The parallels with the second treatise attributed to Menander are especially noteworthy. In both works:

• the occasion -rather than the subject, as in the first treatise attributed to Menanderdetermines the genre; • a very similar selection of genres is discussed (of the seven genres mentioned in the ars, only two are absent from Menander's purported work; see Table 7); • the author addresses a former disciple throughout the text. This afÏnity led Heath [18] to describe the ars as "comparable to, though less sophisticated than" Menander's work.

The numerous parallels between ch. 8-11 of the ars and Pseudo-Hermogenes' On Method [29,18] have led scholars to hypothesize either a shared source [29] or a closer, albeit indirect, connection [35,18].

Dates

The dates of the texts constituting the ars have been assessed differently. For ch. 1-7, a mention of the 2nd-century sophist Nicostratus (ch. 2, par. 9, p. 266, l. 14), along with the considerable focus on speeches addressing Roman magistrates, suggests a composition date no earlier than the High Empire [35,22]. Race [31] posits that the first part of the ars is roughly contemporary with the corpus attributed to Menander Rhetor, which is datable to the late 3rd century AD. In contrast, ch. 8-11 may be a century earlier [18], i.e., 2nd century AD.

Aims

The hypotheses concerning the authorship of different parts of the ars have multiplied, as have suggestions regarding its potential relationship with other texts. However, the evidence presented in the scholarship so far is drawn almost exclusively from close reading and remains inconclusive. Additionally, unlike the case with ch. 8-11, no efforts have been made to identify the author responsible for ch. 1-7.

The aim of our investigation, therefore, is to apply modern natural language processing techniques to this rich textual material in order to gather new evidence about the structure of the ars and gain further insights into its authorship. The arguments formulated through language modeling could provide a novel and valuable contribution to the debate, particularly when considered alongside the accumulated philological evidence and existing codicological indications.

The main contributions of this work can be summarized as follows:

• We further fine-tune two pre-trained models for Ancient Greek and one model for Modern Greek on a corpus of Greek rhetoricians. We subsequently use the resulting models to train "open set" 10-class classifiers capable of attributing short fragments of text to different authors of the Second Sophistic period; • Analyzing in more details the results provided by two best-performing models, we shed light on the history of the Pseudo-Dionysian ars, suggesting that:

-Ch. 1-7 of the ars could have been authored by an individual from the same school as the author(s) of the Menandrian treatises; -Ch. 8-11 not only differ in authorship from ch. 1-7, but may have been written by two distinct individuals, one responsible for ch. 8-9 and another for ch. 10-11.

Corpus

The primary focus of our study is a corpus comprising at least 18 rhetores Graeci from the 1st-4th centuries AD and Dionysius of Halicarnassus. We only retained the authors whose teachings are relatively well-preserved, excluding those known only through fragmentary or indirect evidence. Importantly, we focus exclusively on rhetorical theory, i.e., works with a theoretical or pedagogical intent. A significant limitation of this dataset is that many of the rhetorical corpora within it have notorious attribution problems of their own. In particular, there is a compelling case for the heterogeneity of the Hermogenean corpus (see Section 6). Similarly, the question of whether both treatises attributed to Menander were authored by the same person remains unresolved [34,17,31,8]. Other corpora raise similar questions, too [21]. Being aware of this and currently working on a follow-up authorship verification study of these corpora (the importance of which was also insightfully emphasized by the reviewers of this work), for simplicity, we continue to group the studied texts by authorship as categorized in the Thesaurus Linguae Graecae (TLG), where our dataset stems from. 1 We deem this simplification legitimate. In most cases, these questionable attributions are rooted in long-standing traditions that date back to the early stages of textual transmission. For example, the Hermogenean corpus has been consistently attributed to Hermogenes of Tarsus since as early as the 5th century (for more details see Section 6). Therefore, with all necessary reservations, these conventional groupings can be considered to represent at least some kind of connection. Even if they do not link texts written by the same individual, they may still group works originating from the same school. After all, this is why such simplification is commonly used in scholarship.

<UNK> category

The literature on oratory theory was undoubtedly much richer than what has been preserved. To account for this, we created an "open set" scenario. For this purpose, we set aside 9 smaller authorial corpora -those with a number of sentences below the dataset's median value of 517 (marked with * in Table 2). These texts were excluded from the dataset before our conventional 80/10/10 split and later added to the the test set. The idea is straightforward. If at the test stage the model encounters a text that does not belong to any of the authorial classes learned during the training, it is likely that the calibrated probability associated with the top prediction will be relatively low. If it falls below a certain threshold, the model is programmed to abstain from making a decision and assign an <UNK> label to the text in question. The samples with <UNK> label in the test split are necessary to monitor the model's capability to do.

An overview of the classification dataset is presented in the Table 2.

Methodology

Base Transformers

To train our classifiers, we used three different pre-trained transformers as starting points:

(1) RoBERTa-sized GreBerta presented by Riemenschneider and Frank [32],

(2) Ancient Greek BERT trained by Singh, Rutten, and Lefever [39] (3) Modern Greek BERT published by Koutsikakis, Chalkidis, Malakasiotis, and Androutsopoulos [23].

Masked Language Modeling Fine-Tuning

Fine-tuning pre-trained models on domain-specific corpora prior to further tuning them for a downstream task at hand is a common practice in NLP. It allows the model to adapt better to the unique linguistic features of the target domain. This intermediate step may enhance the model's ability to capture specific syntactical patterns and vocabulary, which in turn improves the performance on the final downstream task, such as classification. For this reason, before training classifiers for authorship attribution, we ran training with a masked language modeling objective. BERT-sized models were trained for 3 epochs with a learning rate 1 × 10 −5 and warmup during the first 10% of training steps. RoBERTa-based model was trained for 1 epoch only with a learning rate 1 × 10 −4 and without warmup steps. In both scenarios, the learning rate was decreasing linearly.

Sequence Classification

Authorship classifiers were trained on both out-of-the-box models and their MLM-fine-tuned versions. We employed a sliding window technique to segment the texts into chunks. The process was as follows:

1. Tokenization: We used the bowphs/GreBerta tokenizer to convert the entire corpus into tokens.2 2. Chunking: The tokenized corpus was then divided into chunks of 64 tokens, respecting the boundaries of works (and even chapters -in the case of the ars); 3. Overlap: To ensure continuity and capture context that might span chunk boundaries, we implemented an overlap between chunks. Each chunk overlapped with its adjacent chunks by 32 tokens (half of the chunk length). 4. Decoding: Finally, we decoded these token chunks back into text, resulting in our training data segments. By using a single tokenizer to chunk the entire corpus beforehand instead of splitting the texts with a tokenizer of the corresponding model, we ensured that all models were trained on the same segments of text.

The training was carried out for 700 steps by sampling batches containing 4 chunks per authorial class. Validation set was checked each 350 steps, i.e., twice during the training. Test set including <UNK>-labelled samples was checked upon the end of training. We report the results obtained on the test set.

Results and Discussion

General Performance

Table 3 summarizes the overall performance of the classifiers. Notably, additional MLM training proved beneficial only for the RoBERTa-sized bowphs/GreBerta model. For BERT-sized models, however, the inclusion of new data was detrimental. bowphs/GreBerta appears to be more stable, behaving more like general-purpose language models trained for well-resourced modern languages. This stands to reason: out of the three models with which we experimented, bowphs/GreBerta [32] is the largest and was trained on the riches and highest-quality Ancient Greek corpus.

Authorship Attribution of the ars

The aim of this study was to get some fresh evidence about the authorship of the pseudo-Dionysian ars, a precious witness to the development of rhetorical theory during the High to Late Roman empire. Based on the status quaestionis surveyed in the section 2, we set up 3 research questions:

Table 3

Performance metrics on the test split with the <UNK> category (not represented in the training data). The models were configured to assign <UNK> to samples with a calibrated top probability below 80%. (R) denotes models fine-tuned with an MLM objective on the same data that was used to train the classifiers.

Model

F1 Score Accuracy 1. Can we further comfort or challenge the existing consensus opinion, according to which the attribution to Dionysius of Halicarnassus is incorrect? 2. How many works are discernible in the ars in the form we know it? 3. Can the model convincingly suggest an alternative attribution for the ars or any of its parts?

To address these questions, we applied the trained classifier to individual chapters of the ars, split into chunks following the described procedure. Table 4 summarizes the predictions made by the best-performing BERT-sized and RoBERTa-sized models. 3 For each chapter, we report the "majority vote" (i.e., the number of chunks in the chapter attributed to a given author), the author's "share" (i.e., the proportion of chunks assigned to that author in the total number of chapter chunks), and the mean probability of the author across the chunks of the chapter. In the "majority vote", the attribution is defined by the top probability even if it falls below 80%.

No trace of Dionysius of Halicarnassus

In line with previous scholarship, although the name of Dionysius of Halicarnassus appears among the attributions, its weight is insignificant. Therefore, with regard to the first of the research questions, the evidence is overwhelming: stylistic afÏnity with Dionysius of Halicarnassus's writings is scarce, and the attribution to him cannot be supported by any of the two models.

ars's association to the Menandrean corpus further strengthened

Apart from this rather predictable conclusion, our classifiers yield new insights into more complicated questions concerning the inner structure of the ars and the authorship of the texts, which constitute it. As clear from the Table 4, the attribution profiles for ch. 1-7 and 8-11 are drastically different. Even when the probability is not high enough, Menander Rhetor is the top-ranked candidate in ch. 1-7. The signal is less clear in ch. 8-11. This difference goes in line with the communis opinio that the work is composite: a nearly identical attribution profile of ch. 1-7 being yet another argument in favour of its unity.

What does the model learn?

For the sake of explainability, DH specialists still widely use the bag-of-words model and corpus-specific manual feature engineering for various tasks involving writing style analysis, such as authorship attribution, authorship and self-authorship verification, clustering, etc. [13,12,24,3]. Since deep learning methods lack this level of transparency, understanding exactly what our classifier learned is crucial. A thorough investigation of this matter will be the subject of a separate study, using explainable AI techniques such as integrated gradients and token attribution. Here, we limit our discussion to one insightful example, which seems to illustrate how the model works.

As previously mentioned in Section 2, all the genres addressed in ch. 2-5 are also discussed in the second treatise attributed to Menander Rhetor. Only the most prestigious of the epideictic genres, the panegyric -focused on in ch. 1 and 7 -does not correspond to any section in Menander's works. However, ch. 1, which provides introductory notes on panegyrics, often echoes the examples and some wording of the first treatise by Menander. Ch. 1 offers guidelines on how to appropriately praise gods ("leaders and name-givers of any festival"), cities where the festivals take place, and emperors who organize and preside over the festivals. All these topics are covered in Menander's first treatise.

Considering only ch. 2-5 or the fragments of ch. 1 that have clear parallels in Menander's work, one might argue that the classifier's decision was biased due to the significant content and semantic overlap, especially since such a tendency has been reported about the BERT-based classifiers [7]. However, the consistency of the attribution profile across the chapters by both models is reassuring, as it suggests that they capture more than just semantics.

Menandrean association appears all the stronger when the values for the logical subdivisions of the ars, ch. 1-7, are calculated. As Korenjak [22] has shown, in its current form, the order of the chapters is disorganized, and it is possible that the author intended to arrange them as follows: chapters 1 and 7 (panegyrics or appraisal speeches), chapters 2-4 (speeches related to

Table 7

Content overlap between the ars and the second treatise attributed to Menander. Chapter division for Menander's treatise follows Race [31].

Menander Rhetor

Treatise II ars 5, 6 2, 4 7 3 9 5 8, 10, 15 6 family life occasions), and chapters 5-6 (speeches addressed to ofÏcials and epitaphs). In each of these sections, Menander maintains a stable leadership (Tables 5 and 6).

ars, ch. 8-11: multiple authors? The discrepancy between the attribution profiles of ch. 8-9 and ch. 10-11 might suggest a division, albeit a less distinct one, than ch. 1-7 versus ch. 8-11. This result aligns with the assessment made by Usener [49], although it does not provide any further hint at the identity of the possible author. However, the opposite hypothesis should still be considered seriously. In ch. 1-7, top two single attributions (i.e., Menander Rhetor and another author) in terms of "share" would cover at least 0.58-0.68 of the attributed chunks (ch. 7). In contrast, the top two attributions in ch. 8-11 provide, at best, 0.58-0.62 of the attributions (ch. 11 and 10), the attributions are more evenly distributed. Apparently, among the author classes present in our dataset, none is stylistically similar enough to the text of ch.8-11. This can be explained in two ways. Texts written in a comparable style are either completely absent from the dataset or are not appropriately distributed among author groups, making it challenging for the model to learn the features of this particular writing style. Keeping in mind the existing hypothesis about the relationship between the so-called Hermogenean canon and the works ascribed to Apsines, with extreme caution, we incline to the latter explanation.

Two works, which are part of the Hermogenean canon, On Invention and On Method, were already in Late Antiquity associated with the name of Hermogenes. In our dataset, therefore, following the TLG, we reproduce this conventional attribution. Yet, both are most likely inauthentic [10,11,25,26,27]. If the argumentation presented by Heath [16,17] proves correct and these two texts can securely be ascribed to Apsines, the "new" writing style they would represent might possibly demonstrate a more pronounced afÏnity with the style of ch. 8-11. This and similar possibilities should be thoroughly checked in further experiments.

The scope of the much-needed detailed follow-up study becomes evident. A systematic and critical reassessment of attribution problems within the corpus of the Rhetores Graeci is necessary. Beyond merely reflecting on the attributions of individual works, it is important to establish the homogeneity of different rhetorical corpora within the framework of a pairwise authorship verification study.

But if we set aside the obscure case of ch. 8-11, should we conclude that ch. 1-7 were written by Menander Rhetor? Given the aforementioned limitations of our dataset, we would not go that far. However, our results suggest that the connection between the first part of the Pseudo-Dionysian ars and the Menandrean corpus likely extends beyond a theoretical afÏnity. Despite the obvious terminological discrepancies between the texts and their different levels of elaboration, the possibility of multiple authorship within the same school, or even common authorship, should be considered with all seriousness. The divergence between the ars, ch. 1-7, and the Menandrean corpus can also be explained, apart from the natural evolution of personal style and preferences, by the likelihood that those presenting complex rhetorical theory would probably follow the advice formulated by the author of ch. 11. The art of rhetoric involves presenting material in a way that convinces the audience. Thus, orators are similar to doctors who must not only select the right medication but also administer it in a manner acceptable to the patient [50, ch. 11, par. 9, p. 385, ll. [7][8][9][10][11][12]. In other words, multiple contextual factors influenced the style of the presentation, and, in the cases when the stylistic afÏnity is clear, one should not probably overinterpret isolated differences.

Conclusion

This study uses transformer-based models to analyze ancient rhetorical texts for authorship attribution in classical philology. First, we adapted these models to handle the linguistic nuances of Ancient Greek texts from the 1st to the 4th century AD using masked language modeling. We then apply the fine-tuned models to identify authorship markers in Ars Rhetorica, a text possibly written by multiple ancient writers. This application not only reminds of benefits of modern AI techniques to classical studies but also deepens our understanding of ancient literary compositions through modern computational methods.

The results of BERT and RoBERTa classifiers do not support connection of the ars to Dionysius of Halicarnassus, going in line with the previous studies that question his authorship. They also strengthen the link of ars to the Menandrean corpus, particularly evident in the distinct attribution profiles between chapters 1-7 and 8-11, which suggests a composite nature of the work.

Despite the lack of transparency of MLM techniques compared to conventional methods, which prioritize human-interpretable features, the effectiveness and relevance of machine learning methods is noteworthy.

While neural networks are often criticized in digital humanities for their black-box nature [12], their ability to detect writing styles make them a valuable tool in the field of digital humanities. The use of these models promises significant advancements in authorship attribution and our understanding of ancient literary works.

Limitations

This study has several limitations that should be considered when interpreting the results.

Firstly, the issue of disputed authorship within the dataset is a significant challenge. For instance, the Hermogenean corpus and Menandrean treatises, both central to our analysis, have long-standing debates regarding their true authorship, see Section 4. These uncertainties could affect the attribution accuracy. We are currently working on a study intended to solve this issue, adopting an authorship verification approach.

Secondly, the use of transformer-based models like BERT and RoBERTa, come with limitations related to their opaque nature. The lack of interpretability in these models means that understanding the specific features and patterns the models use to make attributions is challenging. This limits our ability to provide a transparent rationale for the models' decisions, which is often critical in digital humanities research. Yet, the attempts were made to find way to make the results of pre-trained language models more interpretable, e.g., by means of the so-called integrated gradients [46]. These methods can perhaps be adapted for cases similar to ours.

Despite achieving notable accuracies with relatively short chunks (64 tokens), the models' performance still leaves room for improvement, particularly in terms of handling unbalanced corpora and downplaying the influence of the thematic clues. Nevertheless, their performance, comparable to state-of-the-art results for modern languages, demonstrates an ability to capture writing style. There clearly are instances where the models are overly confident, leading to incorrect authorship attribution. These errors could arise from factors such as the models' sensitivity to stylistic nuances and the complexity of the texts. Embracing more sophisticated methodologies for uncertainty-aware training would be an interesting avenue for further exploration.

Another potential avenue for future research is the development of chronological and regional classifiers. Texts from different regions and periods may exhibit unique linguistic and stylistic features that are not captured by a generalized model. Developing classifiers specific to historical periods or geographical (and cultural) areas could enhance attribution accuracy and offer more detailed insights into the ars and many other texts.

Table 11Themes addressed in the Ars Rhetorica and alleged authorship of its different parts. Each Roman number stands for one author. II-III means that the section might have been written by two different persons.arsThemeAuthor1Panegyrics2Marriage speeches3Birthday speeches4EpithalamiumI5Addresses6Funeral speeches7Exhortations to athletes8 9"Figured speeches"I or II or II-III10 Criticism of declamations I or II or IV or IV-V 11

Table 22Classification dataset. Texts by authors marked with * were grouped under the <UNK> label. This label is present only in the test data to evaluate the model's ability to deal with uncertainty in an "open set" scenario.NameTLGDateLocationAelius Aristides284II ADMysiaAelius Herodianus & Pseudo-Herodianus87II ADAlexandriaAelius Theon*607I-II ADAlexandriaAlciphron640II-III ADUnknownAlexander*5941st half II ADUnknownAnonymus Seguerianus*2002 1st half III ADUnknownCassius Longinus*2178mid III ADAthensDemetrius613I ADUnknownDionysius Halicarnasseus81I BCHalicarnassusEudemus1376II ADArgosHermogenes592II-III ADTarsusLesbonax*649II ADMiletusLonginus*560I ADUnknownMarcus Cornelius Fronto*186II ADNumidiaMenander2586III-IV ADLaodiceaMinucianus Junior*2903III ADAthensPolyaenus616II ADMacedoniaPolybius*605II ADSardisValerius Apsines2027III ADAthens

Table 4 "4Majority vote", share, and mean prediction probability for each chapter of the ars: Ancient Greek BERT vs GreBerta (R). "Rest" stands for the sum of all minor attributions. Sorted by the mean prediction probability.pranaydeeps/Ancient-Greek-BERTbowphs/GreBerta (R)Ch.AuthorVoteShareProb.AuthorCountShareProb.Menander320.7066.59Menander390.8579.211Aelius Aristides Dionysius H.5 30.11 0.0712.51 6.80Aelius Aristides Dionysius H.3 30.07 0.077.39 6.57Rest60.1314.10Rest10.026.83Menander300.5957.46Menander360.7168.802Dionysius H. Aelius Aristides9 90.18 0.1815.20 14.48Aelius Aristides Dionysius H.5 60.10 0.129.32 9.23Rest30.0612.87Rest40.0812.66Menander250.9694.13Menander250.9689.753Dionysius H. Hermogenes1 00.04 0.002.05 1.43Hermogenes Dionysius H.1 00.04 0.004.68 2.10Rest00.002.40Rest00.003.47Menander150.7969.45Menander160.8479.774Hermogenes Aelius Aristides3 10.16 0.0511.68 6.42Aelius Aristides Demetrius2 10.11 0.058.99 5.43Rest00.0012.45Rest00.005.81Menander210.4948.00Menander280.6560.735Aelius Aristides Dionysius H.12 30.28 0.0725.16 7.34Aelius Aristides Hermogenes11 20.26 0.0521.61 6.29Rest70.1619.50Rest20.0511.37Menander340.6152.98Menander340.6156.716Hermogenes Dionysius H.9 50.16 0.0915.58 8.78Valerius Apsines Dionysius H.6 70.11 0.1212.06 11.11Rest80.1422.66Rest90.1620.12Menander310.3937.87Menander420.5348.347Aelius Aristides Dionysius H.15 120.19 0.1518.39 15.14Aelius Aristides Valerius Apsines12 80.15 0.1014.65 10.85Rest210.2728.60Rest170.2226.15Hermogenes490.2121.63Hermogenes590.2625.458Valerius Apsines Aelius Aristides41 450.18 0.1917.19 16.92Aelius Aristides Dionysius H.58 490.25 0.2121.86 19.96Rest960.4244.26Rest650.2832.73Hermogenes600.2017.95Aelius Aristides780.2623.269Demetrius Aelius Aristides45 490.15 0.1615.37 14.55Hermogenes Dionysius H.51 450.17 0.1517.99 15.03Rest1450.4852.14Rest1250.4243.72Hermogenes430.3430.00Hermogenes520.4238.2110Dionysius H. Valerius Apsines31 170.25 0.1421.65 14.49Dionysius H. Valerius Apsines25 160.20 0.1220.63 11.48Rest340.2733.86Rest320.2629.68Hermogenes410.3731.65Hermogenes340.3028.3911Menander Dionysius H.23 140.21 0.1220.22 13.48Menander Dionysius H.23 210.21 0.1919.30 18.81Rest340.3034.64Rest340.3033.51

Table 5 "5Majority vote", share, and mean prediction probability for logical subdivisions within the ars, ch. 1-7:GreBerta (R).VoteShare (%)Probability1 & 7 2-4 5 & 6 1 & 7 2-4 5 & 6 1 & 72-4 5 & 6Menander8177620.68 0.830.65 59.70 76.64 58.45Aelius Aristides157130.12 0.080.14 11.977.09 11.33Hermogenes7270.06 0.020.077.873.779.01Dionysius Halicarnassensis9670.08 0.060.077.65.77.06Valerius Apsines8170.07 0.010.076.401.88.85

Table 6 "6Majority vote", share, and mean prediction probability for logical subdivisions within the ars, ch. 1-7: Ancient GreekBERT. VoteShare (%)Probability1 & 7 2-4 5 & 6 1 & 7 2-4 5 & 6 1 & 72-4 5 & 6Menander6370550.52 0.740.59 48.44 69.76 50.82Aelius Aristides2010150.17 0.110.16 16.239.22 15.89Dionysius Halicarnassensis151080.12 0.110.09 12.079.477.87Hermogenes123110.10 0.030.129.464.18 11.36Valerius Apsines10240.08 0.020.047.813.136.49

We cannot publish the full texts with all the corresponding metadata. However, the shufÒed chunks used in MLM fine-tuning and subsequent classifier training are made available on GitHub: https://github.com/glsch/rhetores_ graeci. We did not repeat the experiment producing chunks with other available tokenizers. pranaydeeps/Ancient-Greek-BERT and bowphs/GreBerta (R)

Acknowledgments

We extend their gratitude to Jürgen Jost, Charlotte Schubert, Friedrich Meissner, Caroline Macé, and Mark de Kreij for welcoming this study and future collaboration between machine learning, history, and philology.

We would also like to thank Ben Nagy and two anonymous reviewers for the careful reading and insightful feedback.

We thank Shari Boodts and Sven Meeder, Principal Investigators of the ERC Proof of Concept project "ManuscriptAI" and the ERC Consolidator project "SOLEMNE". Without their support, this research would not have been possible.

A. Online Resources

The code and both models considered in detail in this study are accessible at:

• https://huggingface.co/glsch • https://github.com/glsch/rhetores_graeci

Whodunit? Learning to Contrast for Authorship Attribution BAi YWang YTan STan 10.48550/ARXIV.2209.11887 2022. Visited on 01/11/2024 Latin BERT: A Contextual Language Model for Classical Philology DBamman PJBurns 2020 The Elementary Particles: A Computational Stylometric Inquiry into the Mediaeval Greek-Latin Aristotle PBeullens WHaverals BNagy 10.21071/mijtk.v9i.16723 Mediterranea. International Journal on the Transfer of Knowledge 2445-2378 9 Apr. 2024. Visited on 05/06/2024 FBlass De Dionysii Halicarnassensis scriptis rhetoricis

Bonn

Max Cohen et fil 1863 Paideia: the World of the Second Sophistic BEBorg 2008 de Gruyter Greek sophists in the Roman Empire GWBowersock 1969 Clarendon Press Oxford Rethinking the Authorship Verification Experimental Setups FBrad AManolache EBurceanu ABarbalau RIonescu MPopescu 2022 KBrodersen Menandros. Abhandlungen zur Rhetorik. ger grc

Stuttgart

Anton Hiersemann 2019 88 Bibliothek der griechischen Literatur TCBurgess Epideictic literature 1902 3 University of Michigan Library Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? I EBürgi Wiener Studien 48 1930 Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? II EBürgi Wiener Studien 49 1931 Twenty-One* Pseudo-Chrysostoms and more: Authorship Verification in the Patristic World TClérice AGlaise Proceedings of the Computational Humanities Research Conference the Computational Humanities Research Conference 2023. 2022. Dec. 2023 Computational Humanities Research Conference MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts SCorbara AMoreo FSebastiani MTavoni Sept. 2021. 02/05/2024 BertAA : BERT Fine-Tuning for Authorship Attribution MFabien EVillatoro-Tello PMotlicek SParida Proceedings of the 17th International Conference on Natural Language Processing (ICON) PBhattacharyya DMSharma RSangal the 17th International Conference on Natural Language Processing (ICON)

Patna, India

NLP Association of India (NLPAI Dec. 2020 Indian Institute of Technology Patna Machine Learning and the Future of Philology: A Case Study BGraziosi JHaubold CCowen-Breen CBrooks 10.1353/apa.2023.a901022 TAPA 2575-7199 153 1 Mar. 2023. Visited on 10/20/2024 Hermogenes' Biographers MHeath Eranos 96 1998 Menander: a Rhetor in Context MHeath 2004 Oxford University Press USA Pseudo-Dionysius Art of Rhetoric 8-11: Figured speech, Declamation, and Criticism MHeath American Journal of Philology 124 1 2003 PART: Pre-Trained Authorship Representation Transformer JHuertas-Tato AHuertas-Garcia AMartin DCamacho Sept. 2022. on 01/11/2024 Greek Rhetoric under Christian Emperors GAKennedy 2008 Wipf and Stock Publishers 3 Some Recent Controversies in the Study of Later Greek Rhetoric GAKennedy American Journal of Philology 124 2 2003 Ps.-Dionysius Ars RhetoricaI-VII: One Complete Treatise MKorenjak Harvard Studies in Classical Philology 105 2010 Greek-BERT: The Greeks Visiting Sesame Street JKoutsikakis IChalkidis PMalakasiotis IAndroutsopoulos 10.1145/3411408.3411440 11th Hellenic Conference on Artificial Intelligence. SETN 2020

Athens, Greece

Association for Computing Machinery 2020 Authorship Analysis and the Ending of Seven Against Thebes: Aeschylus' Antigone or Updating Adaptation? NManousakis EStamatatos 10.1353/clw.2023.0007 Classical World 1558-9234 116 3 Mar. 2023. Visited on 02/01/2024 Le De Inventione du Pseudo-Hermogène MPatillon 10.1515/9783110815146-003 Aufstieg und Niedergang der römischen Welt Einzelne Autoren seit der hadrianischen Zeit und Allgemeines zur Literatur

Berlin; Boston

De Gruyter 1997 34 Teilband Sprache und Literatur Pseudo-Hermogène, L'Invention MPatillon Corpus rhetoricum Synopse des exordes. grc fre

Paris

Les Belles lettres 2012 3 1 Anonyme Pseudo-Hermogène, La méthode de l'habilité MPatillon Anonyme, Méthodes des discours d'adresse grc fre

Paris

Les Belles lettres 2014 5 Corpus rhetoricum De sermone figurato quaestio rhetorica JPenndorf Leipziger Studien zur classischen Philologie 20 1902 Hermogenis Opera HRabe 1985 Teubner Rhetoren-Corpora HRabe Rheinisches Museum 67 1912 Rhetor. Dionysius of Halicarnassus WHRace Menander Loeb classical library Ars Rhetorica. grc eng

Cambridge (Mass; London

Harvard University Press 2019 539 Exploring Large Language Models for Classical Philology FRiemenschneider AFrank 10.18653/v1/2023.acl-long.846 Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics the 61st Annual Meeting of the Association for Computational Linguistics

Toronto, Canada

2023. Visited on 02/10/2024 1 Association for Computational Linguistics Rhetors at the Wedding DARussell 10.1017/S0068673500004156 Proceedings of the Cambridge Philological Society the Cambridge Philological Society 1979. Visited on 05/07/2024 25 DARussell NGWilson Menander Rhetor. grc eng

Oxford

Clarendon Press 1981 Classicizing Rhetoric and Criticism: the Pseudo-Dionysian Exetasis and Mistakes in Declamation DARussell Le Classicisme à Rome aux 1 ers siècles avant et après 1979 25 LSadée De Dionysii Halicarnassensis scriptis rhetoricis quaestiones criticae Teubner 1878 Untersuchungen zur Anlage und Entstehung der beiden pseudodionysianischen Traktate περὶ ἐσχηματισμένων KSchöpsdau Rheinisches Museum für Philologie 118 1 1975 HSchott ΤΕΧΝΗ ΡΗΤΟΡΙΚΗ: quae vulgo integra Dionysio Halicarnassensi tribuitur, emendata, nova versione Latina et commentario illustrata Sumtibus E.B. Suicquerti 1804 A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek PSingh GRutten ELefever 10.18653/v1/2021.latechclfl-1.15 Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature SDegaetano-Ortlieb AKazantseva NReiter SSzpakowicz the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature

Punta Cana, Dominican Republic

Nov. 2021 Association for Computational Linguistics Machine Learning for Ancient Languages: A Survey TSommerschield YAssael JPavlopoulos VStefanak ASenior CDyer JBodel JPrag IAndroutsopoulos NDeFreitas 10.1162/coli_a_00481 Computational Linguistics 0891- 2017 49 3 Sept. 2023. Visited on 10/20/2024 Machine Learning for Ancient Languages: A Survey TSommerschield YAssael JPavlopoulos VStefanak ASenior CDyer JBodel JPrag IAndroutsopoulos NDe Freitas 10.1162/coli_a_00481 Computational Linguistics Sept. 2023 LSpengel Rhetores Graeci Teubner 1885 1 Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024 RSprugnoli MPassarotti the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024

Torino, Italia

ELRA and ICCL May 2024 Overview of the EvaLatin 2022 Evaluation Campaign RSprugnoli MPassarotti FMCecchini MFantoli GMoretti Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages RSprugnoli MPassarotti the Second Workshop on Language Technologies for Historical and Ancient Languages

Marseille, France

European Language Resources Association June 2022 Like Two Pis in a Pod: Author Similarity Across Time in the Ancient Greek Corpus GStorey DMimno 10.22148/001c.13680 Journal of Cultural Analytics 2371- 4549 5 2 July 2020. Visited on 10/20/2024 Axiomatic Attribution for Deep Networks MSundararajan ATaly QYan eprint: 1703.01365 2017 Dionysii Halicarnasei quae fertur ars rhetorica rec. Hermannus Usener GThiele Göttingische Gelehrte Anzeigen 159 1897 On the State of the Art in Authorship Attribution and Authorship Verification JTyo BDhingra ZCLipton arXiv:2209.06869 2022 Dionysii Halicarnasei quae fertur Ars Rhetorica HUsener 1895 Teubner Leipzig Latin Dionysii Halicarnasei quae exstant Opuscula, volumen secundum. grc HUsener LRadermacher

Stuttgart-Leipzig

Teubner 1929 6 Attention Is All You Need AVaswani NShazeer NParmar JUszkoreit LJones ANGomez LKaiser IPolosukhin 2017 Rhetores Graeci CWalz 1834 KWeismann De Dionysii Halicarnassei vita et scriptis: Diss. inaug. Steuber 1837 BERT in Plutarch's Shadows IPYamshchikov ATikhonov YPantis CSchubert JJost Nov. 2022. on 12/29/2022