<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>GlebSchmidt</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>VeronicaVybornaya</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ivan P. Yamshchikov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CAIRO, THWS, Technische Hochschule Würzburg-Schweinfurt</institution>
          ,
          <addr-line>Franz-Horn Str. 2, 97082 Würzburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>CHR 2024: Computational Humanities Research Conference</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Independent scholar</institution>
          ,
          <addr-line>St. Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Radboud University Nijmegen</institution>
          ,
          <addr-line>Erasmusplein 1, 6525 HT, Nijmegen</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <fpage>369</fpage>
      <lpage>385</lpage>
      <abstract>
        <p>This paper explores the use of pre-trained language models for Ancient Greek in the context of authorship attribution. The study adopts a two-step approach: first, the models are fine-tuned on a domainspecific corpus using a masked language modeling (MLM) objective; second, based on the fine-tuned model, a classifier is trained to address the authorship attribution task. The analysis centers on a corpus of texts on rhetorical theory from the Second Sophistic period, with particular focus on the PseudoDionysianArs Rhetorica. The results of the experiment suggest that this approach ofers valuable insights into the authorship of ancient texts. Notably, the findings align with some traditional scholarly views on the Ars Rhetorica while also opening the door to reconsidering long-discarded hypotheses about the treatise's internal structure. This study highlights how the integration of natural language processing and classical philology can significantly advance discussions in ancient literary scholarship.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;pre-trained language models</kwd>
        <kwd>authorship attribution</kwd>
        <kwd>authorship analysis</kwd>
        <kwd>historical languages</kwd>
        <kwd>transfer learning</kwd>
        <kwd>ancient greek (roman period)</kwd>
        <kwd>Ps</kwd>
        <kwd>-Dionysius's Ars Rhetorica</kwd>
        <kwd>BERT</kwd>
        <kwd>RoBERTa</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Over the past several years, the application of transformer-based neural netwo5r1k]sh[as
led to significant advancements in many NLP tasks related to historical languag4e4s,[
        <xref ref-type="bibr" rid="ref32 ref42">32, 43</xref>
        ].
However, unlike in the case of modern languages, where fine-tuning pre-trained transformers
for linguistic forensics is very commo1n4,[
        <xref ref-type="bibr" rid="ref1 ref19 ref47">48, 19, 1</xref>
        ], the application of such models for
authorship attribution tasks in historical languages remains relatively underexplored, although
some excellent seminal studies and surveys have been recently publish4e5d,1[
        <xref ref-type="bibr" rid="ref39 ref40 ref5">5, 40, 41</xref>
        ]. The
availability of state-of-the-art pre-trained language mode2l,s32[
        <xref ref-type="bibr" rid="ref38 ref53">, 39, 54</xref>
        ] excelling in multiple
downstream tasks suggests that the situation with authorship analysis can be diferent as well.
      </p>
      <p>
        Yamshchikov, Tikhonov, Pantis, Schubert, and Jost54[] obtained a pre-trained model for
Ancient Greek by fine-tuning a Modern GreekBERT model [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The resulting model subsequently
served as the backbone for a classifier and proved efective for authorship attribution of the
so-called Pseudo-Plutarch corpus. Interestingly, despite being fine-tuned on a limited amount
of Ancient Greek data, the model obtained through transfer learning showed results
comparable to those of the models trained from scratch on significantly larger corpora, as reported
by Singh, Rutten, and Lefever 3[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and Riemenschneider and Frank32[]. Drawing inspiration
from Yamshchikov, Tikhonov, Pantis, Schubert, and Jos5t4[], this study experiments with a
similar approach focusing on the works of late Greek rhetoricians.
      </p>
      <p>
        Greek prose on rhetorical theory from the period known as the Second Sophistic serves as
a crucial source, documenting the cultural and intellectual framework of Greek thought and
literature in the first centuries AD9[
        <xref ref-type="bibr" rid="ref20 ref5 ref6">, 6, 20, 5</xref>
        ]. However, the study of this extensive corpus of
texts, collectively referred to under the broad concepRthoeftores Graeci [
        <xref ref-type="bibr" rid="ref41 ref51">52, 42</xref>
        ], is significantly
complicated by endless controversies surrounding authorship, dating, and contextual factors
[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>In this paper, we explore the potential of a transformer-based models, fine-tuned for
sequence classification task, to provide further insights into the debate.</p>
      <p>
        The focal point of our study is the text conventionally referred to asAtrhseRhetorica (Art of
Rhetoric, hereafter ars). This work has long been attributed to, and frequently published under
the name of, the rhetorician and historian Dionysius of Halicarnascsau.s6(0–7 BC). However,
Sadée [
        <xref ref-type="bibr" rid="ref35">36</xref>
        ], followed by Usener4[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and Usener and Radermacher5[0], demonstrated that the
ars most likely circulated anonymously, with its association to Dionysius emerging from a
much later conjecture. This conjecture appears to have been based on an overinterpretation of
a scholion (a marginal commentary) on chapt1e0r of the text.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Ars Rhetorica</title>
      <p>Several aspects of thears must be discussed in the context of statistical modelling of its writing
style.</p>
      <sec id="sec-2-1">
        <title>2.1. Not one, but multiple works</title>
        <p>
          The text has a complex structure. InParisinus Graecus 1741 [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ], the only manuscript that
preserves all the material associated with thaers (f. 1–37), the text is divided into 11
chapters. However, these chapters do not constitute a homogeneous work, as the text is generally
understood to consist of two18[], three [
          <xref ref-type="bibr" rid="ref34 ref48 ref49">49, 50, 35, 33</xref>
          ], or even four 3[
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] distinct parts.
        </p>
        <p>The first part, covering ch. 1–7, provides concise instruction on ceremonial (epideictic)
oratory, addressing seven epideictic genres. These chapters are connected by cross-references and
recurring addresses to the author’s former pupil, Echecrates, to whom the text is presented as
a wedding gift. The remainder of the text, ch.8–11, may be interpreted as a combination of two
or three distinct works on separate topics. C8h–.9 explore the so-called “figured speeches”, i.e.,
speeches intended to convey a hidden meaning that may conflict with the literal content and
stated purpose of the speech, while ch1.0–11 focus on the criticism of declamation.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Authorship</title>
        <p>
          Ch. 1–7 exhibit a consistent compositional pattern and a recognizable writing style, suggesting
they were authored by the same rhetorician. However, whether these chapters form a coherent
and complete treatise is a matter of debate. This portion of tahres has been interpreted as a
collection of distinct letters or essay38s,[
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], as remnants or excerpts from a much longer work
[
          <xref ref-type="bibr" rid="ref4 ref48 ref49">4, 49, 50</xref>
          ], or as a unified treatise [
          <xref ref-type="bibr" rid="ref22 ref46 ref52">53, 47, 22</xref>
          ]. For ch.8–11, the situation is even more ambiguous.
Usener [
          <xref ref-type="bibr" rid="ref48">49</xref>
          ] speculated that ch.8–9 were written by two diferent disciples attending separate
lectures of the same teacher. Penndor2f8[] and Schöpsdau [
          <xref ref-type="bibr" rid="ref36">37</xref>
          ] rejected the idea that ch.8–9
had a single author, suggesting instead that these texts drew from various sources. Similarly,
ch. 10–11 have been attributed either to the same author as ch8–.9 (with Heath [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] tentatively
identifying him as Sarapion Aelius, a 2nd-century Alexandrian rhetorician whose entire corpus
is lost) or to two diferent authors unrelated to the rest of tahres. Table 1 summarizes the
content and authorship hypotheses for the various sections of tahres.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Ars Rhetorica, Menanderian Corpus, and Pseudo-Hermogenes’ On Method</title>
        <p>
          Since the early days of scholarship on thAers Rhetorica, it has been noted that the rhetorical
instruction provided in ch1–.7 and ch.8–11 shows a clear methodological afÏnity with,
respectively, the treatises ascribed to Menander Rhetor (particularly the second one) and
PseudoHermogenes’On Method. The parallels with the second treatise attributed to Menander are
especially noteworthy. In both works:
• the occasion — rather than the subject, as in the first treatise attributed to Menander —
determines the genre;
• a very similar selection of genres is discussed (of the seven genres mentioned inartsh, e
only two are absent from Menander’s purported work; see Ta7b);le
• the author addresses a former disciple throughout the text.
This afÏnity led Heath [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] to describe thears as “comparable to, though less sophisticated
than” Menander’s work.
        </p>
        <p>
          The numerous parallels between c8h–.11 of the ars and Pseudo-Hermogenes’On Method
[
          <xref ref-type="bibr" rid="ref18 ref29">29, 18</xref>
          ] have led scholars to hypothesize either a shared sour2ce9][or a closer, albeit indirect,
connection [
          <xref ref-type="bibr" rid="ref18 ref34">35, 18</xref>
          ].
2.4. Dates
The dates of the texts constituting thears have been assessed diferently. For ch.1–7, a mention
of the 2nd-century sophist Nicostratus (ch2., par. 9, p. 266, l. 14), along with the considerable
focus on speeches addressing Roman magistrates, suggests a composition date no earlier than
the High Empire [
          <xref ref-type="bibr" rid="ref22 ref34">35, 22</xref>
          ]. Race [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ] posits that the first part of the ars is roughly contemporary
with the corpus attributed to Menander Rhetor, which is datable to the late 3rd century AD. In
contrast, ch.8–11 may be a century earlier1[
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], i.e., 2nd century AD.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Aims</title>
      <p>The hypotheses concerning the authorship of diferent parts of thaers have multiplied, as
have suggestions regarding its potential relationship with other texts. However, the evidence
presented in the scholarship so far is drawn almost exclusively from close reading and remains
inconclusive. Additionally, unlike the case with8c–h1.1, no eforts have been made to identify
the author responsible for ch1–.7.</p>
      <p>The aim of our investigation, therefore, is to apply modern natural language processing
techniques to this rich textual material in order to gather new evidence about the structure
of the ars and gain further insights into its authorship. The arguments formulated through
language modeling could provide a novel and valuable contribution to the debate, particularly
when considered alongside the accumulated philological evidence and existing codicological
indications.</p>
      <p>The main contributions of this work can be summarized as follows:
• We further fine-tune two pre-trained models for Ancient Greek and one model for
Modern Greek on a corpus of Greek rhetoricians. We subsequently use the resulting models
to train “open set”10-class classifiers capable of attributing short fragments of text to
diferent authors of the Second Sophistic period;
• Analyzing in more details the results provided by two best-performing models, we shed
light on the history of the Pseudo-Dionysiaanrs, suggesting that:
– Ch. 1–7 of the ars could have been authored by an individual from the same school
as the author(s) of the Menandrian treatises;
– Ch. 8–11 not only difer in authorship from ch1.–7, but may have been written by
two distinct individuals, one responsible for8c–h9. and another for ch1.0–11.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Corpus</title>
      <p>The primary focus of our study is a corpus comprising at le1a8strhetores Graeci from the
1st–4th centuries AD and Dionysius of Halicarnassus. We only retained the authors whose
teachings are relatively well-preserved, excluding those known only through fragmentary or
indirect evidence. Importantly, we focus exclusively on rhetorical theory, i.e., works with a
theoretical or pedagogical intent.</p>
      <p>
        A significant limitation of this dataset is that many of the rhetorical corpora within it have
notorious attribution problems of their own. In particular, there is a compelling case for the
heterogeneity of the Hermogenean corpus (see Sectio6n). Similarly, the question of whether
both treatises attributed to Menander were authored by the same person remains unresolved
[
        <xref ref-type="bibr" rid="ref17 ref31 ref33 ref8">34, 17, 31, 8</xref>
        ]. Other corpora raise similar questions, to2o1][. Being aware of this and
currently working on a follow-up authorship verification study of these corpora (the importance
of which was also insightfully emphasized by the reviewers of this work), for simplicity, we
continue to group the studied texts by authorship as categorized in Tthheesaurus Linguae
Graecae (TLG), where our dataset stems from1 . We deem this simplification legitimate. In most
cases, these questionable attributions are rooted in long-standing traditions that date back to
the early stages of textual transmission. For example, the Hermogenean corpus has been
consistently attributed to Hermogenes of Tarsus since as early as the 5th century (for more details
see Section6). Therefore, with all necessary reservations, these conventional groupings can be
considered to represent at least some kind of connection. Even if they do not link texts written
by the same individual, they may still group works originating from the same school. After all,
this is why such simplification is commonly used in scholarship.
      </p>
      <sec id="sec-4-1">
        <title>4.1. &lt;UNK&gt; category</title>
        <p>The literature on oratory theory was undoubtedly much richer than what has been preserved.
To account for this, we created an “open set” scenario. For this purpose, we set as9idsmealler
authorial corpora — those with a number of sentences below the dataset’s median valu5e17of
(marked with * in Table2). These texts were excluded from the dataset before our conventional
80/10/10 split and later added to the the test set. The idea is straightforward. If at the test stage
the model encounters a text that does not belong to any of the authorial classes learned during
the training, it is likely that the calibrated probability associated with the top prediction will
be relatively low. If it falls below a certain threshold, the model is programmed to abstain from
making a decision and assign an&lt;UNK&gt; label to the text in question. The samples wit&lt;hUNK&gt;
label in the test split are necessary to monitor the model’s capability to do.</p>
        <p>An overview of the classification dataset is presented in the Tabl2e.
1We cannot publish the full texts with all the corresponding metadata. However, the shufÒed chunks used in MLM
ifne-tuning and subsequent classifier training are made available on GitHubh:ttps://github.com/glsch/rhetores_
graeci.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Methodology</title>
      <sec id="sec-5-1">
        <title>5.1. Base Transformers</title>
        <p>
          To train our classifiers, we used three diferent pre-trained transformers as starting points:
(1) RoBERTa-sized GreBerta presented by Riemenschneider and Fran3k2][,
(2) Ancient GreekBERT trained by Singh, Rutten, and Lefever3[
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]
(3) Modern GreekBERT published by Koutsikakis, Chalkidis, Malakasiotis, and
Androutsopoulos [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ].
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Masked Language Modeling Fine-Tuning</title>
        <p>Fine-tuning pre-trained models on domain-specific corpora prior to further tuning them for a
downstream task at hand is a common practice in NLP. It allows the model to adapt better to
the unique linguistic features of the target domain. This intermediate step may enhance the
model’s ability to capture specific syntactical patterns and vocabulary, which in turn improves
the performance on the final downstream task, such as classification. For this reason, before
training classifiers for authorship attribution, we ran training with a masked language
modeling objective. BERT-sized models were trained fo3repochs with a learning rate1 × 10−5 and
warmup during the first 10% of training steps.RoBERTa-based model was trained fo1r epoch
only with a learning rat1e× 10−4 and without warmup steps. In both scenarios, the learning
rate was decreasing linearly.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Sequence Classification</title>
        <p>Authorship classifiers were trained on both out-of-the-box models and their MLM-fine-tuned
versions. We employed a sliding window technique to segment the texts into chunks. The
process was as follows:
1. Tokenization: We used thebowphs/GreBerta tokenizer to convert the entire corpus into
tokens.2
2. Chunking: The tokenized corpus was then divided into chunks6o4ftokens, respecting
the boundaries of works (and even chapters — in the case of tahres);
3. Overlap: To ensure continuity and capture context that might span chunk boundaries,
we implemented an overlap between chunks. Each chunk overlapped with its adjacent
chunks by 32 tokens (half of the chunk length).
4. Decoding: Finally, we decoded these token chunks back into text, resulting in our
training data segments. By using a single tokenizer to chunk the entire corpus beforehand
instead of splitting the texts with a tokenizer of the corresponding model, we ensured
that all models were trained on the same segments of text.</p>
        <p>The training was carried out fo7r00 steps by sampling batches containing4 chunks per
authorial class. Validation set was checked eac3h50 steps, i.e., twice during the training. Test
set including&lt;UNK&gt;-labelled samples was checked upon the end of training. We report the
results obtained on the test set.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Results and Discussion</title>
      <sec id="sec-6-1">
        <title>6.1. General Performance</title>
        <p>
          Table3 summarizes the overall performance of the classifiers. Notably, additional MLM
training proved beneficial only for theRoBERTa-sized bowphs/GreBerta model. ForBERT-sized
models, however, the inclusion of new data was detrimentbaolw.phs/GreBerta appears to be
more stable, behaving more like general-purpose language models trained for well-resourced
modern languages. This stands to reason: out of the three models with which we experimented,
bowphs/GreBerta [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] is the largest and was trained on the riches and highest-quality Ancient
Greek corpus.
        </p>
      </sec>
      <sec id="sec-6-2">
        <title>6.2. Authorship Attribution of the ars</title>
        <p>The aim of this study was to get some fresh evidence about the authorship of the
pseudoDionysianars, a precious witness to the development of rhetorical theory during the High to
Late Roman empire. Based on thestatus quaestionis surveyed in the section2, we set up 3
research questions:
2We did not repeat the experiment producing chunks with other available tokenizers.
1. Can we further comfort or challenge the existing consensus opinion, according to which
the attribution to Dionysius of Halicarnassus is incorrect?
2. How many works are discernible in thaers in the form we know it?
3. Can the model convincingly suggest an alternative attribution foratrhseor any of its
parts?</p>
        <p>To address these questions, we applied the trained classifier to individual chapters of the ars,
split into chunks following the described procedure. Ta4bsluemmarizes the predictions made
by the best-performingBERT-sized andRoBERTa-sized models3. For each chapter, we report
the “majority vote” (i.e., the number of chunks in the chapter attributed to a given author), the
author’s “share” (i.e., the proportion of chunks assigned to that author in the total number of
chapter chunks), and the mean probability of the author across the chunks of the chapter. In
the “majority vote”, the attribution is defined by the top probability even if it falls bel8o0w%.</p>
      </sec>
      <sec id="sec-6-3">
        <title>6.3. No trace of Dionysius of Halicarnassus</title>
        <p>In line with previous scholarship, although the name of Dionysius of Halicarnassus appears
among the attributions, its weight is insignificant. Therefore, with regard to the first of the
research questions, the evidence is overwhelming: stylistic afÏnity with Dionysius of
Halicarnassus’s writings is scarce, and the attribution to him cannot be supported by any of the two
models.</p>
      </sec>
      <sec id="sec-6-4">
        <title>6.4. ars’s association to the Menandrean corpus further strengthened</title>
        <p>Apart from this rather predictable conclusion, our classifiers yield new insights into more
complicated questions concerning the inner structure of athrse and the authorship of the texts,
which constitute it. As clear from the Tabl4e, the attribution profiles for ch.1–7 and 8–11 are
drastically diferent. Even when the probability is not high enough, Menander Rhetor is the
top-ranked candidate in ch1.–7. The signal is less clear in ch8.–11. This diference goes in
3pranaydeeps/Ancient-Greek-BERT and bowphs/GreBerta (R)</p>
        <p>Menander
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Menander
Dionysius H.</p>
        <p>Aelius Aristides
Rest
Menander
Dionysius H.</p>
        <p>Hermogenes
Rest
Menander
Hermogenes
Aelius Aristides
Rest
Menander
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Menander
Hermogenes
Dionysius H.</p>
        <p>Rest
Menander
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Hermogenes
Valerius Apsines
Aelius Aristides
Rest
Hermogenes
Demetrius
Aelius Aristides
Rest
Hermogenes
Dionysius H.</p>
        <p>Valerius Apsines
Rest
Hermogenes
Menander
Dionysius H.</p>
        <p>Rest
“Majority vote”, share, and mean prediction probability for each chapter of the ars: Ancient Greek BERT
(R). “Rest” stands for the sum of all minor attributions. Sorted by the mean prediction
pranaydeeps/Ancient-Greek-BERT</p>
        <p>bowphs/GreBerta (R)
of ch. 1–7 being yet another argument in favour of its unity.
line with thecommunis opinio that the work is composite: a nearly identical attribution profile</p>
      </sec>
      <sec id="sec-6-5">
        <title>6.5. What does the model learn?</title>
        <p>
          For the sake of explainability, DH specialists still widely use the bag-of-words model and
corpus-specific manual feature engineering for various tasks involving writing style analysis,
such as authorship attribution, authorship and self-authorship verification, clustering, e1t3c,. [
          <xref ref-type="bibr" rid="ref12 ref24 ref3">12, 24, 3</xref>
          ]. Since deep learning methods lack this level of transparency, understanding exactly
        </p>
        <p>Menander
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Menander
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Menander
Hermogenes
Dionysius H.</p>
        <p>Rest
Menander
Aelius Aristides
Demetrius
Rest
Menander
Aelius Aristides
Hermogenes
Rest
Menander
Valerius Apsines
Dionysius H.</p>
        <p>Rest
Menander
Aelius Aristides
Valerius Apsines
Rest
Hermogenes
Aelius Aristides
Dionysius H.</p>
        <p>Rest
Aelius Aristides
Hermogenes
Dionysius H.</p>
        <p>Rest
Hermogenes
Dionysius H.</p>
        <p>Valerius Apsines
Rest
Hermogenes
Menander
Dionysius H.</p>
        <p>Rest
what our classifier learned is crucial. A thorough investigation of this matter will be the
subject of a separate study, using explainable AI techniques such as integrated gradients and token
attribution. Here, we limit our discussion to one insightful example, which seems to illustrate
how the model works.</p>
        <p>As previously mentioned in Sectio2n, all the genres addressed in c2h–.5 are also discussed in
the second treatise attributed to Menander Rhetor. Only the most prestigious of the epideictic
genres, the panegyric — focused on in ch1. and 7 — does not correspond to any section in
Menander’s works. However, ch1., which provides introductory notes on panegyrics, often
echoes the examples and some wording of the first treatise by Menander. C1h.ofers guidelines
on how to appropriately praise gods (“leaders and name-givers of any festival”), cities where
the festivals take place, and emperors who organize and preside over the festivals. All these
topics are covered in Menander’s first treatise.</p>
        <p>
          Considering only ch2.–5 or the fragments of ch.1 that have clear parallels in Menander’s
work, one might argue that the classifier’s decision was biased due to the significant content
and semantic overlap, especially since such a tendency has been reported aboutBtEhReT-based
classifiers [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. However, the consistency of the attribution profile across the chapters by both
models is reassuring, as it suggests that they capture more than just semantics.
        </p>
        <p>Menandrean association appears all the stronger when the values for the logical subdivisions
of the ars, ch. 1–7, are calculated. As Korenjak22[] has shown, in its current form, the order
of the chapters is disorganized, and it is possible that the author intended to arrange them as
follows: chapters1 and 7 (panegyrics or appraisal speeches), chapter2s–4 (speeches related to
family life occasions), and chapter5s–6 (speeches addressed to ofÏcials and epitaphs). In each
of these sections, Menander maintains a stable leadership (Tab5leasnd 6).
ars, ch. 8–11: multiple authors? The discrepancy between the attribution profiles of ch8.–
9 and ch.10–11 might suggest a division, albeit a less distinct one, than c1h–.7 versus ch.8–11.
This result aligns with the assessment made by Usene4r9[], although it does not provide any
further hint at the identity of the possible author.</p>
        <p>However, the opposite hypothesis should still be considered seriously. In1–c7h,.top two
single attributions (i.e., Menander Rhetor and another author) in terms of “share” would cover
at least 0.58–0.68 of the attributed chunks (ch7.). In contrast, the top two attributions in ch8–.
11 provide,at best, 0.58–0.62 of the attributions (ch1.1 and10), the attributions are more evenly
distributed. Apparently, among the author classes present in our dataset, none is stylistically
similar enough to the text of ch8–.11. This can be explained in two ways. Texts written in
a comparable style are either completely absent from the dataset or are not appropriately
distributed among author groups, making it challenging for the model to learn the features of
this particular writing style. Keeping in mind the existing hypothesis about the relationship
between the so-called Hermogenean canon and the works ascribed to Apsines, with extreme
caution, we incline to the latter explanation.</p>
        <p>
          Two works, which are part of the Hermogenean canoOn,n Invention and On Method, were
already in Late Antiquity associated with the name of Hermogenes. In our dataset, therefore,
following the TLG, we reproduce this conventional attribution. Yet, both are most likely
inauthentic [
          <xref ref-type="bibr" rid="ref10 ref11 ref25 ref26 ref27">10, 11, 25, 26, 27</xref>
          ]. If the argumentation presented by Heath1[
          <xref ref-type="bibr" rid="ref17 ref6">6, 17</xref>
          ] proves correct
and these two texts can securely be ascribed to Apsines, the “new” writing style they would
represent might possibly demonstrate a more pronounced afÏnity with the style of c8h–.11.
This and similar possibilities should be thoroughly checked in further experiments.
        </p>
        <p>The scope of the much-needed detailed follow-up study becomes evident. A systematic
and critical reassessment of attribution problems within the corpus ofRthheetores Graeci is
necessary. Beyond merely reflecting on the attributions of individual works, it is important to
establish the homogeneity of diferent rhetorical corpora within the framework of a pairwise
authorship verification study.</p>
        <p>But if we set aside the obscure case of ch8.–11, should we conclude that ch1.–7 were written
by Menander Rhetor? Given the aforementioned limitations of our dataset, we would not
go that far. However, our results suggest that the connection between the first part of the
Pseudo-Dionysianars and the Menandrean corpus likely extends beyond a theoretical afÏnity.
Despite the obvious terminological discrepancies between the texts and their diferent levels
of elaboration, the possibility of multiple authorship within the same school, or even common
authorship, should be considered with all seriousness. The divergence betweenartsh,ceh. 1–7,
and the Menandrean corpus can also be explained, apart from the natural evolution of personal
style and preferences, by the likelihood that those presenting complex rhetorical theory would
probably follow the advice formulated by the author of 1c1h.. The art of rhetoric involves
presenting material in a way that convinces the audience. Thus, orators are similar to doctors
who must not only select the right medication but also administer it in a manner acceptable
to the patient [50, ch. 11, par. 9, p. 385, ll.7–12]. In other words, multiple contextual factors
influenced the style of the presentation, and, in the cases when the stylistic afÏnity is clear, one
should not probably overinterpret isolated diferences.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>This study uses transformer-based models to analyze ancient rhetorical texts for authorship
attribution in classical philology. First, we adapted these models to handle the linguistic nuances
of Ancient Greek texts from the 1st to the 4th century AD using masked language modeling.
We then apply the fine-tuned models to identify authorship markers iAnrs Rhetorica, a text
possibly written by multiple ancient writers. This application not only reminds of benefits
of modern AI techniques to classical studies but also deepens our understanding of ancient
literary compositions through modern computational methods.</p>
      <p>The results of BERT and RoBERTa classifiers do not support connection of tahres to
Dionysius of Halicarnassus, going in line with the previous studies that question his authorship. They
also strengthen the link oafrs to the Menandrean corpus, particularly evident in the distinct
attribution profiles between chapters1–7 and 8–11, which suggests a composite nature of the
work.</p>
      <p>Despite the lack of transparency of MLM techniques compared to conventional methods,
which prioritize human-interpretable features, the efectiveness and relevance of machine
learning methods is noteworthy.</p>
      <p>
        While neural networks are often criticized in digital humanities for their black-box nature
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], their ability to detect writing styles make them a valuable tool in the field of digital
humanities. The use of these models promises significant advancements in authorship attribution
and our understanding of ancient literary works.
      </p>
    </sec>
    <sec id="sec-8">
      <title>8. Limitations</title>
      <p>This study has several limitations that should be considered when interpreting the results.</p>
      <p>Firstly, the issue of disputed authorship within the dataset is a significant challenge. For
instance, the Hermogenean corpus and Menandrean treatises, both central to our analysis, have
long-standing debates regarding their true authorship, see Sect4io.nThese uncertainties could
afect the attribution accuracy. We are currently working on a study intended to solve this issue,
adopting an authorship verification approach.</p>
      <p>
        Secondly, the use of transformer-based models lBikEReT andRoBERTa, come with limitations
related to their opaque nature. The lack of interpretability in these models means that
understanding the specific features and patterns the models use to make attributions is challenging.
This limits our ability to provide a transparent rationale for the models’ decisions, which is
often critical in digital humanities research. Yet, the attempts were made to find way to make
the results of pre-trained language models more interpretable, e.g., by means of the so-called
integrated gradients4[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. These methods can perhaps be adapted for cases similar to ours.
      </p>
      <p>Despite achieving notable accuracies with relatively short chun6k4s t(okens), the models’
performance still leaves room for improvement, particularly in terms of handling unbalanced
corpora and downplaying the influence of the thematic clues. Nevertheless, their performance,
comparable to state-of-the-art results for modern languages, demonstrates an ability to
capture writing style. There clearly are instances where the models are overly confident, leading
to incorrect authorship attribution. These errors could arise from factors such as the models’
sensitivity to stylistic nuances and the complexity of the texts. Embracing more sophisticated
methodologies for uncertainty-aware training would be an interesting avenue for further
exploration.</p>
      <p>Another potential avenue for future research is the development of chronological and
regional classifiers. Texts from diferent regions and periods may exhibit unique linguistic and
stylistic features that are not captured by a generalized model. Developing classifiers specific
to historical periods or geographical (and cultural) areas could enhance attribution accuracy
and ofer more detailed insights into thears and many other texts.</p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgments</title>
      <p>We extend their gratitude to Jürgen Jost, Charlotte Schubert, Friedrich Meissner, Caroline Macé,
and Mark de Kreij for welcoming this study and future collaboration between machine learning,
history, and philology.</p>
      <p>We would also like to thank Ben Nagy and two anonymous reviewers for the careful reading
and insightful feedback.</p>
      <p>We thank Shari Boodts and Sven Meeder, Principal Investigators of the ERC Proof of Concept
project “ManuscriptAI” and the ERC Consolidator project “SOLEMNE”. Without their support,
this research would not have been possible.
[33] D. A. Russell. “Rhetors at the Wedding”. en. InPr:oceedings of the Cambridge Philological
Society 25 (1979), pp. 104–117. issn: 0068-6735, 2053-5899. doi: 10.1017/S0068673500004
156. (Visited on 05/07/2024).
The code and both models considered in detail in this study are accessible at:
• https://huggingface.co/glsch
• https://github.com/glsch/rhetores_graeci</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Ai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Tan</surname>
          </string-name>
          .
          <article-title>Whodunit? Learning to Contrast for Authorship Attribution</article-title>
          .
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .48550/ARXIV.2209.
          <fpage>11887</fpage>
          . (Visited on 01/11/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bamman</surname>
          </string-name>
          and
          <string-name>
            <surname>P. J. BurnsL.atin BERT</surname>
          </string-name>
          :
          <article-title>A Contextual Language Model for Classical Philology</article-title>
          .
          <year>2020</year>
          . url: https://arxiv.org/abs/
          <year>2009</year>
          .10053.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Beullens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Haverals</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Nagy</surname>
          </string-name>
          . “
          <article-title>The Elementary Particles: A Computational Stylometric Inquiry into the Mediaeval Greek-Latin Aristotle”</article-title>
          .MIned:iterranea.
          <source>International Journal on the Transfer of Knowledge</source>
          <volume>9</volume>
          (
          <issue>Apr</issue>
          .
          <year>2024</year>
          ), pp.
          <fpage>385</fpage>
          -
          <lpage>408</lpage>
          . issn:
          <fpage>2445</fpage>
          -
          <lpage>2378</lpage>
          . doi:
          <volume>10</volume>
          .21071/mijtk.v9i.
          <fpage>16723</fpage>
          . (Visited on 05/06/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Blass. De Dionysii</surname>
          </string-name>
          <article-title>Halicarnassensis scriptis rhetoricis</article-title>
          . Bonn: Max Cohen et fil.,
          <year>1863</year>
          . url: https://books.google.nl/books?id=k3g-
          <fpage>AAAAcAA</fpage>
          .J
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B. E.</given-names>
            <surname>Borg</surname>
          </string-name>
          .
          <article-title>Paideia: the World of the Second Sophistic</article-title>
          . de Gruyter,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G. W.</given-names>
            <surname>Bowersock</surname>
          </string-name>
          .
          <article-title>Greek sophists in the Roman Empire</article-title>
          . eng. Oxford: Clarendon Press,
          <year>1969</year>
          . isbn:
          <fpage>978</fpage>
          -0-
          <fpage>19</fpage>
          -814279-9.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Brad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Manolache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Burceanu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barbalau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ionescu</surname>
          </string-name>
          , and
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>PopesRcuet.hinking the Authorship Verification Experimental Setups</article-title>
          .
          <year>2022</year>
          . url: https://arxiv.org/abs/2112.05 125.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Brodersen</surname>
          </string-name>
          .Menandros.
          <article-title>Abhandlungen zur Rhetorik. ger grc</article-title>
          . Vol.
          <volume>88</volume>
          . Bibliothek der griechischen Literatur.
          <source>Stuttgart: Anton Hiersemann</source>
          ,
          <year>2019</year>
          . isbn:
          <fpage>978</fpage>
          -3-
          <fpage>7772</fpage>
          -1934-9.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Burgess</surname>
          </string-name>
          .
          <article-title>Epideictic literature</article-title>
          . Vol.
          <volume>3</volume>
          . University of Michigan Library,
          <year>1902</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>E. Bürgi.</surname>
          </string-name>
          “
          <article-title>Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? I.”</article-title>
          <source>In: Wiener Studien</source>
          <volume>48</volume>
          (
          <year>1930</year>
          ), pp.
          <fpage>187</fpage>
          -
          <lpage>197</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>E. Bürgi.</surname>
          </string-name>
          “
          <article-title>Ist die dem Hermogenes zugeschriebene Schrift Περὶ μεθόδου δεινότητος echt? II.”</article-title>
          <source>In: Wiener Studien</source>
          <volume>49</volume>
          (
          <year>1931</year>
          ), pp.
          <fpage>40</fpage>
          -
          <lpage>69</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Clérice</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Glaise</surname>
          </string-name>
          . “
          <string-name>
            <surname>Twenty-One</surname>
          </string-name>
          *
          <article-title>Pseudo-Chrysostoms and more: Authorship Verification in the Patristic World”</article-title>
          .
          <source>In:Computational Humanities Research Conference 2023. Proceedings of the Computational Humanities Research Conference 2022. Dec</source>
          .
          <year>2023</year>
          . url: https://inria.hal.
          <source>science/hal-0421117</source>
          .
          <fpage>6</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Corbara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moreo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          , and
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>TavonMi.edLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts</article-title>
          . Sept.
          <year>2021</year>
          . url: http://arxiv.org/abs/
          <year>2006</year>
          .12289(visited on 02/05/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fabien</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Villatoro-Tello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Motlicek</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Parida</surname>
          </string-name>
          . “
          <article-title>BertAA : BERT Fine-Tuning for Authorship Attribution”</article-title>
          .
          <source>InP:roceedings of the 17th International Conference on Natural Language Processing (ICON)</source>
          . Ed. by
          <string-name>
            <given-names>P.</given-names>
            <surname>Bhattacharyya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Sharma</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Sangal</surname>
          </string-name>
          . Indian Institute of Technology Patna, Patna, India: NLP Association of India (NLPAI),
          <year>Dec</year>
          .
          <year>2020</year>
          , pp.
          <fpage>127</fpage>
          -
          <lpage>137</lpage>
          . url: https://aclanthology.org/
          <year>2020</year>
          .icon-main..
          <source>16</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>B.</given-names>
            <surname>Graziosi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Haubold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cowen-Breen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Brooks</surname>
          </string-name>
          . “
          <article-title>Machine Learning and the Future of Philology: A Case Study”</article-title>
          .
          <source>en. InT:APA 153</source>
          .1 (
          <issue>Mar</issue>
          .
          <year>2023</year>
          ), pp.
          <fpage>253</fpage>
          -
          <lpage>284</lpage>
          . issn:
          <fpage>2575</fpage>
          -
          <lpage>7199</lpage>
          . doi:
          <volume>10</volume>
          .1353/apa.
          <year>2023</year>
          .
          <year>a901022</year>
          .
          <source>(Visited on 10/20/</source>
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Heath</surname>
          </string-name>
          . “Hermogenes' Biographers”.
          <source>InE:ranos 96</source>
          (
          <year>1998</year>
          ), pp.
          <fpage>44</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Heath</surname>
          </string-name>
          .
          <article-title>Menander: a Rhetor in Context</article-title>
          . Oxford University Press, USA,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Heath</surname>
          </string-name>
          . “
          <article-title>Pseudo-Dionysius Art of Rhetoric 8-11: Figured speech, Declamation, and Criticism”</article-title>
          .
          <source>In:American Journal of Philology 124.1</source>
          (
          <issue>2003</issue>
          ), pp.
          <fpage>81</fpage>
          -
          <lpage>105</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Huertas-Tato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Huertas-Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and D.</given-names>
            <surname>CamachoPA</surname>
          </string-name>
          .RT:
          <string-name>
            <surname>Pre-Trained Authorship Representation Transformer. Sept</surname>
          </string-name>
          .
          <year>2022</year>
          . url: http://arxiv.org/abs/2209.15373 (visited on 01/11/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Kennedy</surname>
          </string-name>
          .
          <article-title>Greek Rhetoric under Christian Emperors</article-title>
          . Vol.
          <volume>3</volume>
          . Wipf and Stock Publishers,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>G. A. Kennedy. “</surname>
          </string-name>
          <article-title>Some Recent Controversies in the Study of Later Greek Rhetoric”</article-title>
          .
          <source>In: American Journal of Philology 124.2</source>
          (
          <issue>2003</issue>
          ), pp.
          <fpage>295</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Korenjak</surname>
          </string-name>
          . “
          <string-name>
            <surname>Ps</surname>
          </string-name>
          .-DionysiusArs
          <string-name>
            <surname>RhetoricaI-VII</surname>
          </string-name>
          :
          <article-title>One Complete Treatise”</article-title>
          .
          <source>In:Harvard Studies in Classical Philology</source>
          <volume>105</volume>
          (
          <year>2010</year>
          ), pp.
          <fpage>239</fpage>
          -
          <lpage>254</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>J.</given-names>
            <surname>Koutsikakis</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Chalkidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Malakasiotis</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Androutsopoulos.</surname>
          </string-name>
          “
          <article-title>Greek-BERT: The Greeks Visiting Sesame Street”</article-title>
          .
          <source>In:11th Hellenic Conference on Artificial Intelligence . SETN 2020</source>
          . Athens, Greece: Association for Computing Machinery,
          <year>2020</year>
          , pp.
          <fpage>110</fpage>
          -
          <lpage>117</lpage>
          . isbn:
          <volume>9781450388788</volume>
          . doi:
          <volume>10</volume>
          .1145/3411408.3411440.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>N.</given-names>
            <surname>Manousakis</surname>
          </string-name>
          and
          <string-name>
            <surname>E. Stamatatos.</surname>
          </string-name>
          “
          <article-title>Authorship Analysis and the Ending of Seven Against Thebes: Aeschylus' Antigone or Updating Adaptation?” en</article-title>
          .
          <source>ICnl:assical World 116.3</source>
          (
          <issue>Mar</issue>
          .
          <year>2023</year>
          ), pp.
          <fpage>247</fpage>
          -
          <lpage>274</lpage>
          . issn:
          <fpage>1558</fpage>
          -
          <lpage>9234</lpage>
          . doi:
          <volume>10</volume>
          .1353/clw.
          <year>2023</year>
          .
          <volume>0007</volume>
          . (Visited on 02/01/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>M.</given-names>
            <surname>Patillon</surname>
          </string-name>
          . “
          <article-title>LeDe Inventione du Pseudo-Hermogène”</article-title>
          .
          <source>InT:eilband Sprache und Literatur</source>
          .
          <article-title>Einzelne Autoren seit der hadrianischen Zeit und Allgemeines zur Literatur des 2. und 3</article-title>
          .
          <string-name>
            <surname>Jahrhunderts</surname>
          </string-name>
          . Vol.
          <volume>34</volume>
          /3. Aufstieg und Niedergang der römischen Welt. Berlin, Boston: De Gruyter,
          <year>1997</year>
          , pp.
          <fpage>2064</fpage>
          -
          <lpage>2172</lpage>
          . isbn:
          <volume>9783110815146</volume>
          . doi:
          <volume>10</volume>
          .1515/
          <fpage>9783110815146</fpage>
          -
          <lpage>003</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>PatillonP.</surname>
          </string-name>
          seudo-Hermogène,
          <string-name>
            <surname>L</surname>
          </string-name>
          'Invention. Anonyme,
          <article-title>Synopse des exordes</article-title>
          .
          <source>grc fre</source>
          . Vol.
          <volume>3</volume>
          ,
          <issue>1</issue>
          . Corpus rhetoricum. Paris: Les Belles lettres,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>M.</given-names>
            <surname>PatillonP.</surname>
          </string-name>
          seudo-Hermogène, La méthode de l'habilité. Maxime,
          <article-title>Lex objections irréfutables</article-title>
          . Anonyme,
          <article-title>Méthodes des discours d'adresse. grc fre</article-title>
          . Vol.
          <volume>5</volume>
          . Corpus rhetoricum. Paris: Les Belles lettres,
          <year>2014</year>
          . isbn:
          <fpage>978</fpage>
          -2-
          <fpage>251</fpage>
          -00591-1.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>J.</given-names>
            <surname>Penndorf</surname>
          </string-name>
          . “
          <article-title>De sermone figurato quaestio rhetorica”</article-title>
          .
          <source>InL:eipziger Studien zur classischen Philologie</source>
          <volume>20</volume>
          (
          <year>1902</year>
          ), pp.
          <fpage>169</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>H.</given-names>
            <surname>Rabe</surname>
          </string-name>
          . Hermogenis Opera. Teubner,
          <year>1985</year>
          . isbn:
          <fpage>978</fpage>
          -3-
          <fpage>519</fpage>
          -01760-8. url:https://books .google.nl/books?id=WreAtwEACA A.J
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>H.</given-names>
            <surname>Rabe</surname>
          </string-name>
          . “
          <article-title>Rhetoren-Corpora”</article-title>
          .
          <source>InR:heinisches Museum</source>
          <volume>67</volume>
          (
          <year>1912</year>
          ), pp.
          <fpage>321</fpage>
          -
          <lpage>357</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>W. H.</given-names>
            <surname>Race</surname>
          </string-name>
          . Menander Rhetor.
          <source>Dionysius of Halicarnassus, Ars Rhetorica. grc eng</source>
          . Vol.
          <volume>539</volume>
          .
          <article-title>Loeb classical library</article-title>
          . Cambridge (Mass.) London: Harvard University Press,
          <year>2019</year>
          . isbn:
          <fpage>978</fpage>
          -0-
          <fpage>674</fpage>
          -99722-6.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>F.</given-names>
            <surname>Riemenschneider</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Frank</surname>
          </string-name>
          . “
          <article-title>Exploring Large Language Models for Classical Philology”</article-title>
          . en. In:
          <article-title>Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</article-title>
          . Toronto, Canada: Association for Computational Linguistics,
          <year>2023</year>
          , pp.
          <fpage>15181</fpage>
          -
          <lpage>15199</lpage>
          . doi:
          <volume>10</volume>
          . 18653 / v1 /
          <year>2023</year>
          . acl - long .
          <fpage>846</fpage>
          . (Visited on 02/10/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Russell</surname>
          </string-name>
          and
          <string-name>
            <surname>N. G. WilsonM.</surname>
          </string-name>
          enander Rhetor.
          <source>grc eng</source>
          . Oxford: Clarendon Press,
          <year>1981</year>
          . isbn:
          <fpage>978</fpage>
          -0-
          <fpage>19</fpage>
          -814013-9.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Russell</surname>
          </string-name>
          . “
          <article-title>Classicizing Rhetoric and Criticism: the Pseudo-Dionysian Exetasis and Mistakes in Declamation”</article-title>
          .
          <source>InL:e Classicisme à Rome aux 1ers siècles avant et après J</source>
          .-C
          <volume>25</volume>
          (
          <year>1979</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>L.</given-names>
            <surname>Sadée. De Dionysii</surname>
          </string-name>
          <article-title>Halicarnassensis scriptis rhetoricis quaestiones criticae</article-title>
          .
          <source>lat. Strasbourg: Teubner</source>
          ,
          <year>1878</year>
          . isbn:
          <fpage>978</fpage>
          -0-
          <fpage>666</fpage>
          -72899-9.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>K.</given-names>
            <surname>Schöpsdau</surname>
          </string-name>
          . “
          <article-title>Untersuchungen zur Anlage und Entstehung der beiden pseudodionysianischen Traktate περὶ ἐσχηματισμένων”</article-title>
          .
          <source>In:Rheinisches Museum für Philologie 118.H. 1/2</source>
          (
          <year>1975</year>
          ), pp.
          <fpage>83</fpage>
          -
          <lpage>123</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>H.</given-names>
            <surname>Schott. ΤΕΧΝΗ ΡΗΤΟΡΙΚΗ</surname>
          </string-name>
          <article-title>: quae vulgo integra Dionysio Halicarnassensi tribuitur, emendata, nova versione Latina et commentario illustrata</article-title>
          . Sumtibus E.B.
          <string-name>
            <surname>Suicquerti</surname>
          </string-name>
          ,
          <year>1804</year>
          . url: https://books.google.nl/books?id=SiYUAAAAYAA.J
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rutten</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Lefever. “</surname>
          </string-name>
          <article-title>A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek”P. rIonc:eedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage</article-title>
          ,
          <source>Social Sciences, Humanities</source>
          and Literature. Ed. by
          <string-name>
            <given-names>S.</given-names>
            <surname>Degaetano-Ortlieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kazantseva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Reiter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Szpakowicz</surname>
          </string-name>
          . Punta Cana,
          <article-title>Dominican Republic (online): Association for Computational Linguistics</article-title>
          , Nov.
          <year>2021</year>
          , pp.
          <fpage>128</fpage>
          -
          <lpage>137</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2021</year>
          .latechclfl-
          <volume>1</volume>
          .1.
          <fpage>5</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>T.</given-names>
            <surname>Sommerschield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Assael</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pavlopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stefanak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Senior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bodel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Prag</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Androutsopoulos</surname>
          </string-name>
          , and N. De Freitas. “
          <article-title>Machine Learning for Ancient Languages: A Survey”</article-title>
          . en.
          <source>In:Computational Linguistics</source>
          <volume>49</volume>
          .3 (
          <issue>Sept</issue>
          .
          <year>2023</year>
          ), pp.
          <fpage>703</fpage>
          -
          <lpage>747</lpage>
          . issn:
          <fpage>0891</fpage>
          -
          <lpage>2017</lpage>
          ,
          <fpage>1530</fpage>
          -
          <lpage>9312</lpage>
          . doi:
          <volume>10</volume>
          .1162/coli_a_
          <fpage>00481</fpage>
          . (Visited on 10/20/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>T.</given-names>
            <surname>Sommerschield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Assael</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pavlopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stefanak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Senior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bodel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Prag</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Androutsopoulos</surname>
          </string-name>
          , and N. de Freitas. “
          <article-title>Machine Learning for Ancient Languages: A Survey”</article-title>
          .
          <source>In:Computational Linguistics (Sept</source>
          .
          <year>2023</year>
          ), pp.
          <fpage>703</fpage>
          -
          <lpage>747</lpage>
          . doi:
          <volume>10</volume>
          .1162/coli_a_
          <volume>0</volume>
          <fpage>0481</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>L.</given-names>
            <surname>Spengel</surname>
          </string-name>
          .
          <source>Rhetores Graeci</source>
          . Vol.
          <volume>1</volume>
          .
          <string-name>
            <surname>Teubner</surname>
          </string-name>
          ,
          <year>1885</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          , edPsr.
          <source>oceedings of the Third Workshop on Language Technologies for Historical</source>
          and
          <article-title>Ancient Languages (LT4HALA) @ LREC-COLING-2024</article-title>
          . Torino, Italia: ELRA and ICCL, May
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Cecchini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fantoli</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Moretti.</surname>
          </string-name>
          “
          <article-title>Overview of the EvaLatin 2022 Evaluation Campaign”</article-title>
          .
          <source>InP:roceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages</source>
          . Ed. by
          <string-name>
            <given-names>R.</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Passarotti</surname>
          </string-name>
          . Marseille, France: European Language Resources Association,
          <year>June 2022</year>
          , pp.
          <fpage>183</fpage>
          -
          <lpage>188</lpage>
          . url: https://aclanthology.org/
          <year>2022</year>
          .lt4hala-
          <fpage>1</fpage>
          .
          <fpage>29</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>G.</given-names>
            <surname>Storey</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Mimno</surname>
          </string-name>
          . “
          <article-title>Like Two Pis in a Pod: Author Similarity Across Time in the Ancient Greek Corpus”</article-title>
          . en.
          <source>InJ:ournal of Cultural Analytics 5.2 (July</source>
          <year>2020</year>
          ). issn:
          <fpage>2371</fpage>
          -
          <lpage>4549</lpage>
          . doi:
          <volume>10</volume>
          .22148/001c.
          <fpage>13680</fpage>
          . (Visited on 10/20/
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sundararajan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Taly</surname>
          </string-name>
          , and
          <string-name>
            <surname>Q.</surname>
          </string-name>
          <article-title>YanA. xiomatic Attribution for Deep Networks</article-title>
          .
          <year>2017</year>
          . eprint:
          <volume>1703</volume>
          .01365. url: https://arxiv.org/abs/1703.01365.
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [47]
          <string-name>
            <surname>G. Thiele.</surname>
          </string-name>
          “
          <article-title>Dionysii Halicarnasei quae fertur ars rhetorica rec. Hermannus Usener”</article-title>
          .
          <source>In: Göttingische Gelehrte Anzeigen</source>
          <volume>159</volume>
          (
          <year>1897</year>
          ), pp.
          <fpage>237</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>J.</given-names>
            <surname>Tyo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Dhingra</surname>
          </string-name>
          , and
          <string-name>
            <surname>Z. C.</surname>
          </string-name>
          <article-title>LiptonO.n the State of the Art in Authorship Attribution</article-title>
          and
          <string-name>
            <given-names>Authorship</given-names>
            <surname>Verification</surname>
          </string-name>
          .
          <year>2022</year>
          . arXiv:
          <volume>2209</volume>
          .06869 [cs.CL]. url: https://arxiv.org/abs/220 9.06869.
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>H.</given-names>
            <surname>Usener</surname>
          </string-name>
          .
          <article-title>Dionysii Halicarnasei quae fertur Ars Rhetorica</article-title>
          . Latin. Leipzig: Teubner,
          <year>1895</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>H.</given-names>
            <surname>Usener</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Radermacher</surname>
          </string-name>
          , edsD. ionysii
          <article-title>Halicarnasei quae exstant</article-title>
          . Vol.
          <volume>6</volume>
          : Opuscula, volumen secundum.
          <source>grc</source>
          . Vol.
          <volume>6</volume>
          . Stuttgart-Leipzig: Teubner,
          <year>1929</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          [51]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kaiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I. Polosukhin.Attention</given-names>
            <surname>Is All You Need</surname>
          </string-name>
          .
          <year>2017</year>
          . url: https://arxiv.org/abs/1706.03762.
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          [52]
          <string-name>
            <given-names>C.</given-names>
            <surname>Walz</surname>
          </string-name>
          .Rhetores Graeci.
          <year>1834</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          [53]
          <string-name>
            <given-names>K.</given-names>
            <surname>Weismann. De Dionysii Halicarnassei</surname>
          </string-name>
          vita et scriptis: Diss. inaug. Steuber,
          <year>1837</year>
          . url: https://books.google.nl/books?id=5XJSAAAAcAA.J
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          [54]
          <string-name>
            <given-names>I. P.</given-names>
            <surname>Yamshchikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tikhonov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Pantis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Schubert</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. JostB.</surname>
          </string-name>
          <article-title>ERT in Plutarch's Shadows</article-title>
          .
          <source>Nov</source>
          .
          <year>2022</year>
          . url: http://arxiv.org/abs/2211.05673(visited on 12/29/
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>