<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Team “better_call_claude”: Style Change Detection using a Sequential Sentence Pair Classifier⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gleb Schmidt</string-name>
          <email>gleb.schmidt@ru.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johannes Römisch</string-name>
          <email>johannes.roemisch@study.thws.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mariia Halchynska</string-name>
          <email>mariia.halchynska@study.thws.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Svetlana Gorovaia</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ivan P. Yamshchikov</string-name>
          <email>ivan.yamshchikov@thws.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Artificial Intelligence, Technical University of Applied Sciences Würzburg-Schweinfurt</institution>
          ,
          <addr-line>Münzstraße 12, 97070, Würzburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Humanities Lab, Faculaty of Arts, Radboud University</institution>
          ,
          <addr-line>Houtlaan 4, 6525 XZ, Nijmegen</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>LEYA Lab, School of Computer Science, Physics and Technology, HSE University</institution>
          ,
          <addr-line>6, 25th Liniya, Vasilievsky Ostrov, 199004, St Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>Style change detection-identifying the points in a document where writing style shifts-remains one of the most important and challenging problems in computational authorship analysis. At PAN 2025, the shared task challenges participants to detect style switches at the most fine-grained level: individual sentences. The task spans three datasets, each designed with controlled and increasing thematic variety within documents. We propose to address this problem by modeling the content of each problem instance-that is, a series of sentences-as a whole, using a Sequential Sentence Pair Classifier (SSPC). The architecture leverages a pre-trained language model (PLM) to obtain representations of individual sentences, which are then fed into a bidirectional LSTM (BiLSTM) to contextualize them within the document. The BiLSTM-produced vectors of adjacent sentences are concatenated and passed to a multi-layer perceptron for prediction per adjacency. Building on the work of previous PAN participants classical text segmentation, the approach is relatively conservative and lightweight. Nevertheless, it proves efective in leveraging contextual information and addressing what is arguably the most challenging aspect of this year's shared task: the notorious problem of “stylistically shallow”, short sentences that are prevalent in the proposed benchmark data. Evaluated on the oficial PAN 2025 test datasets, the model achieves strong macro-F1 scores of 0.923, 0.828, and 0.724 on the EASY, MEDIUM, and HARD data, respectively, outperforming not only the oficial random baselines but also a much more challenging one: claude-3.7-sonnet's zero-shot performance.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Style Change Detection</kwd>
        <kwd>Text Segmentation</kwd>
        <kwd>Sequence Labeling</kwd>
        <kwd>BiLSTM</kwd>
        <kwd>Large Language Models</kwd>
        <kwd>Pre-Trained Language Models</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Detecting changes in writing style—in other words, identifying places within a document where stylistic
signal changes—is a form of authorship analysis that, perhaps alongside authorship verification, holds the
greatest potential for applications beyond industrial contexts, particularly in humanities research. Given
that our contemporary concept of individual authorship—let alone formal definitions of intellectual
property and copyright—is a relatively recent development, a substantial part of human written culture
goes back to periods when, broadly speaking, “collaborative writing” (actual co-authorship, extensive
reuse, interpolation to mention but a few of its possible forms), was not only common—as it remains
today—but was often regarded as a way of declaring ones belonging to a tradition, and therefore valued
even more highly than original composition.</p>
      <p>CLEF 2025 Working Notes, 9 – 12 September 2025, Madrid, Spain
⋆You can use this document as the template for preparing your publication. We recommend using the latest version of the
ceurart style.
* Corresponding author.</p>
      <p>Nonetheless, the exploration of such “mixed-authorship texts” is typically hindered precisely by the
uncertainties surrounding their authorial structure, which creates notorious contextualization problems
and subsequently puts strict limitations on the interpretation of such texts.</p>
      <p>
        Recent studies have shown that computational methods can ofer valuable insights into mixed
authorship at the level of entire corpora [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] or major subdivisions of individual works [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2, 3, 4</xref>
        ]. However,
this level of granularity may often be insuficient for solving research questions that require a more
ifne-grained diarization—at the paragraph or even sentence level [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        In this context, the decade-long efort of PAN organizers to stimulate research in this direction
deserves special recognition. In various forms, the style change detection task has consistently been
a part of the competition’s program since 2016, making the participants’ work notes and traditional
overviews published in the aftermath of these events an invaluable source of methodological insight [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ].
Echoing the field’s growing theoretical sophistication, for almost a decade the PAN workshops have
been ofering increasingly complex benchmark data and task formulations, encouraging participants to
push the boundaries of achievable.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <sec id="sec-2-1">
        <title>2.1. Style Change Detection at PAN</title>
        <p>
          Since its first inclusion in the PAN program in 2016, the style change detection task has appeared in
various formulations. However, most of the factors contributing to its complexity were already present
in the first two editions—namely, the required level of granularity for document analysis, the uncertainty
regarding the number of style changes or contributing authors, and the need to segment the document.
The most recent development of the task introduced only one additional—though important—dimension:
controlled topic diversity in the data.
At PAN 2016, the task was framed as an authorship diarization problem, closely related to the traditional
intrinsic plagiarism detection explored during the early years of the competition’s history [
          <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
          ].
Participants faced three subtasks, each highlighting diferent challenges that were to become recurring
focal points of the task in the following years. The first subtask assumed a major author and required
identifying segments written by secondary authors. In the second subtask, the number of authors
was provided, and participants were required to cluster document segments by authorship. The third
and most challenging subtask involved building authorial clusters without any prior knowledge of the
number of authors.
        </p>
        <p>
          Operating at the sentence level, both proposed methods relied on traditional stylometric features,
which were then processed using techniques typical of intrinsic plagiarism detection—namely,
thresholdor Gaussian HMM-based outlier detection [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], and clustering [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. These approaches, however, failed
to achieve suficient performance at such a fine level of granularity.
        </p>
        <p>
          The poor performance led to a redefinition of the task as a style breach detection problem at PAN
2017. Instead of complete clustering a document’s segments by authorship, participants were asked to
predict whether a document was written by multiple authors and, if so, to identify the points at which
the writing style changes [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
        <p>
          The submitted approaches again centered on distance measures and outlier detection applied to
sentences as well as actual or artificially constructed paragraphs, represented using either conventional
stylometric features [
          <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
          ] or neural sentence embeddings [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Despite the relative improvement
over PAN 2016, the results of PAN 2017 confirmed that style-based document decomposition remained
marginally beyond the state of the art at that time.
        </p>
        <p>
          Therefore, for PAN 2018, the task was redefined once again, framing the problem as a document-level
binary classification task. This invited participants to explore how stylistic inconsistency could be
detected across an entire text [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The submissions reflected both conventional feature engineering
combined with rule-based or classical machine learning approaches [
          <xref ref-type="bibr" rid="ref17 ref18">17, 18, 19</xref>
          ] and early applications
of deep learning [20, 21]. Ensembling multiple classifiers operating on diverse feature spaces—each
capturing diferent aspects of language—not only proved reliable [
          <xref ref-type="bibr" rid="ref18">19, 18</xref>
          ], but also yielded the winning
result [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. The core of the classifier proposed by [ 21] was a CNN designed to capture patterns of
character bigrams in groups of varying length.
        </p>
        <p>The submission by [20] deserves special attention not only because it scored second, but also because
it anticipated developments in recent authorship analysis research and served as a distant source of
inspiration for the approach proposed below. Instead of relying on traditional feature spaces—where
lexical components took center stage at the time—[20] focused exclusively on the dependency trees of
sentences. The expressiveness of this feature space has recently been confirmed in a series of authorship
attribution studies [22, 23, 24, 25]. [20] define and extract what they call a Parse Tree Feature (PTF)—a
path from the root to any given word—and use it to represent each sentence as a sequence of its words’
PTFs, and each document as a sequence of sentence representations. Subsequently, both the original
and reversed versions of the document are fed into an LSTM with an additional sentence-level attention
mechanism, which contextualizes each sentence based on rich syntactic information across the entire
problem. Finally, the similarity between the original and reversed representations is computed and used
as the basis for prediction.</p>
        <p>
          Responding to performance boost observed at PAN 2018 on a simplified version of the problem,
the organizers of PAN 2019 increased the task’s complexity once again by adding the objective of
predicting the number of authors per document [26]. To address this task, [27] employed a combination
of clustering techniques based on representations of balanced-size text chunks obtained using the 50
most frequent words (MFW). Relying on a multi-layer perceptron operating on tf-idf-weighted word
unigrams to detect style changes within a document, Zuo et al. subsequently used an extension of [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]’s
feature extraction procedure to represent document paragraphs. They then applied an ensemble model
comprising two clustering methods and a multi-layer perceptron to predict the number of style changes.
        </p>
        <p>
          The Style Change Detection task at PAN 2020 was marked by two important shifts. First, after a
significant departure from its originally intended scope during PAN 2018—2019, “the task was steered
back into its original direction” [
          <xref ref-type="bibr" rid="ref1">29, 1</xref>
          ]. The segmentation component was reintroduced, and in addition
to the document-level prediction of multi-authorship, participants were required to identify the positions
where paragraph-level style changes occur. Second, for the first time, a solution based on pre-trained
transformers was employed to address the task [30], significantly outperforming clustering-based
approaches—the 0-maximal used by [31] and a modified version of [ 27], which, however, remained
undocumented.
        </p>
        <p>
          PAN 2021 reintroduced yet another element of the task’s original scope that had previously been
set aside as too complex: grouping of text segments by authorship within documents. The shared
task presented arguably the most complete formulation of the problem, comprising three separate
questions: (1) whether the text was written by multiple authors; (2) where between paragraphs the
writing style changes; and (3) which author each paragraph belongs to [32]. Although the competition
saw an increasing reliance on pre-trained transformers, it was marked by a wide diversity of methods.
[33] proposed the highest-scoring approach for Tasks 2 and 3, using a similarity measure extracted from
paragraphs with a pre-trained BERT model. They approached all tasks simultaneously, first solving
Tasks 2 and 3 in an authorship verification fashion. Each paragraph was compared with all preceding
ones, and a new authorial class was assigned whenever a paragraph could not be classified as written
by the same author as any of the previous ones. This information was subsequently used to solve the
remaining tasks. The approach by [34] excelled at Task 1. It combined sentence features extracted using
BERT and aggregated per paragraphs with the set of stylometric features proposed by [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. The tasks
were solved by stacking two feature spaces and feeding them to an ensemble of four classifiers. [ 35]
decomposed the tasks into a series of authorship verification problems and solved them adapting the
method proposed by [36]. Other works operated over various selections of stylometric features and
used LSTM-powered model [37] and Siamese architecture [38] respectively.
        </p>
        <p>
          Three subtasks of PAN 2022 challenged participants with both segmentation task and granularity
level. Task 1 required finding the only style shift in a document co-authored by two persons. In Tasks 2
and 3, it was necessary to find style changes in a text written by two or more authors with switches
occurring at only paragraph or paragraph and sentence levels. Despite clear prevalence of pre-trained
transformer-based approaches, submissions exclusively working with manually-engineered feature
spaces and traditional classification or clustering approaches [
          <xref ref-type="bibr" rid="ref19 ref20">39, 40</xref>
          ] or combining hand-picked features
with those extracted using pre-trained models were submitted [
          <xref ref-type="bibr" rid="ref21">41</xref>
          ]. One of the submissions downright
“hacked” the task by accessing extrinsic information online and yielding a nearly perfect result [
          <xref ref-type="bibr" rid="ref22">42</xref>
          ].
Most submissions, however, explored diferent PLM-based architectures. The overall best score was
achieved by [
          <xref ref-type="bibr" rid="ref23">43</xref>
          ] who obtained predictions for pairs of sentences or paragraphs by ensembling classifiers
based on BERT, RoBERTa, and ALBERT. [
          <xref ref-type="bibr" rid="ref24">44</xref>
          ] classified pairs of sentence or paragraph representations
obtained applying one-dimensional convolution and max pooling to BERT output. [
          <xref ref-type="bibr" rid="ref25">45</xref>
          ] used a
promptbased approach fine-tuning a BERT model with masked language modeling objective to predict special
tokens such &lt;DIFFERENT&gt; or &lt;SAME&gt; in a dynamically-constructed sequence: “They are the &lt;MASK&gt;
writing style: Para1 and Para2”. [
          <xref ref-type="bibr" rid="ref26">46</xref>
          ] trained three diferent transformer models to address each subtask.
[
          <xref ref-type="bibr" rid="ref27">47</xref>
          ] used LSTM, convolution, and max pooling over BERT-based word representations.
Two past shared tasks are both characterized by explicit intention to put the theoretical problem closer
to real-world scenarios and addressed the problem of possible topic consistency within the documents
introducing controlled levels of thematic homogeneity in benchmark data challenging participants with
development of methods less dependent on thematic signal.
        </p>
        <p>The solutions submitted to PAN 2023 demonstrated relative dificulty of this setting. Whereas most
submission achieved F1 score of more than 90% and 80% on EASY and MEDIUM tasks where writing
style change could coincide with thematic shift, the performance on single-topic HARD dataset was
significantly lower.</p>
        <p>
          The year was marked by further expansion of the PLMs’ use, although one solution focusing on
traditional stylometry was also submitted [
          <xref ref-type="bibr" rid="ref28">48</xref>
          ]. One of the important tendencies that year was a broad
diversity of ways in which PLMs’s linguistic knowledge was integrated into the solutions. [
          <xref ref-type="bibr" rid="ref29 ref30">49, 50</xref>
          ] made
recourse to contrastive learning in former case combining it with a prompt-based approach that excelled
on the HARD dataset. [
          <xref ref-type="bibr" rid="ref31">51</xref>
          ] solved the task as inference problem concatenating paragraphs and predicting
special tokens &lt;ENTAILMENT&gt; or &lt;CONTRADICTION&gt;. [
          <xref ref-type="bibr" rid="ref32">52</xref>
          ] pre-trained a custom model serving as a
basis for classifier, while [
          <xref ref-type="bibr" rid="ref33">53</xref>
          ] achieved highest scores on EASY and MEDIUM data using an ensemble of
several PLMs to predict binary labels for concatenated pairs of adjacent paragraphs.
        </p>
        <p>
          The following year’s shared task on multi-author style analysis retained the same definition and
structure of benchmark data as 2023 [
          <xref ref-type="bibr" rid="ref34">54</xref>
          ], continuing the focus on paragraph-level style change
detection with varying levels of topical homogeneity. Contrastive learning techniques and ensemble
architectures based on large PLMs took an even more prominent role. As a result, overall performance
improved and the gap between multi-topic and single-topic document scenarios narrowed. The best
submission achieved an impressive F1 around 86% on HARD dataset. Notably, purely traditional
stylometry approaches virtually disappeared in 2024, as nearly all top methods relied on fine-tuned transformer
models (often in combination or with specialized training objectives) to detect writing style changes.
        </p>
        <sec id="sec-2-1-1">
          <title>2.1.3. Generated Text Detection in Human-AI Collaborative Hybrid Texts</title>
          <p>The surge of Generative AI has given rise to a new field of application for methods conceptually related
to style change detection—authorship analysis in hybrid documents, i.e., texts co-written by humans
and AI. Zeng et al. investigated the detection of AI-generated passages within human–AI collaborative
texts, highlighting unique challenges of this setting: frequent author switches, obfuscation by
postediting, and the limited availability of stylistic cues in short segments. The suggested approach—a
two-step segmentation and classification method augmented with modern transformers and contrastive
techniques—builds directly on the foundations laid by style change detection research.</p>
          <p>
            This hybrid document segmentation problem was also the focus of the ALTA 2024 shared task
[
            <xref ref-type="bibr" rid="ref36">56</xref>
            ], which required participants to identify AI-generated sentences in news articles. The competition
demonstrated a clear methodological convergence with style change detection research at PAN.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>
        At first glance, this year’s task may appear less challenging. On the one hand, it does not require the
explicit grouping of text segments by authorship. On the other, sentence-level granularity is by no
means new and has been successfully addressed in previous editions. Yet, as the organizers note, the
benchmark data was designed to more accurately replicate real-world scenarios, which is why the level
of thematic coherence within each document was meticulously controlled [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Whereas the documents in the EASY dataset always cover multiple topics, the MEDIUM and HARD
datasets exhibit limited or no thematic diversity, respectively. Therefore, while in the first case solutions
may rely on thematic clues as potential indicators of style change, the latter two—each to a diferent
extent—force participants to rely more heavily on detecting subtle style changes rather than topic
variations [
        <xref ref-type="bibr" rid="ref6">6, 438</xref>
        ].
      </p>
      <p>Further challenges of this year’s shared task become evident in preliminary exploratory analysis
of the benchmark data, which reveals several peculiarities of the data that significantly amplify the
dificulty of the task (see Appendix A):
• Relatively short sentence length.
• A substantial portion—over 10%—of all sentences are exact duplicates, with some occurring more
than 3,000 times1.
• Hundreds of sequences only contain punctuation marks, but are placed on separate lines and
thus formally treated as separate sentences by the compilers of the data.</p>
      <p>Consequently, each individual sentence may not provide suficient information for identifying author’s
ifngerprint and making a reliable decision. A pair of identical or nearly identical one- or two-word
sentences—not only fairly common in Internet communication in general and also abundantly present
in the data—may be entirely style- or even content-neutral, i.e., provide no reliable clues whatsoever.</p>
      <sec id="sec-3-1">
        <title>3.2. Core Intuition</title>
        <p>To address the notorious problem of such “shallow sentences”, we pivoted our approach around the
idea of incorporating into the decision-making process the one thing that even the most minimalist
one-word sentence always has—its context, or, in simpler terms, its position within the problem.
Therefore, we designed a BiLSTM-powered solution intended to model a problem as a whole and capture
positional dependencies between the document’s sentences treated as atomic units that are organized
into stylistically cohesive segments.</p>
        <p>
          Our inspiration comes from late 2010s work on text segmentation and learning cohesion breaks. [
          <xref ref-type="bibr" rid="ref37">57</xref>
          ]
demonstrated eficiency of bidirectional RNN trained on positive and negative examples of cohesive
text in learning breaks in speech transcriptions. Further theoretical step was made by [
          <xref ref-type="bibr" rid="ref38">58</xref>
          ] who
presented text topic segmentation task as a supervised, specifically—sequence labeling, problem and
employed to a BiLSTM powered architecture operating over sentence embeddings to implement this
approach. [
          <xref ref-type="bibr" rid="ref39">59</xref>
          ]’s system, SegBot, achieved reliable performance on segmentation task at sentence and
Elementary Discourse Unit (EDU) level. Improving and expanding the method, [
          <xref ref-type="bibr" rid="ref40">60</xref>
          ] implemented a
system segmenting texts into thematically coherent sections and assigning topic labels. Glavaš and
1The most frequently repeated sentences include moderation messages (e.g., “Debate/discuss/argue the merits of ideas, don’t
attack people.”, “r/politics is currently accepting new moderator applications.”) and automatic notifications (e.g., “Personal
insults, shill or troll accusations, hate speech, any suggestion or support of harm, violence, or death, and other rule violations
can result in a permanent ban.”, “I am a bot, and this action was performed automatically.”)
Somasundaran and Lo et al. implemented similar contextualization approaches employing two-level
transformers.
        </p>
        <p>
          The approach also has predecessors among PAN participants. Hosseinia and Mukherjee treated
representations of problem’s sentences with syntactic features as atomic units and explored their
sequential dependencies using an LSTM. More recently, BiLSTM appeared several times as a steps in
extraction of sentence or paragraph representation from PLMs [
          <xref ref-type="bibr" rid="ref27">37, 47</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.3. Architecture</title>
        <p>To implement the idea, we opted for a lightweight solution. A problem—a sequence of sentences—is
considered a single sample, and its sentences are encoded using a PLM. Applying straightforward mean
pooling to token embeddings, fixed-length representations of the problem’s sentences are obtained.
Then, to capture the inter-sentence contextual clues, the sequence of problem’s sentence vectors is feed
into a BiLSTM. This layer outputs context-aware sentence representations enriching raw mean-pooled
vectors with information from the sentences across the entire problem. Subsequently, by concatenating
each sentence vector with adjacent one across feature dimensions, representations of pairs of adjacent
sentences are constructed. These are then passed through a multi-layer perceptron (MLP) classifier
that outputs logits corresponding to the probability of a style change between each sentence pair (see
Figure 1).</p>
        <p>This design enables the model to leverage both the semantic richness of the fine-tuned backbone PLM
and the sequential structure of the problem, resulting in robust style change detection performance
across documents of varying lengths and complexities. The submitted implementation code is available
on GitHub2.</p>
        <sec id="sec-3-2-1">
          <title>3.3.1. Base Transformer</title>
          <p>
            Diferent pre-trained models were tested (see Table 1), but StyleDistance/styledistance,
presented by [
            <xref ref-type="bibr" rid="ref43">63</xref>
            ], was retained and submitted for evaluation on the test data.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Training</title>
      <sec id="sec-4-1">
        <title>4.1. Data Augmentation</title>
        <p>To obtain more training data, we used three datasets from PAN 2024. All sentence transitions within
a single paragraph were labeled as not representing a style change. The first sentence of each new
paragraph was labeled as 1, indicating a style change.
2https://github.com/glsch/better-call-claude_pan25-multi-author-style-analysis</p>
        <p>A single model for all subtasks was trained on the most complete training data: the three
sentencelevel datasets from 2025 and all three paragraph-turned-sentence-level datasets from 2024.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Training</title>
        <p>The Sequential Sentence Pair Classifier was implemented in PyTorch Lightning, allowing for seamless
experimentation with both the architecture and hyperparameters.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Baselines</title>
      <p>
        At PAN 2024, three baselines were used: RANDOM, PREDICT 1, and PREDICT 0. While using all
these baselines, we decided on another—and more challenging—one, LLM, zero-shot predictions by
a best-performing LLM claude-3.7-sonnet prompted with the so-called linguistically informed
prompts (LIP) [
        <xref ref-type="bibr" rid="ref47">67</xref>
        ]. For the detailed description of the baseline setup we address the reader to [
        <xref ref-type="bibr" rid="ref48">68</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Results</title>
      <p>The following results were obtained on the oficial PAN 2025 test data (Table 3).</p>
      <p>The gradual decline in performance from the easy to the hard tasks reflects the increasing dificulty of
identifying style changes. Overall, our model outperforms zero-shot large language model predictions
on both the easy and hard tasks and falls short by only a fraction of a percentage point on the medium
task. At the same time, the proposed solution is lightweight and does not require much computational
resources.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Discussion</title>
      <p>Several observations stem from our approach to the PAN 2025 style change detection task.</p>
      <p>First, while our model is efective, its strength lies in exploiting the macrostructure of the problem
and inter-sentence contextual patterns—particularly the sequential order and distribution of sentences—
rather than in isolating purely stylistic signals. This reflects a broader shift from traditional stylometric
analysis, which typically assumes topical uniformity and relies on intrinsic features like syntax and
lexical choice. This raises concerns about generalizability.</p>
      <p>Future iterations of the task could focus more on isolation of stylistic signal by reducing contextual
cues—e.g., further controlling topic coherence or randomizing sentence order—to more rigorously test a
model’s ability to capture intrinsic authorial style.</p>
      <p>Second, the strong zero-shot performance of Claude draws attention to the growing impact of LLMs
in authorship analysis. LLMs, with their vast pretraining and generalization capabilities, can recognize
both contextual and stylistic patterns with little to no task-specific adjustment. Future PAN tasks
might clearly separate evaluation tracks that allow external LLM calls from those that do not even for
otherwise AI-unrelated tasks.</p>
    </sec>
    <sec id="sec-8">
      <title>Conclusion</title>
      <p>
        Despite the promising results of the proposed model, it has several limitations. First, the padding strategy
required for batch processing may hinder scalability and eficiency when applied to much longer texts
or inputs with highly variable lengths. Second, while the BiLSTM used for contextualization has proven
efective, it may not be the optimal architecture for capturing complex dependencies, particularly in
longer input sequences. More sophisticated architectures—such as those proposed by [
        <xref ref-type="bibr" rid="ref41 ref42">61, 62</xref>
        ]—could
potentially yield better results.
      </p>
      <p>Finally, our reliance on a frozen pre-trained encoder may limit the model’s adaptability to
domainspecific nuances. Fine-tuning the encoder or incorporating domain-specific data and training strategies
could further improve performance.</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used Claude 4 and GPT-4 (gpt-3.5/gpt-4) to perform
grammar/ spelling checks and edit text for clarity. Additionally, the author(s) used Perplexity’s research
tools to double-check relevant literature and contributions related to the topic. All content was reviewed
and edited by the author(s), who take full responsibility for the final publication.
multi-aspect approach for robust style change detection, in: CLEF 2018 Evaluation Labs and
Workshop–Working Notes Papers, CEUR-WS. org, 2018, p. 3.
[19] K. Safin, A. Ogaltsov, Detecting a change of style using text statistics, Working Notes of CLEF
(2018).
[20] M. Hosseinia, A. Mukherjee, A Parallel Hierarchical Attention Network for Style Change Detection,</p>
      <p>CLEF, 2018.
[21] N. Schaetti, Character-based Convolutional Neural Network for Style Change Detection: Notebook
for PAN at CLEF 2018., in: CLEF (Working Notes), 2018.
[22] R. Gorman, Author identification of short texts using dependency treebanks without vocabulary,
Digital Scholarship in the Humanities 35 (2020) 812–825. URL: https://academic.oup.com/dsh/
article/35/4/812/5606771. doi:10.1093/llc/fqz070.
[23] R. Gorman, Universal Dependencies and Author Attribution of Short Texts with Syntax Alone.,</p>
      <p>DHQ: Digital Humanities Quarterly 16 (2022).
[24] R. Gorman, Morphosyntactic Annotation in Literary Stylometry, Information 15 (2024) 211. URL:
https://www.mdpi.com/2078-2489/15/4/211. doi:10.3390/info15040211.
[25] V. B. Gorman, R. J. Gorman, A morphosyntactic authorship attribution study of the speeches
of Demosthenes and Apollodorus, The Journal of Hellenic Studies 144 (2024) 65–92. URL: https:
//www.cambridge.org/core/product/identifier/S0075426924000302/type/journal_article. doi: 10.
1017/S0075426924000302.
[26] E. Zangerle, M. Tschuggnall, G. Specht, B. Stein, M. Potthast, Overview of the Style Change
Detection Task at PAN 2019, in: L. Cappellato, N. Ferro, D. E. Losada, H. Müller (Eds.), Working
Notes Papers of the CLEF 2019 Evaluation Labs, volume 2380 of CEUR Workshop Proceedings, 2019.</p>
      <p>URL: https://ceur-ws.org/Vol-2380/paper_243.pdf, iSSN: 1613-0073.
[27] S. Nath, Style change detection by threshold based and window merge clustering methods., in:</p>
      <p>CLEF (Working Notes), 2019.
[28] C. Zuo, Y. Zhao, R. Banerjee, Style Change Detection with Feed-forward Neural Networks., CLEF
(Working Notes) 93 (2019).
[29] E. Zangerle, M. Mayerl, G. Specht, B. Stein, M. Potthast, Overview of the Style Change Detection
Task at PAN 2020, in: L. Cappellato, C. Eickhof, N. Ferro, A. Névéol (Eds.), Working Notes
Papers of the CLEF 2020 Evaluation Labs, volume 2696 of CEUR Workshop Proceedings, 2020. URL:
https://ceur-ws.org/Vol-2696/paper_256.pdf, iSSN: 1613-0073.
[30] A. Iyer, S. Vosoughi, Style Change Detection Using BERT., CLEF (Working Notes) 93 (2020) 106.
[31] D. Castro-Castro, C. A. Rodríguez-Lozada, R. Muñoz, Mixed Style Feature Representation and</p>
      <p>B-maximal Clustering for Style Change Detection., in: CLEF (Working Notes), 2020.
[32] E. Zangerle, M. Mayerl, M. Potthast, B. Stein, Overview of the Style Change Detection Task at
PAN 2021, in: G. Faggioli, N. Ferro, A. Joly, M. Maistro, F. Piroi (Eds.), Working Notes Papers of
the CLEF 2021 Evaluation Labs, volume 2936 of CEUR Workshop Proceedings, 2021. URL: https:
//ceur-ws.org/Vol-2936/paper-148.pdf, iSSN: 1613-0073.
[33] Z. Zhang, Z. Han, L. Kong, X. Miao, Z. Peng, J. Zeng, H. Cao, J. Zhang, Z. Xiao, X. Peng, Style Change
Detection Based On Writing Style Similarity—Notebook for PAN at CLEF 2021, in: G. Faggioli,
N. Ferro, A. Joly, M. Maistro, F. Piroi (Eds.), CLEF 2021 Labs and Workshops, Notebook Papers,
CEUR-WS.org, 2021. URL: http://ceur-ws.org/Vol-2936/paper-198.pdf.
[34] E. Strøm, Multi-label Style Change Detection by Solving a Binary Classification Problem., in:</p>
      <p>CLEF (working notes), 2021, pp. 2146–2157.
[35] R. Singh, J. Weerasinghe, R. Greenstadt, Writing Style Change Detection on Multi-Author
Documents., in: CLEF (Working Notes), 2021, pp. 2137–2145.
[36] J. Weerasinghe, R. Greenstadt, Feature vector diference based neural network and logistic
regression models for authorship verification, in: CEUR workshop proceedings, volume 2695,
2020.
[37] R. Deibel, D. Löflad, Style Change Detection on Real-World Data using an LSTM-powered</p>
      <p>Attribution Algorithm., in: CLEF (Working Notes), 2021, pp. 1899–1909.
[38] S. Nath, Style change detection using Siamese neural networks., in: CLEF (Working Notes), 2021,</p>
    </sec>
    <sec id="sec-10">
      <title>A. Dataset Statistics</title>
      <p>Tables 4 and 5 represent general data statistics and top-5 duplicated sentences in the data.</p>
      <p>Problems</p>
      <p>Sentences</p>
      <p>Avg words/sent.</p>
      <p>Median words/sent.</p>
      <p>Avg sent./doc</p>
      <p>Median sent./doc
In general, be courteous to others.</p>
      <p>Debate/discuss/argue the merits of ideas, don’t attack people.</p>
      <p>Personal insults, shill or troll accusations, hate speech, any suggestion or support of harm,
violence, or death, and other rule violations can result in a permanent ban.</p>
      <p>For those who have questions regarding any media outlets being posted on this subreddit,
please click to review our details as to our approved domains list and outlet criteria.
r/politics is currently accepting new moderator applications.</p>
    </sec>
    <sec id="sec-11">
      <title>B. Online Resources</title>
      <p>• GitHub</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Clérice</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Glaise</surname>
          </string-name>
          , Twenty-One*
          <article-title>Pseudo-Chrysostoms and more: authorship verification in the patristic world</article-title>
          ,
          <source>in: Computational Humanities Research Conference</source>
          <year>2023</year>
          ,
          <source>Proceedings of the Computational Humanities Research Conference</source>
          <year>2022</year>
          , Paris, France,
          <year>2023</year>
          . URL: https: //inria.hal.science/hal-04211176.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Cafiero</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-B. Camps</surname>
          </string-name>
          , '
          <article-title>Psyché'as a Rosetta Stone? Assessing Collaborative Authorship in the French 17th Century Theatre</article-title>
          , Proceedings http://ceur-ws.
          <source>org ISSN 1613</source>
          (
          <year>2021</year>
          )
          <fpage>0073</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Plecháč</surname>
          </string-name>
          ,
          <article-title>Relative contributions of Shakespeare and Fletcher in Henry VIII: An analysis based on most frequent words and most frequent rhythmic patterns</article-title>
          ,
          <source>Digital Scholarship in the Humanities</source>
          <volume>36</volume>
          (
          <year>2021</year>
          )
          <fpage>430</fpage>
          -
          <lpage>438</lpage>
          . Publisher: Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vybornaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. P.</given-names>
            <surname>Yamshchikov</surname>
          </string-name>
          ,
          <article-title>Fine-Tuning Pre-Trained Language Models for Authorship Attribution of the Pseudo-Dionysian Ars Rhetorica</article-title>
          , Aarhus,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Eder</surname>
          </string-name>
          , Rolling stylometry,
          <source>Digital Scholarship in the Humanities</source>
          <volume>31</volume>
          (
          <year>2016</year>
          )
          <fpage>457</fpage>
          -
          <lpage>469</lpage>
          . Publisher: Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bevendorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dementieva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fröbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Gipp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Greiner-Petter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Karlgren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Panchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shelmanov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegmann</surname>
          </string-name>
          , E. Zangerle, Overview of PAN 2025:
          <article-title>Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection</article-title>
          , in: C.
          <string-name>
            <surname>Hauf</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Macdonald</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Jannach</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Kazai</surname>
            ,
            <given-names>F. M.</given-names>
          </string-name>
          <string-name>
            <surname>Nardini</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Pinelli</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Silvestri</surname>
          </string-name>
          , N. Tonellotto (Eds.),
          <source>Advances in Information Retrieval</source>
          , Springer Nature Switzerland, Cham,
          <year>2025</year>
          , pp.
          <fpage>434</fpage>
          -
          <lpage>441</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -88720-8_
          <fpage>64</fpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -88720-8_
          <fpage>64</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , Multi-Author
          <source>Writing Style Analysis</source>
          <year>2025</year>
          ,
          <year>2025</year>
          . URL: https://pan.webis.de/clef25/pan25-web/style-change-detection.html.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschnuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Verhoeven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <article-title>Clustering by authorship within and across documents</article-title>
          ,
          <source>in: Working Notes Papers of the CLEF</source>
          <year>2016</year>
          <article-title>Evaluation Labs</article-title>
          . CEUR Workshop Proceedings/Balog, Krisztian [edit.]; et al.,
          <year>2016</year>
          , pp.
          <fpage>691</fpage>
          -
          <lpage>715</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          , E. Stamatatos,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , Overview of PAN'16:
          <article-title>new challenges for authorship analysis: cross-genre profiling, clustering, diarization, and obfuscation</article-title>
          ,
          <source>in: Experimental IR Meets Multilinguality, Multimodality, and Interaction: 7th International Conference of the CLEF Association, CLEF</source>
          <year>2016</year>
          , Évora, Portugal, September 5-
          <issue>8</issue>
          ,
          <year>2016</year>
          , Proceedings 7, Springer,
          <year>2016</year>
          , pp.
          <fpage>332</fpage>
          -
          <lpage>350</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Motrenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kuznetsova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. V.</given-names>
            <surname>Strijov</surname>
          </string-name>
          ,
          <article-title>Methods for Intrinsic Plagiarism Detection and Author Diarization</article-title>
          .,
          <source>in: CLEF (Working notes)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>912</fpage>
          -
          <lpage>919</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Sittar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. R.</given-names>
            <surname>Iqbal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M. A.</given-names>
            <surname>Nawab</surname>
          </string-name>
          ,
          <article-title>Author Diarization Using Cluster-Distance Approach</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1000</fpage>
          -
          <lpage>1007</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Verhoeven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Potthast, Overview of the author identification task at PAN-2017: style breach detection and author clustering</article-title>
          ,
          <source>in: Working Notes Papers of the CLEF 2017 Evaluation Labs/Cappellato</source>
          , Linda [edit.]; et al.,
          <year>2017</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Karas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Spiewak</surname>
          </string-name>
          , P. Sobecki, OPI-JSA at CLEF 2017:
          <article-title>Author Clustering and Style Breach Detection</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Khan</surname>
          </string-name>
          , Style Breach Detection:
          <article-title>An Unsupervised Detection Model</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>K.</given-names>
            <surname>Safin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kuznetsova</surname>
          </string-name>
          ,
          <article-title>Style Breach Detection with Neural Sentence Embeddings</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Potthast, Overview of the author identification task at PAN-2018: cross-domain authorship attribution and style change detection</article-title>
          ,
          <source>in: Working Notes Papers of the CLEF</source>
          <year>2018</year>
          <article-title>Evaluation Labs</article-title>
          . Avignon, France,
          <source>September 10-14</source>
          ,
          <year>2018</year>
          /Cappellato, Linda [edit.]; et al.,
          <year>2018</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <article-title>A model for style change detection at a glance</article-title>
          , volume
          <volume>593</volume>
          ,
          <year>2018</year>
          , p.
          <fpage>113</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zlatkova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kopev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Mitov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Atanasov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hardalov</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Koychev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <string-name>
            <surname>An</surname>
          </string-name>
          ensemble-rich pp.
          <fpage>2073</fpage>
          -
          <lpage>2082</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>S.</given-names>
            <surname>Alshamasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Menai</surname>
          </string-name>
          ,
          <article-title>Ensemble-Based Clustering for Writing Style Change Detection in MultiAuthored Textual Documents</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          / paper-187.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>H. A. F.</given-names>
            <surname>Alvi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Alqahtani</surname>
          </string-name>
          ,
          <article-title>Style Change Detection using Discourse Markers</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-188.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Rodríguez-Losada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Castro-Castro</surname>
          </string-name>
          ,
          <article-title>Three Style Similarity: sentence-embedding, auxiliary words, punctuation</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-218. pdf.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>L.</given-names>
            <surname>Graner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ranly</surname>
          </string-name>
          ,
          <article-title>An Unorthodox Approach for Style Change Detection</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2455</fpage>
          -
          <lpage>2466</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [43]
          <string-name>
            <surname>T.-M. Lin</surname>
            ,
            <given-names>C.-Y.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.-W.</given-names>
          </string-name>
          <string-name>
            <surname>Tzeng</surname>
            ,
            <given-names>L.-H.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Ensemble Pre-trained Transformer Models for Writing Style Change Detection</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2565</fpage>
          -
          <lpage>2573</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Lao</surname>
          </string-name>
          , L. Ma,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Liang</surname>
          </string-name>
          ,
          <source>Style Change Detection Based On Bert And Conv1d</source>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-208. pdf.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Han</surname>
          </string-name>
          ,
          <string-name>
            <surname>L</surname>
          </string-name>
          . Kong,
          <source>Style Change Detection based on Prompt., in: CLEF (Working Notes)</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2753</fpage>
          -
          <lpage>2756</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>Z. Z. X.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Huang</surname>
          </string-name>
          , Style Change Detection:
          <article-title>Method Based On Pre-trained Model And Similarity Recognition</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          / paper-205.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>L. Z. J.</given-names>
            <surname>Zia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liua</surname>
          </string-name>
          ,
          <article-title>Style Change Detection Based On Bi-LSTM And Bert</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <article-title>CLEF 2022 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2022</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-234.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>G.</given-names>
            <surname>Jacobo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dehesa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rojas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gómez-Adorno</surname>
          </string-name>
          ,
          <article-title>Authorship verification machine learning methods for Style Change Detection in texts</article-title>
          , in: M.
          <string-name>
            <surname>Aliannejadi</surname>
            , G. Faggioli,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
          </string-name>
          , M. Vlachos (Eds.), Working Notes of CLEF 2023 -
          <article-title>Conference and Labs of the Evaluation Forum, CEUR-WS</article-title>
          .org,
          <year>2023</year>
          , pp.
          <fpage>2652</fpage>
          -
          <lpage>2658</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3497</volume>
          /paper-217.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Qi</surname>
          </string-name>
          , Y. Han,
          <article-title>Supervised Contrastive Learning for Multi-Author Writing Style Analysis</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>2817</fpage>
          -
          <lpage>2822</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>W.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <article-title>Contrastive Learning Approaches for Multi-Author Style Analysis</article-title>
          ,
          <source>in: CLEF 2023 Working Notes</source>
          , volume
          <volume>3497</volume>
          ,
          <string-name>
            <surname>CEUR-WS</surname>
          </string-name>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [51]
          <string-name>
            <given-names>I. E.</given-names>
            <surname>Kucukkaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sahin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Toraman</surname>
          </string-name>
          , ARC-NLP at PAN 23:
          <article-title>Transition-Focused Natural Language Inference for Writing Style Detection</article-title>
          , in: M.
          <string-name>
            <surname>Aliannejadi</surname>
            , G. Faggioli,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
          </string-name>
          , M. Vlachos (Eds.), Working Notes of CLEF 2023 -
          <article-title>Conference and Labs of the Evaluation Forum, CEUR-WS</article-title>
          .org,
          <year>2023</year>
          , pp.
          <fpage>2659</fpage>
          -
          <lpage>2668</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3497</volume>
          /paper-218.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [52]
          <string-name>
            <given-names>M.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kong</surname>
          </string-name>
          ,
          <article-title>Encoded Classifier Using Knowledge Distillation for Multi-Author Writing Style Analysis</article-title>
          , in: M.
          <string-name>
            <surname>Aliannejadi</surname>
            , G. Faggioli,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
          </string-name>
          , M. Vlachos (Eds.), Working Notes of CLEF 2023 -
          <article-title>Conference and Labs of the Evaluation Forum, CEUR-WS</article-title>
          .org,
          <year>2023</year>
          , pp.
          <fpage>2629</fpage>
          -
          <lpage>2634</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3497</volume>
          /paper-214.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [53]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hashemi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <article-title>Enhancing Writing Style Change Detection using Transformer-based Models and Data Augmentation</article-title>
          .,
          <source>in: CLEF (Working Notes)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>2613</fpage>
          -
          <lpage>2621</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [54]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <article-title>Overview of the Multi-Author Writing Style Analysis Task at PAN 2024</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Galuščáková</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. G. S.</surname>
          </string-name>
          Herrera (Eds.),
          <source>Working Notes Papers of the CLEF</source>
          <year>2024</year>
          <article-title>Evaluation Labs, CEUR-WS</article-title>
          .org,
          <year>2024</year>
          , pp.
          <fpage>2513</fpage>
          -
          <lpage>2522</lpage>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3740</volume>
          /paper-222.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [55]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zeng</surname>
          </string-name>
          , S. Liu,
          <string-name>
            <given-names>L.</given-names>
            <surname>Sha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gašević</surname>
          </string-name>
          , G. Chen,
          <article-title>Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and</article-title>
          <string-name>
            <surname>Insights</surname>
          </string-name>
          ,
          <year>2024</year>
          . URL: http://arxiv.org/abs/2403.03506. doi:
          <volume>10</volume>
          .48550/arXiv.2403.03506, arXiv:
          <fpage>2403</fpage>
          .03506 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [56]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mollá</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Overview of the 2024 alta shared task: Detect automatic ai-generated sentences for human-ai hybrid articles</article-title>
          ,
          <source>arXiv preprint arXiv:2412.17848</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [57]
          <string-name>
            <given-names>I.</given-names>
            <surname>Sehikh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fohr</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Illina</surname>
          </string-name>
          ,
          <article-title>Topic segmentation in ASR transcripts using bidirectional RNNs for change detection, in: 2017 IEEE automatic speech recognition and understanding workshop</article-title>
          (ASRU), IEEE,
          <year>2017</year>
          , pp.
          <fpage>512</fpage>
          -
          <lpage>518</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [58]
          <string-name>
            <given-names>O.</given-names>
            <surname>Koshorek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Mor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rotman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Berant</surname>
          </string-name>
          ,
          <article-title>Text Segmentation as a Supervised Learning Task, in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</article-title>
          , Volume
          <volume>2</volume>
          (
          <string-name>
            <surname>Short</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , New Orleans, Louisiana,
          <year>2018</year>
          , pp.
          <fpage>469</fpage>
          -
          <lpage>473</lpage>
          . URL: http://aclweb.org/anthology/N18-2075. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N18</fpage>
          -2075.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [59]
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Joty</surname>
          </string-name>
          ,
          <article-title>SegBot: A Generic Neural Text Segmentation Model with Pointer Network</article-title>
          .,
          <source>in: IJCAI</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>4166</fpage>
          -
          <lpage>4172</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [60]
          <string-name>
            <given-names>S.</given-names>
            <surname>Arnold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cudré-Mauroux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Gers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Löser</surname>
          </string-name>
          ,
          <article-title>SECTOR: A Neural Model for Coherent Topic Segmentation and Classification, Transactions of the Association for Computational Linguistics 7 (</article-title>
          <year>2019</year>
          )
          <fpage>169</fpage>
          -
          <lpage>184</lpage>
          . URL: https://direct.mit.edu/tacl/article/43514. doi:
          <volume>10</volume>
          .1162/tacl_ a_
          <fpage>00261</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [61]
          <string-name>
            <given-names>G.</given-names>
            <surname>Glavaš</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Somasundaran</surname>
          </string-name>
          ,
          <article-title>Two-Level Transformer and Auxiliary Coherence Modeling for Improved Text Segmentation</article-title>
          ,
          <source>Proceedings of the AAAI Conference on Artificial Intelligence</source>
          <volume>34</volume>
          (
          <year>2020</year>
          )
          <fpage>7797</fpage>
          -
          <lpage>7804</lpage>
          . URL: https://ojs.aaai.org/index.php/AAAI/article/view/6284. doi:
          <volume>10</volume>
          .1609/ aaai.v34i05.
          <fpage>6284</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [62]
          <string-name>
            <given-names>K.</given-names>
            <surname>Lo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Du</surname>
          </string-name>
          , W. Buntine,
          <article-title>Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence</article-title>
          ,
          <year>2021</year>
          . URL: http://arxiv.org/abs/2110.07160. doi:
          <volume>10</volume>
          .48550/arXiv.2110.07160, arXiv:
          <fpage>2110</fpage>
          .07160 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [63]
          <string-name>
            <given-names>A.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Horvitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Apidianaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>McKeown</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Callison-Burch, StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples</article-title>
          ,
          <year>2024</year>
          . URL: https://arxiv.org/abs/2410.12757. doi:
          <volume>10</volume>
          .48550/ARXIV.2410.12757, version Number:
          <volume>2</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [64]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Wayne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Jun</surname>
          </string-name>
          ,
          <article-title>A Robustly Optimized BERT Pre-training Approach with Post-training</article-title>
          , in: S. Li,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sun</surname>
          </string-name>
          , Y. Liu,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Che</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>He</surname>
          </string-name>
          , G. Rao (Eds.),
          <source>Proceedings of the 20th Chinese National Conference on Computational Linguistics</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1218</fpage>
          -
          <lpage>1227</lpage>
          . URL: https://aclanthology.org/
          <year>2021</year>
          .ccl-
          <volume>1</volume>
          .108/.
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [65]
          <string-name>
            <given-names>A.</given-names>
            <surname>Conneau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Khandelwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Chaudhary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Wenzek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Guzmán</surname>
          </string-name>
          , E. Grave,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Unsupervised Cross-lingual Representation Learning at Scale</article-title>
          , in: D.
          <string-name>
            <surname>Jurafsky</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Chai</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Schluter</surname>
          </string-name>
          , J. Tetreault (Eds.),
          <source>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>8440</fpage>
          -
          <lpage>8451</lpage>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .acl-main.
          <volume>747</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2020</year>
          .acl-main.
          <volume>747</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [66]
          <string-name>
            <given-names>N.</given-names>
            <surname>Reimers</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          , Sentence-BERT:
          <article-title>Sentence Embeddings using Siamese BERT-Networks</article-title>
          ,
          <source>in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics</source>
          ,
          <year>2019</year>
          . URL: https://arxiv.org/abs/
          <year>1908</year>
          .10084.
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [67]
          <string-name>
            <given-names>B.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shu</surname>
          </string-name>
          ,
          <source>Can Large Language Models Identify Authorship?</source>
          ,
          <year>2024</year>
          . URL: http: //arxiv.org/abs/2403.08213. doi:
          <volume>10</volume>
          .48550/arXiv.2403.08213, arXiv:
          <fpage>2403</fpage>
          .08213 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [68]
          <string-name>
            <given-names>G.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Römisch</surname>
          </string-name>
          , I. Yamshchikov,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gorovaia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halchynska</surname>
          </string-name>
          , Better Call Claude:
          <article-title>Can LLMs Detect Changes of Writing Style?, in: Experimental IR Meets Multilinguality, Multimodality, and Interaction</article-title>
          .
          <source>Proceedings of the Sixteenth International Conference of the CLEF Association (CLEF</source>
          <year>2025</year>
          ),
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>