<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Conference and Labs of the Evaluation Forum, September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Izzet Emre Kucukkaya</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Umitcan Sahin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cagri Toraman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aselsan Research Center</institution>
          ,
          <addr-line>06378, Ankara</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>1</volume>
      <fpage>8</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>The task of multi-author writing style detection aims at finding any positions of writing style change in a given text document. We formulate the task as a natural language inference problem where two consecutive paragraphs are paired. Our approach focuses on transitions between paragraphs while truncating input tokens for the task. As backbone models, we employ diferent Transformer-based encoders with warmup phase during training. We submit the model version that outperforms baselines and other proposed model versions in our experiments. For the easy and medium setups, we submit transition-focused natural language inference based on DeBERTa with warmup training, and the same model without transition for the hard setup.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Multi-author</kwd>
        <kwd>natural language inference</kwd>
        <kwd>transition</kwd>
        <kwd>writing style detection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>I'm not arguing with you here, I'm simply trying
to contextualize this for you. To the extent that
they are there, it is with your consent. The state
has passed laws making sure that vulnerable
people (not saying he's one) don't get abused
Author 1 (not saying you're abusing him), and in casting a
wide net to save as many vulnerable little birds</p>
      <p>as possible from hitting the floor after being
kicked out of their nest wrongfully, the state has
(as much from a lack of better options as from
any other reason) created a circumstance where
occasionally some not-so-vulnerable little bird
can take advantage of someone else's nest.</p>
      <p>He's at my place half the time and his fiance(e)'s
place the other half of the time. He's been
(homeless) couch surfing for several years and
Author 2 only recently got engaged to his other partner.</p>
      <p>We don't have any current issues that would
lead me to want this arrangement to stop, but I
do want to protect my own legal rights. I don't
think he, in particular, would do that, but I do</p>
      <sec id="sec-1-1">
        <title>Author 2 want to retain my own legal rights wherever</title>
        <p>possible/appropriate.
style change in two consecutive paragraphs.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        In PAN 2018 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the task is basically binary classification. Zlatkova et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] develop an ensemble
approach of the models including SVM, Random Forest, LightGBM etc. Hosseinia and Mukherjee
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] use parallel attention networks to focus on the hierarchical structure of the language.
      </p>
      <p>
        In PAN 2019 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], the task is to detect the number of authors in a given document. Nath [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
use two clustering algorithms based on the threshold and window merge. In addition, Zuo et al.
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] use K-means and hierarchical clustering algorithms.
      </p>
      <p>
        In PAN 2020 [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], the task is to detect style changes between two consecutive paragraphs.
Castro-Castro et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] use a paragraph representation based on character, lexical, and syntactic
features in a clustering algorithm. Iyer and Vosoughi [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] use a pre-trained BERT model, and
train a random forest classifier of the embedding representation generated from the BERT
model.
      </p>
      <p>
        In PAN 2021 [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], the task is to determine the number of authors, and locate specific author
changes. Strøm [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] apply a stacking ensemble on text embeddings. Deibel and Löflad
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] use
an LSTM-based algorithm.
      </p>
      <p>
        For the task of the last year, in PAN 2022 [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], the winning solution Lin et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] employ an
ensemble of three Transformer-based language models using majority voting to obtain the final
prediction. Furthermore, Jiang et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] use base and large versions of the ELECTRA model,
and report highly challenging scores.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Task</title>
      <p>Participants are asked to solve the the intrinsic style change detection task. For a given text, we
ifnd all positions of writing style change on the paragraph-level. For example, the document
in Figure 1 is written by two authors, and there is a style change between first and second
paragraph. The label of this transition is specified as 1. Furthermore, there is no style change
between second and third paragraph where the label is 0. This example is chosen from the Easy
split of the dataset. There are three dificulty levels:
• Easy: The paragraphs of a document consist of various number of topics.
• Medium: The topical variety is small. The need of the style detection instead of topic
detection increases.</p>
      <p>• Hard: All paragraphs in a document are on the same topic.</p>
      <p>All the documents in this task are in English, and contain diferent numbers of style changes
and authors. Furthermore, writing style change only occurs in paragraph level. There is no
need to investigate sentences separately.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Dataset</title>
      <p>In this task, there are three diferent dificulty levels with their own train-validation-test sets.
The numbers of documents on the train and validation splits are the same in all of the three
dificulty levels, which are provided in Table 1.</p>
      <p>The total number of the 0 and 1 labels in documents are reported in Table 2. These numbers
indicate the number of samples in the natural language inference task derived from the actual
task as mentioned in Section 5.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Proposed Method: Transition-Focused Natural Language</title>
    </sec>
    <sec id="sec-6">
      <title>Inference</title>
      <sec id="sec-6-1">
        <title>Paragraph 1</title>
      </sec>
      <sec id="sec-6-2">
        <title>Paragraph 2 Tok 1 Tok 2 Tok 1</title>
        <p>Tok 2
...
...</p>
        <p>Tok 328
Tok 346</p>
      </sec>
      <sec id="sec-6-3">
        <title>a) Longest First Truncation (default)</title>
      </sec>
      <sec id="sec-6-4">
        <title>First 256</title>
        <p>Tok 2
CLS</p>
        <p>Tok 1
...</p>
        <p>Tok 256
SEP</p>
        <p>Tok 1</p>
        <p>Tok 2
CLS</p>
        <p>Last 256
Tok 73 Tok 74 ...</p>
      </sec>
      <sec id="sec-6-5">
        <title>b) Transition-Focused Truncation Tok 328 SEP Tok 1</title>
        <p>Tok 2</p>
      </sec>
      <sec id="sec-6-6">
        <title>First 256 ... Tok 256</title>
      </sec>
      <sec id="sec-6-7">
        <title>First 256 ... Tok 256 SEP SEP</title>
        <sec id="sec-6-7-1">
          <title>5.1. Main Approach</title>
          <p>In this work, we formulate the task as natural language inference (NLI). To do so, we employ a
Transformer-based language model that is based on the encoder structure. We prepare input by
concatenating consecutive paragraphs and using the SEP token between them. We then place a
binary classification layer of the CLS embedding, based on whether if the style change occurs
(1) or not (0) between these to paragraphs.</p>
          <p>Since the models have a limited length of input sequence (i.e. 512 tokens for BERT, RoBERTa,
and DeBERTa; and 1024 tokens for BigBird), we need to truncate the input paragraphs before
training NLI. For truncation, we focus on transitions between paragraphs (we refer it to as
Transition-Focused Truncation), since transitions provide logical connections between
paragraphs in documents. We also provide results for default truncation that focuses on the
beginning of text (we refer it to as Longest First Truncation). The proposed NLI model and
truncation approaches are illustrated in Figure 2. When input sequence length is 512, the last
256 tokens of the first paragraph and the first 256 tokens of the second paragraph are combined
in transition-focused truncation as in Figure 2a, while first 256 tokens from both paragraphs
are truncated as in Figure 2b.</p>
        </sec>
        <sec id="sec-6-7-2">
          <title>5.2. Backbone Models</title>
          <p>As the text encoder, we employ several Transformer-based language models in our preliminary
experiments. Here, we report the highest performing four models.</p>
          <p>
            BERT [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ] Bidirectional encoder representation for transformers, BERT, is an encoder
architecture that utilizes an attention mechanism. It was pretrained on masked language modelling
and next sentence prediction tasks.
          </p>
          <p>
            RoBERTa [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ] A robust optimized BERT pre-training approach, RoBERTa, has the same
architecture as BERT. However, the pretraining task of the next sentence prediction is removed.
Furthermore, it has dynamically changing masking pattern applied to the training data with
larger training batches.
          </p>
          <p>
            DeBERTa v3 [
            <xref ref-type="bibr" rid="ref22">22</xref>
            ] Decoding-enhanced BERT with Disentangled Attention, DeBERTa, has two
additional techniques compared to the BERT, distangled attention and enhanced mask decoder.
Due to the new adjustments, they state that it outperforms BERT and the other state-of-art
models in many tasks.
          </p>
          <p>
            BigBird-RoBERTa [
            <xref ref-type="bibr" rid="ref23">23</xref>
            ] BigBird has a sparse attention mechanism that reduces this quadratic
dependency to linear which enables it to handle sequences of length up to 8 times of what was
previously possible using similar hardware. Since paragraphs can be too long in this task, we
employ this model to cover more number of tokens in input.
          </p>
        </sec>
        <sec id="sec-6-7-3">
          <title>5.3. Warmup</title>
          <p>In preliminary experiments, we realize that our models converge diferent minima in the training
of this task. In order to overcome this issue, we use the warmup with the warmup ration is 0.1.
In warmup steps, the model trains with a very low learning rate and tries to find the global
minima of the loss function, and hinders the inaccurate convergence.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6. Experiments</title>
      <sec id="sec-7-1">
        <title>6.1. Experimental Setup</title>
        <p>The input length in BigBird is 1024 tokens (512 for each paragraph), while 512 tokens for other
models (256 for each paragraph). We set the following hyperparameters. Learning rate is 5e-5,
number of epochs is 5, and batch size is 4. We used 3 NVIDIA GeForce RTX 2080 GPUs in
training. The pre-trained models and the trainer framework are obtained from the HuggingFace
library [24].</p>
        <p>For evaluating model performances, we calculate Macro F1 Scores using the oficial evaluation
script1.</p>
      </sec>
      <sec id="sec-7-2">
        <title>6.2. Baselines</title>
        <p>We implement two baseline methods to compare with our approach.</p>
        <p>Random</p>
        <p>The output label array is generated randomly by sampling from 0 and 1, uniformly.
1https://github.com/pan-webis-de/pan-code/tree/master/clef23/multi-author-analysis
TF-IDF We use TF-IDF term weighting [25] to extract features using the English stopwords of
NLTK library [26]. Additional features such as number of question marks, periods, apostrophes,
parenthesis, and words are concatenated to the feature vector. Finally, we concatenate the
TF-IDF feature vectors of consecutive two paragraphs, and train Support Vector Classifier (SVC)
[27] for classification.</p>
      </sec>
      <sec id="sec-7-3">
        <title>6.3. Experimental Results</title>
        <p>We report the model performances on the validation splits (see Section 4) in Table 3. We divide
the table into six parts. At the top, we provide the baseline scores. The second part consists of
our proposed approach with four backbone models, described in Section 5. We use the same
models with warmup during training in the third part. So far, we do not employ
transitionfocused truncation. Next, we provide the results of transition-focused truncation and lastly with
warmup as well. In the last part, we provide the performance scores of the submitted models on
the test set (leaderboard). We submitted the highest performing models on the validation set
(given as bold). We have the following observations.</p>
        <p>• Baseline models perform poor as expected. TF-IDF is based on bag-of-words model, which
can show that writing style can not be detected by syntactical writing features.
• DeBERTa is the highest performing backbone model for NLI in all setups.
• Using warmup during training can increase the performance in some cases, specifically
for the medium and hard setups.
• Transition-focused truncation method improves the results in some cases. More
importantly, we obtain the highest scores on the validation set for the easy and medium setups
when we employ transition-focused DeBERTa with warmup. For the hard setup, the same
model with default truncation performs highest. We submitted the highest performing
methods to the shared task.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>7. Conclusion</title>
      <p>In this paper, we propose transition-focused natural language inference (NLI) for multi-author
writing style detection. We truncate the input paragraphs by focusing on transitions between
paragraphs, since transitions provide logical connections between paragraphs in documents.
Transition-focused NLI performs highest in easy and medium setups. Moreover, we obtain
the highest performances when backbone model is DeBERTa in all setups. We submitted the
highest performing models on the validation set. Our models are placed in the second place for
all subtasks (easy, medium, and hard) in the leaderboard.</p>
      <p>As a future work, there can be some improvements to overcome the class imbalance problem.
Furthermore, other large language models can be employed to encode the embedding vectors.
ber 6-12, 2020, virtual, 2020. URL: https://proceedings.neurips.cc/paper/2020/hash/
c8512d142a2d849725f31a9a7a361ab9-Abstract.html.
[24] T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf,
M. Funtowicz, J. Davison, S. Shleifer, P. von Platen, C. Ma, Y. Jernite, J. Plu, C. Xu, T. Le Scao,
S. Gugger, M. Drame, Q. Lhoest, A. Rush, Transformers: State-of-the-Art Natural
Language Processing, in: Proceedings of the 2020 Conference on Empirical Methods in
Natural Language Processing: System Demonstrations, Association for Computational
Linguistics, Online, 2020, pp. 38–45. URL: https://aclanthology.org/2020.emnlp-demos.6.
doi:10.18653/v1/2020.emnlp-demos.6.
[25] G. Salton, M. McGill, Introduction to Modern Information Retrieval, McGraw-Hill Book</p>
      <p>Company, 1984.
[26] S. Bird, E. Loper, NLTK: The Natural Language Toolkit, in: Proceedings of the ACL
Interactive Poster and Demonstration Sessions, Association for Computational Linguistics,
Barcelona, Spain, 2004, pp. 214–217. URL: https://aclanthology.org/P04-3031.
[27] C. Cortes, V. Vapnik, Support-Vector Networks, Mach. Learn. 20 (1995) 273–297. URL:
https://doi.org/10.1007/BF00994018. doi:10.1007/BF00994018.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bevendorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Borrego-Obrador</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chinea-Ríos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Franco-Salvador</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fröbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Heini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kredens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pęzik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wolska</surname>
          </string-name>
          , , E. Zangerle, Overview of PAN 2023:
          <article-title>Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection, in: Experimental IR Meets Multilinguality, Multimodality, and Interaction</article-title>
          .
          <source>Proceedings of the Fourteenth International Conference of the CLEF Association (CLEF</source>
          <year>2023</year>
          ), Lecture Notes in Computer Science, Springer,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <article-title>Overview of the Multi-Author Writing Style Analysis Task at PAN 2023</article-title>
          , in: M.
          <string-name>
            <surname>Aliannejadi</surname>
            , G. Faggioli,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
          </string-name>
          , M. Vlachos (Eds.), Working Notes of CLEF 2023 -
          <article-title>Conference and Labs of the Evaluation Forum, CEUR-</article-title>
          <string-name>
            <surname>WS</surname>
          </string-name>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Toraman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Ozcelik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sahinuç</surname>
          </string-name>
          , U. Sahin, ARC-NLP at checkthat!
          <article-title>-2022: Contradiction for harmful tweet detection</article-title>
          ,
          <source>in: Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum</source>
          , Bologna, Italy, September 5th - to - 8th,
          <year>2022</year>
          , volume
          <volume>3180</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>722</fpage>
          -
          <lpage>739</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-59.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          ,
          <article-title>BERT: pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis</article-title>
          , MN, USA, June 2-7,
          <year>2019</year>
          , Volume
          <volume>1</volume>
          (Long and Short Papers),
          <source>Association for Computational Linguistics</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>4171</fpage>
          -
          <lpage>4186</lpage>
          . URL: https://doi.org/10.18653/v1/n19-
          <fpage>1423</fpage>
          . doi:
          <volume>10</volume>
          .18653/v1/n19-
          <fpage>1423</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fröbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kolyada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Grahm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Elstner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Loebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <article-title>Continuous Integration for Reproducible Shared Tasks with TIRA.io</article-title>
          ,
          <source>in: Advances in Information Retrieval. 45th European Conference on IR Research (ECIR</source>
          <year>2023</year>
          ), Lecture Notes in Computer Science, Springer, Berlin Heidelberg New York,
          <year>2023</year>
          , pp.
          <fpage>236</fpage>
          -
          <lpage>241</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Potthast, Overview of the Author Identification Task at PAN-2018: Cross-domain Authorship Attribution and Style Change Detection</article-title>
          , in: L.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Nie</surname>
          </string-name>
          , L. Soulier (Eds.), Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Avignon, France,
          <source>September 10-14</source>
          ,
          <year>2018</year>
          , volume
          <volume>2125</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2018</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /invited_paper_2.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zlatkova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kopev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Mitov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Atanasov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hardalov</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Koychev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <article-title>An Ensemble-Rich Multi-Aspect Approach Towards Robust Style Change Detection: Notebook for PAN at CLEF 2018</article-title>
          , in: Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Avignon, France,
          <source>September 10-14</source>
          ,
          <year>2018</year>
          , volume
          <volume>2125</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2018</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /paper_142.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hosseinia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mukherjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Parallel</given-names>
            <surname>Hierarchical</surname>
          </string-name>
          <article-title>Attention Network for Style Change Detection: Notebook for PAN at CLEF 2018</article-title>
          , in: Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Avignon, France,
          <source>September 10-14</source>
          ,
          <year>2018</year>
          , volume
          <volume>2125</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2018</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          / paper_91.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tschuggnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <source>Overview of the Style Change Detection Task at PAN</source>
          <year>2019</year>
          , in: L.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>D. E.</given-names>
          </string-name>
          <string-name>
            <surname>Losada</surname>
          </string-name>
          , H. Müller (Eds.),
          <source>Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum, Lugano, Switzerland, September</source>
          <volume>9</volume>
          -
          <issue>12</issue>
          ,
          <year>2019</year>
          , volume
          <volume>2380</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2019</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2380</volume>
          /paper_243.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Nath</surname>
          </string-name>
          ,
          <article-title>Style Change Detection by Threshold Based and Window Merge Clustering Methods</article-title>
          , in: Working Notes of CLEF 2019 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Lugano, Switzerland, September 9-
          <issue>12</issue>
          ,
          <year>2019</year>
          , volume
          <volume>2380</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2019</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2380</volume>
          /paper_163.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zuo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Banerjee</surname>
          </string-name>
          ,
          <article-title>Style Change Detection with Feed-forward Neural Networks</article-title>
          , in: Working Notes of CLEF 2019 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Lugano, Switzerland, September 9-
          <issue>12</issue>
          ,
          <year>2019</year>
          , volume
          <volume>2380</volume>
          <source>of CEUR Workshop Proceedings</source>
          , CEURWS.org,
          <year>2019</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2380</volume>
          /paper_229.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          , G. Specht,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <source>Overview of the Style Change Detection Task at PAN</source>
          <year>2020</year>
          , in: L.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Eickhof</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Névéol (Eds.), Working Notes of CLEF 2020 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Thessaloniki, Greece,
          <source>September 22-25</source>
          ,
          <year>2020</year>
          , volume
          <volume>2696</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2020</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2696</volume>
          /paper_256.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Castro-Castro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Rodríguez-Lozada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Muñoz</surname>
          </string-name>
          ,
          <article-title>Mixed Style Feature Representation and B-maximal Clustering for Style Change Detection</article-title>
          , in: Working Notes of CLEF 2020 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Thessaloniki, Greece,
          <source>September 22- 25</source>
          ,
          <year>2020</year>
          , volume
          <volume>2696</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2020</year>
          . URL: https: //ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2696</volume>
          /paper_227.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Iyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vosoughi</surname>
          </string-name>
          ,
          <article-title>Style Change Detection Using BERT</article-title>
          , in: Working Notes of CLEF 2020 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Thessaloniki, Greece,
          <source>September 22-25</source>
          ,
          <year>2020</year>
          , volume
          <volume>2696</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2020</year>
          . URL: https: //ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2696</volume>
          /paper_232.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <source>Overview of the Style Change Detection Task at PAN</source>
          <year>2021</year>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maistro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Piroi</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum</source>
          , Bucharest, Romania, September 21st - to - 24th,
          <year>2021</year>
          , volume
          <volume>2936</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1760</fpage>
          -
          <lpage>1771</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2936</volume>
          /paper-148.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>E.</given-names>
            <surname>Strøm</surname>
          </string-name>
          <article-title>, Multi-label Style Change Detection by Solving a Binary Classification Problem</article-title>
          ,
          <source>in: Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum</source>
          , Bucharest, Romania, September 21st - to - 24th,
          <year>2021</year>
          , volume
          <volume>2936</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>2146</fpage>
          -
          <lpage>2157</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2936</volume>
          /paper-191.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>R.</given-names>
            <surname>Deibel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Löflad</surname>
          </string-name>
          ,
          <article-title>Style Change Detection on Real-World Data using an LSTM-powered Attribution Algorithm</article-title>
          ,
          <source>in: Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum</source>
          , Bucharest, Romania, September 21st - to - 24th,
          <year>2021</year>
          , volume
          <volume>2936</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1899</fpage>
          -
          <lpage>1909</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2936</volume>
          /paper-163.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <source>Overview of the Style Change Detection Task at PAN</source>
          <year>2022</year>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.),
          <source>Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum</source>
          , Bologna, Italy, September 5th - to - 8th,
          <year>2022</year>
          , volume
          <volume>3180</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2344</fpage>
          -
          <lpage>2356</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-186.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>T.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tzeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Ensemble Pre-trained Transformer Models for Writing Style Change Detection</article-title>
          ,
          <source>in: Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum</source>
          , Bologna, Italy, September 5th - to - 8th,
          <year>2022</year>
          , volume
          <volume>3180</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2565</fpage>
          -
          <lpage>2573</lpage>
          . URL: https: //ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-210.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>X.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Qi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , M. Huang, Style Change Detection:
          <article-title>Method Based On Pre-trained Model And Similarity Recognition</article-title>
          ,
          <source>in: Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum</source>
          , Bologna, Italy, September 5th - to - 8th,
          <year>2022</year>
          , volume
          <volume>3180</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>2526</fpage>
          -
          <lpage>2531</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3180</volume>
          /paper-205.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          , V. Stoyanov,
          <article-title>RoBERTa: A Robustly Optimized BERT Pretraining Approach</article-title>
          , CoRR abs/
          <year>1907</year>
          .11692 (
          <year>2019</year>
          ). URL: http://arxiv.org/abs/
          <year>1907</year>
          .11692. arXiv:
          <year>1907</year>
          .11692.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>P.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gao</surname>
          </string-name>
          , W. Chen,
          <article-title>DeBERTaV3: Improving DeBERTa using ELECTRA-Style PreTraining with Gradient-Disentangled Embedding Sharing</article-title>
          ,
          <source>CoRR abs/2111</source>
          .09543 (
          <year>2021</year>
          ). URL: https://arxiv.org/abs/2111.09543. arXiv:
          <volume>2111</volume>
          .
          <fpage>09543</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zaheer</surname>
          </string-name>
          , G. Guruganesh,
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Dubey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ainslie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Alberti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ontañón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ravula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ahmed</surname>
          </string-name>
          , Big Bird:
          <article-title>Transformers for Longer Sequences</article-title>
          ,
          <source>in: Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems</source>
          <year>2020</year>
          ,
          <article-title>NeurIPS 2020</article-title>
          , Decem-
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>