<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>X i v .</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>wrote the Abstract? - Explainable Multi-Authorship Attribution with a Data Augmentation Strategy</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kanishka Silva</string-name>
          <email>kanishka.silva.92@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ingo Frommholz</string-name>
          <email>ifrommholz@acm.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ChatGPT</institution>
          ,
          <addr-line>Multi-Authorship Attribution, Multimodal Transformers, Data Augmentation, Explainability</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>In: M. Litvak, I.Rabaev, R. Campos, A. Jorge, A. Jatowt (eds.): Proceedings of the IACT'23 Workshop</institution>
          ,
          <addr-line>Taipei</addr-line>
          ,
          <country country="TW">Taiwan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Wolverhampton</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>2</volume>
      <issue>3</issue>
      <abstract>
        <p>Active discussions have been conducted regarding implications and issues associated with Large Language Models (LLMs) such as ChatGPT across various domains. One particular concern is the efect of machinegenerated texts, which include a new category in authorship attribution models: machine-generated text resembling human text in topic and writing style. Diferentiating human-vs-AI-written text in scientific articles is crucial for several reasons. In this work, we approach this issue from a multi-authorship perspective by investigating automatically generated abstracts. We propose a multimodal transformer which combines handcrafted stylometric features with deep learning-based text features to perform multi-author attribution. We demonstrate the efectiveness of this approach on a curated dataset of 1000 samples and discuss its explainability via the Local Interpretable Model-agnostic Explanations (LIME) Recent advancements in text-generative Large Language Models (LLMs) saw many research directions and applications with machine-generated texts emerging, comprising summarisation, information retrieval and data augmentation. Computer-aided day-to-day tasks have evolved to use chat-based applications, providing the ability to summarise a large amount of content and efective information extraction abilities [ 1, 2]. These developments sparked the discussion of whether computer-assisted writing is accepted for scientific publications. For instance, a user may prompt ChatGPT [3] to generate an abstract for a full text of their scientific paper and then modify it accordingly. Alternatively, an editorial assistant could use ChatGPT to guidelines1 might accept or reject generated texts to a certain degree. While some guidelines allow using LLMs as a writing aid, using wholly generated and unedited text fragments is often htp:/ceur-ws.org CEUR Workshop Proceedings (CEUR-WS.org) ISN1613-073</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>CEUR
Workshop
Proceedings
forbidden. This makes the task of proper identification of generated texts, or text fragments
a crucial one. We can regard this as an multi-authorship problem, where the two classes of
authors involved: human co-authored text and LLM/chatbot aided human co-authored text.
We assume the LLM/chatbot-aided text carries distinguishable stylometric features compared
to the human-co-authored text portion in a considered document. Further, for simplicity, the
LLM/chatbot-aided text is a wholly generated and unedited text fragment in a known position
of the document.</p>
      <p>
        Our main contributions are: using GPT 3.5 as a data augmentation mechanism to generate
abstracts; addressing the multi-authorship problem in scientific papers using a multimodal
transformer, combining stylometric and decoder-based text features; providing a corresponding
dataset and scripts; a novel explainability feature to the multimodal transformer using the
Local Interpretable Model-agnostic Explanations (LIME) framework [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]; a case study to utilise
ChatGPT [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] as a data augmentation tool for scientific articles. The main research question
addressed in this work is how efectively an explainable multimodal transformer-based model
can identify ChatGPT-generated text.
      </p>
      <p>The remainder of the paper is organised as follows. Section 2 demonstrates a brief literature
survey. Section 3 outlines the proposed multimodal transformer model for ChatGPT text
identification. Section 4 describes the dataset. Section 5 elaborates on the experiment design
outline, focusing on the research question. Section 6 summarises the results and discussion
points. Section 7 outlines the limitations and future directions related to the research presented
in this paper. Finally, Section 8 discuss concluding remarks and future directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <p>
        Authorship attribution is identifying the author of an unknown text by comparing a corpus of
known authorship of candidate authors [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The approaches for the authorship attribution are
traditional stylometric [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
        ] and deep learning-based [
        <xref ref-type="bibr" rid="ref10 ref11 ref9">9, 10, 11</xref>
        ]. The stylometric approaches
usually involve handcrafted feature extraction [
        <xref ref-type="bibr" rid="ref12 ref6 ref8">6, 12, 8</xref>
        ]. Ensemble models combining diferent
stylometric features with deep learning have been outperforming other state-of-the-art models
[
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ].
      </p>
      <p>Multi-authorship attribution addresses identifying authors or detecting cases where diferent
parts of a document were written by diferent authors [ 15, 16]. Approaches include co-authorship
graph-based authorship attribution [16]; others simply identify whether a given document is
multi-authored [17, 18, 19].</p>
      <p>
        Several attempts in recent research are towards identifying human-vs-AI-created text [20, 21,
22, 23]. Authorship obfuscation with machine-generated text to impose/hide the original writing
style is discussed in the works of Jones et al. [24], Dehouche [25]. A pilot study on ChatGPT
text authorship has been reported by Landa-Blanco et al. [
        <xref ref-type="bibr" rid="ref15">26</xref>
        ]. With the recent advancements
of the LLMs and prompt-based text generation models, much research has focused on utilising
machine-generated text for human-vs-AI text detection [
        <xref ref-type="bibr" rid="ref16 ref17">27, 28</xref>
        ].
      </p>
      <p>
        Multimodal transformers [
        <xref ref-type="bibr" rid="ref18">29</xref>
        ] combine a pre-trained transformer model output with additional
task-specific categorical or numerical features. Gu and Budhkar [
        <xref ref-type="bibr" rid="ref18">29</xref>
        ] provide diferent feature
combination methods involving feature concatenation, attention methods, gating mechanisms
and weighted feature summation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Explainable Multi-Authorship Attribution</title>
      <p>
        Limited dataset availability is ubiquitous in many human-vs-AI text attribution applications.
One mitigation approach would automatically synthesise machine-generated text with a natural
language-based generative model. Until the recent applications of such generative models can
generate human-like data, most recently, using chatbot APIs like ChatGPT [
        <xref ref-type="bibr" rid="ref19">30</xref>
        ], much research
has been initiated in formulating data augmentation strategies.
      </p>
      <p>
        Similarly, as in Figure 1, we propose a scalable task-specific encoder-decoder-based
experimental pipeline with multimodal and explainable abilities. The Decoder LLM takes the original
data and uses an LLM-based generator model such as ChatGPT (used in this work) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], BARD
[
        <xref ref-type="bibr" rid="ref20">31</xref>
        ] or PaLM [
        <xref ref-type="bibr" rid="ref21">32</xref>
        ] to return synthesised data, i.e. machine-aided text segments. Then according
to the desired task, the synthesised and original data are passed through a Data Merger to
generate the train-test-validation splits for training the model and test data form prediction
through an Explainer model such as LIME Explainer [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The Encoder Language Model (LLM) can
incorporate any appropriate LLM based on the intended task, such as a classifier for document
classification purposes. Then the attention output of the model is passed to the Explainer in
conjunction with the test data to obtain highlighted segments of the document which contributed to
the Encoder LLM result. We propose this model for a case study of a multi-authorship problem,
where a scientific document is multi-authored by a human or machine author if the original
writer prompted ChatGPT to generate the abstract.
      </p>
      <p>
        For the simplicity of the experiments, we considered article  , written by two
(machinegenerated+human) or one author’s category (human). Given a set of documents  =
{ 1,  2,  3, ..} where   consist of two sections  1 and  2 if multi-authored, which represent
sections with potential style changes, authored by either a human or an AI. Using multimodal
transformers [
        <xref ref-type="bibr" rid="ref18">29</xref>
        ], we concatenated stylometric features (n) and deep learning-based text features
(x) to obtain combined multimodal features ( = || ) to identify whether a given document  
is multi-authored.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Datasets</title>
      <p>
        We used the arXiv dataset described in Cohan et al. [
        <xref ref-type="bibr" rid="ref22">33</xref>
        ] as the basis for our experiments. It
consists of long, structured documents collected from the arXiv and PubMed Open Access
repositories. This dataset contains articles and abstracts separated by the ‘\n’ character and
comprises 215,913 arXiv articles and 133,215 PubMed articles. Using zero-shot prompting, we
utilised the ChatGPT 3.5 API [
        <xref ref-type="bibr" rid="ref19">30</xref>
        ] to generate abstracts for 500 out of 1,000 randomly selected
arXiv articles set from the subset mentioned above. The GPT-3.5 Turbo Language Model was
utilised to generate abstracts from the selected scientific papers, truncating approximately 2500
words and with 0.7 temperature settings for controlled randomness.
      </p>
      <p>The resulting ChatGPT-Aided Papers dataset consists of combined 500 synthesised (ChatGPT
abstract + original text) and 500 original articles (original abstract + original text) under two
categories denoting multi-authored and single-authored, respectively. Each data item in our
dataset comprises nine fields: article, labels and seven stylometric features per each article:
Average Word Length, Average Sentence Length by Characters, Average Sentence Length By
Word, Average Syllable per Word, Special Character Count, Punctuation Count, Functional
Words Count. A further comparison of datasets is available in Table 1, comparing the document
length, sentence length, and vocabulary size of the original papers and the ChatGPT-aided
papers.</p>
      <p>As illustrated in Figure 1, the Data Merger performs the data augmentation by combining the
synthesised and original data to suit the desired task. In our considered case study on the
multiauthor identification in machine-aided scientific paper writeups, we concatenated 500 ChatGPT
abstracts with full paper text to create computer-aided scientific writeups. This synthesised
data reflects a scenario where an author requests ChatGPT to summarise the entire paper for
writing an abstract. To obtain the human-written documents, we combined the remaining 500
human-written abstracts with respective full paper text, resulting in a dataset comprising 1000
articles in a uniform distribution across classes (Original Papers and ChatGPT-Aided Papers),
including the computer-aided and original text.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experiment Design</title>
      <p>Several experiments were designed to validate the model performance as follows:</p>
      <sec id="sec-5-1">
        <title>1. An ablation study with authorship features was performed;</title>
        <p>2. A multimodal transformer with text and authorship features was applied;
3. LIME explainer (see below) was utilised to interpret results;
4. Results were analysed and compared to state-of-the-art (SOTA) models.</p>
        <p>
          In our study, we calculated several handcrafted stylometric features: Average word
length (AWL), average sentence length by word (ASW), and functional word counts (CFW) per
each document2. These features were then concatenated with the text features and passed to a
multimodal transformer[
          <xref ref-type="bibr" rid="ref18">29</xref>
          ] to perform the authorship attribution task. To evaluate the feature
significance, an ablation study was conducted. The main experiment flow is about efectively
using a multimodal transformer [
          <xref ref-type="bibr" rid="ref18">29</xref>
          ] for multi-authorship problems and explaining the model
output. We conducted the experiments to compare a BERT model specific for sequence
classification [
          <xref ref-type="bibr" rid="ref23">34</xref>
          ] with the proposed multimodal transformer and the efect of each aforementioned
stylometric feature on the proposed model. Then, we present the LIME Explainer [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] result
for a scientific paper text where the abstract was entirely written by prompting ChatGPT. To
compare the proposed model with existing models and baselines, we utilised diferent baseline
models such as word-level TF-IDF [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], character n-gram [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], Stylometric features [
          <xref ref-type="bibr" rid="ref24">35</xref>
          ] on the
ChatGPT-Aided Papers dataset.
        </p>
        <p>We performed hyper-parameter tuning to identify the best model parameters with the
converging loss. The proposed model was trained with Adam optimiser, with a 0.5 dropout rate,
a 16 batch size, 0.000001 learning rate, 0.000001 epsilon, 0.2 warm-up proportion, and five train
epochs maximum 512 token length. To ensure the reprehensibility of the presented research,
we release the code-base3 and the dataset4.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Results and Discussion</title>
      <p>The results of the experiments are illustrated in Table 2, showcasing that the proposed
multimodal transformer-based model outperforms the BERT (Sentence Classification) model, which
only utilises text features. The proposed model demonstrates superior performance in terms of
accuracy and F1 of 0.93, which is nearly 50% than the BERT model. This improvement could
be due to stylometric features - Average Word Length (AWL), Average Sentence Length By
Word (ASW), and Count Functional Words (CFW), which were combined with text features.</p>
      <sec id="sec-6-1">
        <title>2The application of all 7 features mentioned before resulted in overfitting.</title>
        <p>
          3Code-base - https://github.com/Kaniz92/Multimodal-ChatGPT-AA. We used Simple Transformers as the boilerplate
for the implementation [
          <xref ref-type="bibr" rid="ref25">36</xref>
          ].
4https://huggingface.co/datasets/Authorship/ChatGPT_Aided_Papers
        </p>
        <sec id="sec-6-1-1">
          <title>Model</title>
          <p>
            BERT [
            <xref ref-type="bibr" rid="ref23">34</xref>
            ]
Stylometric [
            <xref ref-type="bibr" rid="ref24">35</xref>
            ]
Character Ngram [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ]
Word level TF-IDF [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ]
          </p>
        </sec>
        <sec id="sec-6-1-2">
          <title>MMT (BERT) + Stylometric (AWL + ASW + CFW)</title>
          <p>MMT (BERT) + Stylometric (AWL + ASW + CFW) - CFW
MMT (BERT) + Stylometric (AWL + ASW + CFW) - ASW
MMT (BERT) + Stylometric (AWL + ASW + CFW) - AWL</p>
          <p>We conducted an ablation study by systematically removing each feature and evaluating the
performance to gain insights into the contribution of diferent stylometric features. Table 2
presents the results obtained from this study. The similar decrease in performance, resulting in
an accuracy of 0.94 when removing each feature, indicates that all three features contribute
equally to the model’s performance.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Limitations and Future Directions</title>
      <p>
        Our work is limited by the use of ChatGPT 3.5 and its limitations. Research on LLMs is a rapidly
developing field that has seen the introduction of more advanced commercial models such
as GPT 4 [
        <xref ref-type="bibr" rid="ref26">37</xref>
        ] and BARD [
        <xref ref-type="bibr" rid="ref20">31</xref>
        ] and further open-source language models. Hence, future work
should focus on applying our task to other available LLMs or encoders/decoders (such as VAEs
and Difusion models) to provide us with more robust insights. Furthermore, policies on using
LLMs may allow them as writing aid where authors manually proofread and edit a generated
text; such hybrid texts should not necessarily be flagged as “generated”.
      </p>
      <p>Further, Due to the token limitation in the ChatGPT API, we considered the first 2500 words
(a) Annotated Text from LIME Explainer
(b) ChatGPT-Aided Paper Example
(ChatGPT abstract + full paper)
(c) Original Paper Example
(Original abstract + full paper)
of each paper as crucial for an abstract generation. The future directions for this research are
exploring other explainability models, calculating machine-aided text percentage and analysing
other scientific paper sections aided by generator models.</p>
    </sec>
    <sec id="sec-8">
      <title>8. Conclusion</title>
      <p>This paper proposes an encoder-decoder LLMs-based framework for multi-author identification
tasks, where machine-generation text identification is intended. We considered a scenario
of multi-authorship attribution with machine-aided text and original text. According to the
experimental results, the multimodal transformer model combines stylometric features and text
features to outperform the BERT model and other ablation studies. The study demonstrates the
efectiveness of handcrafted stylometric features. Applying the BERT model and the stylometric
features alone provides relatively poor results. Only their combination achieves a significant
performance boost. These exciting observations show that both approaches make classification
errors, but diferent ones that are ironed out when combined.
the Evaluation Forum, Lugano, Switzerland, September 9-12, 2019, volume 2380 of CEUR
Workshop Proceedings, CEUR-WS.org, 2019. URL: http://ceur-ws.org/Vol-2380/paper_220.
pdf.
[15] J. Bevendorf, M. Chinea-Rios, M. Franco-Salvador, A. Heini, E. Körner, K. Kredens, M.
Mayerl, P. Pezik, M. Potthast, F. Rangel, P. Rosso, E. Stamatatos, B. Stein, M. Wiegmann,
M. Wolska, E. Zangerle, Overview of PAN 2023: Authorship verification, multi-author
writing style analysis, profiling cryptocurrency influencers, and trigger detection -
extended abstract, in: Proceedings ECIR 2023, volume 13982 of Lecture Notes in Computer
Science, Springer, 2023, pp. 518–526. URL: https://doi.org/10.1007/978-3-031-28241-6_60.
doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 0 3 1 - 2 8 2 4 1 - 6 \ _ 6 0 .
[16] R. Sarwar, N. Urailertprasert, N. Vannaboot, C. Yu, T. Rakthanmanon, E. Chuangsuwanich,
S. Nutanong, CAG: stylometric authorship attribution of multi-author documents using
a co-authorship graph, IEEE Access 8 (2020) 18374–18393. URL: https://doi.org/10.1109/
ACCESS.2020.2967449. doi:1 0 . 1 1 0 9 / A C C E S S . 2 0 2 0 . 2 9 6 7 4 4 9 .
[17] M. M. Iqbal, A. Raza, M. M. Aslam, M. Farhan, S. Yaseen, A stylometric fingerprinting
method for author identification using machine learning, Technical Journal 28 (2023)
28–35.
[18] A. M. Omi, M. Hossain, M. N. Islam, T. Mittra, Multiple authors identification from source
code using deep learning model, Proceedings ICECIT 2021 (2021) 1–4.
[19] P. Bora, T. Awalgaonkar, H. Palve, R. Joshi, P. Goel, Icodenet - A hierarchical neural network
approach for source code author identification, in: Proceedings ICMLC 2021, ACM, 2021,
pp. 180–185. URL: https://doi.org/10.1145/3457682.3457709. doi:1 0 . 1 1 4 5 / 3 4 5 7 6 8 2 . 3 4 5 7 7 0 9 .
[20] K. Pillutla, S. Swayamdipta, R. Zellers, J. Thickstun, S. Welleck, Y. Choi, Z. Harchaoui,
MAUVE: measuring the gap between neural text and human text using divergence frontiers,
in: Proceedings NeurIPS 2021, 2021, pp. 4816–4828. URL: https://proceedings.neurips.cc/
paper/2021/hash/260c2432a0eecc28ce03c10dadc078a4-Abstract.html.
[21] J. Pu, Z. Sarwar, S. M. Abdullah, A. Rehman, Y. Kim, P. Bhattacharya, M. Javed,
B. Viswanath, Deepfake text detection: Limitations and opportunities, CoRR
abs/2210.09421 (2022). URL: https://doi.org/10.48550/arXiv.2210.09421. doi:1 0 . 4 8 5 5 0 / a r X i v .
2 2 1 0 . 0 9 4 2 1 . a r X i v : 2 2 1 0 . 0 9 4 2 1 .
[22] R. Gagiano, M. M.-H. Kim, X. Zhang, J. Biggs, Robustness analysis of grover for
machinegenerated news detection, in: Proceedings of the The 19th Annual Workshop of the
Australasian Language Technology Association, Australasian Language Technology
Association, Online, 2021, pp. 119–127. URL: https://aclanthology.org/2021.alta-1.12.
[23] A. Uchendu, T. Le, K. Shu, D. Lee, Authorship attribution for neural text generation,
in: B. Webber, T. Cohn, Y. He, Y. Liu (Eds.), Proceedings EMNLP 2020, Association for
Computational Linguistics, 2020, pp. 8384–8395. URL: https://doi.org/10.18653/v1/2020.
emnlp-main.673. doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 0 . e m n l p - m a i n . 6 7 3 .
[24] K. Jones, J. R. C. Nurse, S. Li, Are you robert or roberta? deceiving online authorship
attribution models using neural text generators, in: C. Budak, M. Cha, D. Quercia (Eds.),
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media,
ICWSM 2022, Atlanta, Georgia, USA, June 6-9, 2022, AAAI Press, 2022, pp. 429–440. URL:
https://ojs.aaai.org/index.php/ICWSM/article/view/19304.
[25] N. Dehouche, Plagiarism in the age of massive generative pre-trained transformers (gpt-3),</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ma</surname>
          </string-name>
          , J. Liu,
          <string-name>
            <given-names>F.</given-names>
            <surname>Yi</surname>
          </string-name>
          , Q. Cheng, Y. Huang,
          <string-name>
            <given-names>W.</given-names>
            <surname>Lu</surname>
          </string-name>
          , X. Liu,
          <article-title>AI vs. human - diferentiation analysis of scientific content generation</article-title>
          ,
          <year>2023</year>
          .
          <article-title>a r X i v : 2 3 0 1 . 1 0 4 1 6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>O. O.</given-names>
            <surname>Buruk</surname>
          </string-name>
          ,
          <article-title>Academic writing with GPT-3.5: reflections on practices, eficacy and transparency</article-title>
          ,
          <source>CoRR abs/2304</source>
          .11079 (
          <year>2023</year>
          ). URL: https://doi.org/10.48550/arXiv.2304.11079.
          <article-title>doi:1 0 . 4 8 5 5 0 / a r X i v . 2 3 0 4 . 1 1 0 7 9 . a r X i v : 2 3 0 4 . 1 1 0 7 9</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3] OpenAI, ChatGPT - openai blog, https://openai.com/blog/chatgpt/,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          , ”
          <article-title>why should I trust you?”: Explaining the predictions of any classifier</article-title>
          ,
          <source>in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1135</fpage>
          -
          <lpage>1144</lpage>
          . URL: https://doi.org/ 10.1145/2939672.2939778.
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 2 9 3 9 6 7 2 . 2 9 3 9 7 7 8 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <article-title>A survey of modern authorship attribution methods</article-title>
          ,
          <source>J. Assoc. Inf. Sci. Technol</source>
          .
          <volume>60</volume>
          (
          <year>2009</year>
          )
          <fpage>538</fpage>
          -
          <lpage>556</lpage>
          . URL: https://doi.org/10.1002/asi.21001.
          <source>doi:1 0 . 1 0 0 2 / a s i . 2 1</source>
          <volume>0 0 1 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>O.</given-names>
            <surname>Aborisade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Anwar</surname>
          </string-name>
          ,
          <article-title>Classification for authorship of tweets by comparing logistic regression and naive bayes classifiers</article-title>
          ,
          <source>in: 2018 IEEE International Conference on Information Reuse and Integration</source>
          ,
          <string-name>
            <surname>IRI</surname>
          </string-name>
          <year>2018</year>
          , IEEE,
          <year>2018</year>
          , pp.
          <fpage>269</fpage>
          -
          <lpage>276</lpage>
          . URL: https: //doi.org/10.1109/IRI.
          <year>2018</year>
          .
          <volume>00049</volume>
          .
          <source>doi:1 0 . 1 1 0 9 / I R I . 2 0</source>
          <volume>1 8 . 0 0 0 4 9 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Madigan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Genkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. D.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. G. D. D.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Argamon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fradkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. D. L.</given-names>
            <surname>Consulting</surname>
          </string-name>
          ,
          <article-title>Author identification on the large scale</article-title>
          ,
          <source>in: In Proc. of the Meeting of the Classification Society of North America</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Soler Company</surname>
          </string-name>
          , L. Wanner,
          <article-title>On the relevance of syntactic and discourse features for author profiling and identification</article-title>
          , in: M.
          <string-name>
            <surname>Lapata</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Blunsom</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Koller (Eds.),
          <source>Proceedings EACL</source>
          <year>2017</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          ,
          <year>2017</year>
          , pp.
          <fpage>681</fpage>
          -
          <lpage>687</lpage>
          . URL: https: //doi.org/10.18653/v1/e17-
          <fpage>2108</fpage>
          .
          <source>doi:1 0 . 1 8</source>
          <volume>6 5 3</volume>
          / v 1 / e 1 7
          <article-title>- 2 1 0 8</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fabien</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Villatoro-Tello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Motlícek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Parida</surname>
          </string-name>
          ,
          <article-title>Bertaa : BERT fine-tuning for authorship attribution</article-title>
          , in: P.
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Sharma</surname>
          </string-name>
          , R. Sangal (Eds.),
          <source>Proceedings ICON</source>
          <year>2020</year>
          ,
          <article-title>NLP Association of India (NLPAI</article-title>
          ),
          <year>2020</year>
          , pp.
          <fpage>127</fpage>
          -
          <lpage>137</lpage>
          . URL: https://aclanthology.org/
          <year>2020</year>
          . icon-main.
          <volume>16</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C.</given-names>
            <surname>Saedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dras</surname>
          </string-name>
          ,
          <article-title>Siamese networks for large-scale author identification</article-title>
          ,
          <source>Comput. Speech Lang</source>
          .
          <volume>70</volume>
          (
          <year>2021</year>
          )
          <article-title>101241</article-title>
          . URL: https://doi.org/10.1016/j.csl.
          <year>2021</year>
          .
          <volume>101241</volume>
          .
          <source>doi:1 0 . 1 0</source>
          <volume>1 6</volume>
          / j . c
          <source>s l . 2 0</source>
          <volume>2 1 . 1 0 1 2 4 1 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ghafari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Breslin</surname>
          </string-name>
          ,
          <article-title>Character-level and multi-channel convolutional neural networks for large-scale authorship attribution</article-title>
          ,
          <source>CoRR abs/1609</source>
          .06686 (
          <year>2016</year>
          ). URL: http://arxiv.org/abs/1609.06686.
          <article-title>a r X i v : 1 6 0 9 . 0 6 6 8 6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sari</surname>
          </string-name>
          ,
          <article-title>Neural and non-neural approaches to authorship attribution</article-title>
          ,
          <source>Ph.D. thesis</source>
          , University of Shefield, UK,
          <year>2018</year>
          . URL: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.
          <volume>755204</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>E.</given-names>
            <surname>Moreau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Jayapal</surname>
          </string-name>
          , G. Lynch,
          <string-name>
            <given-names>C.</given-names>
            <surname>Vogel</surname>
          </string-name>
          ,
          <article-title>Author verification: Basic stacked generalization applied to predictions from a set of heterogeneous learners - notebook for pan at clef 2015</article-title>
          , in: CLEF,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bacciu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Morgia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. N.</given-names>
            <surname>Nemmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Neri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Stefa</surname>
          </string-name>
          ,
          <article-title>Cross-domain authorship attribution combining instance based and profile-based features</article-title>
          , in: L.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>D. E.</given-names>
          </string-name>
          <string-name>
            <surname>Losada</surname>
          </string-name>
          , H. Müller (Eds.),
          <source>Working Notes of CLEF 2019 - Conference and Labs of Ethics in Science and Environmental Politics</source>
          <volume>21</volume>
          (
          <year>2021</year>
          )
          <fpage>17</fpage>
          -
          <lpage>23</lpage>
          .
          <source>doi:1 0 . 3 3</source>
          <volume>5 4</volume>
          / e s e p 0
          <volume>0 1 9 5 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>Landa-Blanco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Flores</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mercado</surname>
          </string-name>
          ,
          <article-title>Human vs. ai authorship: Does it matter in evaluating creative writing? a pilot study using chatgpt (</article-title>
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Bird</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ek'art</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Faria</surname>
          </string-name>
          ,
          <article-title>Chatbot interaction with artificial intelligence: human data augmentation with t5 and language transformer ensemble for text classification</article-title>
          ,
          <source>Journal of Ambient Intelligence and Humanized Computing</source>
          <volume>14</volume>
          (
          <year>2020</year>
          )
          <fpage>3129</fpage>
          -
          <lpage>3144</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>F.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Qin</surname>
          </string-name>
          , X. Cheng, W. Zhou, T. Liu,
          <article-title>Soft contextual data augmentation for neural machine translation</article-title>
          , in: A.
          <string-name>
            <surname>Korhonen</surname>
            ,
            <given-names>D. R.</given-names>
          </string-name>
          <string-name>
            <surname>Traum</surname>
          </string-name>
          , L. Màrquez (Eds.),
          <source>Proceedings ACL</source>
          <year>2019</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          ,
          <year>2019</year>
          , pp.
          <fpage>5539</fpage>
          -
          <lpage>5544</lpage>
          . URL: https://doi.org/10.18653/v1/p19-
          <fpage>1555</fpage>
          .
          <source>doi:1 0 . 1 8</source>
          <volume>6 5 3</volume>
          / v 1 / p 1
          <fpage>9</fpage>
          -
          <lpage>1</lpage>
          5
          <fpage>5</fpage>
          5 .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>K.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Budhkar</surname>
          </string-name>
          ,
          <article-title>A package for learning on tabular and text data with transformers</article-title>
          ,
          <source>in: Proceedings of the Third Workshop on Multimodal Artificial Intelligence</source>
          , Association for Computational Linguistics, Mexico City, Mexico,
          <year>2021</year>
          , pp.
          <fpage>69</fpage>
          -
          <lpage>73</lpage>
          . URL: https://www. aclweb.org/anthology/2021.maiworkshop-
          <volume>1</volume>
          .10.
          <article-title>doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 1 . m a i w o r k s h o p - 1 . 1 0 .</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [30]
          <string-name>
            <surname>OpenAI</surname>
          </string-name>
          ,
          <string-name>
            <surname>Chatgpt</surname>
            <given-names>API</given-names>
          </string-name>
          , https://openai.com/docs/api,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Google</surname>
            <given-names>AI Blog</given-names>
          </string-name>
          ,
          <article-title>BARD: Google AI Search Updates</article-title>
          , https://blog.google/technology/ai/ bard-google
          <article-title>-ai-search-updates/</article-title>
          , YYYY. Accessed: Month Day,
          <year>Year</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chowdhery</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Narang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bosma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Mishra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Roberts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Barham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. W.</given-names>
            <surname>Chung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Sutton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gehrmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Schuh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tsvyashchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Maynez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Barnes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Prabhakaran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Reif</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hutchinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pope</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bradbury</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Austin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Isard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gur-Ari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Duke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Levskaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ghemawat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Michalewski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Misra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Fedus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ippolito</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Luan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Zoph</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Spiridonov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sepassi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dohan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Omernick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. S.</given-names>
            <surname>Pillai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pellat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lewkowycz</surname>
          </string-name>
          , E. Moreira,
          <string-name>
            <given-names>R.</given-names>
            <surname>Child</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Polozov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Saeta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Diaz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Firat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Catasta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Meier-Hellstern</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Eck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Petrov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Fiedel</surname>
          </string-name>
          , Palm:
          <article-title>Scaling language modeling with pathways</article-title>
          ,
          <source>CoRR abs/2204</source>
          .02311 (
          <year>2022</year>
          ). URL: https://doi.org/10.48550/arXiv.2204.02311.
          <article-title>doi:1 0 . 4 8 5 5 0 / a r X i v . 2 2 0 4 . 0 2 3 1 1 . a r X i v : 2 2 0 4 . 0 2 3 1 1</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>A.</given-names>
            <surname>Cohan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Dernoncourt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Bui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goharian</surname>
          </string-name>
          ,
          <article-title>A discourseaware attention model for abstractive summarization of long documents</article-title>
          , in: M. A.
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Ji</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Stent (Eds.),
          <source>Proceedings NAACL-HLT</source>
          <year>2018</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          ,
          <year>2018</year>
          , pp.
          <fpage>615</fpage>
          -
          <lpage>621</lpage>
          . URL: https://doi.org/10.18653/v1/n18-
          <fpage>2097</fpage>
          .
          <article-title>doi:1 0 . 1 8 6 5 3 / v 1 / n 1 8 - 2 0 9 7</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>H.</given-names>
            <surname>Face</surname>
          </string-name>
          , Hugging face transformers documentation: Bertforsequenceclassification, https://huggingface.co/docs/transformers/v4.30.0/en/model_doc/bert#transformers. BertForSequenceClassification, Accessed:
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stevenson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Vlachos</surname>
          </string-name>
          ,
          <article-title>Topic or style? exploring the most useful features for authorship attribution</article-title>
          , in: E. M.
          <string-name>
            <surname>Bender</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Derczynski</surname>
          </string-name>
          , P. Isabelle (Eds.),
          <source>Proceedings of the 27th International Conference on Computational Linguistics</source>
          ,
          <string-name>
            <surname>COLING</surname>
          </string-name>
          <year>2018</year>
          ,
          <string-name>
            <given-names>Santa</given-names>
            <surname>Fe</surname>
          </string-name>
          , New Mexico, USA,
          <year>August</year>
          20-
          <issue>26</issue>
          ,
          <year>2018</year>
          , Association for Computational Linguistics,
          <year>2018</year>
          , pp.
          <fpage>343</fpage>
          -
          <lpage>353</lpage>
          . URL: https://aclanthology.org/C18-1029/.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Rajapakse</surname>
          </string-name>
          , Simple transformers, https://github.com/ThilinaRajapakse/ simpletransformers,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [37] OpenAI, GPT-4
          <source>technical report, CoRR abs/2303</source>
          .08774 (
          <year>2023</year>
          ). URL: https://doi.org/10.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>