<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Datasets and Models for Authorship Attribution on Italian Personal Writings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gaetana Ruggiero</string-name>
          <email>garuggiero@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Albert Gatt</string-name>
          <email>albert.gatt@um.edu.mt</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malvina Nissim</string-name>
          <email>m.nissim@rug.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Linguistics and Language Technology, University of Malta, Malta Center for Language and Cognition, University of Groningen</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is available (e.g novels), mainly in English. We approach AA via Authorship Verification on short Italian texts in two novel datasets, and analyze the interaction between genre, topic, gender and length. Results show that AV is feasible even with little data, but more evidence helps. Gender and topic can be indicative clues, and if not controlled for, they might overtake more specific aspects of personal style.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Authorship Attribution (AA) is the task of
identifying authors by their writing style. In addition
to being a tool for studying individual language
choices, AA is useful for many real-life
applications, such as plagiarism detection
        <xref ref-type="bibr" rid="ref27">(Stamatatos
and Koppel, 2011)</xref>
        , multiple accounts detection
        <xref ref-type="bibr" rid="ref18 ref31 ref34">(Tsikerdekis and Zeadally, 2014)</xref>
        , and online
security
        <xref ref-type="bibr" rid="ref18 ref31 ref34">(Yang and Chow, 2014)</xref>
        .
      </p>
      <p>
        Most work on AA focuses on English, on
relatively long texts such as novels and articles
        <xref ref-type="bibr" rid="ref13">(Juola,
2015)</xref>
        where personal style could be mitigated due
to editorial interventions. Furthermore, in many
real-world applications the texts of disputed
authorship tend to be short
        <xref ref-type="bibr" rid="ref22">(Omar et al., 2019)</xref>
        .
      </p>
      <p>
        The PAN 2020 shared task was originally
meant to investigate multilingual AV in fanfiction,
focusing on Italian, Spanish, Dutch and English
        <xref ref-type="bibr" rid="ref2">(Bevendorff et al., 2020)</xref>
        . However, the datasets
were eventually restricted to English only, to
maximize the amount of available training data
        <xref ref-type="bibr" rid="ref15">(Kestemont et al., 2020)</xref>
        , emphasizing the difficulty in
compiling large enough datasets for less-resourced
languages.
      </p>
      <p>Copyright c 2020 for this paper by its authors. Use
permitted under Creative Commons License Attribution 4.0
International (CC BY 4.0).</p>
      <p>
        AA research in Italian has largely focused on
the single case of Elena Ferrante
        <xref ref-type="bibr" rid="ref32 ref9">(Tuzzi and
Cortelazzo, 2018)</xref>
        1. The present work seeks a more
realistic take, using more diverse, user-generated
data namely web forums comments and diary
fragments, thereby introducing two novel datasets for
this task: ForumFree and Diaries.
      </p>
      <p>
        We cast the AA problem as authorship
verification (AV). Rather than identifying the specific
author of a text (the most common task in AA), AV
aims at determining whether two texts were
written by the same author or not
        <xref ref-type="bibr" rid="ref17 ref19">(Koppel and Schler,
2004; Koppel et al., 2009)</xref>
        .
      </p>
      <p>
        The GLAD system of Hu¨rlimann et al. (2015)
was specifically developed to solve AV problems,
and has been shown to be highly adaptable to new
datasets
        <xref ref-type="bibr" rid="ref9">(Halvani et al., 2018)</xref>
        . GLAD uses an
SVM with a variety of features including
character level ones, which have proved to be most
effective for AA tasks
        <xref ref-type="bibr" rid="ref11 ref21 ref30">(Stamatatos, 2009; Moreau et al.,
2015; Hu¨rlimann et al., 2015)</xref>
        , and is freely
available. Moreover, Kestemont et al. (2019) show that
many of the best models for authorship attribution
are based on Support Vector Machines. Hence we
adopt GLAD in the present study.
      </p>
      <p>
        More specifically, we run GLAD on our
datasets and study the interaction of four
different dimensions: topic, gender, amount of evidence
per author, and genre. In practice, we design
intratopic, cross-topic, and cross-genre experiments,
controlling for gender and amount of evidence per
author. The focus on cross-topic and cross-genre
AV is in line with the PAN 2015 shared task
        <xref ref-type="bibr" rid="ref29">(Stamatatos et al., 2015)</xref>
        ; this setting has been shown
to be more challenging than the task definitions
of previous editions
        <xref ref-type="bibr" rid="ref12 ref28">(Juola and Stamatatos, 2013;
Stamatatos et al., 2014)</xref>
        .
      </p>
    </sec>
    <sec id="sec-2">
      <title>1https://www.newyorker.com/culture/cultural</title>
      <p>comment/the-unmasking-of-elena-ferrante
Contributions We advance AA for Italian
introducing two novel datasets, ForumFree and
Diaries, which contribute to enhance the amount of
available Italian data suitable for AA tasks.2</p>
      <p>Running a battery of experiments on personal
writings, we show that AV is feasible even with
little data, but more evidence helps. Gender and
topic can be indicative clues, and if not controlled
for, they might overtake more specific aspects of
personal style.
2</p>
      <sec id="sec-2-1">
        <title>Data</title>
        <p>
          For the present study, we introduce two novel
datasets, ForumFree and Diaries. Although
already compiled
          <xref ref-type="bibr" rid="ref20">(Maslennikova et al., 2019)</xref>
          , the
original ForumFree dataset was not meant for AA.
Therefore, we reformat it following the PAN
format3. The dataset contains web forum comments
taken from the ForumFree platform4, and the
subset used in this work covers two topics, Medicina
Estetica (“Aesthethic Medicine”) and Programmi
Tv (“Tv Programmes”; Celebrities in the
original dataset). A third subset, Mix, is the union of
the first two. The Diaries dataset is originally
assembled for the present study, and contains a
collection of diary fragments included in the project
Italiani all’estero: i diari raccontano (“Italians
abroad: the diaries narrate”).5 For Diaries, no
topic classification has been taken into account.
Table 1 shows an overview of the datasets.
        </p>
        <p>Subset
Med Est
Prog TV
Mix
Diaries</p>
        <p># Authors
F M Tot
33 44 77
78 71 149
111 115 276
77 188 275
# Docs W/A D/A</p>
        <p>W/D
2Further information about the datasets can be found at
https://github.com/garuggiero/Italian-Datasets-for-AV
3https://pan.webis.de/clef15/pan15-web/authorshipverification.html
4https://www.forumfree.it/
5https://www.idiariraccontano.org
ten in the past, were removed from the dataset,
together with their authors when this was the only
text associated with them.</p>
        <p>
          The stories narrated in the diaries are of a very
personal nature, which means that many proper
nouns and names of locations are used. To avoid
relying on these explicit clues, which are strong
but not indicative of personal writing style, we
perform Named Entity Recognition (NER),
using spaCy
          <xref ref-type="bibr" rid="ref10">(Honnibal, 2015)</xref>
          . Person names,
locations and organizations were replaced by their
corresponding labels, namely PER, LOC, ORG.
The fourth label used by spaCy, MISC
(miscellany), was not considered; dates were also not
normalized. Moreover, a separate set of experiments
was performed by bleaching the diary texts prior
to their input to the GLAD system. The
bleaching method was proposed by van der Goot et al.
(2018) in the context of cross-lingual Gender
Prediction, and consists of transforming tokens into
an abstract representation that masks lexical forms
while maintaining key features. We only use 4 of
the 6 original features. Shape transforms
uppercase letters into ‘U’, lowercase ones into ‘L’,
digits into ‘D’, and the rest into ‘X’. PunctA replaces
emojis with ‘J’, emoticons with ‘E’, punctuation
with ‘P’ and one or more alphanumeric characters
with a single ‘W’. Length represents a word by the
number of its characters. Frequency corresponds
to the log frequency of a token in the dataset. The
features are then concatenated. The word ‘House’
would be rewritten as ‘ULLLL W 05 6’.
2.2
        </p>
        <p>Reformatting
We reformat both datasets in order to make them
suitable for AV. The data is divided into so-called
problems: each problem is made of a known and
an unknown text of equal length.</p>
        <p>
          To account for the shortness of the texts and to
avoid topic biases that would derive by taking
consecutive text as known and unknown fragments,
all the documents written by the same author are
first shuffled and then concatenated into a single
string. The string is split into two spans
containing the same number of words, so that the words
contained in the unknown span come from subsets
of texts which are different from the ones that form
the known one. An example of this process is
displayed in Figure 1. Rather than being represented
by individual productions, each author is therefore
represented by a set of texts, whose original
sequential order has been altered. Each known text
is paired with an unknown text from the same
author. To create negative instances, given a dataset
with multiple problems, one can (i) make use
of external documents (extrinsic approach
          <xref ref-type="bibr" rid="ref18 ref26 ref31 ref34">(Seidman, 2013; Koppel and Winter, 2014)</xref>
          ), or (ii) use
fragments collated from all authors in the
training data, except the target author (intrinsic
approach). We create negative instances with an
intrinsic approach. More specifically, following
Dwyer (2017), the second half of the unknown
array is shifted by one, so that the texts of the second
half of the known array are paired with a
differentauthor text in the unknown array. In this way, the
label distribution is balanced.
3
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Method</title>
        <p>Given a pair of known and unknown fragments
(KU pair), the task is to predict whether they are
written by the same author or not. In designing our
experiments, we control for topic, gender, amount
of evidence, and genre. The latter is fostered by
the diverse nature of our datasets.</p>
        <p>Topic Maintaining the topic roughly constant
should allow stylistic features to gain more
discriminative value. We design intra-topic (IT) and
cross-topic experiments (CT). In IT, we
distinguish same- and different-topic KU pairs. In
same-topic, we train and test the system on KU
pairs from the same topic. In different-topic, we
include the Mix set and the diaries. Since we train
and test on a mixture of topics and there can be
topic overlap, these are not truly cross-topic, and
we do not consider them as such.</p>
        <p>Given that no topic classification is available
for the diaries, the CT experiments are only
performed on the ForumFree dataset. We train the
system on Medicina Estetica and test it on
Programmi Tv, and vice versa.</p>
        <p>
          Gender Previous work has shown that similarity
can be observed in writings of people of the same
gender
          <xref ref-type="bibr" rid="ref1 ref24">(Basile et al., 2017; Rangel et al., 2017)</xref>
          .6
In order to assess the influence of same vs different
gender in AA, we consider three gender settings:
only female authors and only male authors
(singlegender), and mixed-gender, where the known and
unknown document can be either written by two
authors of the same gender, or by a male and a
female author. In dividing the subsets according
to the gender of the authors, we consider gender
implicitly. However, we also perform experiments
adding gender as feature to the instance vectors,
indicating both the gender of the known and
unknown documents’ authors and whether or not the
gender of the authors is the same.
        </p>
        <p>Evidence Following Feiguina and Hirst (2007),
we experiment with KU pairs of different sizes,
i.e. with 400, 1 000, 2 000 and 3 000 words per
author. Each element of the KU pair is thus made up
of 200, 500, 1 000 and 1 500 words respectively.
To observe the effect of the different text sizes on
the classification, we manipulate the number of
instances in training and test, so that the same
authors are included in all the different word settings
of a single topic-gender experiment.</p>
        <p>6Binary gender is a simplification of a much more
nuanced situation in reality. Following previous work, we adopt
it for convenience.</p>
        <p>Genre We perform cross-genre experiments
(CG) by training on ForumFree and testing on the
Diaries, and vice versa.</p>
        <p>
          Splits and Evaluation We train on 70% and test
on 30% of the instances. However, since we are
controlling for gender and topic, the number of
instances contained in the training and test sets
varies in each experiment. We keep the test sets
stable across IT, CT and CG experiments, so that
we can compare results. Following the PAN
evaluation settings
          <xref ref-type="bibr" rid="ref29">(Stamatatos et al., 2015)</xref>
          , we use
three metrics. c@1 takes into account the
number of problems left unanswered and rewards the
system when it classifies a problem as unanswered
rather than misclassifying it.
        </p>
        <p>
          Probability scores are converted to binary
answers: every score greater than 0.5 becomes a
positive answer, every score smaller than 0.5
corresponds to a negative answer and every score
which is exactly 0.5 is considered as an
unanswered problem. The AU C measure corresponds
to the area under the ROC curve
          <xref ref-type="bibr" rid="ref7">(Fawcett, 2006)</xref>
          ,
and tests the ability of the system to rank scores
properly, assigning low values to negative
problems and high values to positive ones
          <xref ref-type="bibr" rid="ref29">(Stamatatos
et al., 2015)</xref>
          . The third measure is the product of
c@1 and AU C.
        </p>
        <p>
          Model We run all experiments using GLAD
          <xref ref-type="bibr" rid="ref11">(Hu¨rlimann et al., 2015)</xref>
          . This is an SVM with rbf
kernel, implemented using Python’s scikit-learn
          <xref ref-type="bibr" rid="ref23">(Pedregosa et al., 2011)</xref>
          library and NLTK
          <xref ref-type="bibr" rid="ref3">(Bird et
al., 2009)</xref>
          . GLAD was designed to work with 24
different features, which take into account
stylometry, entropy and data compression measures. We
compare GLAD to a simple baseline which
randomly assigns a label from the set of possible
labels (i.e. ‘YES’ or ‘NO’) to each test instance.
        </p>
        <p>
          Our choice fell on GLAD for a variety of
reasons. As a general observation, even in later
challenges, SVMs have proven to be the most
effective for AA tasks
          <xref ref-type="bibr" rid="ref14">(Kestemont et al., 2019)</xref>
          . More
specifically, in a survey of freely available AA
systems, GLAD showed best performance and
especially high adaptability to new datasets
          <xref ref-type="bibr" rid="ref9">(Halvani
et al., 2018)</xref>
          . Lastly, de Vries (2020) has
explored fine-tuning a pre-trained model for AV in
Dutch, a less-resourced language compared to
English. He found that fine-tuning BERTje (a Dutch
monolingual BERT-model,
          <xref ref-type="bibr" rid="ref4">(de Vries et al., 2019)</xref>
          )
with PAN 2015 AV data
          <xref ref-type="bibr" rid="ref29">(Stamatatos et al., 2015)</xref>
          ,
failed to outperform a majority baseline
          <xref ref-type="bibr" rid="ref5">(de Vries,
2020)</xref>
          . He concluded that Tranformer-encoder
models might not suitable for AA tasks, since they
will likely overfit if the documents contain no
reliable clues of authorship
          <xref ref-type="bibr" rid="ref5">(de Vries, 2020)</xref>
          .
4
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Results and Discussion</title>
        <p>The number of experiments is high due to the
interaction of the dimensions we consider.</p>
        <p>Tables 2 and 3 only include the mixed-gender
results of the IT experiments on Mix (which
corresponds to the entire ForumFree dataset used for
this study) and Diaries, respectively. Results
concerning all dimensions considered are anyway
discussed in the text. We refer to the combined score.
Since the baseline results are different for each
setting, we do not include them. However, all
models perform consistently above their corresponding
baseline.</p>
        <p>For the Mix topic, we achieved 0.966 with 96
authors in total and 3 000 words (Table 2). For the
diaries, we achieved 0.821 with 46 authors in total
and 3 000 words each (Table 3).7 Although the
training and test sets are of different sizes for both
datasets, more evidence seems to help the model
to solve the problem.</p>
        <p>In the IT experiments, the highest score for
Medicina Estetica is 0.923, with 41 authors in total
and 1 000 words per author, and for Programmi Tv
0.944, with 59 authors and 3 000 words each. In
the CT setting, the scores stay basically the same
in both directions. In CG, when training on the
diaries and testing on Mix, we obtain the same
score when training on Mix with 3 000 words.
When training on Mix and testing on Diaries, we
achieved 0.737 on the same test set, and 0.748 with
1 000 words per instance.</p>
        <p>Discussion When more variables interact in the
same subset, as in mixed-gender sets of the
ForumFree and Diaries dataset, we found that the
classifier uses the implicit gender information.
Indeed, it achieves slightly better scores in
mixedgender settings than in female- and male-only
ones, suggesting that the classifier might be using
internal clustering of the data rather than writing
style characteristics. This also explains why
results are higher in Mix than in separate topics,
because the classifier can use topic information.</p>
        <p>7Using a bleached representation of the texts, the score
increased by 0.36
# W/A
# Auth
# Problems</p>
        <p>Train Test
0.691
0.796
0.833
0.857</p>
        <p>AUC</p>
        <p>We also observe that by adding gender as an
explicit feature in topic- and gender-controlled
subsets, GLAD uses this information to improve
classification, especially in mixed-gender scenarios.</p>
        <p>
          Although previous research demonstrated that
CT and CG experiments are harder than IT ones
          <xref ref-type="bibr" rid="ref25 ref29">(Sapkota et al., 2014; Stamatatos et al., 2015)</xref>
          ,
in our case the scores for the three settings are
comparable. However, since we only performed
CT and CG experiments on mixed-gender subsets,
the gender-specific information might have also
played a role in this process (see above).
        </p>
        <p>Overall, the experiments show that using a
higher number of words per author is preferable.
Although 3 000 words seems to be optimal for
most settings, in the large number of experiments
that we carried out (not all included in this paper)
we also observed that lower amounts of words also
led to comparable results. This aspect will require
further investigation.
5</p>
      </sec>
      <sec id="sec-2-4">
        <title>Conclusion</title>
        <p>We experimented with AV on Italian forum
comments and diary fragments. We compiled two
datasets and performed experiments which
considered the interaction among topic, gender, length
and genre. Even when the texts are short and
present more individual variation than traditional
texts used in AA, AV is a feasible task, but having
more evidence per author improves classification.
While making the task more challenging,
controlling for gender and topic ensures that the system
prioritizes authorship over different data clusters.
Although the datasets used are intended for AV
problems, they can be easily adapted to other AA
tasks. We believe this to be one of the major
contributions of our work, as it can help to advance
the up-to-now limited AA research in Italian.</p>
      </sec>
      <sec id="sec-2-5">
        <title>Acknowledgments</title>
        <p>The ForumFree dataset was a courtesy of the
Italian Institute of Computational Linguistics
“Antonio Zampolli” (ILC) of Pisa.8</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>8http://www.ilc.cnr.it/</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Angelo</given-names>
            <surname>Basile</surname>
          </string-name>
          , Gareth Dwyer, Maria Medvedeva, Josine Rawee, Hessel Haagsma, and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>N-GrAM: New Groningen Authorprofiling Model-Notebook for PAN at CLEF 2017</article-title>
          .
          <source>In CEUR Workshop Proceedings</source>
          , volume
          <year>1866</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Janek</given-names>
            <surname>Bevendorff</surname>
          </string-name>
          , Bilal Ghanem, Anastasia Giachanou, Mike Kestemont, Enrique Manjavacas, Martin Potthast, Francisco Rangel, Paolo Rosso, Gu¨nther Specht, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, and
          <string-name>
            <given-names>Eva</given-names>
            <surname>Zangerle</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Shared Tasks on Authorship Analysis at PAN 2020</article-title>
          . In Joemon M. Jose, Emine Yilmaz, Joa˜o Magalha˜es, Pablo Castells, Nicola Ferro, Ma´rio
          <string-name>
            <given-names>J.</given-names>
            <surname>Silva</surname>
          </string-name>
          , and Fla´vio Martins, editors,
          <source>Advances in Information Retrieval</source>
          , pages
          <fpage>508</fpage>
          -
          <lpage>516</lpage>
          , Cham. Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Steven</given-names>
            <surname>Bird</surname>
          </string-name>
          , Ewan Klein, and
          <string-name>
            <given-names>Edward</given-names>
            <surname>Loper</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Natural language processing with Python: analyzing text with the natural language toolkit</article-title>
          . ”
          <string-name>
            <surname>O'Reilly Media</surname>
          </string-name>
          , Inc.”.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Wietse de Vries</surname>
          </string-name>
          , Andreas van Cranenburgh,
          <string-name>
            <surname>Arianna Bisazza</surname>
            , Tommaso Caselli, Gertjan van Noord,
            <given-names>and Malvina</given-names>
          </string-name>
          <string-name>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Bertje: A Dutch BERT model</article-title>
          . arXiv preprint arXiv:
          <year>1912</year>
          .09582.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Wietse de Vries</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Language Models are not just English Anymore: Training and Evaluation of a Dutch BERT-based Language Model Named BERTje</article-title>
          .
          <source>Master Thesis in Information Science</source>
          , University of Groningen, The Netherlands.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Gareth</given-names>
            <surname>Terence Bryan Dwyer</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Novel approaches to authorship attribution</article-title>
          .
          <source>Master Thesis in Language and Communication Technologies</source>
          , Information Science, University of Groningen, The Netherlands.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Tom</given-names>
            <surname>Fawcett</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>An introduction to roc analysis</article-title>
          .
          <source>Pattern recognition letters</source>
          ,
          <volume>27</volume>
          (
          <issue>8</issue>
          ):
          <fpage>861</fpage>
          -
          <lpage>874</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Olga</given-names>
            <surname>Feiguina</surname>
          </string-name>
          and
          <string-name>
            <given-names>Graeme</given-names>
            <surname>Hirst</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Authorship attribution for small texts: Literary and forensic experiments</article-title>
          .
          <source>In Proceedings of the SIGIR'07 Workshop on Plagiarism Analysis</source>
          ,
          <string-name>
            <given-names>Authorship</given-names>
            <surname>Identification</surname>
          </string-name>
          , and
          <string-name>
            <surname>Near-Duplicate Detection</surname>
          </string-name>
          (PAN
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Oren</given-names>
            <surname>Halvani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Christian</given-names>
            <surname>Winter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Lukas</given-names>
            <surname>Graner</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Unary and binary classification approaches and their implications for authorship verification</article-title>
          . arXiv preprint arXiv:
          <year>1901</year>
          .00399.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Matthew</given-names>
            <surname>Honnibal</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>spacy: Industrial-strength natural language processing (nlp) with python and cython</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Manuela</given-names>
            <surname>Hu</surname>
          </string-name>
          ¨rlimann, Benno Weck, Esther van den Berg, Simon Suster, and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Glad: Groningen lightweight authorship detection</article-title>
          .
          <source>In CLEF (Working Notes).</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Juola</surname>
          </string-name>
          and
          <string-name>
            <given-names>Efstathios</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Overview of the Author Identification Task at PAN 2013</article-title>
          .
          <source>CLEF (Working Notes)</source>
          ,
          <volume>1179</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Juola</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>The Rowling case: A proposed standard analytic protocol for authorship questions</article-title>
          .
          <source>Digital Scholarship in the Humanities</source>
          ,
          <volume>30</volume>
          (
          <issue>suppl 1</issue>
          ):
          <fpage>i100</fpage>
          -
          <lpage>i113</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Mike</given-names>
            <surname>Kestemont</surname>
          </string-name>
          , Efstathios Stamatatos, Enrique Manjavacas, Walter Daelemans,
          <string-name>
            <given-names>Martin</given-names>
            <surname>Potthast</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Overview of the Cross-domain Authorship Attribution Task at PAN 2019</article-title>
          . In CLEF (Working Notes).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Mike</given-names>
            <surname>Kestemont</surname>
          </string-name>
          , Enrique Manjavacas, Ilia Markov, Janek Bevendorff, Matti Wiegmann, Efstathios Stamatatos,
          <string-name>
            <given-names>Martin</given-names>
            <surname>Potthast</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <article-title>Overview of the Cross-Domain Authorship Verification Task at PAN 2020</article-title>
          . In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aure´lie Ne´ve´ol, editors,
          <source>CLEF 2020 Labs and Workshops</source>
          , Notebook Papers.
          <article-title>CEUR-WS</article-title>
          .org, September.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Moshe</given-names>
            <surname>Koppel</surname>
          </string-name>
          and
          <string-name>
            <given-names>Jonathan</given-names>
            <surname>Schler</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Authorship verification as a one-class classification problem</article-title>
          .
          <source>In Proceedings of the twenty-first international conference on Machine learning, page 62.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Moshe</given-names>
            <surname>Koppel</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yaron</given-names>
            <surname>Winter</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Determining if two documents are written by the same author</article-title>
          .
          <source>Journal of the Association for Information Science and Technology</source>
          ,
          <volume>65</volume>
          (
          <issue>1</issue>
          ):
          <fpage>178</fpage>
          -
          <lpage>187</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Moshe</given-names>
            <surname>Koppel</surname>
          </string-name>
          , Jonathan Schler, and
          <string-name>
            <given-names>Shlomo</given-names>
            <surname>Argamon</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Computational methods in authorship attribution</article-title>
          .
          <source>Journal of the American Society for information Science and Technology</source>
          ,
          <volume>60</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Aleksandra</given-names>
            <surname>Maslennikova</surname>
          </string-name>
          , Paolo Labruna,
          <source>Andrea Cimino, and Felice Dell'Orletta</source>
          .
          <year>2019</year>
          .
          <article-title>Quanti anni hai? Age Identification for Italian</article-title>
          .
          <source>In Proceedings of 6th Italian Conference on Computational Linguistics (CLiC-it)</source>
          ,
          <fpage>13</fpage>
          -
          <lpage>15</lpage>
          November,
          <year>2019</year>
          , Bari, Italy.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Erwan</given-names>
            <surname>Moreau</surname>
          </string-name>
          , Arun Jayapal, Gerard Lynch, and
          <string-name>
            <given-names>Carl</given-names>
            <surname>Vogel</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Author verification: basic stacked generalization applied to predictions from a set of heterogeneous learners-notebook for pan at clef 2015</article-title>
          . In Linda Cappellato, Nicola Ferro, Gareth Jones, and Eric San Juan, editors,
          <source>CLEF 2015 Evaluation Labs and Workshop - Working Notes Papers</source>
          ,
          <fpage>8</fpage>
          -
          <lpage>11</lpage>
          September, Toulouse, France.
          <source>CEUR-WS.org.</source>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Abdulfattah</given-names>
            <surname>Omar</surname>
          </string-name>
          ,
          <source>Basheer Ibrahim Elghayesh, and Mohamed Ali Mohamed Kassem</source>
          .
          <year>2019</year>
          .
          <article-title>Authorship attribution revisited: The problem of flash fiction a morphological-based linguistic stylometry approach</article-title>
          .
          <source>Arab World English Journal (AWEJ</source>
          ) Volume,
          <volume>10</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Fabian</given-names>
            <surname>Pedregosa</surname>
          </string-name>
          , Gae¨l Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Prettenhofer</surname>
          </string-name>
          , Ron Weiss,
          <string-name>
            <surname>Vincent Dubourg</surname>
          </string-name>
          , et al.
          <year>2011</year>
          .
          <article-title>Scikit-learn: Machine learning in python</article-title>
          .
          <source>the Journal of machine Learning research</source>
          ,
          <volume>12</volume>
          :
          <fpage>2825</fpage>
          -
          <lpage>2830</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Francisco</given-names>
            <surname>Rangel</surname>
          </string-name>
          , Paolo Rosso,
          <string-name>
            <given-names>Martin</given-names>
            <surname>Potthast</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Overview of the 5th author profiling task at pan 2017: Gender and language variety identification in twitter</article-title>
          .
          <source>Working notes papers of the CLEF</source>
          , pages
          <fpage>1613</fpage>
          -
          <lpage>0073</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Upendra</given-names>
            <surname>Sapkota</surname>
          </string-name>
          , Thamar Solorio, Manuel Montes, Steven Bethard, and
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Rosso</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Crosstopic authorship attribution: Will out-of-topic data help</article-title>
          ?
          <source>In Proceedings of COLING</source>
          <year>2014</year>
          ,
          <source>the 25th International Conference on Computational Linguistics: Technical Papers</source>
          , pages
          <fpage>1228</fpage>
          -
          <lpage>1237</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Shachar</given-names>
            <surname>Seidman</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Authorship verification using the impostors method</article-title>
          .
          <source>In CLEF 2013 Evaluation labs and workshop-Working notes papers</source>
          , pages
          <fpage>23</fpage>
          -
          <lpage>26</lpage>
          . Citeseer.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <given-names>Efstathios</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          and
          <string-name>
            <given-names>Moshe</given-names>
            <surname>Koppel</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Plagiarism and authorship analysis: introduction to the special issue</article-title>
          .
          <source>Language Resources and Evaluation</source>
          ,
          <volume>45</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <given-names>Efstathios</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          , Walter Daelemans, Ben Verhoeven, Martin Potthast, Benno Stein, Patrick Juola,
          <string-name>
            <surname>Miguel A Sanchez-Perez</surname>
          </string-name>
          , and Alberto Barro´nCeden˜o.
          <year>2014</year>
          .
          <article-title>Overview of the author identification task at pan 2014</article-title>
          .
          <article-title>In CLEF 2014 Evaluation Labs</article-title>
          and Workshop Working Notes Papers, Sheffield, UK,
          <year>2014</year>
          , pages
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <given-names>Efstathios</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          , Walter Daelemans, Ben Verhoeven, Patrick Juola, Aurelio Lo´
          <fpage>pez</fpage>
          -Lo´pez, Martin Potthast, and
          <string-name>
            <given-names>Benno</given-names>
            <surname>Stein</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Overview of the author identification task at pan 2015</article-title>
          .
          <article-title>clef 2015 evaluation labs</article-title>
          and workshop, online working notes, toulouse, france.
          <source>In CEUR Workshop Proceedings</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <given-names>Efstathios</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>A survey of modern authorship attribution methods</article-title>
          .
          <source>Journal of the American Society for information Science and Technology</source>
          ,
          <volume>60</volume>
          (
          <issue>3</issue>
          ):
          <fpage>538</fpage>
          -
          <lpage>556</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <given-names>Michail</given-names>
            <surname>Tsikerdekis</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sherali</given-names>
            <surname>Zeadally</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Multiple account identity deception detection in social media using nonverbal behavior</article-title>
          .
          <source>IEEE Transactions on Information Forensics and Security</source>
          ,
          <volume>9</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1311</fpage>
          -
          <lpage>1321</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <given-names>Arjuna</given-names>
            <surname>Tuzzi and Michele A Cortelazzo</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <source>Drawing Elena Ferrante's Profile: Workshop Proceedings</source>
          , Padova, 7
          <article-title>September 2017</article-title>
          . Padova UP.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <surname>Rob van der Goot</surname>
          </string-name>
          , Nikola Ljubesˇic´,
          <string-name>
            <surname>Ian</surname>
            <given-names>Matroos</given-names>
          </string-name>
          , Malvina Nissim, and
          <string-name>
            <given-names>Barbara</given-names>
            <surname>Plank</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Bleaching text: Abstract features for cross-lingual gender prediction</article-title>
          . arXiv preprint arXiv:
          <year>1805</year>
          .03122.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <given-names>Min</given-names>
            <surname>Yang</surname>
          </string-name>
          and
          <string-name>
            <surname>Kam-Pui Chow</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Authorship attribution for forensic investigation with thousands of authors</article-title>
          .
          <source>In IFIP International Information Security Conference</source>
          , pages
          <fpage>339</fpage>
          -
          <lpage>350</lpage>
          . Springer.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>