<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Deep learning Model to predict gender, age and occupation of the celebrities based on tweets followers</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Roobaea Alroobaea</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ahmed H. Almulihi</string-name>
          <email>a.almulihi@tu.edu.sa</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fahd S. Alharithi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Seifeddine Mechti</string-name>
          <email>mechtiseif@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Moez Krichen</string-name>
          <email>moez.krichen@redcad.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lamia Hadrich Belguith</string-name>
          <email>l.belguith@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ANLP Group, MIRACL Laboratory, University of Sfax</institution>
          ,
          <addr-line>3018,Sfax</addr-line>
          <country country="TN">Tunisia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>College of Computers and Information Technology, Taif University</institution>
          ,
          <country country="SA">Saudi Arabia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>The laboratory on Development and Control of Distributed Applications</institution>
          ,
          <addr-line>ReDCAD</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <abstract>
        <p>This paper present the methods are used for detection of celebrity profiles on Twitter when participating in PAN @ CLEF 2020. We have tried to predict the age, gender and occupation of celebrities based on their tweets followers. Our method is based on the use of deep learning techniques to discriminate between authors. The results obtained by the team ''TUKSA20'' are encouraging, indeed we obtained the 1st rank for the prediction of the gender, the second for the prediction of the occupation, and for the age the results obtained are less good.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Author profiling is the study which consists in recognizing certain dimensions of the
author's profile based on the stylistic features of their writings [
        <xref ref-type="bibr" rid="ref3">Maharjan et al., 2014</xref>
        ].
The dimensions targeted by profiling particularly concern the demographic and
sociocultural aspects of the author, such as his age, sex, personality, mother tongue, native
region, and recently the occupation [
        <xref ref-type="bibr" rid="ref13">Alroobaea, 2020</xref>
        ;
        <xref ref-type="bibr" rid="ref1 ref2">Wiegmann et al.,2019</xref>
        ].
[Koppel, 2002; Argamon et al., 2009; Pennebaker, 2011].
      </p>
      <p>Recently, the prediction of the author’s occupation has been the focus of work on the
author’s profile detection. The question is whether the author is a politician, footballer,
artist, singer, scientist, etc. predicting occupation has an important role in many domains
such as websites bidding and forensic linguistics.</p>
      <p>The Celebrity Profiling task 2020 is to develop a piece of software which predicts three
demographics of a celebrity from the texts of their followers: occupation, age, and
gender.</p>
      <p>In this paper, we try to implement a system for detecting the age, gender and occupation
of celebrities using deep learning techniques. The remainder of this paper is organized
as follows; Section 2 presents a brief state of the art of author profile prediction methods.
Section 3 introduces our method used to discriminate between authors. Section 4
presents the experiments as well as the evaluations. Finally, the summarization with
conclusion will be mentioned.</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <sec id="sec-2-1">
        <title>Stylistic features</title>
        <p>
          The study of [
          <xref ref-type="bibr" rid="ref1 ref2">Wiegmann et al., 2019</xref>
          ] distinguished two types of attributes: stylistic
attributes (style-based features) and attributes based on content (content features).
In determining the age or gender of a writer, it is important to consider functional words,
such as prepositions, pronouns and determiners. These attributes have been shown to be
effective in detecting an author’s profile. [
          <xref ref-type="bibr" rid="ref6">Cruz et al., 2013</xref>
          ]. In other work, the authors
use the frequency of punctuation marks, the frequency of capital letters and citations
[Aleman et al., 2013]. Similarly, HTML attributes, such as image URLs or a web page
link, have been used by [
          <xref ref-type="bibr" rid="ref7">Sapkota et al., 2013</xref>
          ]. In the work of [Patra et al., 2013] the
authors are based on specific terms (foreign words) for the distinction between the
authors. Similarly, other authors use a calculation of the frequency of emoticons as a
discriminating attribute for the prediction of the authors [
          <xref ref-type="bibr" rid="ref8">Irazu et al., 2013</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Content features</title>
        <p>In addition to the style used, the occurrence of the themes used can differentiate
several age categories. [Argamon et al., 2009].</p>
        <p>
          The age of groups generally represents adolescents, young people and adults.
Adolescents were grouped in the '10s' category presenting individuals between 13 and
17 years of age, young people in the '20s' category representing those between 23 and
27 years of age and finally adults in the '30s' category representing those aged between
33 and 47 years [
          <xref ref-type="bibr" rid="ref9">Schler et al., 2006</xref>
          ].
        </p>
        <p>
          The work carried out by Koppel et al. have shown that at the gender level. Men prefer
to categorize things using more determinants (le / la, ce / ce, un / une, etc.) and quantifiers
(two, more, few, etc.). On the other hand, women frequently use personal pronouns in
their writings (I, you, me, etc.) [
          <xref ref-type="bibr" rid="ref4">Koppel et al., 2003</xref>
          ].
        </p>
        <p>
          <xref ref-type="bibr" rid="ref5">Argamon et al. (2009)</xref>
          have done a study on the British national corpus using the
elements of speech features. They obtained an 80% prediction for the determination of
the genus [Argamon et al., 2009]. For their part, [Nguyen et al., 2013] have tried to
predict age in the conversations of Dutch Twitter users. They used traits from the
language model combined with parts of speech. 74% of the discussions were well
classified. They obtained an average margin of error between 4.1 and 6.8 years for age
prediction. Similarly, in [Zhang and Zhang, 2010], the authors worked on blog segments
using word-type features, punctuations, average words per sentence, length of sentences,
part of speech and the rate of prediction of the gender which amounts to 72%.
          <xref ref-type="bibr" rid="ref10">Peersman
et al. (2011)</xref>
          used a Netlog corpus by experimenting with the 2-gram, and 3-gram
unigrams. The average obtained for the prediction of age and gender is 88% [Peersman
et al., 2011].
        </p>
        <p>
          In [Gaustad et al., 2007], the authors are interested in the automatic classification of
mails. They obtained an accuracy rate of 81% for gender and 72% for age. We cite
among others, the work of [Kose et al., 2007]; [
          <xref ref-type="bibr" rid="ref12">Hariharan et al, 2011</xref>
          ] which have shown
promising results in detecting the author’s gender in instant discussions. For their part,
Maharjan et al. have developed a system based entirely on the Map Reduce technique
for predicting age and gender. This technique is a parallel programming model allowing
the processing of a very large mass of data (Big data). These real-time programs are
executed on partitions (clusters) and are automatically parallelized [
          <xref ref-type="bibr" rid="ref3">Maharjan et al.,
2014</xref>
          ]. This system which employs a large number of functionalities is capable of
performing the prediction task in a fraction of time with good details. This approach
demonstrates that the use of these systems can be a perfect solution for data quantity
problems and / or a large number of attributes.
        </p>
        <p>The objective of the study by Rangel et al. (2016) was to show how people use language
to express their emotions and how it can help them identifying their age and gender.
They used a graphical approach called "EmoGraph" which allows the text to be
represented by a graph of parts of speech labeled by emotions and themes. The authors
showed results comparable to those of the best systems presented at the PAN @ CLEF
2013 evaluation conference [Rangel et al., 2016].
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <sec id="sec-3-1">
        <title>3.1 Preprocessing</title>
        <p>
          Plain text should be filtered to remove noisy data, such as HTML tags and urls. This is
because the presence of this noisy data could affect and reduce the accuracy of the entire
analysis. [
          <xref ref-type="bibr" rid="ref11">Derczynski et al., 2013</xref>
          ] pointed out that ''twitter text is difficult to
part-ofspeech tag: it is noisy, with linguistic errors and idiosyncratic style''. Thus, preprocessing
stage is important step. After that the cleaned data is then stored in a database.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2 text analysis</title>
        <p>In this step, it is a matter of carrying out linguistic and statistical processing to find the
useful information allowing to subsequently achieve the best classification of
documents. To achieve our objective, we calculated the number of occurrences of all the
words that occurred in the corpus and thus obtained a list of words sorted by frequency.
Note that the text analysis step is repeated for each age group and for each gender. Then
we tried to gather these terms into thematic classes by adopting a semi-automatic
annotation which follows the following two steps:
-Automatic procedure: During this annotation, we based ourselves on the automatic
search of thematic classes for the attributes based on the content and of syntactic classes
for the attributes based on the style. Furthermore, the definition of thematic classes is
based on the notion of a universe of discourse. Indeed, all the terms presenting a semantic
relation are gathered around a centroid which bears the name of this class. In this
annotation, we use the Wordnet-Similarity tool to extract the semantic distances between
terms. We use synonymy, hypernymy and hypomimy relationships to detect
generalization and specification. At the end of this step all the terms having a semantic
distance with the “synset” belong to the same class. It should be noted that the stylistic
classes are obtained using the morpho-syntactic analyzer in order to be able to
differentiate prepositions, verbs, pronouns, etc.
- Manual procedure: Since the automatic annotation does not allow assigning all the
terms found in a class, we have had recourse to the manual annotation which is inspired
by the thematic classification proposed by [Argamon et al., 2009]. It consists of
manually grouping the most frequent terms in the same class. This operation is
performed by three different linguists.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3 Features extraction</title>
        <p>
          Recent methods differentiate two main categories of features that can be used to
predict the profile of the author: Stylistic features and those based on content
[
          <xref ref-type="bibr" rid="ref1 ref2">Wiegmann, 2019</xref>
          ] .In order to measure the relevance of a term in a given document
in the corpus, we used the TF * IDF measure which is calculated as follows:
[Buckley et al., 1996]
        </p>
        <p>1.1
With:
d: documents,
W: the word,
D: one document є D,
fw,d : number of occurrence of w in d,
|D|: size of the corpora,
fw, D : number of documents.</p>
        <p>After forming the classes, the trait selection is used to elucidate which ones allow
better discrimination. This step is essential since the initial calculation of the frequency
of terms in the corpus does not provide information on the ability of the classes found to
discriminate between the different age / gender categories. In addition, the selection of
attributes is a very important factor which eliminates redundant attributes and which
results in improved classification performance.</p>
        <p>It should be noted that the selection of attributes can be done in two ways. The first is an
"a priori" selection which allows you to filter the attributes before starting the
classification step. In this case, it is a question of employing filtering algorithms making
it possible to find, for example, the best K most discriminating attributes. As for the
second, it is an "a posteriori" selection which exploits the classification process: each
time it is a question of using a set of attributes for learning and of refining the results as
and when. This iterative process stops when the classification result is optimal.
Therefore, the attributes allowing to obtain the best results are retained as the most
discriminating attributes.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4 Classification</title>
        <p>Once the attributes have been defined, the classification of the texts is proceed. It is
started with the construction of the two learning matrices relating to the gender and age
dimension. However, we have opted to enrich our base of thematic classes. The idea is
that each test corpus can contain terms semantically linked to existing terms in the
learning corpus. Therefore, expanding these terms is a step towards improving
relevance.</p>
        <p>It should be noted that the enrichment of the base of thematic terms is defined as being
the process of completing the original terms of the thematic classification proposed by
associating them with semantically similar terms. It is one of the solutions envisaged to
enrich the proposed classes. Similar to the text analysis stage, in this enrichment process,
we have based ourselves on the exploitation of semantic links like synonymy, hyponymy
or hypernymy for the enrichment process.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5 Experimentations and Results</title>
      <sec id="sec-4-1">
        <title>The LSTM deep learning model is presented in figure 1: Fig 1: Architecture of the LSTM deep learning Model In the evaluation process, Submissions are judged by a combined metric cRank, which is the harmonic mean of each label's metric.</title>
        <p>All features are evaluated by their respective F1. Precision and recall of birth-year are
calculated leniently. If a prediction is within an m-window of the truth, it is counted as
correct:</p>
        <p>The window size m is based on the birth-year and increases linearly from about 3
years for 1999 to about 9 years for 1940.</p>
        <p>After the evaluation process our team ''TUKSA20" obtain third rank in term of
CRANK and first rank for the prediction of the gender with 0.69. For the occupation we
obtain second Rank VS Hodge20.</p>
        <p>TEST-DATASET</p>
        <sec id="sec-4-1-1">
          <title>GENDER</title>
        </sec>
        <sec id="sec-4-1-2">
          <title>OCCUPATION CRANK</title>
          <p>0.631
0.577
0.521
0.477
0.469
0.333
AGE
0.500
0.432
0.407
0.315
0.362
0.333
0.753
0.681
0.616
0.696
0.584
0.500
0.700
0.707
0.597
0.598
0.521
0.250
TEAM</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Baseline-ngram-celebritytweets hodge20 koloski20</title>
        <p>TUKSA20
Baseline-ngram-followertweets
Random</p>
        <sec id="sec-4-2-1">
          <title>6 Conclusion</title>
          <p>This paper presents our method to predict gender, age and occupation of the
celebrities based on tweets followers. For the prediction of gender and occupation, good
results were obtain based on LSTM (Long Short term memory) architecture. Indeed,
TUKSA20 ranked the first for the gender prediction. However, the age prediction results
were less good because of the unbalanced data. As a perspective, other deep learning
models need to be test such as CNN (Convolutional Neural Network) , GRu ( Gated
recurrent Units) and GAN ( Adversarial Neural network).</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Wiegmann</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            <given-names>M. Celebrity</given-names>
          </string-name>
          <string-name>
            <surname>Profiling</surname>
          </string-name>
          .
          <article-title>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL</article-title>
          <year>2019</year>
          ),
          <year>July 2019</year>
          . ACL.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Wiegmann</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Overview of the Celebrity Profiling Task at PAN 2019</article-title>
          .
          <article-title>CLEF 2019 Labs and Workshops</article-title>
          , Notebook Papers,
          <year>September 2019</year>
          . CEURWS.org.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Maharjan S.</given-names>
            ,
            <surname>Shrestha</surname>
          </string-name>
          <string-name>
            <given-names>P.</given-names>
            and
            <surname>Solorio</surname>
          </string-name>
          <string-name>
            <surname>T.</surname>
          </string-name>
          <article-title>A simple approach to author profiling in MapReduce</article-title>
          .
          <source>In proceedings of the 4th Pan at conference and labs of the evaluation forum (CLEF)</source>
          , pp.
          <fpage>1121</fpage>
          -
          <lpage>1128</lpage>
          , England,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Koppel</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Argamon</surname>
            <given-names>S.</given-names>
          </string-name>
          and
          <article-title>Shimoni A. Automatically categorizing written texts by author, gender</article-title>
          .
          <source>Literary and Linguistic Computing</source>
          , pp.
          <fpage>401</fpage>
          -
          <lpage>412</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Argamon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koppel</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pennebaker</surname>
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Schler J. Automatically</surname>
          </string-name>
          <article-title>Profiling the Author of an Anonymous Text</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>52</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>119</fpage>
          -
          <lpage>123</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Cruz F.</given-names>
            ,
            <surname>Haro</surname>
          </string-name>
          . R.,
          <source>and Ortega J. ITALICA at PAN</source>
          <year>2013</year>
          :
          <article-title>An Ensemble Learning Approach to Author Profiling. In Proceedings of the 4th PAN at conference and labs of the evaluation forum (CLEF)</article-title>
          .
          <source>Spain</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Sapkota U.</given-names>
            ,
            <surname>Solorio</surname>
          </string-name>
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Montes-</surname>
          </string-name>
          y
          <string-name>
            <surname>-Gómez</surname>
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>De-la-Rosa</surname>
            <given-names>G</given-names>
          </string-name>
          .
          <article-title>Author profiling for English and Spanish text</article-title>
          .
          <source>In Proceedings of the 4th PAN at conference and labs of the evaluation forum (CLEF)</source>
          ,
          <year>Spain</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Irazu D.</given-names>
            ,
            <surname>Farias</surname>
          </string-name>
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Guzman-Cabrera</surname>
          </string-name>
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Reyes</surname>
          </string-name>
          <string-name>
            <given-names>A.</given-names>
            and
            <surname>Rocha</surname>
          </string-name>
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Semantic-based Features for Author Profiling Identification</article-title>
          .
          <source>In Proceedings of the 4th PAN at conference and labs of the evaluation forum (CLEF)</source>
          ,
          <year>Spain</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Schler J.</given-names>
            ,
            <surname>Koppel</surname>
          </string-name>
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Argamon</surname>
          </string-name>
          <string-name>
            <given-names>S.</given-names>
            and
            <surname>Pennebaker</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          <article-title>Effects of age and gender on blogging</article-title>
          .
          <source>AAAI Spring Symposium on Computational Approaches for Analyzing Weblogs</source>
          , pp.
          <fpage>191</fpage>
          -
          <lpage>197</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Peersman C.</given-names>
            ,
            <surname>Daelemans</surname>
          </string-name>
          <string-name>
            <surname>W</surname>
          </string-name>
          and Van Vaerenbergh L.
          <article-title>Predicting age and gender in online social networks</article-title>
          .
          <source>In Proceedings of the 3rd international workshop on Search and mining user-generated contents</source>
          ,
          <source>(SMUC)</source>
          , pp.
          <fpage>37</fpage>
          -
          <lpage>44</lpage>
          , USA, ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Derczynski</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ritter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Bontcheva</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <year>2013</year>
          ,
          <string-name>
            <surname>September.</surname>
          </string-name>
          <article-title>Twitter partof-speech tagging for all: Overcoming sparse and noisy data</article-title>
          .
          <source>In Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP</source>
          <year>2013</year>
          (pp.
          <fpage>198</fpage>
          -
          <lpage>206</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Hariharan S.</surname>
          </string-name>
          <article-title>Gender prediction in chat based medium's using text mining</article-title>
          . In
          <source>International Journal of Research and Reviews in Information Sciences (IJRRIS)</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>18</fpage>
          -
          <lpage>22</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Alroobaea</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <article-title>An Empirical combination of Machine Learning models to Enhance author profiling performance</article-title>
          . In
          <source>International Journal of Advanced Trends in Computer Science and Engineering</source>
          ,
          <volume>9</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>2130</fpage>
          -
          <lpage>2137</lpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>LI J.</given-names>
            ,
            <surname>Zheng</surname>
          </string-name>
          <string-name>
            <given-names>R.</given-names>
            and
            <surname>Chen H</surname>
          </string-name>
          .
          <article-title>From fingerprint to writeprint</article-title>
          .
          <source>Communication of the ACM</source>
          ,
          <volume>49</volume>
          (
          <issue>4</issue>
          ), pp.
          <fpage>76</fpage>
          -
          <lpage>82</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Potthast M and Gollub Tand Wiegmann</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <article-title>Information Retrieval Evaluation in a Changing World,Nicola Ferro</article-title>
          and Carol Peters, sep,Springer,TIRA Integrated Research Architecture,
          <year>2019</year>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Wiegmann M and Stein</surname>
            <given-names>B</given-names>
          </string-name>
          and
          <string-name>
            <surname>Potthast</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <article-title>Working Notes Papers of the CLEF 2020 Evaluation Labs, sep, CLEF and CEUR-WS.org, Overview of the Celebrity Profiling Task at</article-title>
          PAN,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>