<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Bot and Gender Detection using Textual and Stylistic Information</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>PRHLT Research Center, Universitat Politècnica de València</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>In this paper, we report our participation in the Bots and Gender Profiling task at PAN 2019. Our methodology on bots detection takes advantage of the tweets' textual and stylistic information. For the bots detection task, we propose a linear Support Vector Machine (SVM) classifier trained on words and character grams and on additional features that represent the variation in the use of the different stylistic features. For gender identification, we propose a Stochastic Gradient Descent (SGD) classifier trained on words and character grams, sentiment of tweets and the Pointwise Mutual Information (PMI) of terms. The PMI features represent the importance of the terms per gender. We have managed to achieve average accuracy scores of 0.881 and 0.7105 for the bot and gender identification tasks respectively.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The rise of social media has changed the way that people communicate and interact.
Social media have given the opportunity to people to post and share their thoughts
and opinions on any topic. However, they also allow malicious accounts to post and
propagate fake news that have negative effects on the society. One common way to
create fake news are the social bots. Social bots are computer algorithms that exhibit
human-like behavior and can generate content and interact with users. Although there
are some bots that perform useful tasks, there is also a growing number of bots that
perform malicious functions and which aim to emulate and alter users’ behavior [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. For
example, some bots can negatively affect democratic political discussions and influence
public opinion during a presidential campaign or can promote terrorist propaganda and
recruitment [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Detecting the bot accounts in an automatic way is very important for preventing the
dissemination of fake news. Additionally, knowing the personality traits of social media
users can potentially help to understand the personality traits that make users more
vulnerable to propagate fake news or even to generate fake news or rumours [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Author
profiling focuses on understanding different characteristics of the users depending on
the linguistic patterns they use.
      </p>
      <p>
        In this paper we present our system for bot and gender detection shared task at
PAN 2019 [
        <xref ref-type="bibr" rid="ref16 ref5">16,5</xref>
        ]. The task focuses on investigating whether the author of a Twitter
feed is a bot or a human, and in case of human, profiling the gender of the author. The
task includes tweets in English and Spanish. For the bot detection task, we hypothesise
that bots tend to post tweets with the same number of stylistic features (e.g., links,
mentions). Therefore, we propose features that represent the standard deviation in the
use of stylistic features. For the gender detection task, we assume that there are words
that are more common in the tweets posted by male and words that are more common in
the tweets posted by female users. Therefore, we propose using the Pointwise Mutual
Information (PMI) of terms to assign a weight score to each term by gender class.
Additionally, we use the emotion and sentiment scores expressed in the tweets assuming
that different emotions are expressed in the tweets posted by male and female users.
Both the bot and gender detection tasks are defined as binary classification tasks.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Bot and gender detection have attracted a lot of research attention in the last years. Varol
et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] proposed a machine learning system that extracts features from six different
categories: users and friends meta-data, tweet content and sentiment, network patterns,
and activity time series to differentiate between bot and human accounts. Cai et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
proposed a behavior enhanced deep model for bot detection. They proposed to extract
latent temporal patterns based on user content. Additionally, they proposed to combine
content and behavior information using deep learning method. A great number of
researchers have focused on credibility and fake news detection. For example, Giachanou
et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] proposed EmoCred that incorporates emotions that are expressed in the claims
into an LSTM network to differentiate between fake and real claims.
      </p>
      <p>
        Gender detection has also attracted a lot of research attention. Schler et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]
performed gender classification on a corpus of 71,000 blogs using unigrams with the
highest information gain and stylistic features. The results showed that male bloggers
write more about politics and technology, while female bloggers write more about their
personal lives. Rangel and Rosso [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] proposed the EmoGraph approach to capture how
users convey verbal emotions in the morphosyntactic structure of the discourse.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Bots and Gender Detection Systems</title>
      <p>First, we perform some preprocessing that is the same for both the bot and gender
detection. The preprocessing includes the following steps:
– Concatenate all 100 tweets of each author into one document
– Replace URLs with the tag &lt;URL&gt;
– Replace the mentions with the tag &lt;MENTION&gt;
– Replace the symbol of # with the tag &lt;HASHTAG&gt;
– Lowercase all the characters</p>
      <p>Here, we should mention that we did not remove the stopwords for any of the tasks.
3.1</p>
      <sec id="sec-3-1">
        <title>Bots Detection System</title>
        <p>
          Intuitively, content is the most important for the bots prediction task. To this end, our
systems start with word grams and chargrams. Although, word grams and chargrams
are simple features, they have been shown to be important for various tasks such as
emotional reactions prediction [
          <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
          ] as well as information retrieval tasks [
          <xref ref-type="bibr" rid="ref1 ref11">1,11</xref>
          ]. In our
bot detection system we use word grams that range from 1 to 3 and chargrams that
range from 2 to 6.
        </p>
        <p>In addition, we define some new features that capture how much the stylistic
features of tweets posted by the same user vary. The intuition is that the bots will post
tweets that are similar in terms of the number of each stylistic feature they contain (e.g.,
hashtags, mentions, exclamation marks), whereas the humans’ tweets will be more
diverse. More formally, let T = ft1; :::; ti; :::; tjT jg be the list of the tweets posted by
a user. Also, let V = fv1; :::; vi; :::; vjV jg be a list of different stylistic variations and
c(t; v) the number of occurrences of stylistic variation v in a tweet t. Then we can
calculate the deviations in the occurrences as:</p>
        <p>SDv =
s
jc(t; v)</p>
        <p>T
v j
2
where v is the mean of the occurrences of the variation v. We calculate all the
deviations in a similar way. We use the deviations of the following stylistic variations:
exclamation marks, question marks, negative emoticons, positive emoticons, terms in
capital, terms with repeated characters, mentions, links, hashtags. In addition, we count
the number of duplicate tweets.</p>
        <p>We train a SVM classifier on the proposed features for the prediction in the bots
detection system.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Gender Detection System</title>
        <p>
          For the gender detection task, we learn the weights of the words for each gender class.
To learn those weights, we use the Pointwise Mutual Information (PMI) method
originally proposed by Church and Hanks [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. According to this approach, every term w is
assigned a PMI score for each of the two gender classes: male and female. The PMI
score for a term w regarding the male class is calculated as follows:
c(w; male) N
        </p>
        <p>P M I (w; male) = log2 c(w) c(male)
where c(w; male) is the frequency of the term w in the tweets posted by a male user,
N is the total number of words in the corpus, c(w) is the frequency of the term in the
corpus and c(male) is the number of terms in the tweets posted by male users. The PMI
of the terms for the female class is calculated in a similar way. Then the total PMI score
for a document d regarding the male class can be calculated as:</p>
        <p>P M I (d; male) = X P M I (w; male)</p>
        <p>Then these scores are used as additional to the word grams and chargrams features
during the training phase. For English gender detection we use word grams that range
from 1 to 3 and chargrams that range from 3 to 4, whereas for Spanish we use word
grams from 1 to 2 and chargrams from 2 to 6. In addition, for every document d we
calculate the emotion and sentiment scores. We follow a lexicon based approach to
calculate these scores. More specifically, we simply count the number of occurrences
of the emotional and sentimental words that occur in a document d. For the tweets that
are written in English, we focus on positive, negative, anger, anticipation, disgust, fear,
joy, surprise, trust and sadness whereas for the Spanish we focus only on positive and
negative. Finally, we count the number of emoticons that appear in the text. We train a
SGD classifier on the proposed features for the gender prediction task.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Setup</title>
      <p>4.1</p>
      <sec id="sec-4-1">
        <title>Dataset</title>
        <p>In this section we present the dataset and the experimental settings of our methodology.
The dataset consists of tweets in English and Spanish. Table 1 shows the statistics on
the dataset.</p>
        <p>
          We observe that the English collection is bigger than the Spanish. Here we should
mention that the dataset is balanced over the classes for both bot and gender detection.
For each user, a total of 100 tweets are provided.
The submission of our system was made from the TIRA platform [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. We examined
different classifiers including Logistic Regression, Random Forest, Support Vector
Machine and Stochastic Gradient Descent. Regarding the bot detection task we obtained
the best results with a linear SVM and regarding gender detection with SGD. Regarding
SVM, we set the penalty parameter C to 10. For both SVM and SGD we use the square
hinge loss. For the implementation of our system we use scikit-learn library1. To create
our word and char grams we use the hashing vectorizer provided by scikit-learn library.
1 https://scikit-learn.org
        </p>
        <p>
          As already mentioned, on the gender detection task we use the emotions expressed
in the text as additional features. For the English tweets we use the
NRC-EmotionLexicon [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] whereas for the Spanish tweets we use the Spanish lexicon presented by
Perez-Rosas et al. [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The submitted systems are compared with several deadlines
including majority, random, char grams, word grams, word embeddings and the Low
Dimensionality Statistical Embedding approach [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>words &amp; char grams
words &amp; char grams &amp; deviations
words &amp; char grams &amp; duplicates
words &amp; char grams &amp; deviations &amp; duplicates</p>
      <p>Table 4 presents the results of different combinations of features on the development
set regarding the gender detection task. We observe that for the English gender detection
the best result is achieved when all the features are combined. Regarding the Spanish
gender detection, the best result is achieved with the word and char grams. For our final
system, we decided to use the combination of all the features.</p>
      <p>words &amp; char grams
words &amp; char grams &amp; PMI
words &amp; char grams &amp; PMI &amp; sentiments
In this paper we described our system for bot and gender detection task at PAN 2019.
Regarding the bot detection we proposed a system trained on textual and stylistic
features, whereas for the gender detection we proposed a system based on word and char
grams, PMI weights of terms and sentiment features.</p>
      <p>Our results showed that words and char grams are very important features for the
bot and gender detection. Also, we showed that our system that was trained on the
combination of all the proposed features managed to achieve 0.906 and 0.856 accuracy
scores for the English and Spanish bot detection respectively. In addition, from our
results we observe that our system on the bot detection performs better compared to our
system on the gender detection.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments. References</title>
      <p>The first author is supported by the SNSF Early Postdoc Mobility grant P2TIP2_181441
under the project Early Fake News Detection on Social Media, Switzerland.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aliannejadi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Venue suggestion using social-centric scores</article-title>
          . CoRR abs/
          <year>1803</year>
          .08354 (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Berger</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morgan</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The isis twitter census: Defining and describing the population of isis supporters on twitter</article-title>
          .
          <source>The Brookings Project on US Relations with the Islamic World</source>
          <volume>3</volume>
          (
          <issue>20</issue>
          ),
          <fpage>4</fpage>
          -
          <lpage>1</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cai</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zengi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Behavior enhanced deep bot detection in social media</article-title>
          .
          <source>In: Proceedings of the 2017 IEEE International Conference on Intelligence and Security Informatics</source>
          . pp.
          <fpage>128</fpage>
          -
          <lpage>130</lpage>
          . ISI '
          <volume>17</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Church</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanks</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Word association norms, mutual information, and lexicography</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>16</volume>
          (
          <issue>1</issue>
          ),
          <fpage>22</fpage>
          -
          <lpage>29</lpage>
          (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Daelemans</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kestemont</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manjavancas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Specht</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tschuggnall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiegmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zangerle</surname>
          </string-name>
          , E.: Overview of PAN 2019:
          <article-title>Author Profiling, Celebrity Profiling, Cross-domain Authorship Attribution and Style Change Detection</article-title>
          .
          <source>In: Proceedings of the Tenth International Conference of the CLEF Association (CLEF</source>
          <year>2019</year>
          ). Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ferrara</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varol</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Menczer</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flammini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The rise of social bots</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>59</volume>
          (
          <issue>7</issue>
          ),
          <fpage>96</fpage>
          -
          <lpage>104</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Leveraging emotional signals for credibility detection</article-title>
          .
          <source>In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          . SIGIR '
          <volume>19</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mele</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Early commenting features for emotional reactions prediction</article-title>
          .
          <source>In: Proceedings of the International Symposium on String Processing and Information Retrieval</source>
          . pp.
          <fpage>168</fpage>
          -
          <lpage>182</lpage>
          . SPIRE '
          <volume>18</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mele</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Emotional influence prediction of news posts</article-title>
          .
          <source>In: Proceedings of the 12th International AAAI Conference on Web and Social Media</source>
          . pp.
          <fpage>592</fpage>
          -
          <lpage>595</lpage>
          . ICWSM '
          <volume>18</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turney</surname>
          </string-name>
          , P.D.:
          <article-title>Crowdsourcing a word-emotion association lexicon</article-title>
          .
          <source>Computational Intelligence</source>
          <volume>29</volume>
          (
          <issue>3</issue>
          ),
          <fpage>436</fpage>
          -
          <lpage>465</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Paltoglou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Opinion Retrieval: Searching for Opinions in Social Media</article-title>
          , pp.
          <fpage>193</fpage>
          -
          <lpage>214</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Perez-Rosas</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banea</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mihalcea</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Learning sentiment lexicons in spanish</article-title>
          .
          <source>In: Proceedings of the 8th International Conference on Language Resources and Evaluation</source>
          . pp.
          <fpage>3077</fpage>
          -
          <lpage>3081</lpage>
          . LREC '
          <volume>12</volume>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gollub</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiegmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>TIRA Integrated Research Architecture</article-title>
          . In: Ferro,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <surname>C</surname>
          </string-name>
          . (eds.)
          <article-title>Information Retrieval Evaluation in a Changing World - Lessons Learned from 20 Years of</article-title>
          CLEF. Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Franco-Salvador</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A low dimensionality representation forÂa˘language variety identification</article-title>
          .
          <source>In: Computational Linguistics and Intelligent Text Processing</source>
          . pp.
          <fpage>156</fpage>
          -
          <lpage>169</lpage>
          .
          <source>CICLing</source>
          <year>2016</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>On the impact of emotions on author profiling</article-title>
          .
          <source>Information processing &amp; management 52(1)</source>
          ,
          <fpage>73</fpage>
          -
          <lpage>92</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Overview of the 7th Author Profiling Task at PAN 2019: Bots and Gender Profiling</article-title>
          . In:
          <article-title>CLEF 2019 Labs and Workshops, Notebook Papers</article-title>
          .
          <source>CEUR-WS.org (Sep</source>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Schler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koppel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Argamon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J.W.:</given-names>
          </string-name>
          <article-title>Effects of age and gender on blogging</article-title>
          . In: AAAI spring symposium:
          <article-title>Computational approaches to analyzing weblogs</article-title>
          .
          <source>vol. 6</source>
          , pp.
          <fpage>199</fpage>
          -
          <lpage>205</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Shu</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Liu, H.:
          <article-title>Understanding user profiles on social media for fake news detection</article-title>
          .
          <source>In: Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval</source>
          . pp.
          <fpage>430</fpage>
          -
          <lpage>435</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Varol</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferrara</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Menczer</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flammini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Online human-bot interactions: Detection, estimation, and characterization</article-title>
          .
          <source>In: Proceedings of the 11th International AAAI Conference on Web and Social Media</source>
          .
          <source>ICWSM</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>