<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Identi cation of Depression Strength for Users of Online Platforms: A Comparison of Text Retrieval Approaches</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ayan Bandyopadhyay</string-name>
          <email>bandyopadhyay.ayan@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Linda Achilles</string-name>
          <email>achilles@uni-hildesheim.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thomas Mandl</string-name>
          <email>mandl@uni-hildesheim.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mandar Mitra</string-name>
          <email>mandar@isical.ac.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sanjoy Kr. Saha</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Indian Statistical Institute</institution>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Jadavpur University</institution>
          ,
          <country>India sks</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Hildesheim</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Social media became one of the most popular platform to express feelings and thoughts in the world of digital information sharing. Facebook, Snapchat, Instagram, QQ, Weibo, Twitter, Tumblr, Reddit and LinkedIn are among the most popular social networks. They are used to share, spread and create new information, receive and spread news locally, globally or privately. Many citizens share their feelings and thoughts in social media, consequently mining of emotions and psychological states from social media posts has become an active research area. In the CLEF 2019 eRisk task 3, the goal is to detect how strong a user of social media is su ering from depression. The ground truth is obtained by asking persons a set of standardised questions. This paper shows how a variety of ad-hoc retrieval approaches can be adopted to perform this task. The results do not reach a high level of accuracy, but compare to supervised classi cation approaches. In the discussion section, the adequacy of measures for the task is re ected.</p>
      </abstract>
      <kwd-group>
        <kwd>Text Classi cation Information Retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The classi cation of text documents has seen great progress in recent years.
Meanwhile research is approaching complex problems like gender attribution,
Copyright c 2019 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
content reliability as well as di erent quality attributes of text (e.g. helpfulness
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] ). The advances in deep learning technologies have contributed to the
expansion of classi cation tasks. Word embeddings as a latent model of the content
of words are representations which are learned by a system during the processing.
The training items are constructed typically as n-grams of words of subsequent
text. Word embeddings as a representation model have often achieved very good
results in recent years. One assumption behind many computation tasks in the
psychological domain is that text tells a lot about the writer. Consequently, the
prediction of psychological traits of people based on text has become an
important research area. The base is often a collection of texts from social media
due to the large amount of text that can be found and the ease of
availability. Researchers have tried to predict the personality of a person based on the
Big-5 model ([
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. More recently, the prediction of mental health issues has been
seen as a task for classi cation systems. First collections have been developed
for analysis (e.g. [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] ). The eRisk task (Early risk prediction on the Internet)
at the Conference and Labs of the Evaluation Forum (CLEF) became a venue
for comparative analysis of depression detection. In 2019, eRisk moved to
predicting the level of depression of persons based on their social media postings.
This paper reports on heterogeneous experiments for this task and reviews some
technologies for depression detection. Often, there are few data samples available
due to the high level of the required con dentiality. As a consequence, we test
mainly methods based on string similarity and matching techniques instead of
supervised approaches.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <sec id="sec-2-1">
        <title>Depression and depression detection</title>
        <p>
          Traditionally, depression is diagnosed in a therapy in which a therapist checks
whether depression symptoms appear during a period of time in the behavior of
the patient or not. These symptoms are, for instance, described in the Diagnostic
and Statistical Manual of Mental Disorders (DSM) [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. The current fth edition
replaces the now outdated fourth edition.
        </p>
        <p>
          Another instrument in this eld is the Beck's Depression Inventory (BDI) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
The BDI is a questionnaire consisting of 21 questions assessing the patient's
mental state regarding feelings like sadness, pessimism, loss of energy and
similar. The following example shows the rst question of the BDI:
1. Sadness
0. I do not feel sad.
1. I feel sad much of the time.
2. I am sad all the time.
        </p>
        <p>3. I am so sad or unhappy that I can't stand it.</p>
        <p>
          A di erent questionnaire was developed by Radlo [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. It consists of 20
questions, dealing with the frequency of various symptoms of depression. This
questionnaire is called the CES-D Scale (Center for Epidemiologic Studies Depression
Scale). This self-report depression scale has been revised in 2004 (DESD-R) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
Instead of relying on self-report, Eichstaedt et al. [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] used medical codes from
an electronic medical report (EMR) of a patient to establish the depression
diagnosis [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The researchers then analysed the patients' Facebook posts that
were created before the diagnosis in the EMR. Besides the textual post content,
they also used the post length, the frequency of posting, the temporal posting
patterns, as well as the demographic information to predict the future
diagnosis of depression in the EMR. Overall, language features outperformed all other
features considered. They could also show, that their approach resulted in a
prediction accuracy comparable to validated self-report depression scales.
        </p>
        <p>
          The examples above show that getting meaningful data can be a di cult
and time, labor and cost consuming task, which also relates to the sensitivity
of the topic. This becomes apparent in the study of Eichstaedt and colleagues,
for which they asked 11,224 patients of an emergency department of a hospital
of which only 1,175 agreed to participate fully in the study [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. However, Shen
et al. made the point that the DSM, for instance, took over a decade to evolve
from fourth to fth edition and is so relatively slow in updating depression
criteria, especially those that are conveyed by the behavioral patters in social
media [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. Automatically analyzing the online behavior and language on social
media therefore can help in early detection of mental disorders like for instance
depression.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Early risk prediction on the Internet (eRisk)</title>
        <p>
          The eRisk task is an evaluation lab as part of the CLEF initiative. Its main
objective is to examine evaluation methodologies, e ectiveness and performance
metrics, as well as practical applications and the building of test collections
related to early risk detection on the internet. Technologies that can detect
disorders at an early stage can be applied to variety of di erent cases and can be
especially useful in those associated with safety and health. For instance,
notications can be sent when sex o enders start interacting with children. Besides
potential paedophiles other examples encompass stalkers, or persons with
suicidal thoughts or those with tendencies to depression or other mental disorders
[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
        </p>
        <p>In 2018, two tasks were organized by the lab: 1) Early Detection of Signs of
Depression and 2) Early Detection of Signs of Anorexia. The lab in 2019
organized three tasks: 1) Early Detection of Signs of Anorexia (continuation of eRisk
2018's T2 task), 2) Early Detection of Signs of Self-harm (this is a new task in
2019) and 3) Measuring the Severity or Strength of the Signs of Depression (this
is a new task in 2019).</p>
        <p>
          The test collections for task one and two of both years have the same format as
described in the overview paper [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. They consist of writings (post and
comments) from social media authors.
        </p>
        <p>
          For evaluating the performance of the systems in the tasks, standard measures
like F1, Precision and Recall have been used. They do not take the decision
making time into account, so that the organizers proposed the ERDE (early risk
detection error) measure [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Early detection is rewarded, meaning the fewer
posts required to detect e.g. anorexia the better the system is considered to be.
The measure is parameterised to control the place in the X axis where the cost
(the delay in detecting true positives) grows more quickly. ERDE5 therefore is
very demanding with decision delays, because if a system needs more than 5
writings the value for ERDE5 quickly decreases. However, ERDE50 is less strict
with decision delays [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The ERDE measure is in the range [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ] [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. In 2018,
the best results for ERDE5 were achieved by exible temporal variation of terms
(FTVT) and sequential incremental classi cation (SIC) [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. In case of ERDE50
as well as F1 word embeddings and linguistic metadata led to the best results
[
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. The highest precision was achieved by using e ective machine learning
algorithms (a bag of words model has been used to perform ada boost, random
forest, logistic regression and support vector machine classi ers) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Fidel and
colleagues obtained the highest recall by applying two independent models (one
trained to predict depression cases, the other one to predict non-depression cases)
with two variants: Duplex Model Chunk Dependent (DMCD) and Duplex Model
Writing Dependent (DMWD) [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Measuring the Severity or Strength of the Signs of</title>
    </sec>
    <sec id="sec-4">
      <title>Depression (eRisk 2019 task 3)</title>
      <p>
        The third task in eRisk 2019 is an exploratory new task in eRisk. Participants of
the challenge have to build an algorithm that estimates the level of depression
of a user based on a history of postings. Depending on these, the participants
of the eRisk lab have to ll in the questionnaire BDI for each user. This means
that the task consists of predicting how a user would ll in the questionnaire
given her or his texts [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
3.1
      </p>
      <sec id="sec-4-1">
        <title>Data Set</title>
        <p>
          The data set consists of BDI questionnaires that were lled in by social media
users along with each user's history of writings. After submitting the BDI, the
user's writings were extracted right after. These original questionnaires are the
ground truth data for task 3 and were used to evaluate the performance of the
lab participants' systems. The participants were given a data set of 20 social
media authors' writing history. They were then asked to develop an algorithm
that produces the following structure:
username1 answer1 answer2 .... answer21
username2 ....
....
Each line identi es the author and the estimated answers to the questions in
the BDI. The ground truth data has the same format [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
3.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Evaluation Measures</title>
        <p>
          The task employs a variety of evaluation metrics to measure the success of
algorithms. Losada et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] de ne them as follows:
Hit Rate (HR) HR determines how often the prediction was correct, compared
to the real questionnaire and gives the ratio. For instance, a prediction where 5
of the 21 questions of the BDI for correct get an HR value of 5/21.
Average Hit Rate (AHR) AHR is HR, but averaged over all users.
Closeness Rate (CR) CR considers the ordinal scale underlying the questions
in the BDI. For each question an absolute di erence (ad) between the actual
answer and the predicted one. A system that is farther away from the answer
than a second system should be penalized for this greater distance. For that the
measure is build like this:
        </p>
        <p>(mad ad)
CR = (1)
mad
Here, mad stands for the maximum absolute di erence (number of possible
answers minus one).</p>
        <p>Average Closeness Rate (ACR) ACR is CR, but averaged over all users.
However, the questions #16 and #18 have seven possibilities to answer, where
for answers 1 to 3 two possible options (a and b) are available. However, those
options were considered equal, since they represent the same level of depression.</p>
      </sec>
      <sec id="sec-4-3">
        <title>Di erence between overall depression levels (DODL) This measure does</title>
        <p>not take into account the system's correct predictions on question-level, but gives
the overall depression level based on the sum of all answers for the real and
system generated BDI. Furthermore, the absolute di erence (ad overall) between
the real and the predicted depression score is calculated.</p>
        <p>A depression level is an integer between 0 and 63. These numbers are derived
from adding the numbers of the answers from the BDI. For example, considering
question #1 (see section 2.1), if the answer was option 1, the depression level
integer is raised by 1. This way, the following four categories are associated with
the respective depression levels:</p>
        <sec id="sec-4-3-1">
          <title>1. Minimal depression (depression levels 0-9) 2. Mild depression (depression levels 10-18)</title>
        </sec>
        <sec id="sec-4-3-2">
          <title>3. Moderate depression (depression levels 19-29) 4. Severe depression (depression levels 30-63) These levels are widely accepted in the psychological literature [8]. The DODL measure is nally normalized into [0; 1] in the following way:</title>
          <p>DODL =
(63
ad overall)
63
(2)
Average DODL (ADODL) ADODL is DODL, but averaged over all users.</p>
        </sec>
      </sec>
      <sec id="sec-4-4">
        <title>Depression Category Hit Rate (DCHR) DCHR computes the fraction of</title>
        <p>cases, in which the system generated BDI led to the same depression category
obtained from the real author's questionnaire.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Processing Approaches</title>
      <p>We experimented with several heterogeneous ad-hoc information retrieval
approaches for depression prediction. That way, a variety of parameter settings
can be explored. An important research question is, whether such processing
without additional resources can compete with deep learning approaches for a
domain with relatively little text volume.
4.1</p>
      <sec id="sec-5-1">
        <title>Ad-hoc Retrieval Approaches</title>
        <p>We considered the posts given for each user and the BDI as a document corpus
and as traditional ad-hoc information retrieval queries. Each answer of a BDI
question is treated as a query. Each set of user posts is treated as a document
collection and indexed. This allows to retrieve (compute a query document
similarity score) documents and produce the result as quickly as possible. The main
concept behind our approach is as follows: The post \pi" (i = 1; 2::k, k is total
number of posts by user \u") of an user \u" which is returned with the
maximum similarity value for a BDI answer with number 1:j (j=0,1,2,3 here. See
example query number 1) from a question set \1" determines the answer. For
the user \u", \j" is the result of query set 1. In the example, question number 1
is concerned with the concept \sadness", so for user \u" j is the \sadness" label
predicted.</p>
        <p>This approach allows the use of information retrieval technology for the task. It
also enables a completely unsupervised approach which does not require
additional resources.</p>
        <p>Due to the nature of text on social media microblogs, it seems unclear whether
stop word removal and stemming as traditional pre-processing methods are
bene cial for the task. Consequently, we conducted experiments with and without
both techniques. documents by
{ stemming and stop word removal, and
{ no stemming and no stop word removal</p>
        <p>
          The following experiments with di erent retrieval models and parameter
settings were carried out with Lucene as the basic search engine:
{ TF-IDF
{ BM25 [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ][
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] ( 3 ISIKol-bm25-1.2-0.75-5000-Dtac-Qtac ): BM25 model
with parameter settings as follows: k1 = 1:2 and b = 0:75
{ Language Model - Divergence from Randomness with second normalization
model (DFR) [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]
{ LM-dir ( 3 ISIKol-lm-d-1.0-5000-Dtac-Qtac): Language model with Dirichlet
prior smoothing with = 1:0.
{ Multi-Similarity ( 3 ISIKolmultiSimilarity-5000-Dtac-Qtac): This experiment
represents a fusion appraoch with the combined sum of a Language model
with Dirichlet prior smoothing (LM-d) with = 1:0, Language model with
Jelinek-Mercer Smoothing (LM-jm) with = 0:5, DFR with second
normalization model (DFR) [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and a BM25 model with k1 = 1:2 and b = 0:75.
4.2
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Deep Representations for Matching</title>
        <p>
          Recently, deep representations based on word embedding have received much
attention, in particular for supervised learning. Based on our approach described
above, further experiments were done with word embedding representations. For
that, we used the word2vec pre-trained model [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ][
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] and represented a
document as a vector using Equation-3. In this case, !d is the document vector of
document d, w!id is the vector for the ith word (or term) from document d.
        </p>
        <p>
          Equation-4, describes how query and document similarities were calculated.
This method was used by Bandyopadhyay et al. [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] in a retrieval approach for
tweet classi cation during natural disasters. In Equation-4 !q is the query vector
of a query q. CosSim( !q; !d ) is cosine similarity of !q, !d .
        </p>
        <p>jWdj
!d = X w!id</p>
        <p>i=1
Sim(q; d) = CosSim( !q; !d ) =
!q !d</p>
        <p>!
jj !qjj jj d jj
(3)
(4)</p>
        <p>
          We used Google's pre-trained word2vec vectors[
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] and the GloVe pre-trained [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ](
Table 1) word vectors to compute our document vectors using Equation-3
formula.
This section shows the results of our experiments and compares them to the
outcomes of the submitted runs for the task at CLEF eRisk.
        </p>
        <p>
          The experiment LM-d = 1:0 returns the best value for the measures ACR.
BM25 k1 = 1:2, b = 0:75 is best for ACR and TF-IDF for DCHR. The language
model was used for experiments with query expansion (QE). In Table-2 query
expansion results are given. In Table-2 \D"= number of top docs used in QE.
\T"= number of top terms used in QE and \RM3" [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] = value of qmix used in
RM3 QE.
The results of our experiments are not far behind the supervised approaches
submitted at CLEF. This shows that straightforward approaches using only IR
technologies currently perform almost as good as advanced algorithms.
The measure DODL and ADODL need to be interpreted with care. They are
a very useful measure as they consider the depression level of one user overall.
However, it can even out bad results from individual questions. An approach to
trick ADODL would give results in the middle of the answer range. In this case,
ADODL would be 50 per cent for an even distribution. Consider that this would
be better then all submitted experiments which have higher (worse) values. For
an uneven or highly skewed distribution, even better (lower) values could be
obtained by appropriate guessing. In a realistic scenario, such a classi cation
would probably need to nd out the few cases with depression from many users.
In such a case, the set of individual with and without depression are likely to be
highly imbalanced. This needs to be taken into consideration when developing
classi ers for realistic scenarios.
7
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Traditional IR methods including query expansion do not perform best for the
eRisk depression severity detection. However, the performance is not much worse
when compared to the submitted runs.</p>
      <p>In order to improve performance, we need to further analyze why IR methods
are not doing well. One of the reason might be the BDI question length. Average
question length is 8:45 (in words) when no stemming is used or no stop words
are removed. When we remove stop words and stems (porter) BDI query, the
average query length becomes 3.57 (in words).</p>
      <p>There are many directions for future research. It is necessary to obtain on the
one hand a better understanding of the models for professionals in the eld and
reach some sort of transparency for them. The type of transparency and how
it can be reached is a new research area. Maybe the performance of di erent
sub-classes of depression can be a rst step towards that goal.</p>
      <p>On the other hand, experts need to be able to feed their expertise into the
systems and improve their performance. The society overall needs to nd ethical
ways to handle such technology. It seems important that citizens are more aware
of the information they are providing to readers by writing online text which can
be analyzed easily. Basically, they might reveal much about their psychological
traits without being aware of it. One important tool would be a classi er available
to everyone, such that citizens can test the predictions gained from their texts.
This gives users back some of their informational autonomy.
8</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This work was carried out during a stay of the rst author at the University of
Hildesheim in Germany. The work was partially sponsored by the federal state
Niedersachsen and the Institute of Information Science and Language
Technology (IWIST) at the University of Hildesheim.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Google</surname>
          </string-name>
          pre-trained word vector https://code.google.com/archive/p/word2vec/
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. American Psychiatric Association:
          <article-title>Diagnostic and statistical manual of mental disorders : DSM-5</article-title>
          . American Psychiatric Association Arlington,
          <string-name>
            <surname>VA</surname>
          </string-name>
          , 5th ed. edn. (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Abdul-Jaleel</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Allan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
            ,
            <given-names>W.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diaz</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Larkey</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smucker</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wade</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          : Umass at trec 2004:
          <article-title>Novelty and hard</article-title>
          .
          <source>In: Proceedings of TREC13</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Amati</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>Van Rijsbergen</surname>
            ,
            <given-names>C.J.</given-names>
          </string-name>
          :
          <article-title>Probabilistic models of information retrieval based on measuring the divergence from randomness</article-title>
          .
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>20</volume>
          ,
          <issue>357</issue>
          {389 (
          <year>October 2002</year>
          ). https://doi.org/http://doi.acm.
          <source>org/10</source>
          .1145/582415.582416, http://doi.acm.
          <source>org/10</source>
          .1145/582415.582416
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Predicting big ve personality traits of microblog users</article-title>
          .
          <source>vol. 1</source>
          , pp.
          <volume>501</volume>
          {
          <issue>508</issue>
          (11
          <year>2013</year>
          ). https://doi.org/10.1109/WI-IAT.
          <year>2013</year>
          .70
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Bandyopadhyay</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ganguly</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>G.J.:</given-names>
          </string-name>
          <article-title>An Embedding Based IR Model for Disaster Situations</article-title>
          .
          <source>Information Systems Frontiers</source>
          <volume>20</volume>
          (
          <issue>5</issue>
          ),
          <volume>925</volume>
          {932 (
          <year>October 2018</year>
          ). https://doi.org/10.1007/s10796-018-9847-6
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghosh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghosh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Overview of the FIRE 2018 track: Information retrieval from microblogs during disasters (irmidis)</article-title>
          . pp.
          <volume>1</volume>
          {
          <issue>5</issue>
          (
          <issue>12</issue>
          <year>2018</year>
          ). https://doi.org/10.1145/3293339.3293340
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>A.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steer</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carbin</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          :
          <article-title>Psychometric properties of the beck depression inventory: Twenty- ve years of evaluation. Clinical psychology review 8(1</article-title>
          ),
          <volume>77</volume>
          {
          <fpage>100</fpage>
          (
          <year>1988</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>A.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ward</surname>
            ,
            <given-names>C.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendelson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mock</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erbaugh</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>An inventory for measuring depression</article-title>
          .
          <source>Archives of general psychiatry 4</source>
          (
          <issue>6</issue>
          ),
          <volume>561</volume>
          {
          <fpage>571</fpage>
          (
          <year>1961</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Cacheda</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Novoa</surname>
            ,
            <given-names>F.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carneiro</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Analysis and experiments on early detection of depression</article-title>
          . In: Cappellato et al. [
          <volume>11</volume>
          ], pp.
          <volume>10</volume>
          {
          <issue>21</issue>
          , http://ceurws.org/Vol-
          <volume>2125</volume>
          /paper 69.pdf
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nie</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soulier</surname>
            ,
            <given-names>L</given-names>
          </string-name>
          . (eds.): Working Notes of CLEF 2018 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          .
          <source>No. 2125 in CEUR Workshop Proceedings</source>
          , Avignon (
          <year>2018</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Eaton</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ybarra</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muntaner</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tien</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Center for Epidemiologic Studies Depression Scale: review and revision (CESD</article-title>
          and
          <string-name>
            <surname>CESD-R).</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Eichstaedt</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Merchant</surname>
            ,
            <given-names>R.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ungar</surname>
            ,
            <given-names>L.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crutchley</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Preotiuc-Pietro</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Asch</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>H.A.</given-names>
          </string-name>
          :
          <article-title>Facebook language predicts depression in medical records</article-title>
          .
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>115</volume>
          (
          <issue>44</issue>
          ),
          <volume>11203</volume>
          {
          <fpage>11208</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Funez</surname>
            ,
            <given-names>D.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ucelay</surname>
            ,
            <given-names>M.J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burdisso</surname>
            ,
            <given-names>S.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cagnina</surname>
            ,
            <given-names>L.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montes-y Gomez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Errecalde</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          :
          <article-title>UNSLs participation at eRisk 2018 Lab</article-title>
          . In: Cappellato et al. [
          <volume>11</volume>
          ], http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /paper 137.pdf
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A test collection for research on depression and language use</article-title>
          .
          <source>In: International Conference of the Cross-Language Evaluation Forum for European Languages</source>
          . pp.
          <volume>28</volume>
          {
          <fpage>39</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.: Overview of erisk 2018:
          <article-title>Early risk prediction on the internet (extended lab overview)</article-title>
          .
          <source>In: Cappellato</source>
          et al. [
          <volume>11</volume>
          ], http://ceurws.org/Vol-
          <volume>2125</volume>
          /invited paper 1.pdf
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.:
          <source>Overview of eRisk</source>
          <year>2019</year>
          :
          <article-title>Early Risk Prediction on the Internet</article-title>
          .
          <source>In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. 10th International Conference of the CLEF Association</source>
          ,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2019</year>
          . Springer International Publishing, Lugano, Switzerland (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yih</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweig</surname>
          </string-name>
          , G.:
          <article-title>Linguistic Regularities in Continuous Space Word Representations</article-title>
          .
          <source>In: NAACL HLT</source>
          <year>2013</year>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In: Proc. NIPS '13</source>
          . pp.
          <volume>3111</volume>
          {
          <issue>3119</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Paul</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyani</surname>
            ,
            <given-names>J.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Early detection of signs of anorexia and depression over social media using e ective machine learning frameworks</article-title>
          . In: Cappellato et al. [
          <volume>11</volume>
          ], pp.
          <volume>10</volume>
          {
          <issue>21</issue>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /paper 182.pdf
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Pennington</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
          </string-name>
          , C.D.: Glove:
          <article-title>Global vectors for word representation</article-title>
          .
          <source>In: Empirical Methods in Natural Language Processing (EMNLP)</source>
          . pp.
          <volume>1532</volume>
          {
          <issue>1543</issue>
          (
          <year>2014</year>
          ), http://www.aclweb.org/anthology/D14-1162
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Radlo</surname>
            ,
            <given-names>L.S.:</given-names>
          </string-name>
          <article-title>The CES-D scale: A self-report depression scale for research in the general population</article-title>
          .
          <source>Applied psychological measurement 1</source>
          (
          <issue>3</issue>
          ),
          <volume>385</volume>
          {
          <fpage>401</fpage>
          (
          <year>1977</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Some simple e ective approximations to the 2-poisson model for probabilistic weighted retrieval</article-title>
          .
          <source>In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          . pp.
          <volume>232</volume>
          {
          <fpage>241</fpage>
          . SIGIR '
          <volume>94</volume>
          , Springer-Verlag New York, Inc., New York, NY, USA (
          <year>1994</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>188490</volume>
          .
          <fpage>188561</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
          </string-name>
          , H.:
          <article-title>The probabilistic relevance framework: Bm25 and beyond</article-title>
          .
          <source>Found. Trends Inf. Retr</source>
          .
          <volume>3</volume>
          (
          <issue>4</issue>
          ),
          <volume>333</volume>
          {389 (Apr
          <year>2009</year>
          ). https://doi.org/10.1561/1500000019, http://dx.doi.org/10.1561/1500000019
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , M.:
          <article-title>Simple bm25 extension to multiple weighted elds (</article-title>
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nie</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chua</surname>
            ,
            <given-names>T.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          :
          <article-title>Depression detection via harvesting social media: A multimodal dictionary learning solution</article-title>
          .
          <source>In: IJCAI</source>
          . pp.
          <volume>3838</volume>
          {
          <issue>3844</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Trienes</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Identifying unclear questions in community question answering websites</article-title>
          . CoRR abs/
          <year>1901</year>
          .06168 (
          <year>2019</year>
          ), http://arxiv.org/abs/
          <year>1901</year>
          .06168
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Trotzek</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koitka</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          :
          <article-title>Word Embeddings and Linguistic Metadata at the CLEF 2018 Tasks for Early Detection of Depression and Anorexia</article-title>
          . In: Cappellato et al. [
          <volume>11</volume>
          ], http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2125</volume>
          /paper 68.pdf
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>