<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Analysis and Experiments on Early Detection of Depression</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fidel Cacheda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Diego Fernandez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francisco J. Novoa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>V ctor Carneiro</string-name>
          <email>victor.carneirog@udc.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Telematics Research Group Department of Computer Science University of A Corun~a</institution>
          ,
          <addr-line>15071</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we present the participation of the Telematics Research group from the University of A Corun~a at the eRisk 2018 task on early detection of signs of depression. We formalize the task as a classi cation problem and follow a machine learning approach. We perform an analysis of dataset and we propose three di erent feature types (textual, semantic and writing). We base our solution on using two independent models, one trained to predict depression cases and another one trained to predict non-depression cases, with two variants: Duplex Model Chunk Dependent (DMCD) and Duplex Model Writing Dependent (DMWD). Our results show how the DMWD model outperforms the DMCD on terms of ERDE5, ranking in the top-10 submissions for this task.</p>
      </abstract>
      <kwd-group>
        <kwd>Depression early risk detection positive model negative model</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In this paper we present the participation of the Telematics Research group
from the University of A Corun~a (UDC) at the Conference and Labs of the
Evaluation Forum (CLEF) in the eRisk 2018 lab. This lab is intended to explore
the evaluation methodology, e ectiveness metrics and practical applications of
early risk detection on the Internet. This is the second year that this lab is run
and it includes two main tasks: early detection of signs of depression and early
detection of signs of anorexia. Our research group participated in the rst task,
presenting two di erent models that generated ve separated runs.</p>
      <p>
        The eRisk 2018 early detection of signs of depression was divided in two
di erent stages: training and test [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The collection contains a sequence of
writings in chronological order and for each user, her/his collection of writings
has been divided into 10 chunks chronologically, each one containing a 10% of
the user's writings. Initially the training set was provided with a whole history
of writings for a set of training users, indicating explicitly the users that have
been diagnosed with depression. The test stage consisted of 10 sequential releases
of data throughout 10 consecutive weeks. The rst release corresponded to the
rst chunk, containing the oldest writings for all test users. The second release
consisted of the second chunk of data and so on so forth until the tenth chunk.
After each release, for each user the systems provided one of three options: a)
emit a decision of depression, b) emit a decision of non-depression, or c) delay
the decision (that is, see more chunks of data). Once a decision is emitted, this
decision is nal and in the nal chunk all users must have a decision.
      </p>
      <p>
        The evaluation is done considering the ERDE metric [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In this way, the
evaluation takes into account not only the correctness of the system's output
(that is, the precision) but also the delay taken to emit its decision.
      </p>
      <p>To deal with this problem we use a basic approach to measure the textual
and semantic similarities between depressed and non-depressed users, but we also
focus on other writing features, such as textual spreading (i.e. number of writings
per user, number of words per writing), time elapsed between two consecutive
writings and the moment when the writings were created. In our proposal we
use two machine learning models, one trained to predict depression cases while
the other is trained to predict non-depression cases and we provide two variants
named Duplex Model Chunk Dependent and Duplex Model Writing Dependent.</p>
      <p>The remaining of this article is organized as follows. In Section 2 we comment
on related work. Section 3 provides a data analysis of the dataset used for this
task. In Sections 4 and 5 we describe, respectively, the features selected and the
model proposed. Section 6 presents our results for this task and, nally, Section
7 includes our conclusions and future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        There are some previous publications that make use of social networks to identify
and characterize the incidence of di erent diseases. For example, Chunara et
al. analyzed cholera-related tweets published during the rst 100 days of the
2010 Haitian cholera outbreak [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] or Prieto et al. in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] propose a method to
automatically measure the incidence of a set of health conditions in society just
using Twitter. Also, Chew and Eysenbach use sentiment analysis on 2 million
tweets to propose a complementary infoveillance approach [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Speci cally related with depression, a few works intend to characterize
depressed subjects from their social networking behavior. For example, at the
CLPysch 2015 conference a task was organized to detected, among others,
depressed subjects using Twitter posts [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Several groups participated in the task,
with promising results, although none of them were focused on early detection.
      </p>
      <p>
        Finally, it is fundamental to mention the CLEF Lab on Early Risk
Prediction on the Internet 2017 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In general, participants based their approaches
on di erent features, such as lexical, linguistic, semantic or statistical. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]
followed a two-step classi cation, rst post level and next user level, based on a
Naives Bayes classi er. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] proposed several supervised learning approaches and
information retrieval models to estimate the risk of depression. In the work by
Villegas et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], the authors explicitly considered the partial information that
is available in di erent chunks of data. In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], they proposed a graph-based
representation to capture some inherent characteristics of the documents and use a
k-nearest neighbor classi cation system. The work in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] considered a
depression lexicon to extract features and used recurrent neural networks and support
vector machines models for prediction. Trotzek et al. in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] use linguistic meta
information extracted from the users' texts and applied di erent models. In [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
the authors use a combination of lexical and statistics features to predict the risk
of depression. Also, in our work [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] we compare a single machine learning model
with a dual model, improving the results obtained on the lab by 10% using a
dual model and following the eRisk time-aware evaluation methodology.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Data analysis</title>
      <p>
        The dataset used in the eRisk 2018 early detection of signs of depression task
consists of a set of text contents posted by users in Reddit [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Each individual
has been previously classi ed as depressed or non-depressed. Additional
information can be found in Table 1, where small di erences between depressed and
non-depressed users start to arise. For instance, as for the submissions, both
average and standard deviation are signi cantly greater for control users.
      </p>
      <p>In order to detect depression in a user behavior it is essential to study the
characteristics in the writings that determine this diagnosis for the subject. At
this rst stage, we have chosen several features easily measured, such as
textual spreading, time gap and time span. We will assume that writing times are
adapted to the user timezone.</p>
      <p>Textual spreading. The textual spreading refers to the number of words
composing a user writing. So, we are going to consider title length, text length,
and the whole post length.</p>
      <p>The comparison of those lengths according to the type of subject is depicted
in Fig. 1. The rst sub gure shows the number of words by title. Depressed
and non-depressed users show a similar descending trend. As a consequence,
those posts where users are commenting an existing reddit (so the title is not
included) are more abundant. However, the di erence between comments (the
title length is 0) and new reddits (the title length is not 0) is higher for depressed
subjects. Therefore, the latter are more prone to respond to an existing reddit
than creating a new one (the title is mandatory in this case).</p>
      <p>The second sub gure in Fig. 1 represents the number of words in the text,
thus excluding the title. The percentages are higher for depressed users in all the
intervals but the rst one, which corresponds to writings without text. Apart
from that, all users prefer short writings rather than long ones, since the number
of writings with more than 100 words is lower than the others.</p>
      <p>Regarding the third sub gure in Fig. 1, it compares the two types of subjects
taking into account all the words in the writing. In this case, the di erences
between the subjects are smoothed and their results are similar. This is because
di erences in the rst sub gure clearly compensate di erences in the second one.</p>
      <p>Time gap. Users present mainly two di erent behaviors with respect to
the time gaps: having two consecutive writings or two writings separated by
about one day. Non-depressed users follow a clearer routine, posting every day.
Nevertheless, long gaps for depressed users are more sparse.</p>
      <p>Time span. Regarding how users submit their writings during the week,
non-depressed users tend to submit less writings at weekends and more in the
middle of the week. This di erence is more perceptible considering only new
reddits. Moreover, depressed users behavior is more homogeneous, and it does
not vary signi cantly at weekends. In spite of these di erences, the behaviors
for the two types of subjects are very similar when taking into account only
comments.</p>
      <p>Finally, depressed subjects send more posts and comments than non-depressed
ones from midnight to midday, while the latter publish more in the afternoon.
The main di erences considering only comments appear six hours before
midday (when depressed subjects are more active) and six hours after (when
nondepressed subjects are more active). However, taking into account the new
reddits, depressed users publish more from ten in the evening to six in the morning,
but non-depressed subjects increase their di erence in the afternoon.</p>
      <p>In summary, regarding the aspects analyzed, subjects with or without
depression tackle the submission of writings di erently, in terms of number of words,
gaps between writings, day of the week and hour.</p>
    </sec>
    <sec id="sec-4">
      <title>Features</title>
      <p>We formalize this problem as a binary classi cation problem using the presence
or absence of a depression diagnosis as the label. However, if no strong evidence
is found in one direction or another, the decision is delayed.</p>
      <p>To address this machine learning problem, we resort to a features-based
approach and design a collection of features that are expected to capture
correlations between di erent aspects of the individual's writings and depression.</p>
      <p>We propose three types of features: textual similarity, semantic similarity
and writing features.
4.1</p>
      <sec id="sec-4-1">
        <title>Textual similarity features</title>
        <p>The training dataset is divided in two disjunctive sets: positive and negative
subjects. Positive subjects refer to subjects diagnosed with depression, while
negative subjects are those not diagnosed with depression. The main goal of
these features is to estimate the degree of alignment of a subject's writings with
positive or negative subjects measuring only the textual similarity between their
writings. In this way we estimate the likeliness between a given subject versus
positive and negative subjects.</p>
        <p>
          We select a bag-of-words representation, ignoring words ordering and we
employ two di erent measures extensively used in the literature: Cosine similarity
[
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] and Okapi BM25 [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>For each subject we consider two elds with textual information: title and
text. In order to capture potential di erences between positive and negative
subjects in the text, we measured the similarity in each eld independently and
concatenating all the textual information available for each writing. Therefore,
in the same way as described in Section 3, for each subject we consider three
textual scopes: title, text and all.</p>
        <p>At the same time, for each active subject and scope we calculate the average,
standard deviation, minimum, maximum and median of the scores obtained
comparing this subject to every other positive subject. Then we repeat the same
process for the scores with negative subjects. In both cases, the active subject
is removed from the corresponding sample, whether positive or negative.</p>
        <p>As a result we obtained 30 features for the Cosine similarity and another 30
features for the BM25 similarity.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Semantic similarity features</title>
        <p>In order to capture semantic relationships among documents we apply Latent
Semantic Analysis (LSA). LSA will explicitly learn the semantic word vectors
by applying Singular Value Decomposition (SVD). This will project the input
word-representation into a lower-dimensional space of dimensionality k V ,
where semantically related words are closer than unrelated ones.</p>
        <p>
          As in the previous case, each subject is represented as a document that
aggregates all her/his writings but, in this case, no distinction is made between
the di erent elds and all the textual information available is used to compute
the singular values. Semantic similarity between two subjects is computed as the
euclidean distance between the respective projections into the embedding space.
This process is repeated for all positive and negative subjects and scores are
aggregated calculating the average, standard deviation, minimum, maximum and
median values. LSA is applied both following a full-text approach and removing
stopwords and using Porter stemming [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Finally, we apply feature scaling to
normalize the LSA scores computed following min-max normalization [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>Overall, we have been able to capture 40 semantic features, considering two
normalization options, two stopwords removal and stemming alternatives, ve
statistical measures, and positive and negative subjects.
4.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Writing features</title>
        <p>In order to complement the textual and semantic features, we also include a
collection of features used to pro le the characteristics of the subjects' writings.
These features intend to capture part of the di erences detected on Section 3,
as we believe they may have an impact on the depression prediction.</p>
        <p>From the data available on the dataset we extracted three main signals:
textual spreading, time gap and time span.</p>
        <p>Textual spreading. Textual spreading measures the amount of textual
information provided by the subject in her/his writings. This set of features are
intended to measure di erences when users elaborate on their writings. To
address this we introduce the following features:
{ NWritings: the number of writings produced by the subject.
{ AvgWords: the average number of words per writing. For each writing all
the textual information available is considered.
{ DevWords: standard deviation for the number of words per writing.
{ MinWords: minimum number of words in the subject's writings.
{ MaxWords: maximum number of words in the subject's writings.
{ MedWords: median for the number of words in the subject's writing.
Time gap. Time gap intends to measure the frequency for a subject's writings
by means of calculating the time spent between two consecutive writings. In this
way, if a subject only has one writing in the time period considered, the time gap
would be zero. Otherwise, the time gap will measure the number of milliseconds
between two consecutive writings.</p>
        <p>Also, a logarithmic transformation of the raw time gap values is considered.
Therefore, the following two sets of features are considered:
{ TimeGap: the aggregated information for the time lapse between two
consecutive writings. These values are represented as the average, standard
deviation, minimum, maximum and median.
{ LogTimeGap: for the logarithmic transformation of the time gap values. The
same aggregation values are computed for each subject.</p>
        <p>Time span. This group of features is used to pro le the moment when the
writings were created by the subject. The following features are proposed:
{ Day : percentage of writings corresponding to each day of the week.
{ Weekday : accumulative percentage of writings created in a weekday.
{ Weekend : accumulative percentage of writings posted during the weekend.
{ Hour : percentage of writings corresponding to each hour of the day grouped
into four homogeneous classes (0:00-5:59, 6:00-11:59, 12:00-17:59, 18:00-23:59).
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Models</title>
      <p>
        In order to learn a model from the previous features we employ a readily available
machine learning toolkit. For the problem at hand we consider Random Forest
(RF) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] as the base machine learning model to solve the classi cation problem.
      </p>
      <p>As this is not a traditional binary classi cation problem due to the delay
option available when processing the di erent subjects' writings, our proposal is
based on two Random Forest models, where each one has been trained with an
independent set of features.</p>
      <p>The positive model (m+) is trained to predict depression cases, while the
negative model (m ) is trained to predict non-depression cases. For both models,
and in order to make a rm decision, a threshold is set. If the probability is above
the threshold then a diagnosis can be emitted and otherwise the decision must
be delayed.</p>
      <p>All models have been optimized on the training data using ERDE5 or
ERDE50 scores before submitting the results for the rst chunk and no modi
cations were applied later on. The same holds for the di erent thresholds utilized
in each model.</p>
      <p>Following, we describe the two variants for the prediction model proposed
along with their con gurations in the di erent submissions presented to the
eRisk 2018 lab.
5.1</p>
      <sec id="sec-5-1">
        <title>Duplex Model Chunk Dependent (DMCD)</title>
        <p>This model applies the positive and negative models using a threshold function
that depends on the number of chunks processed. The threshold function is
used on the positive and negative models to determine if a rm decision (in
any sense) can be provided. In this case, the threshold follows a decreasing step
function being the same for both positive and negative models. It starts in 0.9
and decreases 0.1 every 2 chunks. For instance, after chunk 2 the value is 0.8,
and after chunk 8 is 0.5.</p>
        <p>More speci cally, if the negative model probability is above the threshold
a negative decision is emitted. Otherwise, if the positive model probability is
above the threshold a positive decision is emitted and the decision is delayed in
any other case.
In this case, the m+ and m are applied based exclusively on the number of
writings generated for each subject. Therefore, we include a writings threshold,
denoted as thw, that is used to determine when the positive and negative models
should be used. More speci cally, if the number of writings is less than or equal
to thw, m+ is applied, and otherwise m is used.</p>
        <p>In both cases, the model probability has to surpass a certain threshold to
provide a de nitive decision or else the decision is delayed. In case of the positive
model the threshold was set at 0:9, while for the negative model was 0:5.
In the eRisk 2018 early detection of signs of depression a total amount of 11
institutions participated, submitting 45 results. The evaluation is focused not
only on the correctness of the predictions, but also on the delay taken to emit
the decision. In this sense, the organizers use ERDE5 and ERDE50, although
also the traditional precision, recall and F1 are provided.</p>
        <p>Figure 2 shows the o cial results for metric ERDE5. Our best result is
achieved with UDCC ranked in seventh place and followed closely by UDCE in
ninth place. Both con gurations follow the DMWD version and use thw = 6 as
writings threshold. Our best performing model uses cosine and BM25 metrics
limited to the writings text eld for the positive model and apply LSA with
Porter stemming for the negative model, while our second best considers all
textual elds for the cosine and BM25 metrics on the positive model, while a
normalized LSA with stemming is used in the negative model.</p>
        <p>Regarding ERDE50 our results are more modest, with UDCA ranked
twentythird and UDCD ranked twenty-sixth. In this case, UDCA uses the DMCD
alternative, while UDCD uses the DMWD (thw = 53). Again, there are some
di erences in the features employed in each case. UDCA uses all textual elds to
compute cosine and BM25 similarity metrics and normalized LSA with stemming
for the negative model, while UDCD computes the cosine and BM25 metrics
using exclusively the text eld and the negative model is based on non-normalized
LSA with stemming.</p>
        <p>More interesting are the results regarding precision showed on Figure 3. In
this case, our results are the worst from all participants. In fact, F1 metric shows
the same results. From our point of view, this just highlights the fact that this
task is not related with precision or F1 but, as the name states, with an early
detection of the signs of depression.</p>
        <p>The ERDE metric penalizes the late detection of a positive case, being
equivalent to a false negative. In this sense, our models try to focus on the
early detection of positive cases, leaving in a second place precision and recall
measures.</p>
        <p>The task proposed for the eRisk labs is interestingly di cult. As a matter
of fact, using the golden truth provided by the organization we have calculated
the performance obtained by an Oracle model that is able to predict perfectly
all depression cases in all di erent chunks. Therefore, Oracle 1 corresponds to
an Oracle that is able to predict in the rst chunk all depression cases, Oracle 2
in the second chunk and so on.</p>
        <p>Table 3 presents the results obtained for the Oracle at all chunks. As
expected, precision, recall and F1 obtain perfect scores. However, ERDE metrics
are much more demanding. Specially ERDE5, where any true positive
decision that requires more than 5 writings would be penalized and soon become
equivalent to a false negative.</p>
        <p>Analyzing our results with respect to the Oracle models, we observe that our
best performing model on the ERDE5 score is equivalent to Oracle 6, while our
best performing model on ERDE50 is worse than Oracle 10. On the other side,
analyzing the best results obtained by all participants in the depression task,
the best performing model on ERDE5 is slightly better than an Oracle 3 while
on ERDE50 it is located between Oracle 5 and Oracle 6.</p>
        <p>In general, there seems to be more room for improvement on the ERDE50
metric than on the ERDE5 metric. This may be related with the fact that
some users will have more than 5 writings in the rst chunk, making an early
prediction impossible.
7</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and future work</title>
      <p>In this paper we have presented the participation of the Telematics Research
group from the University of A Corun~a at the eRisk 2018 task on early detection
of signs of depression.</p>
      <p>We have formalized the task as a classi cation problem and we have used a
machine learning approach, designing three types of features in order to capture
correlations between writings and depression signs: textual similarity, semantic
similarity and writing features. Our proposal is based on two independent
models, positive and negative, with two variants: DMCD and DMWD. Our results
show how the DMWD model performs much better than the DMCD for ERDE5
and it is among the top-10 submissions for the task. On the other side, our results
on ERDE50 are mediocre, with both variants performing below the average.</p>
      <p>In the future, we expect to extend this work by studying other model
combinations, with a focus on new machine learning algorithms and feature sets.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments References</title>
      <p>This work was supported by the Ministry of Economy and Competitiveness of
Spain and FEDER funds of the European Union (Project TIN2015-70648-P).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aksoy</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haralick</surname>
            ,
            <given-names>R.M.</given-names>
          </string-name>
          :
          <article-title>Feature normalization and likelihood-based similarity measures for image retrieval</article-title>
          .
          <source>Pattern Recognition Letters</source>
          <volume>22</volume>
          (
          <issue>5</issue>
          ),
          <volume>563</volume>
          {
          <fpage>582</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Briand</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meurs</surname>
            ,
            <given-names>M.J.:</given-names>
          </string-name>
          <article-title>Detecting early risk of depression from social media user-generated content</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Breiman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Random forests</article-title>
          .
          <source>Machine Learning 45(1)</source>
          ,
          <volume>5</volume>
          {
          <fpage>32</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cacheda</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Novoa</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carneiro</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Arti cial intelligence and social networks for early detection of depression. Submitted for publication (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Chew</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eysenbach</surname>
          </string-name>
          , G.:
          <article-title>Pandemics in the age of twitter: content analysis of tweets during the 2009 h1n1 outbreak</article-title>
          .
          <source>PloS one</source>
          <volume>5</volume>
          (
          <issue>11</issue>
          ),
          <year>e14118</year>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chunara</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brownstein</surname>
            ,
            <given-names>J.S.</given-names>
          </string-name>
          :
          <article-title>Social and news media enable estimation of epidemiological patterns early in the 2010 haitian cholera outbreak</article-title>
          .
          <source>The American journal of tropical medicine and hygiene 86(1)</source>
          ,
          <volume>39</volume>
          {
          <fpage>45</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Coppersmith</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dredze</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hollingshead</surname>
          </string-name>
          , K., Mitchell, M.:
          <article-title>Clpsych 2015 shared task: Depression and ptsd on twitter</article-title>
          .
          <source>In: Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality</source>
          . pp.
          <volume>31</volume>
          {
          <issue>39</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Far as Anzaldua,
          <string-name>
            <given-names>A.A.</given-names>
            ,
            <surname>Montesy</surname>
          </string-name>
          <string-name>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Lopez</given-names>
            <surname>Monroy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.P.</given-names>
            ,
            <surname>GonzalezGurrola</surname>
          </string-name>
          , L.C.:
          <article-title>Uach-inaoe participation at erisk2017</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          . vol.
          <year>1866</year>
          .
          <article-title>NIH Public Access (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A test collection for research on depression and language use</article-title>
          .
          <source>In: Conference Labs of the Evaluation Forum</source>
          . pp.
          <volume>28</volume>
          {
          <fpage>39</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.: erisk 2017:
          <article-title>Clef lab on early risk prediction on the internet: Experimental foundations</article-title>
          .
          <source>In: International Conference of the Cross-Language Evaluation Forum for European Languages</source>
          . pp.
          <volume>346</volume>
          {
          <fpage>360</fpage>
          . Springer (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.:
          <article-title>Overview of eRisk { Early Risk Prediction on the Internet</article-title>
          .
          <article-title>In: Experimental IR Meets Multilinguality, Multimodality, and Interaction</article-title>
          .
          <source>Proceedings of the Ninth International Conference of the CLEF Association (CLEF</source>
          <year>2018</year>
          ). Avignon, France (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Malam</surname>
            ,
            <given-names>I.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arziki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellazrak</surname>
            ,
            <given-names>M.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benamara</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El Kaidi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Es-Saghir</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Housni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moriceau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>Irit at e-risk</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Porter</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          :
          <article-title>Readings in information retrieval</article-title>
          .
          <source>chap. An Algorithm for Su x Stripping</source>
          , pp.
          <volume>313</volume>
          {
          <fpage>316</fpage>
          . Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Prieto</surname>
            ,
            <given-names>V.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matos</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alvarez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cacheda</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliveira</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          :
          <article-title>Twitter: a good place to detect health conditions</article-title>
          .
          <source>PloS one 9</source>
          (
          <issue>1</issue>
          ),
          <year>e86191</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
          </string-name>
          , H.:
          <article-title>The probabilistic relevance framework: Bm25 and beyond</article-title>
          .
          <source>Found. Trends Inf. Retr</source>
          .
          <volume>3</volume>
          (
          <issue>4</issue>
          ),
          <volume>333</volume>
          {389 (Apr
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Sadeque</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bethard</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Uarizona at the clef erisk 2017 pilot task: Linear and recurrent models for early depression detection</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          . vol.
          <year>1866</year>
          .
          <article-title>NIH Public Access (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Singhal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Modern information retrieval: A brief overview</article-title>
          .
          <source>Bulletin of the IEEE Computer Society Technical Committee on Data Engineering</source>
          <volume>24</volume>
          (
          <issue>4</issue>
          ),
          <volume>35</volume>
          {
          <fpage>43</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Trotzek</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koitka</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>C.M.:</given-names>
          </string-name>
          <article-title>Linguistic metadata augmented classi ers at the clef 2017 task for early detection of depression</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Villatoro-Tello</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ram</surname>
          </string-name>
          rez-de-la
          <string-name>
            <surname>Rosa</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimenez-Salazar</surname>
          </string-name>
          , H.:
          <article-title>Uams participation at clef erisk 2017 task: Towards modelling depressed bloggers</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Funez</surname>
            ,
            <given-names>D.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ucelay</surname>
            ,
            <given-names>M.J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cagnina</surname>
            ,
            <given-names>L.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Errecalde</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          :
          <article-title>Lidic - unsl's participation at erisk 2017: Pilot task on early detection of depression</article-title>
          .
          <source>In: Proceedings Conference and Labs of the Evaluation Forum CLEF</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>