<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Temporal Mood Variation: at the CLEF eRisk-2018 Tasks for Early Risk Detection on The Internet</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Waleed Ragheb</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bilel Moulahi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>J´erˆome Az´e</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sandra Bringay</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maximilien Servajean</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>AMIS, Paule Valery University - Montpellier 3</institution>
          ,
          <addr-line>Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>IUT de B ́eziers, University of Montpellier</institution>
          ,
          <addr-line>B ́eziers</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>LIRMM UMR 5506, CNRS, University of Montpellier</institution>
          ,
          <addr-line>Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Two tasks are proposed at CLEF eRisk-2018 on predicting mental disorder using Users posts on Reddit. Depression and anorexia disorders are considered to be detected as early as possible. In this paper we present the participation of LIRMM (Laboratoire d'Informatique, de Robotique et de Micro´electronique de Montpellier) in both tasks. The proposed architectures and models use only text information without any hand-crafted features or dictionaries to model the temporal mood variation detected from users posts. The proposed models use two learning phases through exploration of state-of-the-art text vectorization. The proposed models perform comparably to other contributions while experiments shows that document-level outperformed word-level vectorizations.</p>
      </abstract>
      <kwd-group>
        <kwd>Classification</kwd>
        <kwd>Word2vec</kwd>
        <kwd>Doc2vec</kwd>
        <kwd>Temporal Variation</kwd>
        <kwd>MLP</kwd>
        <kwd>Depression</kwd>
        <kwd>Anorexia</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Depression is a common mental disorder. Globally, more than 300 million people
of all age stages suffer from depression [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. It has a direct and indirect effect on
the economic growth because of its major impact on the productivity. Depression
also has dramatic consequences not only for those affected but also for their
families and their social and work related environments [27]. It may be the
psycho-physiological basis for panic and anxiety symptoms. Panic disorder has
been increasingly focused on health services and the media, where it affects young
people aged 20-40. The incidence of these disorders affects 22% of the adult world
population. At its worst consequences, depression is one of the major causes of
suicide [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Another common mental disorder is Anorexia which is described as
an eating disorder. It is characterized by low weight, worry of gaining weight,
and a powerful need to be skinny, leading to food restriction. Many who suffer
from eating disorder see themselves as overweight although they could be thin [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
Individuals with eating disorders have also been shown to have lower employment
rates, in addition to an overall loss of earnings. Eating disorder sufferers who
are experiencing an overall loss in earnings associated with their illness are also
magnified by the excess of health-care costs. According to the National Eating
Disorder Association (NEDA), up to 70 million people worldwide suffer from
eating disorders [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Eating disorder symptoms are beginning earlier in both
males and females. As estimated, 1.1 to 4.2 percent of women suffer from anorexia
at some point in their lifetime [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Young people between the ages of 15 and 24
with anorexia have 10 times the risk of dying compared to their same-aged peers.
      </p>
      <p>
        Social media is becoming increasingly used not only by adults but also at
different age stages. Mental disordered patients also turn to online social media
and web forums for information on specific conditions and emotional support [
        <xref ref-type="bibr" rid="ref7 ref8">8,
7</xref>
        ]. Even though social media can be used as a very helpful tool in changing a
person’s life, it may cause such conflicts that can have a negative impact. This
puts responsibilities for content and community management for monitoring
and moderation. With the increasing number of users and their contents these
operations turn out to be extremely difficult. Many social media try to deal with
this problem by reactive moderation. In reactive moderation, users report any
inappropriate, negative or risky user generated contents. However it may reduce
the workload or the cost of moderating, it is not enough especially for handling
mental disordered user’s threads or posts.
      </p>
      <p>
        Previous researches on social media have established the relationship between
an individual’s psychological state and his\her linguistic and conversational
patterns [
        <xref ref-type="bibr" rid="ref11 ref14">25, 11, 23, 14</xref>
        ]. This motivate the task organizers to initiate the pilot task
for detecting depression from user posts on Reddit1 in eRisk-2017 [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In
eRisk2018 the extension of the study was planned to include detection of anorexia.
The main idea is to detect such psychological problems from users posts as early
as possible.
      </p>
      <p>In this paper, we present the participation of LIRMM (Laboratoire
d’Informatique, de Robotique et de Micro´electronique de Montpellier) in both
tasks for early detection of depression and anorexia in eRisk-2018. The
originality of our approach is to perform the detection through two main learning
phases using text vectorizations. The first phase uses Bayesian rule inversion
to construct a time series representing temporal mood variation through users
posts. The second phase is to build variable length time series classification
model to obtain the proper decision. The main idea is to give a decision once
the time series prove clear signs of mental disorder from current and previous
mood extracted from the content.</p>
      <p>The rest of the paper is organized as follows. In section 2, a short description
of these tasks is introduced. Then in Section 3, the related work is introduced.
Section 4 describes the training and testing datasets for both tasks. Section 5
presents the experimental setup of the proposed models. In Section 6, the
evalu1 Reddit is an open-source platform where community members (red-ditors) can
submit content (posts, comments, or direct links), vote submissions, and the content
entries are organized by areas of interests (subreddits).
ation results are presented. The conclusions of the experiments and participation
are stated in Section 7.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Tasks Description</title>
      <p>
        In CLEF eRisk 2018, two tasks are presented [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Both tasks are considered as a
binary classification problem. The first task is to discriminate between depressed
and non-depressed users while the second one is between users diagnosed with
anorexia and non-anorexia. The datasets are a dated textual data of user posts
and comments -posts without titles- on Reddit. The training data is divided into
10 chunks in chronological order. Each chunk contains 10% of the user’s posts.
A description of the datasets for both tasks is presented in Section 4. The goal
is not only to perform classification but also to do it as soon as possible using
minimum amount of data or chunks for each user.
      </p>
      <p>The test datasets also comes with chunks exactly the same way the training
data is divided. The tasks organizers give one week for processing each test
chunk for both tasks before firing a decision. The decision could be one of the
classes or could be postponed for future chunks. At the end of the 10th chunk,
all classification propositions must have been submitted. Each team could only
participate in one or two tasks and submit at most five runs per task.</p>
      <p>
        For evaluation, the classical classification performance measures (Precision,
recall and F1) are computed for each run. In addition error measures called
Early Risk Detection Error (ERDE5,50) are computed. It takes into account the
correctness of the (binary) decision and the delay taken by the system to make
the decision [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
      <p>The training data was released on November 30th 2017. After more than two
months and on February 6th the first chunk of test data was released. On a
weekly basis a new chunk was being released until the last 10th chunk on April
10th. The evaluation results were announced on 24thApril.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Related Work</title>
      <p>
        Recent psychological studies showed the correlation between person’s mental
status and mood variation over time [
        <xref ref-type="bibr" rid="ref15 ref3 ref4">15, 3, 24, 4</xref>
        ]. It is also evident that some
mental disordered may have chronic week-to-week mood instability. It is a
common presenting symptom for people with a wide variety of mental disorders,
with as many as 8 of 10 patients reporting some degree of mood instability
during assessment. These studies suggest that clinicians should screen for
temporal mood variation across all common mental health disorders [24].
      </p>
      <p>
        Concerning text representation, traditional Natural Language Processing
(NLP) modules starts with extracting some important features from text.
These features could be for example the count or frequency of specific words,
predefined patterns, Part-of-Speech tagging, etc. These hand-crafted features
should be selected carefully and sometimes with an expert view. However these
features are interesting [
        <xref ref-type="bibr" rid="ref5">28, 5</xref>
        ], sometimes they loose the sense of generalization.
Another recent trend is the use of word and documents vectorization methods.
These strategies that convert either words, sentences or even overall documents
into vectors take into account all the text not just parts of it. There are many
ways to transform a text to high-dimensional space such as term frequency
and inverse document frequency (TF-IDF), Latent Semantic Analysis (LSA),
Latent Dirichlet Allocation (LDA), etc [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. This direction was revolutionized
by Mikolov et al. [
        <xref ref-type="bibr" rid="ref20 ref21">20–22</xref>
        ] who proposed the Continuous Bag Of Words (CBOW)
and skip-gram models known as Word2vec. It is a probabilistic based model
that makes use of a two layered neural network architecture to compute the
conditional probability of a word given its context. Based on this work Le et
al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] propose Paragraph Vector model. The algorithm which is also known
as Doc2vec learns fixed-length feature representations from variable-length
pieces of texts, such as sentences, paragraphs, and documents. Both word
vectors and documents vectors are trained using stochastic gradient descent
and back-propagation neural network language models. In this paper, we will
use both techniques for text representations.
      </p>
      <p>Concerning mood evaluation, one of the interseting work on text distributed
representation is the bayesian inversion proposed by Taddy in [26]. It uses Bayes
formula to compute the probabilities of a document belonging to a class topic.
Given a document d and label y, Bayes formula is:
p(y|d) =
p(d|y)p(y)</p>
      <p>p(d)</p>
      <p>For classification problems, p(d ) can be ignored since d is fixed. p(d |y) is
estimated by first training the text vectorization model on a subset of the corpus
with label y, then using the skipgram objective composite likelihood as an
approximation. As discussed in [26], bayesian inversion will not always outperform
other classification methods. It rather provides simple, scalable, interpretable
and effective option for classification whenever distributed representations are
used. In this paper, we will use bayesian inversion to construct a time series
representing temporal mood variation.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Datasets</title>
      <p>
        The collection was created as a sequence of XML files, one file per redditor.
Each XML file stores the sequence of the redditor’s submissions (one entry per
submission). Each submission is represented by the submission’s title, the
submission’s text and the submission’s date. No other metadata is available [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. As
mentioned before, the redditor’s submissions come in ten chunks. Each chunk
contains 10% from the overall user submissions. The classification ground truth
(golden truth) is given or expected to be predicted for each user. For eRisk-2018
two groups of datasets are provided for each task. A brief description of these
datasets is presented in this section.
4.1
      </p>
      <p>
        Task-1: Depression Dataset
The depression datasets contain submissions from either depressed or
nondepressed (controlled) users. The classifications was done manually by the e-Risk
organizers. Since the eRisk-2017 pilot task was about the depression
classification, in this task two pre-annotated datasets are provided (Training Dataset
(2017) and testing DataSets (2017)). Some interesting Statistics and summary
for these datasets are provided in Table 1. For eRisk-2018, they provide a test
dataset which approximately doubles the number of users from the previous year
datasets. However it preserves similar characteristics of the two previous datasets
in terms of the number of submissions per users and the length of submissions
in sentences or in words. These numbers are just averages but we clearly note
the existence of very long/short sentences and submissions. Users’ submissions
are either posts or comments. Both are described with the same attributes
(title, text and date) with empty titles for comments. All datasets collections are
unbalanced (more non-depressed users than depressed ones).
The Anorexia datasets are similar to the depression datasets in terms of
structures and attributes. The classification is done for users diagnosed with anorexia
and non-anorexia. eRisk organizers initiated this task for the first time in
eRisk2018. So, there is only one training dataset provided prior to the actual test
set. Table 2 describes some statistics on the anorexia datasets. Because of the
nature of the task and like depression datasets, the anorexia datasets are also
unbalanced.
The temporal aspects of the eRisk-2018 tasks inspired us to model the temporal
mood variation trough user’s text content. The average number of days ranging
from the first submission to the last submission is approximately 600 days [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
So, determining the way in which user’s posts and comments vary from positive
to negative and vise versa through time is worth inspecting. In the proposed
models, time aspects are given as chunks. The main idea is to process user
submissions for each chunk and determine the probability of how positive or
negative the chunk is. The proposed architecture of our models are shown in
Figure. 1.
      </p>
      <p>
        Step 1 - Text Vectorization Module: The input of this module is the list
of textual information divided into ten chunks. The chunks are chronologically
ordered as discussed in Section 4. The first step is to build a text vectorization
model using all the text chunks. Two state-of-the-art text vectorization models
are used. These models are the Word2vec and its evolution, the Doc2vec [
        <xref ref-type="bibr" rid="ref12 ref20">20, 12</xref>
        ].
The two alternatives of Doc2vec specifications, distributed memory (DM) and
distributed bag-of-words (DBOW) are used [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. We also keep track of the text
for each user in every chunk and its label embedded in the model. Also, we built
a vectorization model for positive and for negative and did not use any
external resources. This module can be considered as an unsupervised learning phase.
      </p>
      <p>
        Step 2 - Mood Evaluation Module: Our models are based on the
work of Matt Taddy in [26] about Bayesian inversion. One of the interesting
conclusions from this work is that any distributed representation can be turned
into a classifier through inversion via Bayes rule. In our proposed model, we
segmented the text of each chunk into sentences and scored each sentence
through each vectorization model. The mood of the overall chunk is evaluated
simply by normalizing the count of positive and negative sentences using the
inversion technique. Each chunk will have a number between [
        <xref ref-type="bibr" rid="ref1">0,1</xref>
        ]. This can be
considered as the probability of how positive (risky) the chunk is. Processing all
chunks leads to a ten-points time series for the ten chunks for each user in the
training datasets. Mood evaluation using the inversion technique is considered
as the first learning phase in our proposed architecture.
      </p>
      <p>
        Step 3 - Temporal Modeling Module: Another learning phase is to
build machine learning models to learn some patterns from these time series
to come up with the final classification model. In the ideal case and for the
complete time series, we would have only one model. But since we should not
wait for the complete time series we built multiple models for different sizes
of time series to be able to give a decision without having to wait for the ten
chunks. Figure 2 shows an example of two dimensional representation of the
complete time series for the depression task using t-SNE [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. These time series
will be the training set of the second learning phase. The separation between
positive and negative users is obvious. It is expected that this separation would
not be as ideal as this in testing but it will exists.
      </p>
      <p>Evaluation: For the first task, we used the training set of eRisk-2017 to
train the first and second learning models. The testing set of eRisk-2017 was
the validation set for our experiments before merging together with eRisk-2017
training set to form the final training set. The evaluation was done for the
overall chain of learning phases using the complete ten-points time series. For
the second task, we used the Leave-One-Out (LOO) approach in the training
data set.</p>
      <p>In all the submitted runs for both tasks, only discovered positive users are
reported until the end of the last test chunk in which labeling of negative ones
was done. The rest of this section will be dedicated to describe the submitted
runs for each task. Most of the runs use the same architecture as discussed
earlier.</p>
      <p>Positive
Negative
40
20
0
−20
−40
−20
−10
0
10
20
30
Fig. 2: t-SNE reduced time series information for ten chunks per user in the training
datasets in Task-1
5.1</p>
      <p>Task 1: Early Detection of Signs of Depression</p>
      <p>For document vectorization (Doc2vec), the resultant vectors had 200
dimensions. The model used a context window of 10 words and a minimum of two for
word counts in the text. It used a negative sampling loss with DBOW version
and trained for 20 training epochs. In the word level vectorization, the vector
size of a word had a dimension of 200 with context window size of five words.
Hierarchical softmax was used and a minimum count of two words was
considered.</p>
      <p>In the second learning phase and for temporal modeling, the architecture of
the Multi-layered perceptron (MLP) used had two hidden layers with ten neurons
each. Concerning the Random Forest (RF) classifier, ten estimators were used.</p>
      <p>LIRMMA started to give decisions from the eighth week. Positive users were
reported for those with classification probability higher than 0.6. LIRMMB
started to discover positive users from week five. The classification
probability threshold varied from 1.0 in week five to 0.6 in the tenth week. LIRMMC
started to give predictions from the third week. Since the small number of points
in the time series from week three to five, we fixed the classification threshold
to 1.0 and like LIRMMB, it started to reach 0.6 in the last week.</p>
      <p>Hence we proposed LIRMMD and LIRMME to give a decision from the
first week, we substitute the second learning phase with a window moving
average from the output of the inversion technique. For LIRMMD, we assumed
the positive users will have risky mood in the first chunks than the lasts. Two
varying thresholds were used; one for the number of sentences and the other
for the positive probability threshold. The size of averaging window is three
and the probability changing from 0.6 with number of sentences higher than
100 to 0.8 and zero for sentences count threshold. For LIRMME, the difference
comes from the assumptions that higher probability threshold was given to last
weeks chunks than the first weeks with the same sentences thresholds. The risk
probability starts with 0.8 in the first week to 0.6 in the last week.
In this section, we highlight the difference between the proposed runs for anorexia
second task. The steps are summarized in Table 4.</p>
      <p>For text vectorization in document or word level, we used the same
parameters as for the first task. The classification probability threshold for positive
user was set to 0.8 in LIRMMA while in LIRMMB it was 0.6. For LIRMMC,
LIRMMD and LIRMME, the same thresholds were used exactly as the same
corresponding runs in the first task.
6
6.1</p>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>Evaluation Results
Upon the submission of the last chunk, the evaluation process started for all runs
results. As mentioned in Section 2, the tasks organizers use the two versions of
ERDE in addition to the classical classification measures: Precision(P), Recall
(R) and F1-Measure (F1). Tables 5 and 6 show the formal evaluation of all
proposed runs for both tasks.</p>
      <p>P</p>
      <p>R
From the first look of the results, it is clear that document level vectorization
behaves better than word level vectorization. But all runs using Doc2vec
(LIRMMA for task-1 and LIRMMA &amp; LIRMMB for task-2) started giving their
decisions later than others. Also, runs that gave decisions from the first week
give better ERDE with cut-off of 50 for both tasks. The runs with higher
recall use word level vectorization with either MLP or RF as the second learning
phase. In mode evaluation step, fake stories were misleading and made a lot of
false positive predictions. In addition, our models does not discriminate between
user posts and comments (posts without titles) which could be beneficial for
evaluating user mood.</p>
      <p>For some at-risk users, first chunks posts don’t have any proof of depression
or anorexia and suddenly users started to express their status late. For the second
learning phase, the model classify the overall mood time series and late signs
of disorders could not be predicted by our models. So, in some runs (for both
tasks) some moderation on the proposed assumptions (classification probability
thresholds) are needed.</p>
      <p>Tables 7 and 8 show some statistics of all submitted runs compared to the
proposed models. The ranking of the best run for each evaluation metric is
also included. The statistics of the depression task are for 45 runs of 11 teams.
The anorexia task statistics on results are for 34 runs of 9 teams. Most of the
teams have participated in both tasks with at least one run for both. However
the proposed architecture does not include any hand-crafted features or any
attention signals, it seems to be comparable with other contributions for both
tasks. The proposed models and runs act better for depression task than for
anorexia task. The main reasons are that our models heavily depend on the
data size and the leakage of anorexia data especially for positive users which
was clear.
In this paper we present the participation of LIRMM in the CLEF eRisk-2018
tasks. Both tasks are for early detection of signs of depression and anorexia
from users posts on Reddit. We proposed five runs for each task and the results
are, to some extent, interesting and comparable to other contributions. The
proposed framework architecture used the text without any handcrafted features.
It performs the classification through two phases of supervised learning using text
vectorization methods. The first learning phase builds a time series representing
the mood variation using the bayesian inversion technique. The second learning
phase is a classification model that learns patterns from these time series to
detect early signs of such mental disorders.</p>
      <p>
        Predicting at-risk users once at the end for all runs was a good idea as
some users expressed their disorders in the last chunks. But, some unreal stories
in the firsts posts pushed the proposed runs to make a lot of false positive
decisions. However the proposed architecture only uses the text blindly, adding
some dictionary features or attention signals might have helped [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Another
way is to build the second learner model much deeper using Convolution Neural
Networks (CNN) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] or Recurrent Neural Networks (RNN) architectures [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments References</title>
      <p>We would like to acknowledge La R´egion Occitanie and l’Agglom´eration B´eziers
M´editerran´ee which finance the thesis of Waleed Ragheb.
22. Mikolov, T., Yih, S.W.t., Zweig, G.: Linguistic regularities in continuous space word
representations. In: Proceedings of the 2013 Conference of the North American
Chapter of the Association for Computational Linguistics: Human Language
Technologies (NAACL-HLT-2013). Association for Computational Linguistics (2013)
23. Moulahi, B., Az´e, J., Bringay, S.: Dare to care: A context-aware framework to track
suicidal ideation on social media. In: Bouguettaya A. et al. (eds) Web Information
Systems Engineering - WISE 2017.,Lecture Notes in Computer Science,. Springer,
Cham. vol. 10570 (2017)
24. Patel, R., Lloyd, T., Jackson, R., Ball, M., Shetty, H., Broadbent, M., Geddes,
J.R., Stewart, R., McGuire, P., Taylor, M.: Mood instability is a common feature
of mental health disorders and is associated with poor clinical outcomes. In: BMJ
Open. vol. 5. British Medical Journal Publishing Group (2015)
25. Paul, M.J., Dredze, M.: You are what you tweet: Analyzing twitter for public
health. In: Adamic, L.A., Baeza-Yates, R.A., Counts, S. (eds.) ICWSM. The AAAI
Press (2011)
26. Taddy, M.: Document classification by inversion of distributed language
representations. In: CoRR. vol. abs/1504.07295 (2015)
27. Trautmann, S., Rehm, J., Wittchen, H.: The economic costs of mental disorders:
Do our societies react appropriately to the burden of mental disorders? In: EMBO
reports. vol. 17, pp. 1245–1249 (2016)
28. Trotzek, M., Koitka, S., Friedrich, C.: Linguistic metadata augmented classifiers
at the clef 2017 task for early detection of depression. In: Working Notes of CLEF
2017 - Conference and Labs of the Evaluation Forum. vol. CEUR-WS 1866 (2017)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>1. The national eating disorders association (NEDA).: Envisioning a world without eating disorders</article-title>
          .
          <source>In: The newsletter of the National Eating Disorders Association. Issue</source>
          <volume>22</volume>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. World Health Organization.
          <article-title>: Depression and other common mental disorders: global health estimates</article-title>
          . In: World Health Organization. http://www.who.int/iris/handle/10665/254610. License:
          <article-title>CC BY-NC-SA 3.0 IGO (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bonsall</surname>
            ,
            <given-names>M.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace-Hadrill</surname>
            ,
            <given-names>S.M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Geddes</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goodwin</surname>
            ,
            <given-names>G.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holmes</surname>
            ,
            <given-names>E.A.</given-names>
          </string-name>
          :
          <article-title>Nonlinear time-series approaches in characterizing mood stability and mood instability in bipolar disorder</article-title>
          .
          <source>In: Proceedings of the Royal Society of London B: Biological Sciences</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Glenn</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monteith</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>New measures of mental state and behavior based on data collected from sensors, smartphones, and the internet</article-title>
          .
          <source>In: Current Psychiatry Reports</source>
          . vol.
          <volume>16</volume>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hayda</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Antoine</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marie-Jean</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Detecting early risk of depression from social media user-generated content</article-title>
          .
          <source>In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum</source>
          . vol.
          <source>CEUR-WS</source>
          <year>1866</year>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hoek</surname>
          </string-name>
          , H.:
          <article-title>Review of the worldwide epidemiology of eating disorders</article-title>
          .
          <source>In: Current Opinion in Psychiatry</source>
          . vol.
          <volume>29</volume>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Jadayel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Medlej</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jadayel</surname>
            ,
            <given-names>J.J.:</given-names>
          </string-name>
          <article-title>Mental disorders: A glamorous attraction on social media? In: Journal of Teaching and Education</article-title>
          . vol.
          <volume>07</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. John, N.,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gregory</surname>
            .,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jrgen</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lisa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stephen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Exploring opportunities to support mental health care using social media: A survey of social media users with mental illness</article-title>
          . In: Early Intervention in Psychiatry (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Joyce</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Sulkowski</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>The diagnostic and statistical manual of mental disorders: Fifth edition (dsm-5) model of impairment</article-title>
          .
          <source>In: Assessing Impairment: From Theory to Practice</source>
          . pp.
          <fpage>167</fpage>
          -
          <lpage>189</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kalchbrenner</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grefenstette</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blunsom</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A convolutional neural network for modelling sentences</article-title>
          .
          <source>In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kishaloy</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lahari</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>Modeling temporal progression of emotional status in mental health forum: A recurrent neural net approach</article-title>
          .
          <source>In: Proceedings of the 8th Workshop on Computational Approaches</source>
          to Subjectivity,
          <article-title>Sentiment and Social Media Analysis</article-title>
          . pp.
          <fpage>127</fpage>
          -
          <lpage>135</lpage>
          . Association for Computational Linguistics (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of sentences and documents</article-title>
          .
          <source>In: ICML. JMLR Workshop and Conference Proceedings</source>
          , vol.
          <volume>32</volume>
          , pp.
          <fpage>1188</fpage>
          -
          <lpage>1196</lpage>
          . JMLR.org (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Leite</given-names>
            <surname>Barroso</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          , Lucena Grangeiro Maranho,
          <string-name>
            <surname>T.</surname>
          </string-name>
          ,
          <article-title>Melo teixeira batista</article-title>
          , H.,
          <string-name>
            <surname>Pereira de Brito Neves</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , Farias de Oliveira, G.:
          <article-title>Social panic disorder and its impacts</article-title>
          .
          <source>In: Amadeus International Multidisciplinary Journal</source>
          . vol.
          <volume>2</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A test collection for research on depression and language use</article-title>
          .
          <source>In: Conference Labs of the Evaluation Forum</source>
          . pp.
          <fpage>28</fpage>
          -
          <lpage>39</lpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.: erisk 2017:
          <article-title>Clef lab on early risk prediction on the internet: Experimental foundations</article-title>
          .
          <source>In: 8th International Conference of the CLEF Association</source>
          . pp.
          <fpage>346</fpage>
          -
          <lpage>360</lpage>
          . Springer Verlag (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parapar</surname>
          </string-name>
          , J.:
          <article-title>Overview of eRisk - Early Risk Prediction on the Internet</article-title>
          .
          <article-title>In: Experimental IR Meets Multilinguality, Multimodality, and Interaction</article-title>
          .
          <source>Proceedings of the Ninth International Conference of the CLEF Association (CLEF</source>
          <year>2018</year>
          ). Avignon, France (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cambria</surname>
          </string-name>
          , E.:
          <article-title>Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM</article-title>
          .
          <source>In: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Maas</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Daly</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pham</surname>
            ,
            <given-names>P.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potts</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Learning word vectors for sentiment analysis</article-title>
          .
          <source>In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1</source>
          . pp.
          <fpage>142</fpage>
          -
          <lpage>150</lpage>
          . HLT '
          <volume>11</volume>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Van der Maaten</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G.:
          <article-title>Visualizing data using t-SNE</article-title>
          .
          <source>In: Journal of Machine Learning Research</source>
          . vol.
          <volume>9</volume>
          , pp.
          <fpage>2579</fpage>
          -
          <lpage>2605</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
          <source>In: CoRR</source>
          . vol.
          <source>abs/1301</source>
          .3781 (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          <volume>26</volume>
          . pp.
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          . Curran Associates, Inc. (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>