<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>First Insights on a Passive Ma jor Depressive Disorder Prediction System with Incorporated Conversational Chatbot</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Computer Science</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Engineering</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chalmers University of Technology</string-name>
          <email>Fionnd@student.chalmers.se</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Hong Kong University of Science and Technology</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Insight Centre for Data Analytics, Data Science Institute, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Gothenburg</institution>
          ,
          <country country="SE">Sweden</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Almost 50% of cases of major depressive disorder go undiagnosed. In this paper, we propose a passive diagnostic system that combines the areas of clinical psychology, machine learning and conversational dialogue systems. We have trained a dialogue system, powered by sequence-to-sequence neural networks that can have a real-time conversation with individuals. In tandem, we have developed speci c machine learning classi ers that monitor the conversation and predict the presence or absence of certain crucial depression symptoms. This would facilitate real-time instant crisis support for those su ering from depression. Our evaluation metrics have suggested this could be a positive future direction of research in both developing more human like chatbots and identifying depression in written text. We hope this work may additionally have practical implications in the area of crisis support services for mental health organisations.</p>
      </abstract>
      <kwd-group>
        <kwd>Depression</kwd>
        <kwd>Social Media</kwd>
        <kwd>Conversational Chatbot</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Each year, 300 million individuals worldwide will su er a major depressive
episode lasting a minimum of two weeks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Furthermore, less than 50% will be
correctly diagnosed and o ered appropriate treatment. Of those left untreated,
MDD can lead to suicide, an estimated 800,000 people a year lose their life to
suicide [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. One particular issue which contributes to the low diagnostic and
treatment rates is how the etiology of depression can directly interfere with an
individual seeking treatment.
      </p>
      <p>
        Since many mental disorders are not characterised by clear changes in
external or physical appearance, detection and diagnosis become more challenging.
In the case of depressive disorders, individuals are often not aware that their
symptoms are due to a medical disorder and often attribute them to poor mood
or external factors [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. To make this increasingly more complex, MDD often also
negatively a ects an individual's social interactions. This can hamper an
individual to actually seek professional support or talk about their experiences. This
presents a unique challenge in the medical community, in how to identify and
support individuals to come forward for diagnosis.
      </p>
      <p>In this work, we propose the concept of passive diagnosis, a term for a
new eld of research seen over the last two or three years. This work is not
exclusive to the research of mental health disorders, but concerns itself with using
machine learning techniques for predicting a potential future medical disorder. In
comparison to the traditional concept of active diagnosis, where an individual
su ering certain symptoms would actively seek out a medical diagnosis, this
process can now be facilitated by adding a passive element. Unlike a medical
professional who has limited time and resources, it is feasible to have machine
learning algorithms constantly passively observe an individual's health. Once
these algorithms detect certain changes in an individual's health that might
be indicative of a disorder, the algorithm can inform the individual and an
appropriate human professional for further investigation.</p>
      <p>
        An example of this application is DeepCare, developed by a research team
in Google [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This end to end application is designed to diagnose a wide range
of disorders. This eld of work allows medical professionals to actively provide
interventions to those at high risk before the disorder even sets in. Our work can
be considered to follow a similar trend, where we propose a passive diagnostic
approach to MDD.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <sec id="sec-2-1">
        <title>2.1 Depression Detection</title>
        <p>
          As far back as 1901, Sigmund Freud proposed that language could give us an
insight into certain mental illnesses [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. A combination of NLP and psychology
led to a number of publications investigating how di erent mental illnesses such
as bipolar disorder [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], MDD [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] and anorexia [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] can manifest and be predicted
through an individual's speci c use of certain language characteristics. Examples
of this work include higher counts of the word "I", and lower counts of future
temporal words in depressed student's written notes [6{8]. Penenbaker explains
how many of these characteristics are consistent with the etiology of MDD [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          This work led to di erent researchers in the machine learning community
investigating if there was su cient basis for classi ers to distinguish between
individuals su ering certain mental disorders and those not [
          <xref ref-type="bibr" rid="ref5 ref7">5, 7</xref>
          ]. Although
this body of work has a solid machine learning background and high
evaluation scores, from our perspective its practical medical application is limited. All
the approaches look at MDD as a binary outcome variable, predicting at time
x, if an individual positively diagnoses for MDD or not.
        </p>
        <p>
          We understand how this approach makes sense given that many elementary
machine learning classi ers perform best when predicting a simple binary
outcome. From the perspective of a medical professional, however, we can rarely
place individuals into binary classes. MDD is de ned by a speci c combination
of nine symptoms, the presence or absence of certain symptoms can have
dramatically di erent e ects on the diagnosis [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. For many professionals, MDD will be
viewed as a spectrum, with individuals falling from low risk to high risk [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. This
limitation has also been noted by [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] whose proposed solution was recording
the speci c mention of certain symptoms in online text.
        </p>
        <p>Following consultation with medical professionals, we decided to overcome
this limitation by building a host of separate classi ers that work on a
symptomatic level. The DSM-V lists nine di erent symptoms that can be present
during the occurrence of MDD. We propose that ve of these symptoms can be
reasonably detected to some degree through an online human-computer
interaction. These targeted symptoms are depressed mood most of the day, weight
change not attributed to dieting, sleep change characterised by insomnia or
hypersomnia, inappropriate guilt, and suicidal ideation. We developed ve separate
classi ers and allow medical professionals to make an overall diagnosis based on
the results of the classi ers and their own domain expertise.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Conversation Chatbots</title>
        <p>
          Recent approaches to building conversational chatbot systems are dominated
by the usage of neural networks. The authors of [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] present an approach for
conversational modelling, which uses a sequence-to-sequence neural model. Their
model predicts the next sentence given the previous sentences for an IT
helpdesk domain, as well as for an open-domain trained on a subtitles dataset. For an
open-domain dialogue generation, [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] propose an adversarial training approach
utilising reinforcement learning to produce sequences that are indistinguishable
from human-generated dialogue utterances.
        </p>
        <p>
          A heuristic that guides the development of neural baseline systems for the
extractive conversational chatbot task is described in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Their system, called
FastQA, demonstrates good performance, due to the awareness of question words
while processing the context. [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] demonstrate an approach to non-factoid answer
generation with a separate component, which is based on bidirectional LSTMs
to determine the importance of segments in the input.
        </p>
        <p>
          The increased use of social media as a communication tool between customers
and brands has allowed for the development of these systems to handle
realtime inquiries [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. In addition, increasingly, mental health support services are
incorporating real-time communication tools such as texting and social media
messaging as methods for individuals to talk to a counselor [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Some work
has already explored the possibility of building conversational chatbots that
emulate a counselor, this work makes use of both audio, visual and text-based
interactions [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experimental Settings</title>
      <p>In this section, we give an overview of data collection techniques employed and
the feature extraction methods used for each of our ve classi ers. Additionally,
we outline the process employed for developing our conversational system.
3.1</p>
      <sec id="sec-3-1">
        <title>Sequence-to-Sequence Neural Network Toolkit</title>
        <p>
          To train the conversational system, we use the OpenNMT toolkit [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], which
is a generic deep learning framework mainly specialising in sequence-to-sequence
(seq2seq) models and covers a variety of tasks. We used the default neural
network training parameters, i.e. 2 hidden layers, 500 hidden bidirectional LSTM4
units, input feeding enabled, batch size of 64, 0.3 dropout probability and a
dynamic learning rate decay.
        </p>
        <p>
          Our data was composed of 13,053,384 million question-answer pairs, 6,237,118
of which were obtained from the subreddit /r/AskReddit (see examples in Table
1). Subreddit submissions were considered as questions and the rst reply as an
answer. The remaining pairs were extracted from the OpenSubtitles dataset [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
The conversational model was trained for 13 epochs, which was further tuned
on a selected extraction from the eRisk corpus (cf. 3.2).
Data: We consider the existing work published in the eRisk proceedings to have
been inadvertently focused on the symptom of depressed mood. We were
provided with the eRisk task training set created by [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] consisting of comments and
submissions from 486 Reddit users, of which 83 users were labelled as su ering
depression. We considered each submission or comment as a single data point
labelled as depressed or non-depressed. This led to a collection of 307,065 data
points, of which 10.74% had the depressed label.
        </p>
        <p>
          Linguistic based features: Five di erent groups of linguistic features were
included, the rst of which is the Linguistic Inquiry and Word Count lexicon
(LIWC), which is commonly used in the eRisk task [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. This lexicon scores
78 di erent linguistic features related to social, clinical, health and cognitive
psychology. Scores are percentages of words in a text that re ect a given
emotion, scaled between 1 and 0, where 1 indicates all words in a sentence re ect
a given emotion [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. The authors of [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] found that the use of activation and
dominance sentiment characteristics to be a strong predictor of MDD in their
Twitter dataset [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ].
        </p>
        <sec id="sec-3-1-1">
          <title>4 LSTM - long short-term memory</title>
          <p>
            The Warriner lexicon [
            <xref ref-type="bibr" rid="ref23">23</xref>
            ] contains a more detailed analysis of valence,
activation and dominance scores for words grouped by male, female and overall,
providing a total of nine features. The NRC A ect Intensity Lexicon contains
four di erent emotion scores: anger, fear, joy, sadness [
            <xref ref-type="bibr" rid="ref24">24</xref>
            ], while the SenticNet 5
lexicon provides polarity and intensity in combination with one of aptitude,
attention, pleasantness or sensitivity [
            <xref ref-type="bibr" rid="ref25">25</xref>
            ]. All features above were included, by
calculating the mean word score of a post. Drawing on the work of
psycholinguistics, we included the additional two features, counts of the personal pronoun
"I" and the Flesch Kincaid readability scores [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ].
          </p>
          <p>
            Text embedding: All of our classi ers employ the same text embedding
approach, which draws on the work of [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ] and utilized Doc2Vec [
            <xref ref-type="bibr" rid="ref26">26</xref>
            ]. The
authors of [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ] found that in the context of Reddit data, using pre-compiled word
embeddings had signi cantly lower performance compared with training the
embeddings on their own Reddit data directly. A comparison on our part using
fastText [
            <xref ref-type="bibr" rid="ref27">27</xref>
            ] found a similar situation.
          </p>
          <p>
            In summary, the approach consists of mapping each word to a unique
multidimensional vector and trying to predict the next word in the sentence. The
Doc2Vec approach also maps each paragraph to a unique vector. Both vectors
are then concatenated to predict the next word in a context. A number of
different variations of this approach have been proposed [
            <xref ref-type="bibr" rid="ref26">26</xref>
            ]. We compared a set
of these approaches on error rate based on a logistic regression model trained on
the paragraph embeddings.
          </p>
          <p>
            In our training, the approach with the lowest error rate was a combination
of the Distributed Bag of Words (DBOW) and Distributed Memory (DM). This
combination has been proposed as an optimal method by [
            <xref ref-type="bibr" rid="ref26">26</xref>
            ]. It encompasses
the DM method, which is explained above and a DBOW version of Doc2Vec,
where the word vector is dropped and instead forces the model to predict words
randomly sampled from the paragraph vector by using a sliding window.
Parameters for both algorithms includes discounting all words which occurred less than
twice in the corpus, 20 epochs and a nal vector output of size 100. The two
100-dimensional vector outputs were concatenated to a joint 200-dimensional
vector.
3.3
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Suicidal Ideation Classi er</title>
        <p>
          Data: A limited number of publications have gained access to small collections of
suicide notes and performed some basic linguistic analysis on them [
          <xref ref-type="bibr" rid="ref28 ref29">28, 29</xref>
          ]. Our
approach is based around a public subreddit titled /r/SuicideNotes (SN). This
subreddit describes itself as "A location to immortalize your nal words, or read
the last words that others have written down." and contains 1210 submissions as
of the end of August 2018.
        </p>
        <p>
          To enhance the validity of this data, we drew upon the work of [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] who
identi ed users with suicidal tendencies by applying a time series approach to
users posting on Reddit. Our approach consisted of selected each user (738)
who had posted a note on SN, and selected all their historical posts using the
complete Reddit dataset [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. We began by removing users who only had a single
post ever (throwaway accounts), we then only selected users whose last post ever
was on SN. This gave us a list of 112 users who met the following two conditions,
i.e., (i) history of posting to Reddit and (ii) had posted to SN and had never
interacted with Reddit again.
        </p>
        <p>
          A total of 1,502 Reddit posts labelled as having come from a suicidal user
where extracted. We randomly extracted 1,500 more posts from Reddit to use
as a control group. To further validate the nature of the posts, we refer to the
work of [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] on syntactic features associated with suicide notes. Notably, we
would expect to see a higher account of the personal pronoun "I" in suicide
notes compared to control posts, which we do see in Figure 1.
        </p>
        <p>Count of i
e
trua Present
e
ift
c
c
tna Achievement
y
S</p>
        <p>Negative</p>
        <p>Group</p>
        <p>Suicide
Normal
0.08
0.00
0.02 0.04 0.06</p>
        <p>Percentage occurrence within the text</p>
      </sec>
      <sec id="sec-3-3">
        <title>Feature creation &amp; text embedding: For each post, we extracted the</title>
        <p>following features: 78 linguistic features related to psychology extracted using
the LIWC lexicon. We found the optimal Doc2Vec text embedding approach to
be a single DBOW method with a setup of 100 dimensions, a minimum word
occurrence count of two and 20 epochs.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Insomnia / Hypersomnia Classi er</title>
        <p>Data: Our dataset for this classi er is all posts on the /r/Insomnia subreddit,
which describes itself as "Posts and discussion about insomnia and sleep
disorders.". We collected 40,000 posts from /r/Insomnia and an additional 40,000
random posts from /r/AskDocs a subreddit focusing on general medical related
questions. In the results stage, we nd an abnormally high degree of accuracy
suggesting over tting of the data. We compensated this by adding 20,000
randomly selected Reddit posts to each class as noise.</p>
      </sec>
      <sec id="sec-3-5">
        <title>Feature creation &amp; text embedding: We followed a similar approach to</title>
        <p>the suicidal ideation classi er, for each post we extracted 78 LIWC linguistic
features. Text embedding was Doc2Vec with a DBOW approach providing the
lowest error rate. Our total dataset was thus 120,000 posts, each of which had
178 features.
3.5</p>
      </sec>
      <sec id="sec-3-6">
        <title>Weight Change Classi er</title>
        <p>Data: Since our conversational system is only designed as an initial classi cation
approach, we are expecting this classi er will give an indication (positive class)
that the individual is talking about weight change or in the case of a negative
class prediction, the individual has made no mention of aspects related to weight
change. Data was collected from the r/Loseit subreddit, a community dedicated
to weight change. 80,000 posts were collected and labelled as belonging to the
positive class and another 80,000 from the /r/AskDocs subreddit and labelled as
belonging to the negative class. Additional 40,000 posts were randomly allocated
between classes to prevent over tting and simulate noise.</p>
      </sec>
      <sec id="sec-3-7">
        <title>Feature creation &amp; text embedding: The exact same process that we</title>
        <p>applied to our sleep classi er was taken here. A 178 feature space was created
with 100 Doc2Vec DBOW approach and 78 LIWC features.
3.6</p>
      </sec>
      <sec id="sec-3-8">
        <title>Excessive or Inappropriate Guilt</title>
        <p>
          Approach: We found no existing research published on the syntactic features
associated with inappropriate guilt in the English language, additionally we found
no speci c way to isolate guilt related data on Reddit to use. The developers of
Linguistic Inquiry with Word Count (LIWC) suggested that guilt can be
recognised in certain cases from a combination of negative emotions and anxiety [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
Both of these are features extractable with LIWC and their sum can serve as a
noisy proxy for guilt in a post. We presented this count directly as an indicator
of guilt and performed no further modelling.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
      <p>In this section, we give an overview of the algorithms employed in the
development of our classi er.
4.1</p>
      <sec id="sec-4-1">
        <title>Depressed Mood Classi er</title>
        <p>
          The primary issue a ecting the developed of this classi er was uneven class
balance (majority class = 89%). We overcame this issue by applying a synthetic
minority oversampling technique (SMOTE) [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]. This approach consists of
oversampling the minority class to create an arti cial dataset with an even class
balance. We then applied a Random Forest classi er tuned on a grid search
method (class weight= 'balanced subsample', bootstrap= 'false', criterion=
'entropy',n estimators= 9), all features underwent standard scaling.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Suicidal Ideation Classi er</title>
        <p>
          This model incorporated a logistical regression classi er. Our choice of this
approach was due to its percentage outcome. As [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ] suggests, rather than
considering binary outcome as two independent events, we can consider the outcome as
an unobserved continuous variable. In this case, the propensity of an individual
to attempt suicide. We applied an L2 penalty, with balanced class weights and
all features underwent standard scaling.
        </p>
        <p>Threshold values Threshold values allow for distinction of binary classes
when working on a continuous scale. The allocation of a threshold value is often
considered important in medical literature where there might be a consideration
to knowingly over or under predict certain classes. The most naive approach is
often maximizing the area under the curve when sensitivity is plotted against
1-speci city. Although this can give the most balanced class allocation, we would
consider reducing false negative predictions to be of reasonable importance. To
do this, we chose to set a speci city value of 0.95 which allocates us a threshold
value of 0.55 sensitivity value of 0.61 and a Youden's index of 0.50.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3 Sleep Change &amp; Weight Change Classi ers</title>
        <p>Both classi ers employed a logistic regression algorithm with L2 penalization.
Independent grid searches were applied but resulted in the same set of optimized
parameters. These were intercept scaled to 1, balanced class weight and the Scikit
Learn C parameter which is the inverse of the regularization strength was set to
1 as well.</p>
        <p>Threshold values &amp; scoring Although false negative predictions for sleep
or weight change is not as serious as missing attempts at suicide, we still
optimized the threshold value of the logistic regression by setting the speci city
value to 0.95. Cuto values for the sleep change and weight change classi ers
are 0.72, and 0.74 respectfully.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4 Conversational System Interface Design</title>
        <p>A demo interface5 (Figure 2) was developed which combined the conversational
model and classi ers. The interface consisted of a text eld where users can write
a comment which is sent to the conversational model. The reply to this input is
shown on the screen to the user.
In this section, we begin by reporting on the metric evaluation scores acquired
during the training of the models. The second section reports on the overall
evaluation process we employed in the project.</p>
        <sec id="sec-4-4-1">
          <title>5 http://server1.nlp.insight-centre.org/marvin/demo.html</title>
        </sec>
      </sec>
      <sec id="sec-4-5">
        <title>Classi er Evaluation</title>
        <p>Metric scores during training for each of the four classi ers we developed are
presented in table 2. In all cases, the scores presented are mean scores following
10 fold cross validation performance on a withheld test set composed of a
random 20% sample of the original dataset. The depressed mood classi er however
employed the SMOTE balanced subsample approach.
We recruited seven participants as a convenience sample. All participation was
anonymous and voluntary, no demographic details were collected. Initially,
participants were instructed to have a short interaction with the conversational
system. Beginning with answering the question "How are you?". Participants
could end the conversation at any stage by exiting out of the conversation, but
were asked to try and hold the conversation for at least 20 messages.</p>
        <p>
          The following step of the evaluation was to establish ground truths.
Participants were asked to complete the Beck's Depression Inventory-II [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ], which is
a 21 item multiple response questionnaire that ranks participants on a scale of
no indication of MDD to an strong indication of MDD. The advantage of the
Beck's Inventory is that it is a short and highly standardised instrument that
has seen applications across a wide range of research studies [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ].
5.3
        </p>
      </sec>
      <sec id="sec-4-6">
        <title>Overall Project Evaluation Results</title>
        <p>We propose two metric evaluation approaches for our project, (i) evaluates
classi ers individually, (ii) overall evaluation. For the individual classi er evaluation,
we established ground truths by dividing each question on the Beck's Inventory
to a speci c symptom it was investigating as per table 3. A score above two on a
question was considered a positive score (presence of a symptom), and the same
if a classi er returned a mean prediction score above it's respective threshold
value. Row two, three and four in table 3 provides the metric scores for each
classi er6.</p>
        <p>The second evaluation process, which investigated the overall accuracy was
computed by considering a score above 19 on the Beck's Inventory as ground
truth presence of an MDD. If two out of the four classi er provided a
positive prediction, this was considered an overall positive prediction of depression.
Results are presented in the rst row of table 3.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>Over the course of the above text, we initially began by demonstrating why
MDD is worthy of study, and how passive diagnoses is an important future
issue. Our proposed approach builds on that of the existing machine learning
communities contributions to MDD, whereby our methodology is the rst to view
MDD prediction on a symptomatic level. In addition to a theoretical proposal,
we hope our work may lead to a future practical application.</p>
      <p>
        Within the scope of the work, we note two key limitations that must be
addressed in our future works. Initially, the ever-present problem of suitable data
presents itself. The authors of [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] explores the advantages and disadvantages
with regards to di erent depression related data collection approaches. In all
cases, none of our labelled data actually employs medical diagnoses. Therefore,
we can not be completely con dent our labelled data is representative of the
actual disorder. Our second limitation concerns that of the evaluation stage of
the project. We accept that a sample size of seven individuals is quite limited in
its evaluation scope. Nevertheless, we feel this is su cient as a proof of concept,
and despite these two limitations we have demonstrated a future direction for
research.
      </p>
      <p>With regards to interpreting the results from our evaluation stage, for the
sleep change classi er, we see low precision and high recall scores indicating the
possibility that the threshold value has been set to high. No one in our sample
indemni ed as having the presence of suicidal ideation or negative weight change,
and respectfully our classi ers did not predict any false positives in these cases.
In nal conclusion, within the scope of our limited sample size, we are positive
regarding the results of four of our classi ers, and suggest a revaluation of the
threshold value assigned to one. Ultimately, we hope to see that individuals
6 Neither Suicide ideation (Question 9) nor weight change (Question 18) are included
as all scores are equal to one.
su ering an MDD episode will no longer su er alone but rather will have more
rapid and easy access to diagnostic services and thus receive support in a timely
manner.</p>
      <p>Acknowledgement. This publication has emanated from research conducted
with the nancial support of Science Foundation Ireland (SFI) under Grant
Number SFI/12/RC/2289 (Insight).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. WHO:
          <article-title>Depression fact sheet (</article-title>
          <year>2018</year>
          ), http://www.who.int/news-room/factsheets/detail/depression
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Belmaker</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agam</surname>
          </string-name>
          , G.:
          <article-title>Major depressive disorder</article-title>
          .
          <source>New England Journal of Medicine</source>
          <volume>358</volume>
          (
          <issue>1</issue>
          ),
          <volume>55</volume>
          {
          <fpage>68</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Pham</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Phung</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkatesh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Deepcare: A deep dynamic memory model for predictive medicine</article-title>
          .
          <source>In: Paci c-Asia Conference on Knowledge Discovery and Data Mining</source>
          . pp.
          <volume>30</volume>
          {
          <fpage>41</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehl</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          , Niederho er, K.G.:
          <article-title>Psychological aspects of natural language use: Our words, our selves</article-title>
          .
          <source>Annual review of psychology 54(1)</source>
          ,
          <volume>547</volume>
          {
          <fpage>577</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>L.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.S.:</given-names>
          </string-name>
          <article-title>Detection of the Prodromal Phase of Bipolar Disorder from Psychological and Phonological Aspects in Social Media</article-title>
          . arXiv preprint (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Trotzek</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koitka</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          :
          <article-title>Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences</article-title>
          . arXiv preprint (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ramiandrisoa</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benamara</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moriceau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          : Irit at e-risk
          <year>2018</year>
          . In: E-Risk workshop. pp.
          <volume>367</volume>
          {
          <fpage>377</fpage>
          .
          <string-name>
            <surname>Almquist</surname>
          </string-name>
          &amp; Wiksell
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rude</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gortner</surname>
            ,
            <given-names>E.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Language use of depressed and depressionvulnerable college students</article-title>
          .
          <source>Cognition &amp; Emotion</source>
          <volume>18</volume>
          (
          <issue>8</issue>
          ),
          <volume>1121</volume>
          {
          <fpage>1133</fpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Association</surname>
            ,
            <given-names>D..A.P.</given-names>
          </string-name>
          , et al.:
          <article-title>Diagnostic and statistical manual of mental disorders</article-title>
          . Arlington: American Psychiatric Publishing (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Karmen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hsiung</surname>
            ,
            <given-names>R.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wetter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Screening internet forum participants for depression symptoms by assembling and enhancing multiple nlp methods</article-title>
          .
          <source>Computer methods and programs in biomedicine 120(1)</source>
          ,
          <volume>27</volume>
          {
          <fpage>36</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Vinyals</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.V.</given-names>
          </string-name>
          :
          <article-title>A neural conversational model</article-title>
          .
          <source>CoRR 1506</source>
          .05869 (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monroe</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jean</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ritter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jurafsky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Adversarial learning for neural dialogue generation</article-title>
          .
          <source>In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP</source>
          <year>2017</year>
          , Copenhagen, Denmark, September 9-
          <issue>11</issue>
          ,
          <year>2017</year>
          . pp.
          <volume>2157</volume>
          {
          <issue>2169</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Weissenborn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiese</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Sei e, L.:
          <article-title>Making neural qa as simple as possible but not simpler</article-title>
          . In: CoNLL (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Ruckle,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Gurevych</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          :
          <article-title>Representation learning for answer selection with lstmbased importance weighting</article-title>
          .
          <source>In: IWCS</source>
          <year>2017</year>
          | 12th International Conference on Computational Semantics |
          <article-title>Short papers (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sinha</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akkiraju</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>A new chatbot for customer service on social media</article-title>
          .
          <source>In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems</source>
          . pp.
          <volume>3506</volume>
          {
          <fpage>3510</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davidson</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sicafuse</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Someone to listen: Increasing youth helpseeking behavior through a text-based crisis line for youth</article-title>
          .
          <source>Journal of Community Psychology</source>
          <volume>41</volume>
          (
          <issue>4</issue>
          ),
          <volume>471</volume>
          {
          <fpage>487</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Winata</surname>
            ,
            <given-names>G.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kampman</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dey</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fung</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Nora the empathetic psychologist</article-title>
          .
          <source>In: Proc. Interspeech</source>
          . pp.
          <volume>3437</volume>
          {
          <issue>3438</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Senellart</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rush</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          : Opennmt:
          <article-title>Open-source toolkit for neural machine translation</article-title>
          .
          <source>CoRR abs/1701</source>
          .02810 (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Lison</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tiedemann</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Opensubtitles2016: Extracting large parallel corpora from movie and tv subtitles</article-title>
          . In: Chair),
          <string-name>
            <given-names>N.C.C.</given-names>
            ,
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Grobelnik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Piperidis</surname>
          </string-name>
          , S. (eds.)
          <source>Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ).
          <source>European Language Resources Association (ELRA)</source>
          , Paris, France (may
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Losada</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A Test Collection for Research on Depression and Language Use CLEF 2016, Evora (Portugal)</article-title>
          .
          <source>Experimental IR Meets Multilinguality</source>
          , Multimodality, and Interaction pp.
          <volume>28</volume>
          {
          <issue>29</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Booth</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Francis</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          :
          <article-title>Linguistic inquiry and word count: LIWC [Computer software]</article-title>
          . Erlbaum Publishers, Mahwah,NJ (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>De Choudhury</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Counts</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horvitz</surname>
          </string-name>
          , E.:
          <article-title>Social media as a measurement tool of depression in populations</article-title>
          .
          <source>Proceedings of the 5th Annual ACM Web Science Conference on - WebSci '</source>
          13 pp.
          <volume>47</volume>
          {
          <issue>56</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Warriner</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuperman</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brysbaert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Norms of valence, arousal, and dominance for 13,915 English lemmas</article-title>
          .
          <source>Behavior Research Methods</source>
          <volume>45</volume>
          (
          <issue>4</issue>
          ),
          <volume>1191</volume>
          {
          <fpage>1207</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          :
          <article-title>Word A ect Intensities</article-title>
          .
          <source>arXiv preprint</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Cambria</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poria</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hazarika</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kwok</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>SenticNet 5: Discovering Conceptual Primitives for Sentiment Analysis by Means of Context Embeddings</article-title>
          . Aaai pp.
          <volume>1795</volume>
          {
          <year>1802</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of sentences and documents</article-title>
          .
          <source>In: International Conference on Machine Learning</source>
          . pp.
          <volume>1188</volume>
          {
          <fpage>1196</fpage>
          . Beijing China (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Bag of tricks for e cient text classi cation</article-title>
          .
          <source>arXiv preprint</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Pestian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nasrallah</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matykiewicz</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bennett</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leenaars</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Suicide Note Classi cation Using Natural Language Processing: A Content Analysis</article-title>
          .
          <source>Biomedical informatics insights 2010(3)</source>
          ,
          <volume>19</volume>
          {
          <fpage>28</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Pestian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pestian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pawel</surname>
            <given-names>Matykiewicz</given-names>
          </string-name>
          , Brett South, Ozlem Uzuner, John Hurdle:
          <article-title>Sentiment Analysis of Suicide Notes: A Shared Task</article-title>
          .
          <source>Biomedical Informatics Insights</source>
          <volume>5</volume>
          ,
          <issue>3</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>De Choudhury</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiciman</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dredze</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coppersmith</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Discovering shifts to suicidal ideation from mental health content in social media</article-title>
          .
          <source>In: Proceedings of the 2016 CHI conference on human factors in computing systems</source>
          . pp.
          <year>2098</year>
          {
          <article-title>2110</article-title>
          . ACM, San Jose, CA, USA (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Michael</surname>
          </string-name>
          , J.: Pushshift.io, https://pushshift.io/
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Chawla</surname>
            ,
            <given-names>N.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bowyer</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hall</surname>
            ,
            <given-names>L.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kegelmeyer</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          : SMOTE:
          <article-title>Synthetic minority over-sampling technique</article-title>
          .
          <source>Journal of Arti cial Intelligence Research</source>
          <volume>16</volume>
          ,
          <volume>321</volume>
          {
          <fpage>357</fpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>King</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zeng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Logistic Regression in Rare Events Data</article-title>
          .
          <source>Political Analysis</source>
          <volume>9</volume>
          (
          <issue>02</issue>
          ),
          <volume>137</volume>
          {
          <fpage>163</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>A.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steer</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          :
          <article-title>Internal consistencies of the original and revised beck depression inventory</article-title>
          .
          <source>Journal of clinical psychology 40(6)</source>
          ,
          <volume>1365</volume>
          {
          <fpage>1367</fpage>
          (
          <year>1984</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Dozois</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dobson</surname>
            ,
            <given-names>K.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ahnberg</surname>
            ,
            <given-names>J.L.:</given-names>
          </string-name>
          <article-title>A psychometric evaluation of the beck depression inventory{ii</article-title>
          .
          <source>Psychological assessment 10(2)</source>
          ,
          <volume>83</volume>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>