<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Identification of Depression Strength for Users of Online Platforms: A Comparison of Text Retrieval Approaches</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ayan</forename><surname>Bandyopadhyay</surname></persName>
							<email>bandyopadhyay.ayan@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Indian Statistical Institute</orgName>
								<address>
									<country key="IN">India</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Linda</forename><surname>Achilles</surname></persName>
							<email>achilles@uni-hildesheim.de</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Hildesheim</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Thomas</forename><surname>Mandl</surname></persName>
							<email>mandar@isical.ac.in</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Hildesheim</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Mandar</forename><surname>Mitra</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Indian Statistical Institute</orgName>
								<address>
									<country key="IN">India</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Sanjoy</forename><surname>Kr</surname></persName>
						</author>
						<author>
							<persName><surname>Saha</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">Jadavpur University</orgName>
								<address>
									<country key="IN">India</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Identification of Depression Strength for Users of Online Platforms: A Comparison of Text Retrieval Approaches</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">349086FD8976A5D1F11E6A107667B13A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:29+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Text Classification</term>
					<term>Depression Detection</term>
					<term>Social Media</term>
					<term>Information Retrieval</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Social media became one of the most popular platform to express feelings and thoughts in the world of digital information sharing. Facebook, Snapchat, Instagram, QQ, Weibo, Twitter, Tumblr, Reddit and LinkedIn are among the most popular social networks. They are used to share, spread and create new information, receive and spread news locally, globally or privately. Many citizens share their feelings and thoughts in social media, consequently mining of emotions and psychological states from social media posts has become an active research area. In the CLEF 2019 eRisk task 3, the goal is to detect how strong a user of social media is suffering from depression. The ground truth is obtained by asking persons a set of standardised questions. This paper shows how a variety of ad-hoc retrieval approaches can be adopted to perform this task. The results do not reach a high level of accuracy, but compare to supervised classification approaches. In the discussion section, the adequacy of measures for the task is reflected.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The classification of text documents has seen great progress in recent years. Meanwhile research is approaching complex problems like gender attribution, content reliability as well as different quality attributes of text (e.g. helpfulness <ref type="bibr" target="#b6">[7]</ref>  <ref type="bibr" target="#b26">[27]</ref> ). The advances in deep learning technologies have contributed to the expansion of classification tasks. Word embeddings as a latent model of the content of words are representations which are learned by a system during the processing. The training items are constructed typically as n-grams of words of subsequent text. Word embeddings as a representation model have often achieved very good results in recent years. One assumption behind many computation tasks in the psychological domain is that text tells a lot about the writer. Consequently, the prediction of psychological traits of people based on text has become an important research area. The base is often a collection of texts from social media due to the large amount of text that can be found and the ease of availability. Researchers have tried to predict the personality of a person based on the Big-5 model ( <ref type="bibr" target="#b4">[5]</ref>. More recently, the prediction of mental health issues has been seen as a task for classification systems. First collections have been developed for analysis (e.g. <ref type="bibr" target="#b25">[26]</ref> ). The eRisk task (Early risk prediction on the Internet) at the Conference and Labs of the Evaluation Forum (CLEF) became a venue for comparative analysis of depression detection. In 2019, eRisk moved to predicting the level of depression of persons based on their social media postings. This paper reports on heterogeneous experiments for this task and reviews some technologies for depression detection. Often, there are few data samples available due to the high level of the required confidentiality. As a consequence, we test mainly methods based on string similarity and matching techniques instead of supervised approaches.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Depression and depression detection</head><p>Traditionally, depression is diagnosed in a therapy in which a therapist checks whether depression symptoms appear during a period of time in the behavior of the patient or not. These symptoms are, for instance, described in the Diagnostic and Statistical Manual of Mental Disorders (DSM) <ref type="bibr" target="#b1">[2]</ref>. The current fifth edition replaces the now outdated fourth edition. Another instrument in this field is the Beck's Depression Inventory (BDI) <ref type="bibr" target="#b8">[9]</ref>. The BDI is a questionnaire consisting of 21 questions assessing the patient's mental state regarding feelings like sadness, pessimism, loss of energy and similar. The following example shows the first question of the BDI:</p><p>1. Sadness 0. I do not feel sad. 1. I feel sad much of the time.</p><p>2. I am sad all the time. 3. I am so sad or unhappy that I can't stand it.</p><p>A different questionnaire was developed by Radloff <ref type="bibr" target="#b21">[22]</ref>. It consists of 20 questions, dealing with the frequency of various symptoms of depression. This ques-tionnaire is called the CES-D Scale (Center for Epidemiologic Studies Depression Scale). This self-report depression scale has been revised in 2004 (DESD-R) <ref type="bibr" target="#b11">[12]</ref>. Instead of relying on self-report, Eichstaedt et al.</p><p>[13] used medical codes from an electronic medical report (EMR) of a patient to establish the depression diagnosis <ref type="bibr">[13]</ref>. The researchers then analysed the patients' Facebook posts that were created before the diagnosis in the EMR. Besides the textual post content, they also used the post length, the frequency of posting, the temporal posting patterns, as well as the demographic information to predict the future diagnosis of depression in the EMR. Overall, language features outperformed all other features considered. They could also show, that their approach resulted in a prediction accuracy comparable to validated self-report depression scales.</p><p>The examples above show that getting meaningful data can be a difficult and time, labor and cost consuming task, which also relates to the sensitivity of the topic. This becomes apparent in the study of Eichstaedt and colleagues, for which they asked 11,224 patients of an emergency department of a hospital of which only 1,175 agreed to participate fully in the study <ref type="bibr">[13]</ref>. However, Shen et al. made the point that the DSM, for instance, took over a decade to evolve from fourth to fifth edition and is so relatively slow in updating depression criteria, especially those that are conveyed by the behavioral patters in social media <ref type="bibr" target="#b25">[26]</ref>. Automatically analyzing the online behavior and language on social media therefore can help in early detection of mental disorders like for instance depression.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Early risk prediction on the Internet (eRisk)</head><p>The eRisk task is an evaluation lab as part of the CLEF initiative. Its main objective is to examine evaluation methodologies, effectiveness and performance metrics, as well as practical applications and the building of test collections related to early risk detection on the internet. Technologies that can detect disorders at an early stage can be applied to variety of different cases and can be especially useful in those associated with safety and health. For instance, notifications can be sent when sex offenders start interacting with children. Besides potential paedophiles other examples encompass stalkers, or persons with suicidal thoughts or those with tendencies to depression or other mental disorders <ref type="bibr" target="#b15">[16]</ref>. In 2018, two tasks were organized by the lab: 1) Early Detection of Signs of Depression and 2) Early Detection of Signs of Anorexia. The lab in 2019 organized three tasks: 1) Early Detection of Signs of Anorexia (continuation of eRisk 2018's T2 task), 2) Early Detection of Signs of Self-harm (this is a new task in 2019) and 3) Measuring the Severity or Strength of the Signs of Depression (this is a new task in 2019). The test collections for task one and two of both years have the same format as described in the overview paper <ref type="bibr" target="#b14">[15]</ref>. They consist of writings (post and comments) from social media authors. For evaluating the performance of the systems in the tasks, standard measures like F 1 , Precision and Recall have been used. They do not take the decision making time into account, so that the organizers proposed the ERDE (early risk detection error) measure <ref type="bibr" target="#b14">[15]</ref>. Early detection is rewarded, meaning the fewer posts required to detect e.g. anorexia the better the system is considered to be. The measure is parameterised to control the place in the X axis where the cost (the delay in detecting true positives) grows more quickly. ERDE 5 therefore is very demanding with decision delays, because if a system needs more than 5 writings the value for ERDE 5 quickly decreases. However, ERDE 50 is less strict with decision delays <ref type="bibr" target="#b15">[16]</ref>. The ERDE measure is in the range [0, 1] <ref type="bibr" target="#b14">[15]</ref>. In 2018, the best results for ERDE 5 were achieved by flexible temporal variation of terms (FTVT) and sequential incremental classification (SIC) <ref type="bibr" target="#b13">[14]</ref>. In case of ERDE 50 as well as F 1 word embeddings and linguistic metadata led to the best results <ref type="bibr" target="#b27">[28]</ref>. The highest precision was achieved by using effective machine learning algorithms (a bag of words model has been used to perform ada boost, random forest, logistic regression and support vector machine classifiers) <ref type="bibr" target="#b19">[20]</ref>. Fidel and colleagues obtained the highest recall by applying two independent models (one trained to predict depression cases, the other one to predict non-depression cases) with two variants: Duplex Model Chunk Dependent (DMCD) and Duplex Model Writing Dependent (DMWD) <ref type="bibr" target="#b9">[10]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Measuring the Severity or Strength of the Signs of Depression (eRisk 2019 task 3)</head><p>The third task in eRisk 2019 is an exploratory new task in eRisk. Participants of the challenge have to build an algorithm that estimates the level of depression of a user based on a history of postings. Depending on these, the participants of the eRisk lab have to fill in the questionnaire BDI for each user. This means that the task consists of predicting how a user would fill in the questionnaire given her or his texts <ref type="bibr" target="#b16">[17]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Data Set</head><p>The data set consists of BDI questionnaires that were filled in by social media users along with each user's history of writings. After submitting the BDI, the user's writings were extracted right after. These original questionnaires are the ground truth data for task 3 and were used to evaluate the performance of the lab participants' systems. The participants were given a data set of 20 social media authors' writing history. They were then asked to develop an algorithm that produces the following structure: username1 answer1 answer2 .... answer21 username2 .... ....</p><p>Each line identifies the author and the estimated answers to the questions in the BDI. The ground truth data has the same format <ref type="bibr" target="#b16">[17]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Evaluation Measures</head><p>The task employs a variety of evaluation metrics to measure the success of algorithms. Losada et al. <ref type="bibr" target="#b16">[17]</ref> define them as follows:</p><p>Hit Rate (HR) HR determines how often the prediction was correct, compared to the real questionnaire and gives the ratio. For instance, a prediction where 5 of the 21 questions of the BDI for correct get an HR value of 5/21.</p><p>Average Hit Rate (AHR) AHR is HR, but averaged over all users.</p><p>Closeness Rate (CR) CR considers the ordinal scale underlying the questions in the BDI. For each question an absolute difference (ad) between the actual answer and the predicted one. A system that is farther away from the answer than a second system should be penalized for this greater distance. For that the measure is build like this:</p><formula xml:id="formula_0">CR = (mad − ad) mad<label>(1)</label></formula><p>Here, mad stands for the maximum absolute difference (number of possible answers minus one).</p><p>Average Closeness Rate (ACR) ACR is CR, but averaged over all users. However, the questions #16 and #18 have seven possibilities to answer, where for answers 1 to 3 two possible options (a and b) are available. However, those options were considered equal, since they represent the same level of depression.</p><p>Difference between overall depression levels (DODL) This measure does not take into account the system's correct predictions on question-level, but gives the overall depression level based on the sum of all answers for the real and system generated BDI. Furthermore, the absolute difference (ad overall) between the real and the predicted depression score is calculated.</p><p>A depression level is an integer between 0 and 63. These numbers are derived from adding the numbers of the answers from the BDI. For example, considering question #1 (see section 2.1), if the answer was option 1, the depression level integer is raised by 1. This way, the following four categories are associated with the respective depression levels:</p><p>1. Minimal depression (depression levels 0-9) 2. Mild depression (depression levels 10-18)</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Moderate depression (depression levels 19-29) 4. Severe depression (depression levels 30-63)</head><p>These levels are widely accepted in the psychological literature <ref type="bibr" target="#b7">[8]</ref>.</p><p>The DODL measure is finally normalized into [0, 1] in the following way:</p><formula xml:id="formula_1">DODL = (63 − ad overall) 63<label>(2)</label></formula><p>Average DODL (ADODL) ADODL is DODL, but averaged over all users.</p><p>Depression Category Hit Rate (DCHR) DCHR computes the fraction of cases, in which the system generated BDI led to the same depression category obtained from the real author's questionnaire.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Processing Approaches</head><p>We experimented with several heterogeneous ad-hoc information retrieval approaches for depression prediction. That way, a variety of parameter settings can be explored. An important research question is, whether such processing without additional resources can compete with deep learning approaches for a domain with relatively little text volume.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Ad-hoc Retrieval Approaches</head><p>We considered the posts given for each user and the BDI as a document corpus and as traditional ad-hoc information retrieval queries. Each answer of a BDI question is treated as a query. Each set of user posts is treated as a document collection and indexed. This allows to retrieve (compute a query document similarity score) documents and produce the result as quickly as possible. The main concept behind our approach is as follows: The post "p i " (i = 1, 2..k, k is total number of posts by user "u") of an user "u" which is returned with the maximum similarity value for a BDI answer with number 1.j (j=0,1,2,3 here. See example query number 1) from a question set "1" determines the answer. For the user "u", "j" is the result of query set 1. In the example, question number 1 is concerned with the concept "sadness", so for user "u" j is the "sadness" label predicted. This approach allows the use of information retrieval technology for the task. It also enables a completely unsupervised approach which does not require additional resources. Due to the nature of text on social media microblogs, it seems unclear whether stop word removal and stemming as traditional pre-processing methods are beneficial for the task. Consequently, we conducted experiments with and without both techniques. documents by stemming and stop word removal, and no stemming and no stop word removal</p><p>The following experiments with different retrieval models and parameter settings were carried out with Lucene as the basic search engine:</p><p>-TF-IDF -BM25 <ref type="bibr" target="#b23">[24]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Deep Representations for Matching</head><p>Recently, deep representations based on word embedding have received much attention, in particular for supervised learning. Based on our approach described above, further experiments were done with word embedding representations. For that, we used the word2vec pre-trained model <ref type="bibr" target="#b18">[19]</ref>[18] and represented a document as a vector using Equation-3. In this case, − → d is the document vector of document d, − → w id is the vector for the i th word (or term) from document d.</p><p>Equation-4, describes how query and document similarities were calculated. This method was used by Bandyopadhyay et al. <ref type="bibr" target="#b5">[6]</ref> in a retrieval approach for tweet classification during natural disasters. In Equation-4 − → q is the query vector of a query q. CosSim(</p><formula xml:id="formula_2">− → q , − → d ) is cosine similarity of − → q , − → d . − → d = |W d | i=1 − → w id<label>(3)</label></formula><formula xml:id="formula_3">Sim(q, d) = CosSim( − → q , − → d ) = − → q • − → d || − → q || • || − → d ||<label>(4)</label></formula><p>We used Google's pre-trained word2vec vectors <ref type="bibr" target="#b0">[1]</ref> and the GloVe pre-trained <ref type="bibr" target="#b20">[21]</ref>( Table <ref type="table" target="#tab_0">1</ref>) word vectors to compute our document vectors using Equation-3 formula. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Results</head><p>This section shows the results of our experiments and compares them to the outcomes of the submitted runs for the task at CLEF eRisk. The experiment LM-d λ = 1.0 returns the best value for the measures ACR. BM25 k1 = 1.2, b = 0.75 is best for ACR and TF-IDF for DCHR. The language model was used for experiments with query expansion (QE). In Table-2 query expansion results are given. In Table-2 "D"= number of top docs used in QE. "T"= number of top terms used in QE and "RM3" <ref type="bibr" target="#b2">[3]</ref> = value of qmix used in RM3 QE.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Discussion</head><p>The results of our experiments are not far behind the supervised approaches submitted at CLEF. This shows that straightforward approaches using only IR technologies currently perform almost as good as advanced algorithms.</p><p>The measure DODL and ADODL need to be interpreted with care. They are a very useful measure as they consider the depression level of one user overall. However, it can even out bad results from individual questions. An approach to trick ADODL would give results in the middle of the answer range. In this case, ADODL would be 50 per cent for an even distribution. Consider that this would be better then all submitted experiments which have higher (worse) values. For an uneven or highly skewed distribution, even better (lower) values could be obtained by appropriate guessing. In a realistic scenario, such a classification would probably need to find out the few cases with depression from many users. In such a case, the set of individual with and without depression are likely to be highly imbalanced. This needs to be taken into consideration when developing classifiers for realistic scenarios.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Conclusion</head><p>Traditional IR methods including query expansion do not perform best for the eRisk depression severity detection. However, the performance is not much worse when compared to the submitted runs.</p><p>In order to improve performance, we need to further analyze why IR methods are not doing well. One of the reason might be the BDI question length. Average question length is 8.45 (in words) when no stemming is used or no stop words are removed. When we remove stop words and stems (porter) BDI query, the average query length becomes 3.57 (in words).</p><p>There are many directions for future research. It is necessary to obtain on the one hand a better understanding of the models for professionals in the field and reach some sort of transparency for them. The type transparency and how it can be reached is a new research area. Maybe the performance of different sub-classes of depression can be a first step towards that goal.</p><p>On the other hand, experts need to be able to feed their expertise into the systems and improve their performance. The society overall needs to find ethical ways to handle such technology. It seems important that citizens are more aware of the information they are providing to readers by writing online text which can be analyzed easily. Basically, they might reveal much about their psychological traits without being aware of it. One important tool would be a classifier available to everyone, such that citizens can test the predictions gained from their texts. This gives users back some of their informational autonomy.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc><ref type="bibr" target="#b24">[25]</ref> [23] ( 3 ISIKol-bm25-1.2-0.75-5000-Dtac-Qtac ): BM25 model with parameter settings as follows: k1 = 1.2 and b = 0.75 -Language Model -Divergence from Randomness with second normalization model (DFR) [4] -LM-dir ( 3 ISIKol-lm-d-1.0-5000-Dtac-Qtac): Language model with Dirichlet prior smoothing with µ = 1.0. -Multi-Similarity ( 3 ISIKolmultiSimilarity-5000-Dtac-Qtac): This experiment represents a fusion appraoch with the combined sum of a Language model with Dirichlet prior smoothing (LM-d) with µ = 1.0, Language model with Jelinek-Mercer Smoothing (LM-jm) with λ = 0.5, DFR with second normalization model (DFR) [4] and a BM25 model with k1 = 1.2 and b = 0.75.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Results for the experiments with stemming and without stemming as well as with stop word removal and without stop word removal.</figDesc><table><row><cell>Results</cell><cell>no stemming, no stop word removal</cell><cell>stemming, stop word removal</cell></row><row><cell></cell><cell>AHR ACR ADODL DCHR AHR</cell><cell>ACR ADODL DCHR</cell></row><row><cell>BM25</cell><cell></cell><cell></cell></row><row><cell>k1 = 1.2,</cell><cell cols="2">29.29% 59.37% 73.02% 25.0% 32.38% 60.00% 72.38% 20.0%</cell></row><row><cell>b = 0.75</cell><cell></cell><cell></cell></row><row><cell>TF-IDF</cell><cell cols="2">32.14% 63.10% 74.59% 40.0% 30.48% 59.76% 71.98% 20.0%</cell></row><row><cell cols="3">DFR-I(n)-L-2 30.48% 60.16% 73.65% 25.0% 32.10% 60.95% 73.02% 25.0%</cell></row><row><cell>LM-d λ = 1.0</cell><cell cols="2">29.52% 61.67% 75.48% 25.0% 31.67% 61.03% 74.68% 25.0%</cell></row><row><cell>LM-jm µ = 0.5</cell><cell cols="2">28.33% 61.11% 74.92% 10.0% 31.43% 60.95% 74.44% 20.0%</cell></row><row><cell>Multi Similarity</cell><cell cols="2">30.71% 61.75% 74.71% 25.0% 31.14% 60.35% 73.84% 25.0%</cell></row><row><cell>Google</cell><cell cols="2">23.10% 55.00% 77.22% 05.00% 19.29% 62.38% 70.79% 40.00%</cell></row><row><cell>GloVe</cell><cell cols="2">25.71% 59.76% 80.24% 30.00% 20.16% 61.35% 76.41% 30.00%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Experiments with RM3 query expansion based on the baseline LM-d model.</figDesc><table><row><cell>Results D T RM3</cell><cell>AHR</cell><cell cols="3">ACR ADODL DCHR</cell></row><row><cell>10 10 0.5</cell><cell cols="4">30.48% 60.87% 71.19% 30.0%</cell></row><row><cell>20 10 0.3</cell><cell cols="4">30.00% 60.71% 70.87% 20.0%</cell></row><row><cell>20 10 0.7</cell><cell cols="4">31.43% 61.59% 72.38% 20.0%</cell></row><row><cell>20 10 0.9</cell><cell cols="4">31.90% 61.51% 74.21% 30.0%</cell></row><row><cell>20 15 0.9</cell><cell cols="4">31.90% 61.90% 74.76% 35.0%</cell></row><row><cell>20 20 0.9</cell><cell>32.38% +7.9%</cell><cell>61.98% +6.9%</cell><cell>74.68% +0.7%</cell><cell>35.0% +40.0%</cell></row><row><cell>30 10 0.9</cell><cell cols="4">31.90% 61.51% 74.21% 30.0%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Results of participants in the submitted runs for the task.</figDesc><table><row><cell>Run</cell><cell>AHR</cell><cell>ACR ADODL DCHR</cell></row><row><cell>BioInfo@UAVR</cell><cell cols="2">34.05% 66.43% 77.70% 25.00%</cell></row><row><cell>BiTeM</cell><cell cols="2">32.14% 62.62% 72.62% 25.00%</cell></row><row><cell>CAMHGPTnearestunsupervised</cell><cell cols="2">23.81% 57.06% 81.03% 45.00%</cell></row><row><cell>CAMHGPTsupervised.181features.58hr</cell><cell cols="2">35.47% 68.33% 75.63% 20.00%</cell></row><row><cell>CAMHGPTsupervised.769features.55hr</cell><cell cols="2">36.43% 67.22% 72.30% 20.00%</cell></row><row><cell>CAMHGPTsupervised.949features.75hr</cell><cell cols="2">36.91% 69.13% 75.63% 15.00%</cell></row><row><cell>CAMHLIWCsupervisedSVM</cell><cell cols="2">35.95% 66.59% 75.48% 25.00%</cell></row><row><cell>Fazl</cell><cell cols="2">22.38% 56.27% 72.78% 5.00%</cell></row><row><cell>Illinois</cell><cell cols="2">22.62% 56.19% 66.35% 40.00%</cell></row><row><cell cols="3">ISIKolmultiSimilarity-5000-Dtac-Qtac 29.76% 57.94% 74.13% 25.00%</cell></row><row><cell cols="3">ISIKol-bm25-1.2-0.75-5000-Dtac-Qtac 29.76% 57.06% 72.78% 25.00%</cell></row><row><cell>ISIKol-lm-d-1.0-5000-Dtac-Qtac</cell><cell cols="2">30.00% 57.94% 73.02% 15.00%</cell></row><row><cell>Kimberly</cell><cell cols="2">38.33% 64.44% 66.19% 20.00%</cell></row><row><cell>UNSLA</cell><cell cols="2">37.38% 67.94% 72.86% 30.00%</cell></row><row><cell>UNSLB</cell><cell cols="2">36.93% 70.16% 76.83% 30.00%</cell></row><row><cell>UNSLC</cell><cell cols="2">41.43% 69.13% 78.02% 40.00%</cell></row><row><cell>UNSLD</cell><cell cols="2">38.10% 67.22% 78.02% 30.00%</cell></row><row><cell>UNSLE</cell><cell cols="2">40.71% 71.27% 80.48% 35.00%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table - 2</head><label>-</label><figDesc>shows, that these experiments show slightly better results.</figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8">Acknowledgements</head><p>This work was carried out during a stay of the first author at the University of Hildesheim in Germany. The work was partially sponsored by the federal state Niedersachsen and the Institute of Information Science and Language Technology (IWIST) at the University of Hildesheim.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="https://code.google.com/archive/p/word2vec/" />
		<title level="m">Google pre-trained word vector</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m">American Psychiatric Association: Diagnostic and statistical manual of mental disorders : DSM-5</title>
				<meeting><address><addrLine>Arlington, VA</addrLine></address></meeting>
		<imprint>
			<publisher>American Psychiatric Association</publisher>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
	<note>5th ed. edn.</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Umass at trec 2004: Novelty and hard</title>
		<author>
			<persName><forename type="first">N</forename><surname>Abdul-Jaleel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Allan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">B</forename><surname>Croft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Diaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Larkey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">D</forename><surname>Smucker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Wade</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of TREC</title>
				<meeting>TREC</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="volume">13</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Probabilistic models of information retrieval based on measuring the divergence from randomness</title>
		<author>
			<persName><forename type="first">G</forename><surname>Amati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Van Rijsbergen</surname></persName>
		</author>
		<idno type="DOI">10.1145/582415.582416</idno>
		<ptr target="http://doi.acm.org/10.1145/582415.582416" />
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page" from="357" to="389" />
			<date type="published" when="2002-10">October 2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Predicting big five personality traits of microblog users</title>
		<author>
			<persName><forename type="first">S</forename><surname>Bai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Hao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yuan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zhu</surname></persName>
		</author>
		<idno type="DOI">10.1109/WI-IAT.2013.70</idno>
		<ptr target="https://doi.org/10.1109/WI-IAT.2013.70" />
		<imprint>
			<date type="published" when="2013">11. 2013</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="501" to="508" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">An Embedding Based IR Model for Disaster Situations</title>
		<author>
			<persName><forename type="first">A</forename><surname>Bandyopadhyay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ganguly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mitra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">K</forename><surname>Saha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Jones</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10796-018-9847-6</idno>
		<ptr target="https://doi.org/10.1007/s10796-018-9847-6" />
	</analytic>
	<monogr>
		<title level="j">Information Systems Frontiers</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="925" to="932" />
			<date type="published" when="2018-10">October 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Overview of the FIRE 2018 track: Information retrieval from microblogs during disasters</title>
		<author>
			<persName><forename type="first">M</forename><surname>Basu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ghosh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Ghosh</surname></persName>
		</author>
		<idno type="DOI">10.1145/3293339.3293340</idno>
		<ptr target="https://doi.org/10.1145/3293339.3293340" />
	</analytic>
	<monogr>
		<title level="j">irmidis</title>
		<imprint>
			<biblScope unit="page" from="1" to="5" />
			<date type="published" when="2018">12 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Psychometric properties of the beck depression inventory: Twenty-five years of evaluation</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T</forename><surname>Beck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Steer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Carbin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Clinical psychology review</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="77" to="100" />
			<date type="published" when="1988">1988</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">An inventory for measuring depression</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T</forename><surname>Beck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">H</forename><surname>Ward</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mendelson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Erbaugh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Archives of general psychiatry</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="561" to="571" />
			<date type="published" when="1961">1961</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Analysis and experiments on early detection of depression</title>
		<author>
			<persName><forename type="first">F</forename><surname>Cacheda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Fernandez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Novoa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Carneiro</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2125/paper69.pdf" />
		<editor>Cappellato et al.</editor>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="10" to="21" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Working Notes of CLEF 2018 -Conference and Labs of the Evaluation Forum</title>
		<ptr target="http://ceur-ws.org/Vol-2125/" />
	</analytic>
	<monogr>
		<title level="m">CEUR Workshop Proceedings</title>
				<editor>
			<persName><forename type="first">L</forename><surname>Cappellato</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Nie</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Soulier</surname></persName>
		</editor>
		<meeting><address><addrLine>Avignon</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2125. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">CESD-R)</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">W</forename><surname>Eaton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ybarra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Muntaner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tien</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Center for Epidemiologic Studies Depression Scale: review and revision</title>
				<imprint>
			<publisher>CESD</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Facebook language predicts depression in medical records</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Eichstaedt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>Merchant</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">H</forename><surname>Ungar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Crutchley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Preot ¸iuc-Pietro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Asch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">A</forename><surname>Schwartz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences</title>
		<imprint>
			<biblScope unit="volume">115</biblScope>
			<biblScope unit="issue">44</biblScope>
			<biblScope unit="page" from="11203" to="11208" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">UNSLs participation at eRisk 2018 Lab</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">G</forename><surname>Funez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J G</forename><surname>Ucelay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>Villegas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">G</forename><surname>Burdisso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">C</forename><surname>Cagnina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Montes-Y Gómez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Errecalde</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2125/paper137.pdf" />
		<editor>Cappellato et al</editor>
		<imprint>
			<biblScope unit="volume">11</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">A test collection for research on depression and language use</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">E</forename><surname>Losada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Crestani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference of the Cross-Language Evaluation Forum for European Languages</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="28" to="39" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">E</forename><surname>Losada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Crestani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Parapar</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2125/invitedpaper1.pdf" />
		<title level="m">Overview of erisk 2018: Early risk prediction on the internet (extended lab overview</title>
				<editor>
			<persName><surname>Cappellato</surname></persName>
		</editor>
		<imprint>
			<biblScope unit="volume">11</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Overview of eRisk 2019: Early Risk Prediction on the Internet</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">E</forename><surname>Losada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Crestani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Parapar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Experimental IR Meets Multilinguality, Multimodality, and Interaction. 10th International Conference of the CLEF Association, CLEF 2019</title>
				<meeting><address><addrLine>Lugano, Switzerland</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Linguistic Regularities in Continuous Space Word Representations</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Yih</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zweig</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">NAACL HLT 2013</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Distributed representations of words and phrases and their compositionality</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">S</forename><surname>Corrado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dean</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. NIPS &apos;13</title>
				<meeting>NIPS &apos;13</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="3111" to="3119" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Early detection of signs of anorexia and depression over social media using effective machine learning frameworks</title>
		<author>
			<persName><forename type="first">S</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Kalyani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Basu</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2125/paper182.pdf" />
		<editor>Cappellato et al.</editor>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="10" to="21" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Glove: Global vectors for word representation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Pennington</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Socher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
		<ptr target="http://www.aclweb.org/anthology/D14-1162" />
	</analytic>
	<monogr>
		<title level="m">Empirical Methods in Natural Language Processing (EMNLP)</title>
				<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="1532" to="1543" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">The CES-D scale: A self-report depression scale for research in the general population</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">S</forename><surname>Radloff</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied psychological measurement</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="385" to="401" />
			<date type="published" when="1977">1977</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">E</forename><surname>Robertson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Walker</surname></persName>
		</author>
		<ptr target="http://dl.acm.org/citation.cfm?id=188490.188561" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Springer-Verlag New York, Inc</publisher>
			<date type="published" when="1994">1994</date>
			<biblScope unit="page" from="232" to="241" />
		</imprint>
	</monogr>
	<note>SIGIR &apos;94</note>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">The probabilistic relevance framework: Bm25 and beyond</title>
		<author>
			<persName><forename type="first">S</forename><surname>Robertson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zaragoza</surname></persName>
		</author>
		<idno type="DOI">10.1561/1500000019</idno>
		<ptr target="http://dx.doi.org/10.1561/1500000019" />
	</analytic>
	<monogr>
		<title level="j">Found. Trends Inf. Retr</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="333" to="389" />
			<date type="published" when="2009-04">Apr 2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Simple bm25 extension to multiple weighted fields</title>
		<author>
			<persName><forename type="first">S</forename><surname>Robertson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zaragoza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Taylor</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Depression detection via harvesting social media: A multimodal dictionary learning solution</title>
		<author>
			<persName><forename type="first">G</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Jia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Nie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Feng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">S</forename><surname>Chua</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IJCAI</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="3838" to="3844" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">Identifying unclear questions in community question answering websites</title>
		<author>
			<persName><forename type="first">J</forename><surname>Trienes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Balog</surname></persName>
		</author>
		<idno>CoRR abs/1901.06168</idno>
		<ptr target="http://arxiv.org/abs/1901.06168" />
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<title level="m" type="main">Word Embeddings and Linguistic Metadata at the CLEF 2018 Tasks for Early Detection of Depression and Anorexia</title>
		<author>
			<persName><forename type="first">M</forename><surname>Trotzek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Koitka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">M</forename><surname>Friedrich</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2125/paper68.pdf" />
		<editor>Cappellato et al</editor>
		<imprint>
			<biblScope unit="volume">11</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
