<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Task 3 Patient-Centred Information Retrieval: Team CUNI</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Shadi</forename><surname>Saleh</surname></persName>
							<email>saleh@ufal.mff.cuni.cz</email>
						</author>
						<author>
							<persName><forename type="first">Pavel</forename><surname>Pecina</surname></persName>
							<email>pecina@ufal.mff.cuni.cz</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Faculty of Mathematics</orgName>
								<orgName type="department" key="dep2">Physics</orgName>
								<orgName type="institution">Charles University</orgName>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">Institute of Formal and Applied Linguistics</orgName>
								<address>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Task 3 Patient-Centred Information Retrieval: Team CUNI</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">ED8B9FA2DC762907EF318BFED2B2D5F7</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:30+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Multilingual information retrieval</term>
					<term>Machine Translation</term>
					<term>Machine learning</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper describes our systems that we submitted to the 2017 CLEF eHealth information retrieval (IR) task. We submitted runs to the monolingual and multilingual tasks. In the monolingual task, we investigate the performance of two IR models: probabilistic model and a model based on language-model. In addition, we experiment query expansion based on blind relevance feedback. In the multilingual task, we submitted runs for all the languages. We employ a Statistical Machine Translation (SMT) system to translate the given queries into English and get the n-best-list. Then we use this list of translations for our baseline system by getting 1-best-list to generate queries, we also use n-bestlist reranker that was developed by us to predict 1-best-list for better IR performance. Finally, we present our method for query expansion approach based on a machine learning model that predicts a term from a translation pool to be added to the original query.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Internet searches for medical topics had been increasing recently, and have gotten the attention of information retrieval researchers. Fox <ref type="bibr" target="#b1">[2]</ref> reported that about 80% of Internet users in the United States look for medical information online. The main challenge in the medical information retrieval systems that people with different experience, express their information need in different way <ref type="bibr" target="#b11">[12]</ref>. Laypeople express their medical information need using non-medical terms, while medical experts express it using specific medical terms, thus, information retrieval systems need to be stable for such different query variations.</p><p>The significant increasing of non-English digital content on the World Wide Web has been followed by an increase in looking for this information by internet users. Grefenstette and Nioche <ref type="bibr" target="#b4">[5]</ref> presented an estimation of language size in 1996, late 1999 and early 2000 for documents captured from the internet. Their study showed that the English content has grown 800%, German 1500%, and Spanish 1800% in the same period. Further more, users started to look for information needs represented in documents which are not available in their native languages. The system that searches for information in a language different from the one of user is called Cross-Lingual (multilingual) Information Retrieval (CLIR) system. It enables users to write queries (information need) represented in a language (lang. A), and returns results from a document collection written in a different language (lang. B).</p><p>Usually, the baseline system in CLIR is to take the 1-best-list translation returned by a statistical machine translation (SMT) system and perform the retrieval as shown in the CLEF eHealth Information Retrieval tasks before <ref type="bibr" target="#b2">[3]</ref>. However, researchers recently started to investigate looking inside the box of the machine translation system rather than using it as a black box <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b5">6]</ref> and showed that involving the internal components of the SMT in the retrieval process significantly improved the baseline system.</p><p>Nikoulina et al. <ref type="bibr" target="#b7">[8]</ref> presented an approach to develop Cross-lingual information retrieval (CLIR) system which is based on reranking the hypotheses given from the SMT system. Saleh and Pecina <ref type="bibr" target="#b15">[16]</ref> considered Nikoulina's work as a starting point and expanded it by adding a rich set of features for training. They presented approach covered translating queries from Czech, French and German into English and rerank the alternative translations to predict the hypothesis that gives better CLIR performance.</p><p>In this paper, we describe our participation at the 2017 CLEF eHealth Information Retrieval Task <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b3">4]</ref>. In the IRTask1, participants were provided with English queries representing medical information need and were asked to provide ranked list of documents from the ClueWeb collection sorted by their relevance. While IRTask4 is a multilingual IR task, the original English queries were translated into seven languages: Czech, French, Hungarian, German, Polish, Spanish and Swedish by medical native speakers. Participants in this task were required to provide a ranked list of relevant documents from the English collection. We focus in our participation in the multilingual IR Task. We present our machine learning model which reranks the alternative translations given by the machine translation system for better IR results. We also present our new approach to expand translated queries using our machine learning model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">System description</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Retrieval model</head><p>In our experiment we use ClueWeb12 collection indexed and released by the orgnisers of this task. The index was created using Terrier open source engine <ref type="bibr" target="#b10">[11]</ref>. We use mainly BM25 as a retrieval model. Documents in this model are ranked for a given query as shown in Equation <ref type="formula" target="#formula_0">1</ref>. k 1 and k 3 are tuning parameters, and we leave these parameters as their default values in Terrier. While tf d is the normalised term frequency in document d, normalised by Equation <ref type="formula">2</ref>. dl and avg dl are document length and the average of document length in the collection respectively. b is a free parameter, we tune this parameter using the 2016 CLEF eHealth IR monolingual queries and the provided assessment information, then we set this parameter to 0.6.</p><formula xml:id="formula_0">RSV (d, q) = t∈ d q (k 1 + 1)tf d K + tf d * (k 3 + 1) * tf q k 3 + tf q * idf (t)<label>(1)</label></formula><formula xml:id="formula_1">tf d = tf (1 + b) + b * dl avg dl (2)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Translation System</head><p>We employ Khresmoi statistical machine translation (SMT) system <ref type="bibr" target="#b0">[1]</ref>, for language pairs: Czech-English, French-English, German-English, Hungarian-English, Polish-English, Spanish-English and Swedish-English, to translate the queries into English. Khresmoi SMT system was trained to translate queries, where most general SMT systems fail, and tuned on parallel and monolingual data taken from the medical domain resources like Wikipedia, UMLS concept descriptions and UMLS metathesaurus. Such domain specific data made Khresmoi perform well when translating sentences in the medical domain like the queries in our case. Generally, feature weights in SMT systems are tuned toward BLEU <ref type="bibr" target="#b13">[14]</ref> , a method for automatic evaluation of SMT systems correlates with human judgments. It is not necessary to have correlation between the quality of general SMT system and the quality of CLIR performance <ref type="bibr" target="#b14">[15]</ref>; therefore Khresmoi SMT system was tuned using MERT <ref type="bibr" target="#b9">[10]</ref> towards PER (position-independent word error rate) because it does not penalise word reorder; which is not important for the performance of IR systems.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Hypothesis reranking</head><p>For each input sentence, Khresmoi SMT system returns a list of alternative translations in the target language, we refer to this list as an n-best-list. Saleh and Pecina <ref type="bibr" target="#b15">[16]</ref> presented an approach to rerank an n-best-list and predict a translation that gives the best retrieval performance in terms of P@10. The reranker is a generalized linear regression model that uses a set of features which can be divided according to their sources into: 1) The SMT system: This includes features that are derived from the verbose output of the Khresmoi SMT system (e.g. phrase translation model, the target language model, the reordering model and word penalty). 2) Document collection: The collection is employed to derive features like IDF scores and features that are based on the blindrelevance feedback approach. 2) External resources: Resources like Wikipedia articles, document collection and UMLS metathesaurus are employed to create a rich set of features for each query hypothesis. 3) Retrieval status value: This feature is used to involve the retrieval model in the reranking. It is based on how the Dirichlet model scores the retrieved documents for a given query. This approach is similar to the work of Nottelman et al. <ref type="bibr" target="#b8">[9]</ref>, where they investigated the correlation between the RSV and the probability of relevance.</p><p>To train the model, we used queries and assessment information from the 2016 CLEF eHealth IR task.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Query expansion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Blind relevance feedback</head><p>Query expansion is defined as the procedure of reformulating a user's query for better retrieval efficiency. Blind Relevance Feedback (BRF), also known as Pseudo Relevance Feedback, is the process of automatically expand user's query. It considers the top k documents as relevant to the original query, and then expands the query with terms from these documents. However, the assumption of considering these documents as relevant is risky, because they might not be relevant, thus resulting the original query to be drifted way from its information need. The top k documents are chosen from an initial retrieval that is done using the original query. From these documents we create bag-of-words (BOW) and then we choose from this BOW m terms to be added to the original query. These terms are chosen based on their inverse document frequency from the collection and their frequencies in this BOW. Both k and m need to be tuned based on the used collection and using test queries and assessment information. We use Terrier implementation of BRF and tune k and m using the 2016 CLEF eHealth IR task queries and their assessment information and then based on the results we set k = 3 and m = 10.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Term reranking</head><p>In this experiment, we present our approach for query expansion in the multilingual task. When we translate the query into English using SMT system, we get n-best-list translations. These translations contain different synonyms in the target language for a give term in the source language. The motivation of this experiment is that using more than one of these synonyms, and expanding the original query, could lead to improved retrieval. One of the feature we use in this model is based on the word2vec open source tool developed by <ref type="bibr" target="#b6">[7]</ref>. They presented two models: Continuous Bag-of-Words Model (CBOW) and Continuous Skip-gram model. These models showed very powerful ability to measure the similarity between words in the collection. We used for our experiment trained model of word2vec on 25 millions articles from PubMed using their titles and abstracts, the model available online 1 . To investigate the hypothesis of expanding queries from the translation pool, we use the queries that were provided in CLEF eHealth IR task 2013-2015 by translating them into English and then: 1-) Get 20-best-list translations for each query. 2-) Create a translation pool as bagof-words from these translations. 3-) Then we use 1-best-list translation as an original query, and expand it with one term from the translation pool. 4-) Then we run the retrieval using our baseline setting using the expanded queries. After evaluating the results and collecting the expanded queries that give maximum P @10 among all the other expanded queries, we find that the results from expanded queries outperform significantly the results when using only the original queries. To expand the original query with a term from the translation pool, we build regression model that predicts the change of P @10 when a term is added to the original query. In order to train the model we present set of features for each term as follows:</p><p>-IDF: Inverse document frequency of that term from the indexed collection.</p><p>-RSV: First we conduct retrieval using the original query and then we take the RSV of the document that is ranked firstly using our baseline setting, then we add a term to that original query, and conduct the retrieval again, then the feature value is the difference of these two RSVs. -Similarity: First we use word2vec to get word embeddings for each term in the original query and we sum these embeddings to get vector that represents the entire query. Then we take the embeddings for the candidate term and we calculate the cosine similarity between the query vector and the term vector.</p><p>The model is built to predict a term that will give the highest P @10 when it is added to the original query, and trained on test queries that are taken from CLEF eHealth IR task 2013-2015.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Experiments</head><p>This year we submit runs to the Ad-Hoc task in its monolingual and multilingual subtask.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1">Monolingual Ad-Hoc search</head><p>Run1 This run uses Terrier implementation of BM25 IR model, with normalisation parameter b tuned and set to 0.6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Run2</head><p>For comparison with BM25 model ( a probabilistic IR model), we submit this run based on Terrier implementation of Dirichlet Bayesian smoothed model (language-model based IR model).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Run3</head><p>In this run, we use Terrier implementation of Blind relevance feedback (Bo1) where k is set to 3 documents and m is set to 10 terms.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2">Multilingual task</head><p>Run1 In this run, we translate the query variant into English using Khresmoi SMT then we take only the 1-best-list to generate the topics, then we perform the retrieval using BM25 model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Run2</head><p>First we translate the query into English and take the 15-best-list translations, then the reranker with all features predicts the translation that gives the highest P@10, the predicted translations are used next to generate the topics and perform the retrieval using BM25 model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Run3</head><p>First we use 1-best-list to generate queries then we add to each query one term from the translation pool as described in Section 5.2.</p><p>Run4 This run uses 1-best-list English translations to generate queries, then we conduct the retrieval after doing query expansion using Terrier implementation of BRF approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Conclusion and future work</head><p>In this paper we presented our participation in the CLEF eHealth 2017 Task3 Patient-Centred Information Retrieval as the team of Charles university. We submitted runs into the Ad-hoc task including its monolingual and multilingual subtasks. For the monolingual task, we investigated the performance when using probabilistic IR model (BM25) and language-model based IR model, also we submitted run based on BRF approach. We tuned all the parameters for these models using queries and assessment information from the 2016 CLEF eHealth IR task. While for the multilingual task, we employ an SMT system to translate the queries into English and use 1-best-list to generate queries for our baseline system. We also used our reranker to predict new 1-best-list for better IR performance. We presented new approach to expand queries with a term from the translation pool using machine learning model.</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://www.ncbi.nlm.nih.gov/CBBresearch/Wilbur/IRET/DATASET/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This research was supported by the Czech Science Foundation (grant n. P103/12/G084) and the EU H2020 project KConnect (contract n. 644753).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Machine translation of medical texts in the Khresmoi project</title>
		<author>
			<persName><forename type="first">O</forename><surname>Dušek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hajič</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hlaváčová</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Novák</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Pecina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Ninth Workshop on Statistical Machine Translation</title>
				<meeting>the Ninth Workshop on Statistical Machine Translation<address><addrLine>Baltimore, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="221" to="228" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Health Topics: 80% of internet users look for health information online</title>
		<author>
			<persName><forename type="first">S</forename><surname>Fox</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
		<respStmt>
			<orgName>Pew Research Center</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Tech. rep.</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Overview of the CLEF eHealth evaluation lab</title>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kelly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Suominen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Hanlen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Néváol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Grouin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Palotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zuccon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 6th Conference and Labs of the Evaluation Forum</title>
				<meeting><address><addrLine>Berlin, Germany</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">CLEF 2017 eHealth evaluation lab overview</title>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kelly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Suominen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nvol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Robert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kanoulas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Spijker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Palotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zuccon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CLEF 2017 -8th Conference and Labs of the Evaluation Forum</title>
		<title level="s">Lecture Notes in Computer Science (LNCS</title>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2017-09">September 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Estimation of english and non-english language use on the www</title>
		<author>
			<persName><forename type="first">G</forename><surname>Grefenstette</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Nioche</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Content-Based Multimedia Information Access</title>
				<meeting><address><addrLine>Paris, France, France</addrLine></address></meeting>
		<imprint>
			<publisher>LE CENTRE DE HAUTES ETUDES INTERNATIONALES D&apos;INFORMATIQUE DOCUMENTAIRE</publisher>
			<date type="published" when="2000">2000</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="237" to="246" />
		</imprint>
	</monogr>
	<note>RIAO &apos;00</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Should MT systems be used as black boxes in CLIR?</title>
		<author>
			<persName><forename type="first">W</forename><surname>Magdy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Jones</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Information Retrieval</title>
				<editor>
			<persName><forename type="first">P</forename><surname>Clough</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Foley</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Gurrin</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Jones</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">W</forename><surname>Kraaij</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Lee</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Mudoch</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin, Germany</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="volume">6611</biblScope>
			<biblScope unit="page" from="683" to="686" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Efficient estimation of word representations in vector space</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Corrado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dean</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1301.3781</idno>
		<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Adaptation of statistical machine translation model for cross-lingual information retrieval in a service context</title>
		<author>
			<persName><forename type="first">V</forename><surname>Nikoulina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Kovachev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Lagos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Monz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics</title>
				<meeting>the 13th Conference of the European Chapter of the Association for Computational Linguistics<address><addrLine>Avignon, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="109" to="119" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">From retrieval status values to probabilities of relevance for advanced IR applications</title>
		<author>
			<persName><forename type="first">H</forename><surname>Nottelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Fuhr</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information retrieval</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="363" to="388" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Minimum error rate training in statistical machine translation</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Och</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 41st Annual Meeting on Association for Computational Linguistics</title>
				<meeting>the 41st Annual Meeting on Association for Computational Linguistics<address><addrLine>Sapporo, Japan</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2003">2003</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="160" to="167" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Terrier: A high performance and scalable information retrieval platform</title>
		<author>
			<persName><forename type="first">I</forename><surname>Ounis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Amati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Plachouras</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Macdonald</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Lioma</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Workshop on Open Source Information Retrieval</title>
				<meeting>Workshop on Open Source Information Retrieval<address><addrLine>Seattle, WA, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">How users search and what they search for in the medical domain -understanding laypeople and experts through query logs</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R M</forename><surname>Palotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hanbury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Müller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">E K</forename><surname>Jr</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Inf. Retr. Journal</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="189" to="224" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">CLEF 2017 task overview: The IR Task at the eHealth evaluation lab</title>
		<author>
			<persName><forename type="first">J</forename><surname>Palotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zuccon</surname></persName>
		</author>
		<author>
			<persName><surname>Jimmy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Pecina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lupu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kelly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hanbury</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes of Conference and Labs of the Evaluation (CLEF) Forum. CEUR-WS</title>
				<meeting><address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">BLEU: A method for automatic evaluation of machine translation</title>
		<author>
			<persName><forename type="first">K</forename><surname>Papineni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Roukos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ward</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">J</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 40th annual meeting on Association for Computational Linguistics</title>
				<meeting>the 40th annual meeting on Association for Computational Linguistics<address><addrLine>Philadelphia, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="page" from="311" to="318" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Adaptation of machine translation for multilingual information retrieval in the medical domain</title>
		<author>
			<persName><forename type="first">P</forename><surname>Pecina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Dušek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hajič</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hlavářová</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Jones</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence in Medicine</title>
		<imprint>
			<biblScope unit="volume">61</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="165" to="185" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Reranking hypotheses of machine-translated queries for crosslingual information retrieval</title>
		<author>
			<persName><forename type="first">S</forename><surname>Saleh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Pecina</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Experimental IR Meets Multilinguality, Multimodality, and Interaction. The 7th International Conference of the CLEF Association, CLEF 2016</title>
				<meeting><address><addrLine>Évora, Portugal</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Looking inside the box: Context-sensitive translation for cross-language information retrieval</title>
		<author>
			<persName><forename type="first">F</forename><surname>Ture</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">W</forename><surname>Oard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval<address><addrLine>Portland, Oregon, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="1105" to="1106" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
