<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using Discharge Summaries to Improve Information Retrieval in Clinical Domain</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dongqing Zhu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wu Stephen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Masanz James</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ben Carterette</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hongfang Liu</string-name>
          <email>liu.hongfangg@mayo.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Mayo Clinic</institution>
          ,
          <addr-line>200 First Street SW, Rochester, MN 55905</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Delaware</institution>
          ,
          <addr-line>101 Smith Hall, Newark, DE 19716</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Task 3 of the 2013 ShARe/CLEF eHealth Evaluation Lab simulated web searches for health information by patients. The web searches were designed to be connected to hospital discharge summaries from the patient's Electronic Medical Record (EMR), thus effectually modeling a post-visit information need. We primarily investigated three research questions about the retrieval of medical information from the web: 1) to what degree retrieval techniques effective in searching Electronic Medical Records (EMRs) could aid in finding medical web documents; 2) to what degree medical web retrieval would benefit from natural language processing (NLP) techniques that extract information from text based on medical knowledge; and 3) how to leverage contextual information in the patient's discharge summaries to improve retrieval. We submitted seven runs to ShARe/CLEF eHealth. Our best run used effective EMR-based IR techniques, NLP-produced information, and information in patients' discharge summaries to achieve precision at 10 (P@10) scores at or above the CLEF median for all but 2 of 50 test queries.</p>
      </abstract>
      <kwd-group>
        <kwd>language models</kwd>
        <kwd>mixture of relevance models</kwd>
        <kwd>Markov Random Field</kwd>
        <kwd>semantic concepts</kwd>
        <kwd>UMLS Metathesaurus</kwd>
        <kwd>MeSH</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Task 3 of the 2013 ShARe/CLEF eHealth Evaluation Lab [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] simulated web
searches for health information by patients. The web searches were designed
to be connected to hospital discharge summaries from the patient’s Electronic
Medical Record (EMR), thus effectually modeling a post-visit consumer
information need. This constrains the ad-hoc retrieval of webpages to a limited number
of medical pages; it also differs from recent medical retrieval tasks like patient
cohort identification from EMR text [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] in that it is web search.
This paper describes our participation in Task 3 of 2013 ShARe/CLEF eHealth
Evaluation Lab. For this retrieval task, we primarily investigated three research
questions: 1) to what degree retrieval techniques effective in searching
Electronic Medical Records (EMRs) could aid in finding medical web documents;
2) to what degree medical web retrieval would benefit from natural language
processing (NLP) techniques that extract information from text based on
medical knowledge; and 3) how to leverage contextual information in the patient’s
discharge summaries to improve retrieval.
      </p>
      <p>In particular, we used a two-step ranking strategy. In Step 1, we performed
retrieval in the text space, where the documents and queries were in their raw text
form, by using the MRF, MRM, and MeSH-based query expansion — the first
research question. In Step 2, we re-ranked the output from Step 1 in the concept
space where everything was represented by the medical concepts. In addition,
we produced contextual artifacts of the medical concepts in documents and
discharge summaries by using a clinical NLP annotation tool called MedTagger,
and incorporated those artifacts in our retrieval models.</p>
      <p>To derive the MRM model, we used several external sources: the TREC 2011
Medical Records Track test collection, the TREC 2007 Genomics Track test
collection, and a subset of Mayo Clinic clinical notes collected between 2001–
2010. To apply domain knowledge and construct the concept space, we adopted
the Concept Unique Identifier (CUI) in the Unified Medical Language System
(UMLS) for representing medical concepts.</p>
      <p>We submitted seven runs to ShARe/CLEF eHealth. When evaluated by
precision at 10 (P@10), our best run, which used the two-step ranking strategy and
leveraged information in patients’ discharge summaries, were above or at the
CLEF median for all but 2 of 50 test queries.</p>
      <p>The rest of the paper proceeds as follows: Sect. 2 presents the overview of our
system and follows it with detailed description about the underlying retrieval
models and medical-specific features. Then, Sect. 3 describes different system
settings for our submitted runs, and shows the official evaluation results. Finally,
Sect. 4 concludes our research findings.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Retrieval System</title>
      <p>In this section, we first present an overview of our retrieval system, and then
detail its underlying retrieval models and domain-specific features.
2.1</p>
      <sec id="sec-2-1">
        <title>System Overview</title>
        <p>Indexing I</p>
        <p>Index T
Query
2</p>
        <p>Retrieving I</p>
        <p>2
1
4
2
5</p>
        <p>Text</p>
        <p>Space
Concept
Space</p>
        <p>Rank
List I
6
6
Reranking</p>
        <p>Rank</p>
        <p>List II
Indexing II</p>
        <p>Index C
5</p>
        <p>Retrieving II
5
Discharge
Summary
3
3
CUI Discharge</p>
        <p>Summary</p>
        <p>Docs
3
3
CUI
Docs
5
1
3
3
4
CUI</p>
        <p>
          Query
NLP Concept Extraction3
1. Indexing I: we used Indri3 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] to build the index T for the target collection
(i.e., the collection of CLEF shared task) in the text space. In particular, we
formulated each query based only on the information of the ‘title’ field in
the query file. After html cleaning with Boilerpipe4 [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], we stemmed words
in both text documents and queries by Porter stemmer, and used a standard
medical stoplist [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] for stopping words in queries only.
2. Retrieving I: we obtained intermediate retrieval results (i.e., rank list 1 in
Fig. 1) by querying against the index T. We will detail the retrieval models
in Sect. 2.2.
3. NLP Concept Extraction: we used MedTagger [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] to convert text into the
concept space, i.e., CUIs. MedTagger extracts and annotates concepts from
clinical text by leveraging large corpus statistics, machine learning,
knowledge bases, and syntactic parsing. The output of MedTagger includes CUIs
as well as their attributes, such negation, uncertainty, semantic group, etc.
Documents in rank list 1, text queries, and the discharge summaries
associated with each query were all mapped into the concept space in this manner.
4. Indexing II: we built another index for the CUI documents, again using Indri.
        </p>
        <p>However, we did not stem the CUIs since they are all unique. Due to the
computational cost of NLP concept extraction on 1.6 million documents, we
3 http://www.lemurproject.org/indri/
4 http://code.google.com/p/boilerpipe/
only indexed the 61,273 documents that were present in retrieved results of
the 50 test queries.
5. Retrieving II: the query likelihood model was used to retrieve CUI-only
documents based on CUI-only queries. The concepts from discharge summaries
were also included for the relevant submissions. We will elaborate on this in
Sect. 2.3 where we describe our retrieval models in the concept space.
6. Re-ranking: we weight the text-based retrieval rankings against the
conceptbased retrieval rankings. Because retrieving in the concept space is done
on the intermediate text-based output (Rank List 1), it is essentially a
reranking procedure.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Text-based Retrieval</title>
        <p>In this section, we describe our text-based retrieval models.</p>
        <p>
          Query Likelihood Model. We used the query likelihood language model [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]
as our baseline. This model scores documents for queries with the intuition that
a user will think of the words in a document and try to write a query that is
similar. Formally, the scoring function is a sum of the logarithms of smoothed
probabilities:
score(D; Q) = log P (QjD) =
n
∑ log
i=1
tfqi;D +
jDj +
tfqi;C
jCj ;
(1)
where qi is the ith term in query Q, n is the total number of terms in Q, jDj
and jCj are the document and collection lengths in words respectively, tfqi;D
and tfqi;C are the document and collection term frequencies of qi respectively,
and is the Dirichlet smoothing parameter. Smoothing is a common technique
for estimating the probability of unseen words in the documents [
          <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
          ]. An Indri
query using this query likelihood model looks like the following:
        </p>
        <p>#combine(shortness breath swelling).</p>
        <p>Note that we formulated the above query by removing the stop words ‘of’ and
‘and’ from its original version ‘shortness of breath and swelling’.</p>
        <p>
          This model has been shown to be effective in web search and in medical records;
here, we test the intersection of the two domains — medical web texts.
State-ofthe-art performance on medical records is possible with language models when
augmented with advanced retrieval models, which we turn to now.
Markov Random Field Model. In the query likelihood model, it is a strong
assumption that query terms are generated independently from the document
language model. In reality, related terms are likely to occur in close proximity to
each other. The Markov random field (MRF) model [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] improves upon query
likelihood model by incorporating term proximity information. It works by first
constructing a graph that contains a document node, one node per query term,
and edges that represent dependencies among nodes. Then, MRF models the
joint distribution over the document random variable and query term random
variables. The ranking function of the MRF model is of the form:
        </p>
        <p>P (QjD) ra=nk ∑</p>
        <p>T fT (c) + ∑</p>
        <p>OfO(c) + ∑</p>
        <p>
          U fU (c);
(2)
c2T
c2O
c2O[U
where T is defined to be the set of 2-cliques containing the document node and
a query term node, O is the set of cliques involving the document node and two
or more query terms that appear contiguously in the query, and U is the set of
cliques involving the document node and two or more query terms that appear
non-contiguously within the query. f (c) is the feature function over clique c
and ’s are the feature weights. MRF model has been shown to consistently
outperform the standard unigram model across a range of TREC test collections [
          <xref ref-type="bibr" rid="ref10 ref11">10,
11</xref>
          ].
        </p>
        <p>
          Following Metzler and Croft [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], we set the feature weights ( T ; O; U ) to (0.8,
0.1, 0.1). An Indri query using MRF model looks like:
#weight( 0.8 #combine(shortness breath swelling)
0.1 #combine( #1(breath swelling) #1(shortness breath) )
0.1 #combine( #uw8(breath swelling) #uw8(shortness breath) ) ),
where each ‘#1()’ phrase specifies the ordered query terms, and each ‘#uw8()’
phrase include two query terms that can occur within a text span of 8 terms.
Mixture of Relevance Models. We also used the mixture of relevance models
(MRM) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] to expand the query with related terms for reducing the vocabulary
gap between query language and document language. Previous work has shown
that MRM model significantly improve the retrieval in general web domain [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]
as well as in the clinical domain [
          <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
          ].
        </p>
        <p>
          In particular, we selected several external collections along with the target
collection (i.e., CLEF) for building our MRM model. Our preliminary work suggests
that the domain and characteristics of the supporting external collections have an
effect on retrieval performance, thus we choose the medically oriented resources
in Table 1, which are shown with their collection statistics. The Medical and
Genomics collections are from TREC 2011 Medical Records Track [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and TREC
2007 Genomics Track [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] respectively. The MayoClinic collection is a subset of
Mayo Clinic clinical notes collected between 2001–2010, retrieved from the Mayo
Clinic Life Sciences System (MCLSS). This includes data from a comprehensive
snapshot of Mayo Clinic’s service areas, excluding only microbiology,
radiology, ophthamology, and surgical reports. This corpus has been characterized for
its clinical information content (namely, medical concepts [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] and terms [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ])
and compared to other corpora, such as the 2011 MEDLINE/PubMed Baseline
Repository and the 2010 i2b2 NLP challenge dataset [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
where the ‘#combine()’ phrase corresponds to the original query, and the four
inner ‘#weight()’ phrases correspond to the relevance models derived respectively
from four individual expansion collections listed in Table 1.
        </p>
        <p>
          Query Expansion with MeSH. Previous work has shown that expanding
queries with related Medical Subject Headings (MeSH) enhances retrieval
performance significantly in an EMR cohort identification setting [
          <xref ref-type="bibr" rid="ref13 ref19">19, 13</xref>
          ]. Furthermore,
since MeSH terms were designed to categorize scientific and medical literature,
we believe MeSH is a good fit for the problem of searching the web for medical
reference information. Thus, for each MetaMap-detected MeSH concept [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] in
the query we included its entry terms and descendant nodes in the MeSH
hierarchy for expansion [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. An Indri query using MeSH-based query expansion
looks like:
#weight( 0.7 #combine(shortness breath swelling)
0.3 #weight( 0.1 #uw16(dyspnea paroxysmal) 0.1 edema
0.1 hydrops 0.1 #uw16(edema cardiac) 0.1 #1(hydrops fetalis)
0.1 anasarca 0.1 dropsy 0.1 #1(shortness breath)
0.1 dyspneas 0.1 #1(breath shortnesses)) ),
where the ‘#combine()’ phrase contains the original query, and the inner ‘#weight()’
phrase contains MeSH expansion terms associated with MeSH terms detected in
the original query.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Concept-based Retrieval for Re-ranking</title>
        <p>In this section, we describe several concept-based retrieval models. We used these
models for re-ranking the output from the text-based retrieval. The motivation of
this re-ranking process was our second research question, namely, whether NLP
approaches for concept identification could incorporate some medical domain
knowledge and further improve the results from text-based retrieval.
Concept Unique Identi ers. In the concept space as shown in Fig. 1, we
represented everything by UMLS CUI. These shallow semantics are a common
normalization target for medical NLP systems, since extensive work has gone into
the curation of the UMLS Metathesaurus in reconciling between many different
source ontologies, terminologies, and vocabularies. We used the query likelihood
model (see Eq. 1) for computing the relevance score. An Indri query using this
model will look like:</p>
        <p>#combine(C0225386 C0347940),
where the C0225386 is the CUI related to breath or respiratory air, and C0347940
is related to chest swelling.</p>
        <p>Discharge Summary. Since queries were generated based on discharge
summaries (DS), we hypothesized that DS could contain ‘hidden’ concepts that did
not appear in the query but were related to query concepts. Psychologically, if
a patient finishes reading his/her medical record, these ‘hidden’ concepts might
serve as semantically priming for the patient’s mind when he/she formulates the
query. This may be valuable context that informs us about the patient’s actual
information need.</p>
        <p>To identify these ‘hidden’ concepts and use them for query expansion, we again
used MedTagger to convert text DS to CUI DS, and then weighted each CUI by
its term frequency, and finally took the top 20 CUIs with their weights for query
expansion. An Indri query formulated based on CUI DS looks like:
20 C1269008 20 C0869781 20 C0205042 12 C0018802 11 C1278960
11 C0729538 11 C0398102 11 C0392907 11 C0042449 11 C0036186
10 C0030685 9 C2926602 9 C0460139 9 C0234222 9 C0012621
8 C1548828 ) ),
where expansion CUI terms (e.g., those related to heart and artery: C0018787,
C0003842, and C0205042) are potentially useful for retrieving information
related to chest swelling and shortness of breath.</p>
        <p>Attributes. In addition to content artifacts (CUIs), Medtagger also produced
contextual attributes of CUIs, such as negation, uncertainty, semantic group,
and experiencer. We used negation and uncertainty attributes in our submitted
system. In particular, we experimentally assigned weights to different ‘values’ of
CUI attribute, as illustrated in Table 2. Those weights embody the importance
of specific attribute values of CUIs in terms of ranking documents. Note that
Table 2 corresponds to the weight assignment when there are no negated CUIs
in the original query and the status of each query CUI is ‘confirmed’.
Thus, an Indri query using CUI attributes will look like:
#weight(0.475 C0225386.sta0,pos, 0.0475 C0225386.sta1,pos,
0.095 C0225386.sta2,pos, 0.3325 C0225386.sta3,pos,
0.025 C0225386.sta0,neg, 0.0025 C0225386.sta1,neg,
0.005 C0225386.sta2,neg, 0.0175 C0225386.sta3,neg),
where we applied the attributes values as field restrictions in Indri, e.g., only
negated C0225386 with ‘probable’ status in a document can match the Indri
query term ‘C0225386.sta3,neg’, and consequently contribute with a weight value
of 0.0175 to the relevance score of that document. Note that we used only one
CUI in the above example for a simple demonstration. Longer CUI attribute
queries can be formulated in a similar way.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiments and Results</title>
      <p>We submitted 7 runs based on different settings of our retrieval system. For all
the runs, we experimentally set the Dirichlet smoothing parameter to 2500. In
this section, we describe each run in more detail, and present the corresponding
evaluation results.
3.1</p>
      <sec id="sec-3-1">
        <title>Submitted Runs</title>
        <p>Run 4: On top of the setting for run 3, we further applied the CUI attributes
(see Sect 2.3) for re-ranking documents in the concept space. Tests the third
research question (usefulness of discharge summaries) when compared to run
#7.</p>
        <p>Run 5: We used the same setting as run 3 for the text-based retrieval, i.e.,
we leveraged MRF, MRM, and MeSH-based query expansion. However, we
skipped the re-ranking process and used the initial ranking (i.e, rank list 1
in Fig. 1) for submission. Tests the first research question (robustness of IR
methods from medical records) when compared to the baseline.</p>
        <p>Run 6: On top of the setting for run 5, we further re-ranked the top 1000
retrieved documents in the concept space by using CUIs only (see Sect. 2.3).
Tests the second research question (usefulness of NLP methods) when
compared to run #5.</p>
        <p>Run 7: On top of the setting for run 6, we further exploited CUI attributes
(see Sect. 2.3) for documents re-ranking. Tests the second research question
(usefulness of NLP methods) when compared to run #5.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Results and Discussion</title>
        <p>Table 3 shows the official evaluation results for our submitted runs on the
primary evaluation metrics P@10 and NDCG@10. Run 2 obtained the best overall
performance among the seven runs. We further compare the P@10 performance
of run 2 with the medians of all submitted runs of CLEF. As we can see in Fig. 2,
for 14 out of the 50 test queries, run 2 obtained the best P@10 scores. Run 2 is
also at or above the median for all but two of the queries (Topic 32 and Topic
49).</p>
        <p>In error analysis, we found that the top 10 retrieved for Topic 32 (which contains
a single query term ‘sob’, i.e., shortness of breath) are all non-English documents
which are automatically considered as irrelevant according to the task guideline.
This points to possible improvement through modifying the preprocessing
algorithm, including a language filter. Our system performed reasonably on Topic
49, but other systems in the Task 3 pool outperformed it.</p>
        <p>The comparison between the pairs (run 3 vs. 4, and run 6 vs. 7) suggests that the
CUI attributes hurt the performance. This might be because due to the heuristic
attribute weight settings (see Table 2), an issue due to the limited training data
(only 5 sample queries were available for training).</p>
        <p>The comparison between runs 3 and 6 and between 4 and 7 are insignificantly
pessimistic about incorporating information in the discharge summaries into our
retrieval model.</p>
        <p>Furthermore, the performance of run 5 is comparable to run 2, suggesting that
the text-based retrieval model alone is quite competitive. It is unclear whether
the improved performance in run 2 is due to the absence of the MeSH terms, or
the presence of the concept space.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>We have shown that the text-based retrieval leveraging the well-tested MRF
and MRM presented relatively strong performance for this CLEF retrieval task.
The best-performing system also included NLP results and query expansion
through discharge summaries. We have also found, however, that named entity
attributes should not be used with untested heuristics. Future work includes a
more principled inclusion of named entity attributes, different preprocessing of
web text, and cross-validation to determine the stability of the results.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>The work was supported by ABI: 0845523 from United States National Science
Foundation, R01LM009959 from United States National Institute of Health. The
challenge was organized by the Shared Annotated Resources (ShARe) project,
funded by the United States National Institutes of Health with grant number
R01GM090187.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Salanter¨a,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Velupillai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Chapman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.W.</given-names>
            ,
            <surname>Savova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Elhadad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Mowery</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Leveling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Martinez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Overview of the ShARe/CLEF eHealth Evaluation Lab 2013</article-title>
          .
          <source>In: CLEF 2013. Lecture Notes in Computer Science (LNCS)</source>
          , Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Voorhees</surname>
            ,
            <given-names>E.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>R.M.:</given-names>
          </string-name>
          <article-title>DRAFT: Overview of the TREC 2011 medical records track</article-title>
          .
          <source>In: TREC</source>
          . (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Strohman</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turtle</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W.B.:
          <article-title>Indri: A language model-based search engine for complex queries</article-title>
          .
          <source>In: Proceedings of the International Conference on Intelligent Analysis</source>
          .
          <article-title>(</article-title>
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Kohlschu¨tter,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Fankhauser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Nejdl</surname>
          </string-name>
          ,
          <string-name>
            <surname>W.</surname>
          </string-name>
          :
          <article-title>Boilerplate detection using shallow text features</article-title>
          .
          <source>In: Proceedings of the third ACM international conference on Web search and data mining</source>
          ,
          <source>ACM</source>
          (
          <year>2010</year>
          )
          <fpage>441</fpage>
          -
          <lpage>450</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hersh</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          :
          <article-title>Information Retrieval: A Health and Biomedical Perspective</article-title>
          .
          <article-title>Third edn</article-title>
          .
          <source>Health Informatics</source>
          . Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>S.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jonnalagadda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wagholikar</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haug</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huff</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          :
          <article-title>Towards a semantic lexicon for clinical natural language processing</article-title>
          .
          <source>In: AMIA Annual Symposium Proceedings. Volume</source>
          <year>2012</year>
          ., American Medical Informatics Association (
          <year>2012</year>
          )
          <fpage>568</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ponte</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W.B.:
          <article-title>A language modeling approach to information retrieval</article-title>
          .
          <source>In: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval</source>
          ,
          <source>ACM</source>
          (
          <year>1998</year>
          )
          <fpage>275</fpage>
          -
          <lpage>281</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>S.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>An empirical study of smoothing techniques for language modeling</article-title>
          .
          <source>In: Proceedings of the 34th annual meeting on Association for Computational Linguistics</source>
          , Association for Computational Linguistics (
          <year>1996</year>
          )
          <fpage>310</fpage>
          -
          <lpage>318</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Zhai</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lafferty</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A study of smoothing methods for language models applied to ad hoc information retrieval</article-title>
          .
          <source>In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          ,
          <source>ACM</source>
          (
          <year>2001</year>
          )
          <fpage>334</fpage>
          -
          <lpage>342</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Metzler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W.B.:
          <article-title>A markov random field model for term dependencies</article-title>
          .
          <source>In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          ,
          <source>ACM</source>
          (
          <year>2005</year>
          )
          <fpage>472</fpage>
          -
          <lpage>479</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Metzler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W.B.:
          <article-title>Linear feature-based models for information retrieval</article-title>
          .
          <source>Information Retrieval</source>
          <volume>10</volume>
          (
          <year>June 2007</year>
          )
          <fpage>257</fpage>
          -
          <lpage>274</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Diaz</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Improving the estimation of relevance models using large external corpora</article-title>
          .
          <source>In: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          , New York, NY, USA, ACM (
          <year>2006</year>
          )
          <fpage>154</fpage>
          -
          <lpage>161</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carterette</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Exploring evidence aggregation methods and external expansion sources for medical record search</article-title>
          .
          <source>In: Proceedings of The 21th Text REtrieval Conference</source>
          . (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carterette</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Combining multi-level evidence for medical record retrieval</article-title>
          .
          <source>In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing</source>
          , Maui, USA, ACM (
          <year>October 2012</year>
          )
          <fpage>49</fpage>
          -
          <lpage>56</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hersh</surname>
            ,
            <given-names>W.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruslen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>P.M.:</given-names>
          </string-name>
          <article-title>TREC 2007 genomics track overview</article-title>
          . In: TREC. (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Liu, H.:
          <article-title>Semantic Characteristics of NLP-extracted Concepts in Clinical Notes vs</article-title>
          .
          <source>Biomedical Literature. In: Proceedings of AMIA</source>
          <year>2011</year>
          .
          <article-title>(</article-title>
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Liu,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Tao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Musen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Chute</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          <article-title>: UMLS Term Occurrences in Clinical Notes: A Large-scale Corpus Analysis</article-title>
          .
          <source>In: Proceedings of the AMIA Joint Summit on Clinical Research Informatics</source>
          . (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O</given-names>
          </string-name>
          ¨ .,
          <string-name>
            <surname>South</surname>
            ,
            <given-names>B.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , DuVall, S.L.:
          <year>2010</year>
          i2b2/
          <article-title>va challenge on concepts, assertions, and relations in clinical text</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>18</volume>
          (
          <issue>5</issue>
          ) (
          <year>2011</year>
          )
          <fpage>552</fpage>
          -
          <lpage>556</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carterette</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Improving health records search using multiple query expansion collections</article-title>
          .
          <source>In: 2012 IEEE International Conference on Bioinformatics and Biomedicine</source>
          , Philadelphia, USA (
          <year>October 2012</year>
          )
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Aronson</surname>
            ,
            <given-names>A.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lang</surname>
            ,
            <given-names>F.M.:</given-names>
          </string-name>
          <article-title>An overview of MetaMap: historical perspective and recent advances</article-title>
          .
          <source>Journal of the American Medical Informatics Association (JAMIA) 17(3)</source>
          (
          <year>2010</year>
          )
          <fpage>229</fpage>
          -
          <lpage>36</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>