<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Task 2: ShARe/CLEF eHealth Evaluation Lab 2013</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Danielle L. Mowery</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brett R. South</string-name>
          <email>brett.south@hsc.utah.edu</email>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lee Christensen</string-name>
          <email>leenlp@q.com</email>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura-Maria Murtola</string-name>
          <email>laura-maria.murtola@utu.fi</email>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sanna Salantera</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hanna Suominen</string-name>
          <email>hanna.suominen@nicta.com.au</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Martinez</string-name>
          <email>david.martinez@nicta.com.au</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Noemie Elhadad</string-name>
          <email>noemie@dbmi.columbia.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sameer Pradhan</string-name>
          <email>sameer.pradhan@childrens.harvard.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guergana Savova</string-name>
          <email>guergana.savova@childrens.harvard.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wendy W. Chapman</string-name>
          <email>wwchapman@ucsd.edu</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Columbia University</institution>
          ,
          <addr-line>NY</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Harvard University</institution>
          ,
          <addr-line>MA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>NICTA and The Australian National University</institution>
          ,
          <addr-line>ACT</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>NICTA and The University of Melbourne</institution>
          ,
          <addr-line>VIC</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of California</institution>
          ,
          <addr-line>San Diego, CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>University of Pittsburgh</institution>
          ,
          <addr-line>PA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>University of Turku</institution>
          ,
          <country country="FI">Finland</country>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>University of Utah, UT</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this pilot study, we aimed to generate a reference standard of clinical acronyms and abbreviations normalized to concepts from a standardized, medical vocabulary for the ShARe/CLEF eHealth 2013 challenge. In this paper, we review prior text normalization shared tasks, reference standard generation approaches, and recent clinical acronym and abbreviation normalization research. We report inter-annotator agreement for the reference standard and performance for participant systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Natural Language Processing</kwd>
        <kwd>Text Normalization</kwd>
        <kwd>Reference Standard Generation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Health care organizations are shifting towards a patient centered approach in
care delivery. One aspect of this approach is patient access to personal health
information (PHI) including their clinical reports. Allowing patients access to
their PHI should increase patient knowledge of their own health status, enhance
patient involvement in care related decision-making, and improve
communication between the patients and care providers [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. However, patients that have
accessed their PHI experience worry and confusion due to use of medical jargon
? WWC, BRS, and DLM led the task, WWC, BRS, DLM, NE, SP, and GS de ned
the task, DLM, BRS, LMM and SS led the annotation e ort, HS and SS co-chaired
the lab, DLM, BRS, LC, and DM processed and distributed the dataset, and DLM,
DM and WWC led result evaluations
such as unfamiliar concepts and abbreviations [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Indeed, a lack of medical
language understanding can contribute to poor post-encounter care adherence
when patients can not understand their discharge summary instructions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>Natural Language Processing (NLP) can help patients understand their health
status by enriching PHI with meta-data (presenting patient-speci c words and
de nitions for unfamiliar concepts and abbreviations) that assists them in
understanding the content of clinical reports.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>2.1</p>
      <sec id="sec-2-1">
        <title>Shared Task Annotations</title>
        <p>
          Annotated datasets are often used to train NLP systems to convert narrative
texts into machine computable representations. The annotated datasets serve as
a reference standard (also known as gold standard or ground truth) and
supports both system development and evaluation [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The reference standard must
be both reliable and valid to provide the most optimal training and evaluation
data. Since 2006, various shared-task challenges have provided reference
standards for the clinical NLP community, including the CCHMC Computational
Medicine Challenge and the i2B2 Shared Tasks. Topics for these shared tasks
include assigning discharge diagnosis codes to radiology reports [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], for
identifying clinical events and their relations [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], for nding personally identi able
information [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], for classifying patient smoking status [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], for determining
obesity comorbidities [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], and for extracting medication mentions [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Reference
standards are usually created by annotations of multiple domain experts with
separate adjudication for disagreements.
        </p>
        <p>
          Previous large-scale annotation e orts include the Message Understanding
Conference (MUC) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], Text Retrieval Conference (TREC) [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], Clinical
E-Science Framework (CLEF) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], GENIA [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], and Penn Treebank
[
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Work by Roberts [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], Savova [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ], Chapman [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ],
Uzuner [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], and previous i2b2 challenges [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] provide context and
motivation for this year's rst ShARe/CLEF eHealth shared task including the
development of the annotation guidelines, annotation schema, and evaluation
of participant systems against the resulting reference standard. Continuing the
tradition of shared tasks providing annotated data for development and
evaluation of NLP systems for potentially useful applications, this year's ShARe/CLEF
eHealth shared task focused on facilitating understanding of information in
narrative clinical reports, such as discharge summaries, by identifying and
normalizing disease/disorders (Task 1), normalizing acronym/abbreviations (Task 2),
and retrieving documents from the health and medicine websites for addressing
questions patients may have about the disease/disorders in the clinical notes
(Task 3). The ShARe/CLEF eHealth Evaluation Lab is the rst step towards
a shared task that evaluates our ability to help patients and family members
understand their clinical records. In this paper, we discuss Task 2.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>NLP for Acronym and Abbreviation</title>
        <p>
          AAs occurring in clinical texts present unique challenges to patient readers
due to genre-speci c senses (e.g., in an echocardiogram, \BP" likely represents
\blood pressure" rather than \Bell's Palsy"), lack of parenthetical de nitions
(e.g., \HTN" (Hypertension)), and ambiguous uses or word senses (e.g., \MS"
can mean \mental status" or \multiple sclerosis" even in the same report genre)
[
          <xref ref-type="bibr" rid="ref27">27</xref>
          ]. To potentially aid patient readers in understanding clinical reports, Task 2
involved normalizing pre-annotated Acronyms and Abbreviations (AAs) to the
Uni ed Medical Language System (UMLS) [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ].
        </p>
        <p>
          Researchers have developed systems to normalize AAs in clinical texts for
information extraction, information retrieval, and document summarization
applications. Wu [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] compared the performance of current, existing clinical NLP tools
- MetaMap, MedLEE, and cTAKES - for identifying boundaries of and
normalizing clinical AAs in discharge summaries, with performances ranging from
coverage: 0.37-0.59 (boundary detection) and F-scores: 0.21-0.71 (normalization). The
MedLEE system outperformed MetaMap and cTAKES for all tasks; poor
performances by MetaMap and cTAKES were attributed to a lack of clinical sense
inventories or disambiguation modules. Automated disambiguations methods
developed by Moon [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] showed promise for developing e ective acronym sense
disambiguation solutions using minimal training data. Moon achieved
accuracies greater than 0.90 using a support vector machine trained on 125 samples
encoded with words, part of speech tags, MetaMap concept unique identi ers,
and sections.
        </p>
        <p>
          Our long-term goal is to facilitate development and evaluation of automated
NLP tools for enriching clinical reports with meta-data that assists patients,
providers, and family members in understanding the content of the reports. An
important foundational step towards this goal is mapping acronyms and
abbreviations to their de nitions and potentially to consumer-oriented dictionaries
like the Consumer Health Vocabulary [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. Next, we describe the annotation
schema, the dataset, the annotation process, and the evaluation methods used
for the ShARe/CLEF eHealth Evaluation Lab Task 2.
3
3.1
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <sec id="sec-3-1">
        <title>Annotation Schema</title>
        <p>
          We developed annotation schema guidelines based on the study by Xu [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ],
iterative annotation of 10 development reports, and discussions among the
coauthors. Similar to Xu [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], we instructed annotators to only annotate clinically
relevant AAs. For instance, annotators did not include general English terms
such as salutations (\Mr.") or time (\am"), but could include services (\EMS"),
locations (\ICU"), section headers (\HEENT"), and medications (see Ex. 1-3
below). Once an AA was annotated, we instructed annotators to select the
closest UMLS concept sense.
Ex 1: He was given Vanco. \Vanco" is a mention of type Acronym/Abbreviation
with CUI C0042313 (UMLS preferred term is \Vancomycin")
Ex 2: Patient has breast ca. \ca" is a mention of type Acronym/Abbreviation
with CUI C0006826 (UMLS preferred term is \Malignant Neoplasms")
Ex 3: Mitral Valve: Trivial MR. \MR" is a mention of type Acronym/Abbreviation
with CUI C0026266 (UMLS preferred term is \Mitral Valve Insu ciency")
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Dataset</title>
        <p>
          We annotated AAs on top of the ShARe (Shared Annotated Resources) dataset,
a strati ed subset of 300 de-identi ed clinical reports from over 30,000 ICU
patients stored in the MIMIC (Multiparameter Intelligent Monitoring in Intensive
Care) II database [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]. The ShARe corpus consists of discharge summary,
electrocardiogram, echocardiogram, and radiology report types annotated for
disease/disorders, corresponding SNOMED codes, and attributes such as negation
and severity. For Task 1, disease/disorder named entities and their SNOMED
codes were released for the Evaluation Lab. For Task 2, we maintained the
training (n=200 reports) and test (n=100 reports) dataset splits from Task 1.
        </p>
        <p>
          To characterize our dataset, we split and tokenized sentences in the reports
using NLTK (Natural Language ToolKit) [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]. We measured average report
length by count of sentences in a report, average sentence length by count of
tokens in a sentence, and average token count by count of tokens in a report.
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Annotation Access and Process</title>
        <p>
          Due to the nature of sensitive, patient-oriented information stored in clinical
reports, a data access procedure was implemented. After registration for
annotation with the University of California NLP Annotation Registry [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ], annotators
were required to obtain permission to access the ShARe dataset, which included
(1) a CITI [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ] or NIH [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ] Training certi cate in Human Subjects Research, (2)
registration on the Physionet.org site [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ], (3) signing a Data Use Agreement to
access the Mimic II data. See the ShARe website [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ] for details.
        </p>
        <p>
          For annotation training, annotators were provided an annotation kit
consisting of 1) the eHOST (extensibleHuman Oracle Suite of Tools) [
          <xref ref-type="bibr" rid="ref39">39</xref>
          ] for annotating
the text, 2) a quick start guide for using the tool, 3) a Camtasia video for
training the annotators, and 4) an annotation guideline for learning the task. These
materials were reviewed with each annotator through an interactive annotation
training session using join.me.
        </p>
        <p>We incorporated both clinical professionals and informatics experts to
generate a reference standard re ecting both domain knowledge and NLP annotation
expertise. We recruited a total of 15 annotators; 11 completed the data access
procedure and attempted to annotate the dataset.</p>
        <p>The reference standard was annotated in three steps:</p>
        <p>Step 1) 9 Finnish nursing professionals, 1 Australian nurse, and 1 Australian
biomedical informatician were provided pre-annotated disease/disorder
annotations from Task 1. They were instructed to span each AAs, then map each
concept to one CUI (concept unique identi er) from the UMLS. If a CUI did
not exist in the vocabulary for the AA, the annotator was instructed to assign
the label \CUI-less".</p>
        <p>Step 2) One US biomedical informatician reviewed and adjudicated the
annotated spans from Step 1 annotators.</p>
        <p>Step 3) One US respiratory therapist reviewed and adjudicated the annotated
spans from Step 2.</p>
        <p>Annotators for Steps 2 and 3 were instructed to delete spurious, modify
existing, and add missing AA spans as well as correct their CUI mappings.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Participant Recruitment and Registration</title>
        <p>To recruit participants, we sent emails to relevant listservs, including Corpora,
SigIR, BioNLP, AMIA NLP Working Group, and CLEF. After registration for
tasks through the CLEF Evaluation Lab, participants were required to take
the same steps as annotators to obtain permission to access the ShARe/CLEF
eHealth dataset.
3.5</p>
      </sec>
      <sec id="sec-3-5">
        <title>Evaluation Metrics</title>
        <p>We calculated inter-annotator agreement for the reference standard annotations
and calculated accuracy of the participant systems when compared against the
reference standard.</p>
        <p>
          Annotator Agreement We determined inter-annotator agreement by
comparing annotations resulting from Step 2 against those resulting from Step 3
using the Evaluation Workbench [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ]. Since the number of strings not annotated
as AAs (i.e., true negatives (TN)) is very large, we followed [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ] in
calculating F1-score as a surrogate for kappa. F1-score is the harmonic mean of recall
and precision, calculated from true positive, false positive, and false negative
annotations, which were calculated as follows:
true positive (TP) = the annotation from Step 2 had overlapping character
o sets with the annotation from Step 3 and was assigned the same CUI
false positive (FP) = an annotation from Step 2 did not exist in Step 3
annotations
false negative (FN) = an annotation from Step 3 did not exist in Step 2
annotations
Recall =
Precision =
F1-score =
        </p>
        <p>T P
(T P + F N )</p>
        <p>T P
(T P + F P )
(3)
System Performance We evaluated performance of participating systems by
calculating the accuracy of system performance against the manual AA
annotations as follows: Accuracy = count of correct AAs divided by total count of
AAs. We calculated Strict Accuracy based on the AA annotations resulting from
Step 3 review. Because there is sometimes more than one CUI that matches an
annotated AA, we also calculated Relaxed Accuracy by de ning a correct AA
annotation as a match with the CUI assigned during Step 3 review or during
Step 2 review.</p>
        <p>
          We evaluated system performance with random shu ing [
          <xref ref-type="bibr" rid="ref42">42</xref>
          ], a non-parametric
statistical signi cance test, to compare the Accuracy scores between
participating systems.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>4.1</p>
      <sec id="sec-4-1">
        <title>Dataset</title>
        <p>The dataset showed an overall average report length of 39 sentences, sentence
length of 20 tokens, and token count of 683 tokens. We also characterized the
dataset by report type illustrated in Table 2. We observed similar distributions
of report length, sentence length, and token counts for both training and test
sets, in spite of the di ering distributions of report types.
Inter-annotator agreement scores between Step 2 and Step 3 annotations was
0.85 for the training set and 0.91 for the test set.
We received a total of 56 data requests for individual researchers.
Participating teams included between 3-7 people and were comprised of scientists,
engineers, professors, post doctoral fellows, and graduate students. Our participants
competed from the US, Australia, France, and China. Participants represented
academic and industrial institutions including University of Texas, Vanderbilt
University, Massachusetts Institute of Technology, West Virginia University,
University of Sydney, Computer Sciences Laboratory for Mechanics and Engineering
Sciences, Tsinghua University, Canon Information Technology, and M*Modal.</p>
        <p>In total, ve teams submitted systems for Task 2. Two system submissions
used external annotations for training; three system submissions used no external
annotations for training. As shown in Table 3, the UTHealthCCB team system
had the highest performance, with accuracies of 0.72 for strict and 0.73 for
relaxed accuracy. Using the majority class \CUI-less", a baseline system evaluated
with strict accuracy only achieved 0.06 accuracy (not shown below).
There are several limitations to this study. We only focused on clinical AAs
from four report types using a convenience corpus. An NLP system may need to
disambiguate AAs from a variety of other report types to aid patients. Although,
our annotators represented a variety of clinical and domain expertise, we did
not evaluate inter-annotator agreement between Step 1 annotators, nor did we
evaluate whether there were di erences between annotators with clinical vs
nonclinical training.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>
        Using a step-wise annotation process that incorporated clinical and NLP
expertise, we were able to generate a reference standard with high inter-annotator
agreement. By developing automated systems, participants demonstrated that
an NLP system can interpret clinical AAs with reasonably high accuracy. We
observed only between 4-6% of AAs were \CUI-less" in the strict training and
test sets suggesting reasonable UMLS coverage of clinical AA terms. This
nding demonstrates signi cant improvements since the 2007 study by Xu. Xu [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]
evaluated coverage of the UMLS and Medline's ADAM vocabulary for
abbreviations, acronyms, shortened words, and contractions annotated in admission
notes. In particular, Xu observed low to moderate coverage of abbreviations
(56-67%), senses (24-38%), and ambiguities (33-71%). However, the UMLS is
primarily used to normalize words to a medical vocabulary. In order to improve
understanding of clinical text by patients, we plan to evaluate the coverage of
clinical AAs from the task dataset against the Consumer Health Vocabulary.
Future tasks will build on the annotations in this task and include more user-based
evaluation metrics, such as how well users understand the content of clinical
reports with meta-data such as de nitions of acronyms and abbreviations.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>We greatly appreciate the hard work and feedback of our program committee
members and annotators, including, but not limited to Qing Zeng, Tyler
Forbush, Jianwei Leng, Maricel Angel, Eriikka Siirala, Helja Lundgren-Laine, Jenni
Lahdenmaa, Marita Ritmala-Castren, Riitta Danielsson-Ojala, Saija Heikkinen,
and Sini Koivula.</p>
      <p>This shared task was partially supported by NICTA, funded by the
Australian Government as represented by the Department of Broadband,
Communications and the Digital Economy and the Australian Research Council through
the ICT Centre of Excellence program, the CLEF Initiative, the European
Science Foundation (ESF) project ELIAS, the Khresmoi project, funded by the
European Union Seventh Framework Programme (FP7/2007-2013) under grant
agreement no 257528, the ShARe project funded by the United States National
Institutes of Health (R01GM090187), the US Department of Veterans A airs
(VA) Consortium for Healthcare Informatics Research (CHIR), the US O ce
of the National Coordinator of Healthcare Technology, Strategic Health IT
Advanced Research Projects (SHARP) 90TR0002, the Vardal Foundation
(Sweden), and the National Library of Medicine 5T15LM007059.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Medical</given-names>
            <surname>Protection</surname>
          </string-name>
          <article-title>Society: Online medical records and the doctor-patient partnership</article-title>
          .
          <source>MPS research report</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ross</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>The e ects of promoting patient access to medical records: A review</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>10</volume>
          (
          <issue>2</issue>
          ) (
          <year>2003</year>
          )
          <volume>129</volume>
          {
          <fpage>138</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Delbanco</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bell</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elmore</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farag</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feldman</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mejilla</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ralston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ross</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trivedi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vodicaka</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leveille</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Inviting patients to dread their doctors' notes: a quasi-experimental study and look ahead</article-title>
          .
          <source>Ann Intern Med</source>
          <volume>157</volume>
          (
          <issue>7</issue>
          ) (
          <year>2012</year>
          )
          <volume>461</volume>
          {
          <fpage>470</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Engel</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buckley</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Forth</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCarthy</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ellison</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adams</surname>
          </string-name>
          , J.:
          <article-title>Patient understanding of emergency department discharge summary instructions: Where are knowledge de cits greatest?</article-title>
          <source>Acad Emerg Med</source>
          <volume>19</volume>
          (
          <issue>9</issue>
          ) (
          <year>2012</year>
          )
          <article-title>E1035</article-title>
          {
          <fpage>E1044</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Meystre</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kipper-Schuler</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hurdle</surname>
          </string-name>
          , J.:
          <article-title>Extracting information from textual documents in the electronic health record: a review of recent research</article-title>
          .
          <source>Yearb Med Inform</source>
          <volume>35</volume>
          (
          <year>2008</year>
          )
          <volume>128</volume>
          {
          <fpage>44</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pestian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brew</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matykiewicz</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hovermale</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Johnson, N.,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duch</surname>
            ,
            <given-names>W.:</given-names>
          </string-name>
          <article-title>A shared task involving multi-label classi cation of clinical free text</article-title>
          .
          <source>BioNLP</source>
          <year>2007</year>
          :
          <article-title>Biological, translational, and clinical language processing (</article-title>
          <year>2007</year>
          )
          <volume>97</volume>
          {
          <fpage>104</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mailoa</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sibanda</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Semantic relations for problemoriented medical records</article-title>
          .
          <source>Artif Intell Med</source>
          <volume>50</volume>
          (
          <issue>2</issue>
          ) (
          <year>October 2010</year>
          )
          <volume>63</volume>
          {
          <fpage>73</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szolovits</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Viewpoint paper: Evaluating the state-of-the-art in automatic de-identi cation</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>14</volume>
          (
          <issue>5</issue>
          ) (
          <year>2007</year>
          )
          <volume>550</volume>
          {
          <fpage>563</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldstein</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kohane</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Identifying patient smoking status from medical discharge records</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>15</volume>
          (
          <year>2008</year>
          )
          <volume>14</volume>
          {
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Recognizing obesity and co-morbidities in sparse data</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>16</volume>
          (
          <issue>4</issue>
          ) (
          <year>2009</year>
          )
          <volume>561</volume>
          {
          <fpage>570</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solti</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cadag</surname>
          </string-name>
          , E.:
          <article-title>Extracting medication information from clinical text</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>17</volume>
          (
          <issue>5</issue>
          ) (
          <year>2010</year>
          )
          <volume>514</volume>
          {
          <fpage>518</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Grishman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sundheim</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          : Message Understanding Conference-
          <volume>6</volume>
          :
          <article-title>a brief history</article-title>
          .
          <source>In: Proceedings of the 16th conference on Computational linguistics - Volume 1. COLING '96</source>
          ,
          <string-name>
            <surname>Stroudsburg</surname>
          </string-name>
          , PA, USA, Association for Computational Linguistics (
          <year>1996</year>
          )
          <volume>466</volume>
          {
          <fpage>471</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hersh</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhupatiraju</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corley</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Enhancing access to the Bibliome: the TREC Genomics Track</article-title>
          .
          <source>Stud Health Technol Inform 107(Pt 2)</source>
          (
          <year>2004</year>
          )
          <volume>773</volume>
          {
          <fpage>7</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Re ections on TREC</article-title>
          .
          <source>In: Information Processing &amp; Management</source>
          <volume>31</volume>
          (
          <issue>3</issue>
          ). (
          <year>1995</year>
          )
          <volume>29131</volume>
          {
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaizauskas</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hepple</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demetriou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kola</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setzer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tapuria</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wheeldin</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The CLEF Corpus: Semantic Annotation of Clinical Text</article-title>
          .
          <source>In: AMIA Annu Symp Proc. (</source>
          <year>2007</year>
          )
          <volume>625</volume>
          {
          <fpage>629</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaizauskas</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hepple</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demetriou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setzer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Building a semantically annotated corpus of clinical texts</article-title>
          .
          <source>J Biomed Inform</source>
          <volume>42</volume>
          (
          <issue>5</issue>
          ) (
          <year>2009</year>
          )
          <volume>950</volume>
          {
          <fpage>66</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ohta</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tateisi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsujii</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>GENIA corpus - a semantically annotated corpus for bio-textmining</article-title>
          .
          <source>In: ISMB (Supplement of Bioinformatics)</source>
          . (
          <year>2003</year>
          )
          <volume>180</volume>
          {
          <fpage>182</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ohta</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsujii</surname>
          </string-name>
          , J.:
          <article-title>Corpus annotation for mining biomedical events from literature</article-title>
          .
          <source>BMC Bioinformatics</source>
          <volume>9</volume>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Marcus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beatrice</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marcinkiewicz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Building a large annotated corpus of english: The Penn Treebank</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>19</volume>
          (
          <issue>2</issue>
          ) (
          <year>1993</year>
          )
          <volume>313</volume>
          {
          <fpage>330</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coden</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sominsky</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , Johnson, R.,
          <string-name>
            <surname>Ogren</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Groen</surname>
          </string-name>
          , P.d.,
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Word sense disambiguation across two domains: Biomedical literature and clinical notes</article-title>
          .
          <source>J Biomed Inform</source>
          <volume>41</volume>
          (
          <issue>6</issue>
          ) (
          <year>December 2008</year>
          )
          <volume>1088</volume>
          {
          <fpage>1100</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ogren</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Du</surname>
            <given-names>y</given-names>
          </string-name>
          , P.,
          <string-name>
            <surname>Buntrock</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Technical brief: Mayo Clinic NLP system for patient smoking status identi cation</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>15</volume>
          (
          <year>2008</year>
          )
          <volume>25</volume>
          {
          <fpage>28</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dowling</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hripcsak</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Evaluation of training with an annotation schema for manual annotation of clinical conditions from emergency department reports</article-title>
          .
          <source>Int J Med Inform</source>
          <volume>77</volume>
          (
          <issue>2</issue>
          ) (
          <year>2008</year>
          )
          <volume>107</volume>
          {
          <fpage>13</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bridewell</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanbury</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cooper</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buchanan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A simple algorithm for identifying negated ndings and diseases in discharge summaries</article-title>
          .
          <source>J Biomed Inform</source>
          <year>2001</year>
          (
          <year>2001</year>
          )
          <volume>34</volume>
          {
          <fpage>301</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haug</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Comparing expert systems for identifying chest x-ray reports that support pneumonia</article-title>
          .
          <source>In: AMIA Annu Symp Proc. (</source>
          <year>1999</year>
          )
          <volume>216</volume>
          {
          <fpage>220</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solti</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cadag</surname>
          </string-name>
          , E.:
          <article-title>Community annotation experiment for ground truth generation for the i2B2 medication challenge</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>17</volume>
          (
          <issue>5</issue>
          ) (
          <year>2010</year>
          )
          <volume>519</volume>
          {
          <fpage>523</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O</given-names>
          </string-name>
          .,
          <string-name>
            <surname>South</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , DuVall, S.:
          <year>2010</year>
          i2B2/
          <article-title>VA challenge on concepts, assertions, and relations in clinical text</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>18</volume>
          (
          <issue>5</issue>
          ) (
          <year>2011</year>
          )
          <volume>552</volume>
          {
          <fpage>556</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stetson</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A study of abbreviations in clinical notes</article-title>
          .
          <source>In: AMIA Annu Symp Proc. (</source>
          <year>2007</year>
          )
          <volume>821</volume>
          {
          <fpage>825</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Campbell</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliver</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Shortli e, E.:
          <article-title>The Uni ed Medical Language System: Towards a collaborative approach for solving terminologic problems</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>5</volume>
          (
          <issue>1</issue>
          ) (
          <year>1998</year>
          )
          <volume>12</volume>
          {
          <fpage>16</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denny</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenbloom</surname>
            ,
            <given-names>S.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giuse</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>A comparative study of current clinical natural language processing systems on handling abbreviations in discharge summaries</article-title>
          .
          <source>AMIA Annu Symp Proc</source>
          <year>2012</year>
          (
          <year>2012</year>
          )
          <volume>997</volume>
          {
          <fpage>1003</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Moon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pakhomov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melton</surname>
          </string-name>
          , G.:
          <article-title>Automated disambiguation of acronyms and abbreviations in clinical texts: Window and training size considerations</article-title>
          .
          <source>In: AMIA Annu Symp Proc. (</source>
          <year>2012</year>
          )
          <volume>1310</volume>
          {
          <fpage>1319</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31. CHV: Consumer Health Vocabulary. http://consumerhealthvocab.org/ Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Saeed</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lieu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raber</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mark</surname>
          </string-name>
          , R.:
          <article-title>MIMIC II: a massive temporal ICU patient database to support research in intelligent patient monitoring</article-title>
          .
          <source>Comput Cardiol</source>
          <volume>29</volume>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33. NLTK:
          <article-title>Natural Language ToolKit</article-title>
          . http://nltk.org/ Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Fana</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>NLP Ecosystem: Annotation Registry</article-title>
          . http://nlpecosystem.ucsd.edu/annotators Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35. CITI:
          <article-title>Collaborative Institutional Training Initiative</article-title>
          . https://www.citiprogram.org/ Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36. NIH:
          <article-title>National Institute of Health - ethics training module</article-title>
          . http://ethics.od.nih.gov/Training/AET.htm Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37. Physionet:
          <article-title>Physionet site</article-title>
          . https:http://www.physionet.org/ Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>ShARe: ShARe CLEF eHealth</surname>
          </string-name>
          <article-title>Website</article-title>
          . https://sites.google.com/site/shareclefehealth/data#
          <string-name>
            <surname>TOCObtaining-Datasets-Tasks-</surname>
          </string-name>
          1
          <string-name>
            <surname>-</surname>
          </string-name>
          and-2-/ Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>South</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>TB</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>DuVall</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.:</given-names>
          </string-name>
          <article-title>A prototype tool set to support machine-assisted annotation</article-title>
          .
          <source>In: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing. BioNLP '12</source>
          ,
          <string-name>
            <surname>Stroudsburg</surname>
          </string-name>
          , PA, USA, Association for Computational Linguistics (
          <year>2012</year>
          )
          <volume>130</volume>
          {
          <fpage>139</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Christensen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Evaluation Workbench</article-title>
          . http://nlpecosystem.ucsd.edu/content/documentation Accessed:
          <fpage>2013</fpage>
          -06-30.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Hripcsak</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rothschild</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Agreement, the f-measure, and reliability in information retrieval</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          <volume>12</volume>
          (
          <issue>3</issue>
          ) 296{
          <fpage>8</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <string-name>
            <surname>Yeh</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>More accurate tests for the statistical signi cance of result di erences</article-title>
          .
          <source>In: Proceedings of the 18th Conference on Computational Linguistics (COLING)</source>
          , Saarbrucken,
          <string-name>
            <surname>Germany</surname>
          </string-name>
          (
          <year>2000</year>
          )
          <volume>947</volume>
          {
          <fpage>953</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>