<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>TeamUEvora at CLEF eHealth 2014 Task2a</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jo~ao Sequeira</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nuno Miranda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Teresa Goncalves</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paulo Quaresma</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Science Department, School of Science and Technology University of Evora</institution>
          ,
          <addr-line>Evora</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
      </contrib-group>
      <fpage>156</fpage>
      <lpage>166</lpage>
      <abstract>
        <p>We present our rst participation in a ShARe/CLEF eHealth Lab contributing for task 2a. Task 2 is an extension of the 2013 lab task 1 and consists of information extraction from clinical texts for Disease/Disorder Template Filling; task 2a aims at predicting each attribute's normalization value. This work constitutes a preliminary approach to the problem of extracting and handling information from clinical texts. More than getting a good result, our priority was to get a rst hint on the questions and problems that are posed within this area. For that, we developed a system that combines information from cTAKES output and the training corpus. The performance was measured using accuracy. Our system ranked 7th with an accuracy of 0.802, a F1 of 0.214, a precision of 0.217 and a recall value of 0.211.</p>
      </abstract>
      <kwd-group>
        <kwd>Clinical texts</kwd>
        <kwd>Template lling</kwd>
        <kwd>Text normalization</kwd>
        <kwd>cTAKES</kwd>
        <kwd>Medical Informatics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The ShARe/CLEF eHealth Lab 20141 [
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ] task 2 is an extension of the task 1
of the same lab from 2013 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and consists of information extraction from clinical
texts with the goal of disease/disorder template lling. For each disease/disorder
present in each clinical report there is a template with ten di erent attributes
and participants have to predict the value for each attribute. There are two
subtasks: 2a) assign normalization values to the ten attributes; 2b) assign cue
values to the nine attributes with cues.
      </p>
      <p>This is our rst participation in a ShARe/CLEF eHealth Lab and we
contributed to subtask 2a, building a system that uses previous implemented
technologies. Being this the rst time we work with medical information, our main
priority is to understand the problems associated with the extraction of
information in the area. In this paper we present the system architecture and the
decisions made; we also present and analyse the experimental results on the
training and test corpora.</p>
      <p>The paper has the following structure: Section 2 introduces the task, the
training and test corpora in detail and Section 3 presents the implemented
system. The results are discussed in Section 4 and conclusions and a glimpse of
future work are presented in Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Task</title>
      <p>As said in Section 1, task 2 is an extension of the 2013 task 1 lab aiming at
lling templates with attributes values and cues.</p>
      <p>Files with empty templates for each disease/disorder (mentioned in the
corresponding clinical text) were provided to the participants. Each template
indicates the Uni ed Medical Language System Concept Unique Identi er (CUI),
mention boundaries and the ten attributes needed to be lled. Each attribute
has two slot types: the normalized value and the lexical cue from the sentence
where the normalized value occurred. Task 2a evaluates the systems' ability to
predict the normalized value for each attribute and task 2b the ability to nd
the right cue slot value for each attribute.</p>
      <p>Since we participated only on task 2a (that was mandatory), our templates
have default values in all the cue slots. Table 1 presents template information: a
header with the le name, the cue slot of the disease/disorder and its CUI, the
nine modi ers associated with the disease/disorder with normalized values (task
2a) and cue slots (task 2b) plus the DocTime modi er that only has a normalized
value.
2.1</p>
      <sec id="sec-2-1">
        <title>Description of the training and test corpora</title>
        <p>The train and test corpora provided are composed of clinical texts from four
di erent types: discharge summary, ECG report, ECHO report and radiology
report. Their distribution in each corpus is presented in Table 2.</p>
        <p>Analysing both corpora we can observe some di erences. In the training
corpus the Discharge summary type has 45.82% of documents while the remaining
classes have an equal number, 18.06%; in the test corpus there are only Discharge
summary documents.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>System Architecture</title>
      <p>This section presents the implementation of our system and the approaches taken
to tackle the modi ers.
3.1</p>
      <p>
        cTAKES
As said before, our system uses previous implemented technologies for clinical
texts analysis and information extraction (this method was also used in task
1 [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref6 ref7 ref8 ref9">6,7,8,9,10,11,12,13,14</xref>
        ] of 2013 ShARe/CLEF eHealth Lab).
      </p>
      <p>Header
File name</p>
      <p>Cue slot
Concept Unique Identi er (CUI)</p>
      <p>Modi ers
2a) Normalized values 2b) Cue slot</p>
      <p>yes/no* if value is yes
patient*, family member, other, null, if di erent
donor family member, donor other of patient</p>
      <p>yes/no* if value is yes
unmarked*, changed, increased, decreased, if di erent
improved, worsened, resolved of unmarked
unmarked*, severe, if di erent
slight, moderate of unmarked
true/false* if value is true
true/false* if value is true
NULL*, CUI, if di erent</p>
      <p>CUI-less of NULL
unknown*, before, after, no
overlap, before-overlap slot
none*, date, time, if di erent</p>
      <p>duration, set of none</p>
      <p>
        We used the output of the clinical Text Analysis and Knowledge Extraction
System (cTAKES) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] (version 3.1.1). cTAKES2 is a open source linguistic tool
kit from the Apache Software Foundation. Some operations done by cTAKES
include:
Negation and Uncertainty Indicators, Subject and Conditional Classes
and Body Location. For the modi ers NI, SC, UI, CO and BL we extracted the
information from the cTAKES output. Among the attributes related with the
diseases/disorders identi ed by cTAKES we found information that could be
directly used for some of the modi ers: we used the polarity attribute from
cTAKES to identify if the diseases/disorders were negated and assigning a value
to NI; for the SC, UI and CO modi ers, cTAKES have attributes with the same
name and we only needed to convert that information into the normalized values
of the task modi ers.
      </p>
      <p>For the BL modi er we used a set of rules to know if there were body
locations in the same sentence of the identi ed disease/disorders and extracted the
respective CUI. We tried to extract the CUI of the most speci c body location
possible, so we searched the expression with a bigger number of words, using the
premise that more information means more speci city.</p>
      <sec id="sec-3-1">
        <title>Course Class and Severity Class. For the CC and SV modi ers we used a</title>
        <p>mapping approach. From the 299 clinical texts that compose the training corpus,
we extracted expressions (without repetition) related to each modi er value.</p>
        <p>When using expressions from a mapping approach, there is the risk of
identifying equal expressions from the text but not in the correct context. To determine
if the modi ers CC and SV had this problem we checked the expressions in each
mappings le and concluded that the expressions were not too common and the
probability of identifying wrong expressions was acceptable for our objectives.
Generic Class. The GC modi er had a particular characteristic { there was no
example of it in the training corpus; assuming that the test corpus would follow
this, few to none appearances of this modi er expressions would appear. Based
on this assumption we used the default value (false) in every template.
2 http://ctakes.apache.org/
DocTime. The DT modi er expresses the temporal relation between the
disease/disorder and the time when the clinical text was written. It can have the
following values:
{ Before-overlaps: disease/disorder identi ed in the past and still present;
{ Before: disease/disorder identi ed and treated in the past;
{ Overlap: disease/disorder present but there is no information about when
it was diagnosed or when it will pass;
{ After: one action or event that it is still to come;
{ Unknown: no temporal relation information.</p>
        <p>For this modi er we used a purely statistic approach, meaning that, for each
template we selected the most common value presented in the training corpus {
Overlap.</p>
        <p>Table 3 presents occurrence percentage for training corpus for each possible
DT value; it can be noticed that more than half of the occurrences (56.35%) has
the Overlap value, so this one was chosen to ll all the templates. The Before
value had also an expressive number, but Overlap more than doubles it.
Temporal Expressions. To identify dates and hours we used regular
expressions. At rst we thought of using a mapping approach too, but dates and hours
are very speci c and if an expression appear in the same format but with one
day apart, that expression wouldn't be identi ed.</p>
        <p>Based in the training corpus, we created four regular expressions aiming to
identify DATE and two regular expressions to identify Time:
{ DATE</p>
        <p>Day/Month/Year (dd/mm/yyyy);
Day-Month-Year (dd-mm-yyyy);
Year-Month-Day (yyyy-mm-dd);</p>
        <p>Month-Year (mm-yy).
{ TIME
24 hours time (hh:mm);
12 hours time (hh:mm am/pm)</p>
        <p>We didn't consider the identi cation of expressions associated with the
remaining values of the modi er { duration and set.
3.3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Implementation</title>
        <p>Our system was implemented using the Java programming language. Figure 1
presents the system's architecture { it uses mapping les, regular expressions,
decisions based on the training corpus and cTAKES.</p>
        <p>XML les were generated from cTAKES, and from them we extracted
information using a parser and applied the procedures described in the last
subsection. With the obtained information, the system updated the modi ers' values
and printed the templates with the nal result.</p>
        <p>Next we explain the steps necessary to get the lled templates:
1. run cTAKES with the clinical texts as input;
2. load information from templates, namely the header (because the rest are
the default values), and the map les built for CC and SV;
3. process the XML from cTAKES using a set of rules to extract information;
4. use the information previously gathered to substitute the default values from
the templates.</p>
        <p>Step 1. The rst step can be also called a pre-processing one { the generation
of the XML les using cTAKES. It generates a XML le for each clinical text.
cTAKES has a large set of speci c analysis engines and a set of aggregate ones
that combine the speci c ones. These aggregate engines describe how particular
annotators can be combined using a set of rules that describe how each annotator
uses the analysis of the previous one.</p>
        <p>Several aggregate engines were tested and the one that o ered the best results
(and was used for the participation run) was AggregatePlaintextUMLSProcessor.
Step 2. On startup, the system loads the mapping les of CC and SV modi ers
obtained from the training corpus. It also loads the templates information into
a data structure that the system can use during all process.</p>
        <p>Step 3. After steps 1 and 2, the system processes the XML les. We used
xPath expressions to extract the information considered necessary to task 2a;
this information was stored in data structures suited for being subsequently
processed. The information is extracted using two approaches:
{ the 'strict' one, where the system searches diseases/disorders with a perfect
match the information gathered from cTAKES;
{ the 'relaxed' one, that is used in case the 'strict' fails. This one, although less
accurate, veri es if the boundaries of the disease/disorder from the template
header are inside the ones of the chunk identi ed by cTAKES.</p>
        <p>The CUI of the body locations associated to the disease/disorder is obtained
using a set of rules that joins information from the di erent data structures
mantained. In order to reach the most speci c CUI, the system chooses the
longest body location term from the cTAKES output.</p>
        <p>Step 4. The nal step gathers all information from the previous steps, relying
mainly in the coordinates of the diseases/disorders in text.</p>
        <p>To extract the modi ers information, the system searches the sentences where
the diseases/disorders were identi ed, looks for the cTAKES gathered
information, replaces the info in the respective template, searches for terms in the
mapping, applies the regular expressions and writes the found info in the template.
Finally it writes the info for the DT and GC modi ers (that is equal for all
templates).
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>0.071 and TE an improvement of 0.142. For DT modi er, the training presents a
better result with an improvement of 0.57 over the test corpus.</p>
      <p>Comparing the test corpus results with the best accuracy reported in task
2a we notice that in some modi ers like SC, UI, CO and TE the di erence is lower
than 0.2 and the values for class GC are equal; for modi ers BL, DT and CC there
is a bigger discrepancy between the results. Nevertheless, in overall our system
stood behind 0.082 when compared with the overall value calculated.</p>
      <p>Table 5 presents the F1, precision and recall values for both the train and
test corpora. There we can see that the values are not so di erent between the
train and test corpora among most of the modi ers. Modi ers like SC, UI, CO, BL
and TE have better results in the test corpus; on the other side NI, CC, SV and
DT modi ers have better results in the training corpus.</p>
      <p>The DT modi er obtained widely di erent values with a F1 of 0.592 in the
train and a corresponding value of 0.024 in the test corpus. This can be explained
because the value of this modi er is always the same for every template of
the output; this decision was based on the modi er statistics from the training
corpus.</p>
      <p>We ranked seventh among all the participants of task 2a, as showed in Table
6. The best system had an overall accuracy of 0.868 and our system obtained an
overall accuracy of 0.802. This value is lower than the average accuracy value of
all participants. Our system also obtained values below the average in the F1,
precision and recall.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future work</title>
      <p>This paper presents the design and the implementation of our system,
developed for participating in the task 2a of 2014 ShARe/CLEF eHealth Lab. The
task's main goal was to obtain normalized attributes values for disease/disorder
template lling.
5.1</p>
      <sec id="sec-5-1">
        <title>Conclusions</title>
        <p>Our participation's main goal was to understand the problems associated with
the design and implementation of a system to extract information from medical
data. The system gathers knowledge from already implemented technology in
the clinical area, namely cTAKES; it also uses resources based on the training
corpus, regular expressions and decisions based on modi ers statistics.</p>
        <p>Between 14 participants, it ranked 7th, with an accuracy value of 0.802.
Taking into account our goal, we consider this a good result; nevertheless there
is much space for improvement.
5.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Future work</title>
        <p>
          cTakes is one of the resources of our system and we intend to add more sources
of knowledge of the medical area so we can improve our system. One hypothesis
is MetaMap[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], widely used in task 1 of 2013 Lab. Last year, some participants
used only cTAKES [
          <xref ref-type="bibr" rid="ref6 ref8">6,8</xref>
          ], others used only MetaMap [
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref7 ref9">7,9,10,11,12</xref>
          ] and others
used a joint approach [
          <xref ref-type="bibr" rid="ref13 ref14">13,14</xref>
          ].
        </p>
        <p>On the other hand, we intend to complement or substitute the approach
taken to some modi ers:
{ for Course and Severity we want to try a machine learning approach;
{ for temporal expressions, we want to improve the system by also identifying
duration and set expressions. For that we intend to use technologies in the
area of clinical time identi cation;
{ for DocTime we intend to incorporate knowledge in order to give di erent
values to di erent examples (instead of using the same value for all of them):
{ for Generic modi er, we aim to develop a more automatic way to detect
this class. Nevertheless, to do that we need some examples of this modi er
in the training corpus.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leroy</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schreck</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mowery</surname>
            <given-names>D. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velupillai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
          </string-name>
          , J.:
          <source>Overview of the ShARe/CLEF eHealth Evaluation Lab 2014</source>
          . Springer-Verlag.
          <article-title>(</article-title>
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Elhadad</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>O'Gorman</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>The ShARe Schema for the Syntactic and Semantic Annotation of Clinical Texts</article-title>
          . (
          <year>2014</year>
          ).
          <article-title>(Under Review)</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Salantera,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Velupillai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Chapman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. W.</given-names>
            ,
            <surname>Savova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Elhadad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Pradhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>South</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. R.</given-names>
            ,
            <surname>Mowery</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            ,
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. J.</given-names>
            ,
            <surname>Leveling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Goeuriot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Martinez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Overview of the ShARe/CLEF eHealth Evaluation Lab 2013</article-title>
          .
          <article-title>CLEF 2013, Valencia</article-title>
          , Spain: Springer Berlin Heidelberg. In: Proceedings of ShARe/CLEF eHealth Evaluation Labs (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masanz</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ogren</surname>
            ,
            <given-names>P.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kipper-Schuler</surname>
            ,
            <given-names>K.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          :
          <article-title>Mayo clinical text analysis and knowledge extraction system (ctakes): architecture, component evaluation and applications</article-title>
          .
          <source>In: Journal of the American Medical Informatics Association</source>
          <volume>17</volume>
          (
          <year>2010</year>
          )
          <fpage>507</fpage>
          -
          <lpage>513</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Aronson</surname>
            ,
            <given-names>A.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lang</surname>
            ,
            <given-names>F.M.:</given-names>
          </string-name>
          <article-title>An overview of MetaMap: historical perspective and recent advances</article-title>
          .
          <source>JAMIA</source>
          <volume>17</volume>
          (
          <issue>3</issue>
          ) (
          <year>2010</year>
          )
          <fpage>229</fpage>
          -
          <lpage>236</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cogley</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stokes</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carthy</surname>
          </string-name>
          , J.:
          <article-title>Medical Disorder Recognition with Structural Support Vector Machines</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Leaman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khare</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          : NCBI at 2013 ShARe/CLEF eHealth Shared Task:
          <article-title>Disorder Normalization in Clinical Notes with DNorm</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gung</surname>
          </string-name>
          , J.:
          <article-title>Using Relations for Identication and Normalization of Disorders: Team CLEAR in the ShARe/CLEF 2013 eHealth Evaluation Lab</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hervas</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart nez</surname>
          </string-name>
          , V.,
          <string-name>
            <surname>Sanchez</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>D</given-names>
            <surname>az</surname>
          </string-name>
          , A.:
          <article-title>UCM at CLEF eHealth 2013 Shared Task1</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gyawali</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solorio</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Evaluation of YTEX and MetaMap for clinical concept recognition</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akella</surname>
          </string-name>
          , R.:
          <article-title>UCSC's System for CLEF eHealth 2013 Task 1</article-title>
          . In: Proceedings of ShARe/CLEF eHealth Evaluation Labs (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holloway</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Identify Disorders in Health Records using Conditional Random Fields and Metamap; AEHRC at ShARe/CLEF 2013 eHealth Evaluation Lab Task 1</article-title>
          . In: Proceedings of ShARe/CLEF eHealth Evaluation Labs (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Bodnari</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deleger</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>A Supervised Named-Entity Extraction System for Medical Text</article-title>
          .
          <source>In: Proceedings of ShARe/CLEF eHealth Evaluation Labs</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhong</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tan</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Na</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Combining MetaMap and cTAKES in Disorder Recognition: THCIB at CLEF eHealth Lab 2013 Task 1</article-title>
          . In: Proceedings of ShARe/CLEF eHealth Evaluation Labs (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>