<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Biomedical Abbreviation Recognition and Resolution (BARR) track: benchmarking, evaluation and importance of abbreviation recognition systems applied to Spanish biomedical abstracts</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ander Intxaurrondo</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Pe´rez-Pe´rez</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gael Pe´rez-Rodr´ıguez</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jose Antonio Lo´ pez-Mart´ın</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jesus Santamar´ıa</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Santiago de la Pen˜ a</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marta Villegas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Saber Ahmad Akhondi</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alfonso Valencia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Analia Lourenc¸o</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Krallinger</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Barcelona Supercomputing Center</institution>
          ,
          <addr-line>BSC</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Centro Nacional de Investigaciones Oncolo ́gicas</institution>
          ,
          <addr-line>CNIO</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>ESEI - Department of Computer Science, University of Vigo</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Elsevier Content &amp; Innovation</institution>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Hospital 12 de Octubre - Madrid</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>230</fpage>
      <lpage>246</lpage>
      <abstract>
        <p>Healthcare professionals are generating a substantial volume of clinical data in narrative form. As healthcare providers are confronted with serious time constraints, they frequently use telegraphic phrases, domain-specific abbreviations and shorthand notes. Efficient clinical text processing tools need to cope with the recognition and resolution of abbreviations, a task that has been extensively studied for English documents. Despite the outstanding number of clinical documents written worldwide in Spanish, only a marginal amount of studies has been published on this subject. In clinical texts, as opposed to the medical literature, abbreviations are generally used without their definitions or expanded forms. The aim of the first Biomedical Abbreviation Recognition and Resolution (BARR) track, posed at the IberEval 2017 evaluation campaign, was to assess and promote the development of systems for generating a sense inventory of medical abbreviations. The BARR track required the detection of mentions of abbreviations or short forms and their corresponding long forms or definitions from Spanish medical abstracts. For this track, the organizers provided the BARR medical document collection, the BARR corpus of manually annotated abstracts labelled by domain experts and the BARR-Markyt evaluation platform. A total of 7 teams submitted 25 runs for the two BARR subtasks: (a) the identification of mentions of abbreviations and their definitions and (b) the correct detection of short formlong form pairs. Here we describe the BARR track setting, the obtained results and the methodologies used by participating systems. The BARR task summary, corpus, resources and evaluation tool for testing systems beyond this campaign are available at: http://temu.inab.org.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        There is an increasing adoption of electronic health records (EHRs) in the European
Union, promoted both by national plans as well as European initiatives like the 2
billion Euro public-private partnership Innovative Medicines Initiative [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Electronic
health records encompass a substantial amount of unstructured clinical texts, a key
information source for clinical decision support (CDS), patient cohort stratification, and
disease/adverse drug event surveillance or population health management [
        <xref ref-type="bibr" rid="ref26 ref8">8, 26</xref>
        ]. For
instance, according to estimates provided by the Galician Healthcare Service,
approximately 80 percent of the content of EHRs generated during the last years by this Spanish
region is available as unstructured data, i.e. clinical texts. When considering the amount
of unstructured clinical information generated only by the Galician health system, about
200.000 clinical notes are produced on average just on a single day. Enabling a better
exploitation of the information contained in clinical notes would empower mechanisms
for improving patient assistance.
      </p>
      <p>
        The expansion of EHRs within healthcare systems promoted the development of
clinical natural language processing techniques. Such systems have the aim to assist in
the process of transforming clinical text written by healthcare professionals into
structured clinical data representations. Clinical text mining and natural language processing
(NLP) systems have been applied to a considerable number of different tasks [
        <xref ref-type="bibr" rid="ref26 ref46 ref8">26, 8, 46</xref>
        ].
Among the most prevalent clinical text processing tasks are (a) automated clinical
coding [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], (b) automatic de-identification/anonymization of clinical notes[
        <xref ref-type="bibr" rid="ref45">45</xref>
        ], (c)
recognition of clinical entities or concepts in running text [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], (d) negation detection [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ],
(e) experiencer detection (subject identification) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], (f) temporal status (temporality)
classification [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ], (g) coreference resolution/anaphoric relations extraction[
        <xref ref-type="bibr" rid="ref37 ref51">51, 37</xref>
        ] or
(h) detecting adverse drug events [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Although clinical text is a very abundant type of
health data, it is also the most arduous to explore computationally. There are several
inherent particularities underlying clinical texts that cause important complications to
automatic text processing attempts not encountered in well-written prose or scientific
literature. Clinical notes are generally far less structured and frequently do not follow
normal grammar, containing ungrammatical expressions, lack of punctuation marks and
accentuation as well as presence of conjoined words. Due to time constraints, clinical
writings frequently contain spelling and typing errors together with author- and
domainspecific idiosyncratic, often cryptic expressions.
      </p>
      <p>
        Despite the heavy use of formal and domain specific terminologies in clinical notes,
these are generally used in a rather informal and unsorted way. Moreover, in Spanish
clinical texts, some studies also point out issues related to the incorrect use of Spanish
medical expressions due to false friends and wrong translations from English medical
terms into Spanish [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Benavent and Iscla describe commonly encountered cases of
incorrect use of Spanish medical language [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>Processing of clinical texts heavily relies on the performance of pre-processing
modules such as tokenization, spell checking and sentence boundary recognition.
Another key component for almost any clinical text processing task is the correct
identification and resolution of abbreviations. Acronyms can be seen even in Roman and Greek
inscriptions, and were commonly used during the Roman Empire and the middle Ages.</p>
      <p>
        Abbreviations, acronyms and symbols constitute a widely used strategy to produce
a more reduced and compact representation of written medical expressions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
Therefore, clinical narratives show a heavy use of shorthand lexical units, abbreviations and
acronyms, including cases of local or even misspelled abbreviations. The correct
interpretation of abbreviations is a challenge even for health care practitioners themselves
and can potentially result in medical errors due to wrong interpretations of highly
ambiguous cases [
        <xref ref-type="bibr" rid="ref12 ref43">43, 12</xref>
        ]. Some estimates on English biomedical texts showed that
acronyms are overloaded 33% of the time, and often correspond to highly ambiguous
cases, even given contextual information [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ].
      </p>
      <p>
        Abbreviations are heavily used in Spanish EHRs, as highlighted by several
studies both in Spain as well as in Latin America. For instance, Plasencia and Moliner
found almost 22 abbreviations per record when examining nursing notes, discharge and
emergency discharge reports [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. They also pointed out that many of the examined
abbreviations did have more than one potential meaning or interpretation and that in some
cases abbreviations where wrongly used. In another study carried out by Benavent and
colleagues, on average a total of 14,7 abbreviations were detected per document when
dealing with emergency notes, discharge reports and clinical reports from specialized
healthcare services [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. Abbreviations are also very common in EHRs written in
English, as described by [
        <xref ref-type="bibr" rid="ref48">48</xref>
        ], which detected on average 35 abbreviations in discharge
summaries from the Vanderbilt Medical Center.
      </p>
      <p>
        Abbreviations are being widely used in scientific texts, not only in case of English
documents but also in Spanish medical literature. Especially publications belonging to
biomedical and clinical disciplines are particularly overloaded with abbreviations and
acronyms [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>
        Clinical texts generally do not contain explicit mentions of contractions or
abbreviations (short forms) together with their corresponding full versions or descriptive forms
(long forms), also known as abbreviation-definition pairs. In order to be able to resolve
abbreviations and to build sense inventories the availability of resources covering
abbreviations and their definitions is critical. Although many lexical resources do exist
for English biomedical abbreviations, only few manually constructed lexical resources
have been generated for Spanish that might serve as resources to interpret abbreviations.
These are designed mainly for human consumption and are generally not distributed in
machine-readable formats. Navarro presented a manually constructed resource of
abbreviations and acronyms commonly used in Spanish medical texts that serves as a
valuable aid for interpreting abbreviations [
        <xref ref-type="bibr" rid="ref28 ref49">28, 49</xref>
        ]. Also lexical resources such as the
Unified Medical Language System (UMLS) have been explored to extract
automatically a considerable number of abbreviation definitions, i.e. abbreviation - full form
pairs [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>
        Another important resource for abbreviations is the medical literature. Often key
elements of clinical studies can in fact also be found in medical abstracts [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Many
scientific publications do require that authors provide a definition of abbreviations and
acronyms the first time they are used in the text. According to some estimates, using
English scholarly publications, 25 percent of abbreviations were explicitly defined in
biomedical articles [
        <xref ref-type="bibr" rid="ref50">50</xref>
        ]. When examining different manually annotated abbreviation
corpora for English biomedical abstracts, on average between 0.80 to 1.43 abbreviation
definitions could be found per abstract [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>
        The recognition and disambiguation of biomedical abbreviations is an intensively
studied research topic in English [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ]. A range of different methods have been
applied to address this problem, including alignment-based approaches described by [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ],
machine learning techniques tested by [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], or rule-based approaches explored by [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
In order to be able to evaluate and develop abbreviation recognition tools for English
biomedical texts, several manually annotated corpora have been constructed, i.e. the
MEDSTRACT, Ab3P, BOADI and Schwartz and Hearst corpora (see [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ] for
more details).
      </p>
      <p>
        Unfortunately, far less research has been performed on Spanish medical
abbreviation recognition, despite the volume of EHR written in that language and the existence
of a considerable number of Spanish medical publications. Rubio-Lopez et al. can be
counted among the few published attempts that handled acronym disambiguation in
Spanish EHRs [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]. This is partially due to the lack of annotated resources and
available corpora covering abbreviation annotations.
      </p>
      <p>
        A mechanism to promote the development of biomedical/medical text mining and
natural language processing systems, as well as to determine the state of the art
techniques and performance to address a particular task is through shares tasks and
community challenges [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. A considerable number of challenges have been organized so far
for English biomedical texts, serving as an important driving force for generating the
necessary resources for the implementation of medical text processing systems [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>We have thus organized a track specifically devoted to the automatic processing
of Spanish medical literature, focusing on an important building block task, namely
the automatic detection of abbreviations. The Biomedical Abbreviation Recognition
and Resolution (BARR) track had the aim to promote the development and evaluation
of biomedical abbreviation identification systems by providing Gold Standard training,
development and test corpora manually annotated by domain experts with
abbreviationdefinition pairs within abstracts of biomedical documents written in Spanish.</p>
      <p>The proposed Biomedical Abbreviation Recognition and Resolution (BARR) track
has the aim to promote the development and evaluation of biomedical abbreviation
identification systems by providing Gold Standard training, development and test
corpora manually annotated by domain experts with abbreviation-definition pairs within
abstracts of biomedical documents written in Spanish.</p>
      <p>This paper describes the data used to support the BARR track, as well as the results
obtained for this track. Section 2 describes BARR track setting and posed tasks. Section
3 provides a sort summary of the corpus and resources provided for the BARR track. In
section 4 we give a brief explanation of the Markyt benchmark platform and the used
evaluation measures. In section 5 we focus on the methods used by the participants
while section 6 provides an overview of the obtained results. Finally, section 7 offers
concluding remarks.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Task Description</title>
      <p>
        The BARR track was one of the five tasks of the IberEval 2017 evaluation campaign,
which had the aim to promote the development of language technologies for Iberian
languages. The purpose of the BARR track was to explore settings that are relevant for
processing both medical texts as well as clinical research narratives. The underlying
assumption here was that techniques tailored to medical literature could be potentially
adapted for processing clinical texts [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. Figure 1 provides a general overview of the
BARR task setting.
      </p>
      <p>In essence, the BARR track evaluated systems that are able to detect mentions
of abbreviation-definition pairs, i.e. short form-long form mentions that co-occurred
within sentences found in abstracts of Spanish medical articles. This implied that,
instead of requesting the detection and resolution of all the abbreviations found in an
abstract, in this first edition of BARR, the focus was only on the discovery of
abbreviation that were explicitly defined through their long forms in the same sentence. In line
with some of the previously proposed resources we refer to an abbreviation as a Short
Form (SF) that is a shorter term that denotes a longer word or phrase. On the other hand,
the definition (the Long Form, LF) refers to the corresponding definition found in the
same sentence as the SF.</p>
      <p>
        The BARR task was divided into two separate subtasks, which were carried out on
the same datasets [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The first subtask focused on the detection of mentions, in terms
of their corresponding character offsets, of both short forms as well as long forms.
This implied that participating teams had to detect correctly the start and end indices
corresponding to all the short forms and long forms mentioned in titles and abstracts of
Spanish medical articles. The second, and main subtask, focused on the detection of the
actual pairs of short-form/long-from mentions in running text.
      </p>
      <p>
        Participating systems were provided with a training set to construct their predictor
or system during the training phase [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. All abbreviations used for the training and
test set collections were generated through an exhaustive manual annotation by domain
experts, following well-defined annotation guidelines. At a later stage, a blinded test
set was released for which they were asked to submit predictions that were evaluated
against manual annotations. For evaluation purposes we only examined exact matches
of automatically produced annotations against manual ones. For the BARR track we
used a particular setting related to the test set release, similar to the evaluation
scenario used for the CHEMDNER for chemical named entity recognition systems [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>
        The documents released during the test phase included, in addition to the Gold Standard
evaluation test set used to assess the performance of participating systems, an additional
larger collection of documents to explore robustness and scalability of the systems and
to make sure that any manual revision or correction of results prior to submission would
be unfeasible. Each participating BARR teams was allowed to submit for each of the
sub-tasks a total of up to five predictions (runs). In order to coordinate the
participation and evaluation of teams of the BARR track, the BARR task organizers offered,
in addition to a general task website, an evaluation and benchmark platform adapted
to this task, the BARR-Markyt platform [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. This platform enables user registration
and notification, as well as management, evaluation, visualization and benchmarking of
systems and their annotations. Systems had to upload their predictions to the
BARRMarkyt platform in a predefined format to be considered for evaluation.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Data Sets and Resources</title>
      <p>A considerable barrier for the development of Spanish medical text processing systems
is the lack of a unified repository or publication aggregator of all medical and
biomedical literature published in Spanish. Note that the centralized citation repository PubMed
does provide article abstracts only in English for a large number of articles written
originally in Spanish.</p>
      <p>
        The underlying scenario for Spanish medical literature is rather fragmented, with
multiple (partially overlapping) resources offering abstracts and/or publications of
medical literature written in Spanish [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In order to provide a large document collection of
Spanish medical abstracts for the BARR track, going beyond the actual training and test
documents, we constructed the BARR document collection by integrating abstracts and
titles from multiple sources including a specially constructed set of records provided by
the publisher Elsevier specifically for this track. The BARR document or background
collection contained a total of 155,538 records, out of which Elsevier provided 41,760
documents, while the rest corresponded to publications combined from multiple
different sources. To distinguish these two collections they will be referred to as BARR
background 1 (background set without the Elsevier collection) and background 2 (entire
background set including the Elsevier collection).
      </p>
      <p>A subgroup from the BARR document collection was used to construct a manually
labelled training and test set. The training and the test set were random subsamples
from the same document set to avoid selection bias during the evaluation step. The
entire BARR training set contained 1,050 abstracts, while the BARR Gold Standard
test set had a total of 600 abstracts.</p>
      <p>Table1 provides a statistical overview of the BARR datasets, covering basic corpus
statistics of the abstracts and titles of the training set, background sets, and the test set;
we also included information about Elsevier BARR documents alone.</p>
      <p>The BARR document corpus was released in form of plain-text, UTF8-encoded
abstracts in a simple tab-separated format with columns corresponding to the document
identifier, an ISO 639-1 language two-letter code corresponding to the language of the
abstract body, title of the record and abstract of the record.</p>
      <p>
        The manual labelling of abbreviation mentions of the BARR corpus was done using
a customized version of AnnotateIt. During a second follow up annotation step, a
customized version of the Markyt annotation system was used to manually revised mention
annotations and to annotate the relations between short forms and long forms (as well
as short forms and nested mentions) co-mentioned in sentences [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <p>To provide a richer annotated corpus, not limited only to SF-LF pairs, the BARR
corpus provided annotations for a total of eight different abbreviation-related mention
types, which are summarized in figure 2 (A) together with an example abstract (B). All
annotations were annotated by biomedical experts and supervised by a practising
oncologist with an additional degree in bioinformatics and basic knowledge in text mining.</p>
      <p>Table 2 shows the frequency of entity types and relation types in the training set and
the background set organized by titles and abstracts.</p>
      <p>The average frequency of abbreviations found per abstracts depended heavily on
the considered mention type. For instance short forms occurred in the training set 1.41
times per abstract, while multiple mentions occurred 2.84 times and global
abbreviations (those missing an explicit long form in the abstract) occurred on average 3.37
times.</p>
      <p>A more exhaustive examination of the BARR document collection showed that a
small set of abstracts did correspond to records that were in fact not written in Spanish
but in another language, mostly English. Therefore all the records in the training and test
set were manually classified into its corresponding language, being only the abstracts
written in Spanish considered for evaluation purposes, while for the background sets,
an automatic language detection algorithm was used to assign language codes.</p>
      <p>Finally, the task organizers provided a list of additional resources including tutorials,
results of available abbreviation extraction systems adapted to Spanish and a collection
of software, datasets, and lexical resources relevant to the BARR track.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Evaluation Measures</title>
      <p>
        The Markyt web-based benchmarking platform supported the evaluation of the
predictions of the participating teams [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. The primary evaluation metric used for the BARR
track consisted in micro-average F-measure.
      </p>
      <p>
        Markyt provided micro/macro-average standard performance statistics, such as
recall, precision and F-score, and enabled the examination of annotation mismatches (see
[
        <xref ref-type="bibr" rid="ref41">41</xref>
        ] and [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ]).
      </p>
      <p>Correspondingly, recall (Eq. 1) is the percentage of correctly labelled positive
results over all positive cases, being a measure of the ability of a system to identify
positive cases.</p>
      <p>T P
recall = (1)</p>
      <p>T P + F N</p>
      <p>Precision (Eq. 2) represents the percentage of correctly labelled positive results over
all positive labelled results, i.e. it is a measure of the reproducibility of a classifier of
the positive results.</p>
      <p>T P
precision = (2)</p>
      <p>T P + F P</p>
      <p>Lastly, F-score (or balanced F-measure), the primary evaluation metric, stands for
the harmonic mean between precision and recall (Eq. 3).</p>
      <p>F</p>
      <p>Score =
2 (precision recall)
precision + recall
(3)</p>
      <p>Micro-average statistics were calculated globally by counting the total true
positives, false negatives and false positives. Conversely, macro-average statistics were
calculated by counting the true positives, false negatives and false positives on a
perdocument basis and then, averaged across documents.</p>
      <p>During the test phase, teams were requested to generate predictions for a blinded
collection of documents, and they had to upload their submission to the BARR-Markyt
system within a short period of time. Teams could submit up to five prediction files
(runs). Additionally, three main result types were examined: false negative (FN) results
corresponding to incorrect negative predictions (i.e. cases that were part of the gold
standard, but missed by the automatic system), false positive (FP) results being cases
of incorrect positive predictions (i.e. wrong results predicted by the automatic system
that had no corresponding annotation in the gold standard) and true positive (TP) results
consisting of correct positive predictions (i.e. correct predictions matching exactly with
the gold standard annotations). The micro-averaged recall, precision and F-score
statistics were used for final prediction scoring, and F-score was selected as main evaluation
metric. Figure 3 illustrates schematically the submission process using Markyt.</p>
    </sec>
    <sec id="sec-5">
      <title>Overview of the Submitted Approaches</title>
      <p>
        A total of 17 teams registered for the BARR track through the BARR-Markyt team
registration page, whereof 7 teams returned correctly submission before the team results
submission due. The evaluated BARR teams submitted a total of 25 runs, 13 for the
subtask of recognizing abbreviation mentions and 12 for the detection of mentions of
short form-long form pairs. Below we summarize briefly the most relevant aspects of
the used methodology by participating teams, while additional details can be found in
the BARR team technical note papers [
        <xref ref-type="bibr" rid="ref1 ref15 ref27 ref34 ref36">15, 27, 34, 36, 1</xref>
        ].
      </p>
      <p>
        The IBI-UPF team used an approach structured into two sequential steps, namely an
entity spotting phase and a relation extraction phase [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ]. For the frist step they trained
three token-based Random Forest classifiers that modelled both the token itself as well
as a context window of 2 characters: (1) an abbreviation token classifier, (2) a long form
token classifier and (3) an abbreviation type classifier, i.e. whether the abbreviation is a
short form or another type of abbreviation. During the second phase, they applied a set
of heuristics and a scoring function to identify SF-LF pair relations.
      </p>
      <p>
        The system used by the UC-III team was based on a rule-based approach [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ]. They
implemented a technique consisting of an adaptation of an algorithm originally
proposed by Schwartz and Hearst [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ] to handle English biomedical abbreviations and
tailored it to Spanish texts together with additional modifications. Their strategy assumed
that long forms are mentioned in the same sentence and before their corresponding short
forms. Abbreviations were extracted through pattern rules, imposing maximum length
cut-offs to identify the short forms as well as long forms. Short forms had to start with
an alphanumeric character and had to contain at least one letter. Moreover, the first
character of the short form had to match the character of the initial position of the first
word of the detected long form. The system of UC-III team was particularly fast both
for the abbreviation recognition as well as for the SF-LF relation detection.
      </p>
      <p>
        The IXA-UPV/EHU team addressed this task as a classical NER scenario by
applying two standard, domain agnostic, machine-learning based NER taggers for the
recognition of abbreviation mentions [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. They managed to submit results for one of their
systems, namely a perceptron tagger based on sparse, shallow features. Their system
relied on the IXA pipes tokenizer for pre-processing and tokenization, together with
the ixa-pipe-nerc for short form and long form mention recognition. This team used
the BARR background set to induce clusters and word embeddings for training both
of their systems. For the ixa-pipe-nerc tagger they used a customized version of the
Apache OpenNLP implementation of the perceptron algorithm.
      </p>
      <p>The CNIO team provided a comparative benchmarking of three publicly available
state-of-the-art biomedical abbreviation detection and recognition systems. These
systems, based on heuristics, were originally developed for English biomedical texts and
thus had to be adapted to handle Spanish documents. Tailoring to Spanish covered
essentially processing of accentuated characters. The three approaches that were
benchmarked by this team included the Ab3P, ADRS and BADREX taggers. These tools
exploit heuristics related to the presence of parentheses and the detection of
abbreviation candidates within them, together with the exploration of the nearby context for
detecting SF-LF pairs. These taggers analyse characters inside parentheses, and detect
words that correspond to long form candidates by matching these characters in certain
positions.</p>
      <p>
        The EHU team adapted a system that relied on the FreelingMed analyser to find
abbreviations [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. This system incorporated a dictionary contained in FreelingMed
that comprised abbreviations and their expanded forms appearing in SNOMED CT.
Furthermore, this team also used heuristics to recognize word-forms based on
particular patterns that were indicative for abbreviations. Finally, through Freeling they could
mark abbreviations referring to units of weight, length or time.
      </p>
      <p>
        The UPC team used three different tactics for detecting acronym-expansion pairs
[
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. Their similarity-based strategy relied on 13 hand crafted extraction rules that were
applied in decreasing order of confidence. These rules took into account aspects such
as word shape features observed in training set annotations. Their gazetteer-based
strategy relied on a large medical terminological resource integrating entries from
multiple sources. This terminology contained 14,360 short forms-long form pairs that were
detected in the terminology using the similarity-based method. Finally, their
distancebased strategy was based on a list of patterns covering acronym-expansion pairs
cooccurring closely and frequently in sentences of the training data.
      </p>
      <p>
        The UNED team participated through a two-step approach that identified first the
abbreviation candidates and then tried to match potential definitions [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. For the
identification of abbreviations they essentially considered upper case terms, terms with
combinations of upper case letters and other characters as well as the presence of
parenthesis. Also this team employed an adaptation of the Schwartz and Hearst algorithm for
detecting SF-LF pairs in running text [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]. Their extension of this algorithm
incorporated cases where the characters of the short form did not appear in the identical order
than the corresponding long form matching characters. Moreover, the number of words
of long forms was not allowed to exceed the double of the characters of its
corresponding short form. They also constructed a set of patterns to handle frequent special cases.
When the pattern based-strategy did not detect a valid definition they used a dictionary
of 7,916 entries to match possible long forms.
      </p>
      <p>
        The last three teams (EHU, UPC and UNED) are described in a joined working
notes paper exploring output combinations of these strategies [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. According to their
conclusion, combinations did not improve substantially the results when compared to
the single best system from the UNED team.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Results and Discussion</title>
      <p>We received a total of 25 runs from the seven teams that submitted results for the
subtasks of the BARR track. Dividing the BARR track into two sub-task allowed us to
carry out a more granular evaluation analysis and enabled participants to improve their
systems precisely for each of the underlying components.</p>
      <p>We evaluated a total of 13 runs for the first sub-task, the detection of mention
offsets of abbreviation short forms and abbreviation definitions, i.e. long forms and nested
long forms. Table 3 illustrates the obtained performance for each of the evaluated
submissions for this sub-task. The top scoring prediction was provided by the UNED team,
obtaining an F-measure of 74.91%, followed by the UC-III team with an F-measure
of 72.05%. The UC-III team, which relied on a customized version of the Schwartz
and Hearst algorithm, obtained the highest recall, corresponding to 73.47%. The
second highest recall was obtained by run 4 of the IBI-UPF team (69.85%). All
submissions showed a consistently higher precision when compared to the recall results. The
top scoring precision was reached by the AB3P benchmarking run of the CNIO team
Abbreviation team runs Precision Recall F1-score
(87.95%) followed by the UNED submission (86.84%). These results indicate that rule
based approaches did obtain a competitive performance for this subtask.</p>
      <p>In summary, the achieved results indicate that participating systems were able to
obtain a reasonable performance for this subtask. Nonetheless, these results also suggest
that there is room for additional performance improvement. These results also imply
that this task was considerably more difficult when compared to analogous approaches
tested on English biomedical abstracts. In order to determine potential aspects
affecting the abbreviation mention recognition systems performance we carried out a
detailed analysis of annotations that were often wrongly predicted across multiple runs.
When differentiating the results of different mention types, it became clear that short
forms were considerable easier to detect when compared to long forms or nested long
form mentions. Many of the errors in long form recognition referred to incorrect
mention boundary detection, particularly in cases of very long definitions. The presence of
hyphens, numbers, certain non-alphanumeric characters or accentuated characters was
also observed frequently within missed mentions, of both short forms and long forms.
Long forms with conjoined words were another difficult case for participating teams.</p>
      <p>A frequently missed type of short form corresponded to single letter abbreviations
(e.g. T - telaprevir) or abbreviations that contained punctuation marks (e.g. P.M.M.).
Another particularly difficult type of abbreviation, also affecting the SF-LF relation
extraction subtask, corresponded to the special case of non-Spanish short forms, whereas
the corresponding co-mentioned long form description was written in Spanish. This
special scenario is particularly widespread in the biomedical and medical literature
due to the influence of English academic and technical terminology. Most of the
nonSpanish abbreviations corresponded to English terms, while a minor fraction
corresponded to abbreviations in another language such as Latin. This special ”bi-lingual
short form-long form pairs” were kept due to their practical relevance. For instance
abbreviations such as PSA (prostate specific antigen) is more often used in the Spanish
medical texts as abbreviation for ant´ıgeno prosta´tico espec´ıfico instead of its Spanish
counterpart (APE). The detection of this kind of SF-LF pairs would require the use
of either bilingual abbreviation definition lexical resources or even machine translation
techniques.</p>
      <p>The second, and main subtask of the BARR track, focused on the detection of pairs
of short-form/long-from mentions in running text. This task was obviously directly
dependent on the results of the previous sub-task. Table 4 provides the evaluation results
of the performance of all received submissions for this sub-task.</p>
      <p>The run of the UNED team obtained the highest F-measure (67.74%) followed
by the UC-III submission (66.59%). The UC-III run reached the top scoring recall
(61.78%) whereas the UNED run obtained the second highest recall score (60.93%).
When examining precision, the UPC run obtained the highest score (97.67%), but with
a very low recall (recall 8.91%). The CNIO-AB3P obtained the second highest
precision score (84.23%). An examination of prediction errors for the SF-LF pair task
showed that many mismatches were caused by incorrect boundary recognition of the
long form portion. We observed among the types of frequently missed SF-LF pairs,
relations where the short form was stated in the text before the long form (and not
afterwards). Other difficult cases corresponded to pairs where several characters of the
short form appeared in a single word of its long form (e.g. CTP - cate´ter telescopado, or
CMV cytomegalovirus). Long forms with permuted word order not following its short
form character order were problematic for many teams, while in some cases also pairs
that contained SFs or LFs with accentuated characters were not detected correctly.
Curiously SF-LF pairs corresponding to substances and chemical compounds seemed to
imply some difficulty. Finally, cases where parentheses serving as abbreviation markers
were missing caused also in a drop in performance.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Conclusions</title>
      <p>
        The BARR track was able to promote the development of resources, corpora and
processing tools for a key task of medical text mining, the recognition of abbreviations. The
BARR track was promoted by the Plan for the Advancement of Language Technology,
a Spanish national plan to encourage the development of natural language technologies
for Spanish and Iberian languages [
        <xref ref-type="bibr" rid="ref47">47</xref>
        ]. The obtained results highlight that participating
teams were able to implement systems that can be valuable for the development of
lexical resources for disambiguating abbreviations. Improvement of future systems might
require access, not only to additional corpora, but also to better basic medical language
processing and pre-processing components. Follow up tasks should address the
disambiguation of all abbreviations beyond SF-LF pairs, including also other document types
such as clinical notes as well as identifying the knowledge domain to which an
abbreviation belongs [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Several teams continued the improvement of their systems after the
test phase. In order to encourage further improvements and the implementation of new
systems beyond the BARR test phase we plan the construction of an additional blinded
Gold Standard dataset that can be used to assess future tools through the BARR-Markyt
system. We will also publish the results of the BARR inter-annotator agreement study
on the track web. We expect that the BARR corpus should be distributed in
alternative formats such as the popular BioC format [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Future tasks should also align with
interoperability aspects to facilitate integration of the tools of participating teams into
platforms such as the OpenMinted infrastructure1 in addition to technical assessment as
done by benchmarking initiatives like BeCalm2 [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ].
8
      </p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>We acknowledge the Encomienda MINETAD-CNIO/OTG Sanidad Plan TL and
OpenMinted (654021) H2020 project for funding.
1 http://openminted.eu/
2 http://www.becalm.eu/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Agerri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rigau</surname>
          </string-name>
          , G.:
          <article-title>Applying existing named entity taggers at barr ibereval 2017 task</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Aleixandre-Benavent</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <article-title>Bolan˜os-</article-title>
          <string-name>
            <surname>Pizarro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Dios</surname>
            ,
            <given-names>J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Navarro-Molina</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Fuentes de informaci o´n bibliogra´fica (ii). bases de datos bibliogra´ficas espan˜olas en ciencias de la salud de intere´s en pediatr´ıa: Ime, ibecs y medes/sources of bibliographic information (ii). spanish bibliographic database in health sciences of interest in pediatrics. databases of ime, ibecs and medes</article-title>
          .
          <source>Acta Pediatrica Espanola</source>
          <volume>69</volume>
          (
          <issue>4</issue>
          ),
          <volume>177</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Ao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Takagi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>ALICE: an algorithm to extract abbreviations from MEDLINE</article-title>
          .
          <article-title>Journal of the American Medical Informatics Association : JAMIA (</article-title>
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Benavent</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iscla</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Problemas del lenguaje me´dico actual.(ii) abreviaciones y epo´nimos</article-title>
          .
          <source>Papeles Me´d 10(4)</source>
          ,
          <fpage>170</fpage>
          -
          <lpage>6</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <issue>5</issue>
          .
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schtze</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Altman</surname>
          </string-name>
          , R.B.:
          <article-title>Creating an online dictionary of abbreviations from medline</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Comeau</surname>
            ,
            <given-names>D.C.</given-names>
          </string-name>
          , Islamaj Do g˘an, R.,
          <string-name>
            <surname>Ciccarese</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leitner</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rinaldi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torii</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Bioc: a minimalist approach to interoperability for biomedical text processing</article-title>
          .
          <source>Database</source>
          <year>2013</year>
          , bat064 (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Dawes</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pluye</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shea</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grad</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nie</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          :
          <article-title>The identification of clinically important elements within medical journal abstracts: Patient population problem, exposure intervention, comparison, outcome, duration and results (pecodr)</article-title>
          .
          <source>Journal of Innovation in Health Informatics</source>
          <volume>15</volume>
          (
          <issue>1</issue>
          ),
          <fpage>9</fpage>
          -
          <lpage>16</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Demner-Fushman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>C.J.:</given-names>
          </string-name>
          <article-title>What can natural language processing do for clinical decision support? Journal of biomedical informatics 42(5</article-title>
          ),
          <fpage>760</fpage>
          -
          <lpage>772</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Discovering novel adverse drug events using natural language processing and mining of the electronic health record</article-title>
          .
          <source>Artificial Intelligence in Medicine</source>
          pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shagina</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lussier</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hripcsak</surname>
          </string-name>
          , G.:
          <article-title>Automated encoding of clinical documents based on natural language processing</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>11</volume>
          (
          <issue>5</issue>
          ),
          <fpage>392</fpage>
          -
          <lpage>402</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Harkema</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dowling</surname>
            ,
            <given-names>J.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thornblade</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          :
          <article-title>Context: an algorithm for determining negation, experiencer, and temporal status from clinical reports</article-title>
          .
          <source>Journal of biomedical informatics 42(5)</source>
          ,
          <fpage>839</fpage>
          -
          <lpage>851</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. on Accreditation of Healthcare Organizations,
          <string-name>
            <surname>J.C.</surname>
          </string-name>
          , et al.:
          <article-title>Medication errors related to potentially dangerous abbreviations</article-title>
          .
          <source>Sentinel Event Alert</source>
          <volume>23</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>C.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Community challenges in biomedical text mining over 10 years: success, failure and the future</article-title>
          .
          <source>Briefings in bioinformatics 17(1)</source>
          ,
          <fpage>132</fpage>
          -
          <lpage>144</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lowe</surname>
            ,
            <given-names>H.J.:</given-names>
          </string-name>
          <article-title>A novel hybrid approach to automated negation detection in clinical radiology reports</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>14</volume>
          (
          <issue>3</issue>
          ),
          <fpage>304</fpage>
          -
          <lpage>311</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Cnio at barr ibereval 2017: exploring three biomedical abbreviation identifiers for spanish biomedical publications</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Islamaj</surname>
            <given-names>Doan</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Comeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.C.</given-names>
            ,
            <surname>Yeganova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Wilbur</surname>
          </string-name>
          , W.J.:
          <article-title>Finding abbreviations in biomedical literature: three bioc-compatible modules and four bioc-formatted corpora</article-title>
          .
          <source>Database : the journal of biological databases and curation</source>
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Islamaj</surname>
            <given-names>Doan</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Comeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.C.</given-names>
            ,
            <surname>Yeganova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Wilbur</surname>
          </string-name>
          , W.J.:
          <article-title>Finding abbreviations in biomedical literature: three bioc-compatible modules and four bioc-formatted corpora</article-title>
          .
          <source>Database</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>P.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>L.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brunak</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Mining electronic health records: towards better research applications and clinical care</article-title>
          .
          <source>Nature Reviews Genetics</source>
          <volume>13</volume>
          (
          <issue>6</issue>
          ),
          <fpage>395</fpage>
          -
          <lpage>405</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenbloom</surname>
            ,
            <given-names>S.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mani</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denny</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>18</volume>
          (
          <issue>5</issue>
          ),
          <fpage>601</fpage>
          -
          <lpage>606</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez-Martin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , de la Pen˜a,
          <string-name>
            <surname>S.</surname>
          </string-name>
          , Pe´rez-Pe´rez,
          <string-name>
            <given-names>M.</given-names>
            , Pe´rezRodr´ıguez, G., Santamar´ıa, J.,
            <surname>Villegas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Akhondi</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          , Lourenc¸o,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Valencia</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Resources for the extraction of abbreviations and terms in spanish from medical abstracts: the barr corpus, lexical resources and document collection</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rabal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leitner</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vazquez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salgado</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leaman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ji</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lowe</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sayle</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batista-Navarro</surname>
            ,
            <given-names>R.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huber</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rocktschel</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matos</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campos</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Munkhdalai</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryu</surname>
            ,
            <given-names>K.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramanan</surname>
            ,
            <given-names>S.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nathan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , itnik, S.,
          <string-name>
            <surname>Bajec</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weber</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Irmer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akhondi</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kors</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>An</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sikdar</surname>
            ,
            <given-names>U.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ekbal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yoshioka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dieb</surname>
            ,
            <given-names>T.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verspoor</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khabsa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ravikumar</surname>
            ,
            <given-names>K.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamurias</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Couto</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dai</surname>
            ,
            <given-names>H.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsai</surname>
          </string-name>
          , R.T.H.,
          <string-name>
            <surname>Ata</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Can</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alves</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Segura-Bedmar</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martnez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oyarzabal</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The chemdner corpus of chemicals and drugs and its annotation principles</article-title>
          .
          <source>Journal of cheminformatics 7</source>
          ,
          <issue>S2</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lussier</surname>
            ,
            <given-names>Y.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A study of abbreviations in the umls</article-title>
          .
          <source>Proceedings. AMIA Symposium</source>
          pp.
          <fpage>393</fpage>
          -
          <lpage>397</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aronson</surname>
            ,
            <given-names>A.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A study of abbreviations in medline abstracts</article-title>
          .
          <source>Proceedings. AMIA Symposium</source>
          pp.
          <fpage>464</fpage>
          -
          <lpage>468</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Johnson, S.B.,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the umls</article-title>
          .
          <source>Journal of the American Medical Informatics Association : JAMIA 9</source>
          ,
          <fpage>621</fpage>
          -
          <lpage>636</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lussier</surname>
            ,
            <given-names>Y.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method</article-title>
          .
          <source>Journal of biomedical informatics 34(4)</source>
          ,
          <fpage>249</fpage>
          -
          <lpage>261</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Meystre</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kipper-Schuler</surname>
            ,
            <given-names>K.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hurdle</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          , et al.:
          <article-title>Extracting information from textual documents in the electronic health record: a review of recent research</article-title>
          .
          <source>Yearb Med Inform</source>
          <volume>35</volume>
          (
          <issue>128</issue>
          ),
          <volume>44</volume>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Montalvo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oronoz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Rodr´ıguez, H., Mart´ınez, R.:
          <article-title>Biomedical abbreviation recognition and resolution by prosa-med</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Navarro</surname>
            ,
            <given-names>F.A.</given-names>
          </string-name>
          :
          <article-title>Repertorio de siglas, acro´nimos, abreviaturas y s´ımbolos utilizados en los textos me´dicos en espan˜ol</article-title>
          .
          <source>Panace</source>
          <volume>9</volume>
          (
          <issue>27</issue>
          ) (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Pe</surname>
          </string-name>
          <article-title>´rez-Pe´rez, M., Pe´rez-</article-title>
          <string-name>
            <surname>Rodrıguez</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blanco-Mıguez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fdez-Riverola</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lourenco</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Benchmarking biomedical text mining web servers at biocreative v. 5: the technical interoperability and performance of annotation servers-tips track</article-title>
          .
          <source>Proceedings of the BioCreative 5</source>
          ,
          <fpage>12</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30. Pe´rez-Pe´rez,
          <string-name>
            <given-names>M.</given-names>
            , Pe´rez-Rodr´ıguez, G.,
            <surname>Rabal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Vazquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Oyarzabal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Fdez-Riverola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Valencia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Krallinger</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          , Lourenc¸o,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>The markyt visualisation, prediction and benchmark platform for chemical and gene entity recognition at biocreative/chemdner challenge</article-title>
          .
          <source>Database</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31. Pin˜ero, L.,
          <string-name>
            <surname>Aleixandre-Benavent</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Iba´n˜ez, A.B.:
          <article-title>Uso y abuso de abreviaturas y siglas entre atencio´n primaria, especializada y hospitalaria</article-title>
          .
          <source>Papeles me´dicos 15(2)</source>
          ,
          <volume>29</volume>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32. Plasencia Mart´ınez,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Moliner</surname>
          </string-name>
          <string-name>
            <surname>Lahoz</surname>
          </string-name>
          , J.:
          <article-title>Uso y abuso de las abreviaturas en los informes del hospital cl´ınico universitario lozano blesa</article-title>
          .
          <source>Rev Paraninfo Digital</source>
          <volume>16</volume>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Rijsbergen</surname>
            ,
            <given-names>C.J.V.: Information</given-names>
          </string-name>
          <string-name>
            <surname>Retrieval. Butterworth-Heinemann</surname>
          </string-name>
          (
          <year>1979</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Ronzano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Furlong</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Ibi-upf at barr-2017: learning to identify abbreviations in biomedical literature. system description</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35. Rubio-Lo´pez, I.,
          <string-name>
            <surname>Costumero</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ambit</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Gonzalo-Mart´ın,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Menasalvas</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          , Rodr´ıguez, G.A.:
          <article-title>Acronym disambiguation in spanish electronic health narratives using machine learning techniques</article-title>
          .
          <source>Studies in health technology and informatics 235</source>
          ,
          <issue>251</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Sa</surname>
            ´nchez,
            <given-names>C.</given-names>
          </string-name>
          , Mart´ınez, P.:
          <article-title>A proposed system to identify and extract abbreviation definitions in spanish biomedical texts for the biomedical abbreviation recognition and resolution (barr) 2017</article-title>
          . SEPLN (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crowley</surname>
            ,
            <given-names>R.S.:</given-names>
          </string-name>
          <article-title>Anaphoric relations in the clinical narrative: corpus creation</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>18</volume>
          (
          <issue>4</issue>
          ),
          <fpage>459</fpage>
          -
          <lpage>465</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hearst</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          :
          <article-title>A simple algorithm for identifying abbreviation definitions in biomedical text</article-title>
          .
          <source>Pacific Symposium on Biocomputing. Pacific Symposium</source>
          on Biocomputing pp.
          <fpage>451</fpage>
          -
          <lpage>462</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hearst</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A simple algorithm for identifying abbreviation definitions in biomedical text</article-title>
          .
          <source>In: In Proceedings of Pacic Symposium on Biocomputing</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Comeau</surname>
            ,
            <given-names>D.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wilbur</surname>
          </string-name>
          , W.J.:
          <article-title>Abbreviation definition identification based on automatic precision estimates</article-title>
          .
          <source>BMC bioinformatics</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Sokolova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lapalme</surname>
          </string-name>
          , G.:
          <article-title>A systematic analysis of performance measures for classification tasks</article-title>
          .
          <source>Inf. Process. Manage</source>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rumshisky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Evaluating temporal relations in clinical text: 2012 i2b2 challenge</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>20</volume>
          (
          <issue>5</issue>
          ),
          <fpage>806</fpage>
          -
          <lpage>813</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43. Tena Marsa`,
          <string-name>
            <surname>X.</surname>
          </string-name>
          :
          <article-title>El culto a las abreviaciones: idolatr´ıa o virtud</article-title>
          .
          <source>Reumatolog´ıa Cl´ınica 8(2)</source>
          ,
          <fpage>54</fpage>
          -
          <lpage>55</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Torii</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A comparison study of biomedical short form definition detection algorithms</article-title>
          .
          <source>In: Proceedings of the 1st International Workshop on Text Mining in Bioinformatics</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O¨ .</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szolovits</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Evaluating the state-of-the-art in automatic deidentification</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>14</volume>
          (
          <issue>5</issue>
          ),
          <fpage>550</fpage>
          -
          <lpage>563</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          46.
          <string-name>
            <surname>Velupillai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mowery</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>South</surname>
            ,
            <given-names>B.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kvist</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dalianis</surname>
          </string-name>
          , H.:
          <article-title>Recent advances in clinical natural language processing in support of semantic analysis</article-title>
          .
          <source>Yearbook of medical informatics 10(1)</source>
          ,
          <volume>183</volume>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          47.
          <string-name>
            <surname>Villegas</surname>
            , M., de la Pen˜a,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , Santamar´ıa, J.,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Esfuerzos para fomentar la miner´ıa de textos en biomedicina ma´s alla´ del ingle´s: el plan estrate´gico nacional espan˜ol para las tecnolog´ıas del lenguaje</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          48.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denny</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenbloom</surname>
            ,
            <given-names>S.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giuse</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>A comparative study of current clinical natural language processing systems on handling abbreviations in discharge summaries</article-title>
          .
          <source>In: AMIA annual symposium proceedings</source>
          . vol.
          <year>2012</year>
          , p.
          <fpage>997</fpage>
          . American Medical Informatics Association (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          49.
          <string-name>
            <given-names>Yetano</given-names>
            <surname>Laguna</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          , Alberola Cu n˜at, V.:
          <article-title>Diccionario de siglas me´dicas</article-title>
          . SEDOM, Sociedad Espan˜ola de Documentacio´
          <article-title>n Me´dica (</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          50.
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hripcsak</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Mapping abbreviations to full forms in biomedical articles</article-title>
          .
          <source>Journal of the American Medical Informatics Association : JAMIA 9</source>
          ,
          <fpage>262</fpage>
          -
          <lpage>272</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          51.
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crowley</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savova</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          :
          <article-title>Coreference resolution: A review of general methodologies and applications in the clinical domain</article-title>
          .
          <source>Journal of biomedical informatics 44(6)</source>
          ,
          <fpage>1113</fpage>
          -
          <lpage>1122</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>