<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ICB-UMA at CANTEMIST 2020: Automatic ICD-O Coding in Spanish with BERT</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Guillermo López-García</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>José M. Jerez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nuria Ribelles</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Emilio Alba</string-name>
          <email>ealbac@uma.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francisco J. Veredas</string-name>
          <email>franveredas@uma.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, Universidad de Málaga</institution>
          ,
          <addr-line>Málaga</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Unidad de Gestión Clínica Intercentros de Oncología, Instituto de Investigación Biomédica de Málaga (IBIMA), Hospitales Universitarios Regional y Virgen de la Victoria</institution>
          ,
          <addr-line>Málaga</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <fpage>468</fpage>
      <lpage>476</lpage>
      <abstract>
        <p>This working notes paper presents our contribution to the CANTEMIST track. Our team has participated in the CANTEMIST-CODING subtask, the first shared task consisted in the automatic assignment of ICD-O-3 codes to Spanish oncology clinical cases. We addressed the task as a multi-label text classification problem using BERT model [1]. In order to leverage all the language modelling capabilities of the BERT architecture when applied to the CANTEMIST corpus, we have used an enhanced version of our fragment-based classification approach initially developed to tackle the CodiEsp-D task [2]. Hence, applying the improved version of our fragment-based strategy to the CANTEMIST corpus, we produced short fragments of text comprising a sequence of sentences from the long clinical documents present in the oncology corpus, and used them as input to the model. Two diferent versions of the BERT-Base model, namely the Multilingual BERT [3] and the BERT-SciELO [4] models, were fine-tuned on the CANTEMIST-CODING annotated corpus. The Multilingual BERT model further pre-trained on an unlabeled Spanish corpus of oncology clinical cases retrieved from Galén [5], achieved the highest classification performance among our five submitted systems, obtaining a final Mean Average Precision (MAP) score of 0.847 on the evaluation set.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Clinical NLP</kwd>
        <kwd>BERT</kwd>
        <kwd>Spanish oncology clinical cases</kwd>
        <kwd>Automatic clinical coding</kwd>
        <kwd>Transfer learning</kwd>
        <kwd>Text classification</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        There is a growing interest in processing clinical documents using text mining and Natural
Language Processing (NLP) techniques, giving rise to the birth of a new scientific subdiscipline
situated at the intersection between Medicine, Linguistics and Computer Science, namely
Clinical NLP [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. One of the most active areas of research in Clinical NLP is the development of
tools that perform automatic clinical coding, i.e. the task of autonomously extracting valuable
structured information contained in the unstructured medical notes, following standardised
coding terminologies [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Historically, Clinical NLP researchers have focused mainly on English text, generating and
exploiting clinical coding resources in the English language [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8, 9, 10</xref>
        ]. With 483 million native
speakers [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], there exists a noteworthy interest in processing medical documents in Spanish.
However, the lack of clinical linguistic resources for non-English languages makes it specially
arduous to develop tools tailored to the Spanish clinical documents.
      </p>
      <p>
        With the aim of promoting the application of Clinical NLP techniques to Spanish medical
texts, the CANcer TExt MIning Shared Task (CANTEMIST) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] has been organised in the
context of the Iberian Languages Evaluation Forum (IberLEF 2020). CANTEMIST is the first
shared task consisted in the automatic clinical coding of oncology medical cases written in
Spanish. The CANTEMIST corpus comprises 1.3 clinical cases manually annotated by
experts in oncology using the Spanish version (eCIE-O-3.1) of the International Classification of
Diseases for Oncology (ICD-O-3) codes. The CANTEMIST track is composed of three distinct
subtasks: CANTEMIST-NER, CANTEMIST-NORM and CANTEMIST-CODING.
CANTEMISTNER subtrack requires identifying tumour morphology mentions contained in a free-text clinical
case, whereas in CANTEMIST-NORM subtask tumour morphology mentions must be both
identified and normalised by assigning their corresponding ICD-O-3 codes. On the other hand,
CANTEMIST-CODING task requires assigning a set of ICD-0-3 codes to each medical document
in the corpus.
      </p>
      <p>
        In this work, we present our contribution to the IberLEF 2020, where our team has
participated in the CANTEMIST-CODING subtask. We have addressed the task as a multi-label text
classification problem using BERT [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a Transformer-based [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] language model that achieved
state-of-the-art results on several diferent NLP tasks. BERT was initially designed to receive
short fragments of text as input to the model, as opposed to the long oncology texts present
in the CANTEMIST corpus. In order to leverage all the language modelling capabilities of the
BERT architecture when applied to the CANTEMIST corpus, we have employed an improved
version of our fragment-based classification approach originally developed for the CodiEsp-D
task [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Hence, using the annotations available for the CANTEMIST-NORM subtask, we turned
the CANTEMIST-CODING multi-label document classification problem into a multi-label
shortfragment classification task, producing annotated short fragments of text with a full semantic
meaning. Furthermore, we experimented with two diferent versions of the BERT-Base model:
the Multilingual version [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and the BERT-SciELO [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] model. Besides, diferent alternatives
were explored to adapt the two models to the clinical domain, by further pre-training their
weights on medical corpora before fine-tuning the models on the CANTEMIST corpus.
      </p>
      <p>For reproducibility purposes, the implementation of our approach is publicly available at
https://github.com/guilopgar/CANTEMIST-2020.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Materials and Methods</title>
      <sec id="sec-2-1">
        <title>2.1. Corpora</title>
        <sec id="sec-2-1-1">
          <title>2.1.1. Clinical corpora</title>
          <p>
            In this work, we experimented with two diferent BERT-Base models, namely Multilingual
BERT and BERT-SciELO. Multilingual BERT was pre-trained on an extensive multilingual
general domain corpus comprising texts from 104 diferent languages [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ], including Spanish.
On the other hand, BERT-SciELO model was pre-trained on a corpus of biomedical articles in
Spanish [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]. With the aim of adapting both models to the distinctive features of the Spanish
clinical texts domain, we decided to further pre-train the models on a corpus of de-identified
medical texts in Spanish retrieved from the Galén Oncology Information System [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ]. The corpus
comprises 30.9 unlabeled documents containing oncology clinical notes written by physicians
from the Oncology Departments of the Hospital Universitario Virgen de la Victoria (HUVV)
and the Hospital Regional Universitario (HRU) in Málaga, Spain.
          </p>
          <p>
            Moreover, in order to exploit the cross-lingual features extracted by the Multilingual BERT
model, in addition to the corpus of medical texts retrieved from Galén, we also used a clinical
corpus in English to pre-train the Multilingual BERT model. In this way, we joined the Galén
oncology documents and the discharge summaries from the MIMIC-III database [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] in a single
bilingual clinical corpus, used to perform the unsupervised pre-training of the multilingual
model. From the multiple categories of documents stored in the MIMIC-III database, e.g.
radiology, nursing, nutrition and pharmacy reports, we only selected the discharge summaries
texts, given the high similarity between the medical texts from Galén and the content of the
discharge summaries. In Table 1, a brief description of both the Galén oncology corpus and the
discharge summaries corpus retrieved from the MIMIC-III database is given.
          </p>
        </sec>
        <sec id="sec-2-1-2">
          <title>2.1.2. CANTEMIST corpus</title>
          <p>The CANTEMIST corpus contains 1.3 clinical cases manually curated by oncology experts,
covering a wide variety of cancer types. For the CANTEMIST-CODING subtask, the documents
from the corpus were annotated with ICD-O-3 codes. The entire corpus was divided into
four distinct subsets of annotated texts, the training (501 documents), development-1 (249
documents), development-2 (250 documents) and test (300 documents) sets. Teams participating
in the CANTEMIST-CODING subtrack were evaluated on the test set.</p>
          <p>In Table 2, a basic description of the CANTEMIST-CODING corpus is given. As it can be seen
from the table, the number of codes annotations is scarce considering the limited number of
documents contained in the corpus, having a low average number of texts where each ICD-O-3
code is present. This results in an imbalanced multi-label classification problem, in which for
each code, the annotated texts (positive samples) are clearly outnumbered by the documents
where the code is not present (negative samples).</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Classification system</title>
        <p>
          We have tackled the CANTEMIST-CODING challenge using BERT model [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. One of the
characteristic features of BERT is that its Transformer-based architecture is designed to process
an input sequence of WordPiece [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] sub-tokens with a limited length 
(

original implementation of BERT). This entails an important constraint when dealing with
long= 512 in the
document classification tasks such as CANTEMIST-CODING, in which most of the clinical cases
exhibit a WordPiece sub-tokens sequence length high above the maximum length supported by
BERT.
        </p>
        <p>
          In order to overcome this limitation, we have applied our fragment-based classification
approach initially developed to address the CodiEsp-D task [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Given the high correspondence
between CodiEsp-D and CANTEMIST-CODING subtasks, the three-phases custom approach
was applied in a straightforward manner. In this way, we firstly split each clinical document of
the CANTEMIST corpus into short fragments of text. Subsequently, using the ICD-O-3 codes
annotations available for the CANTEMIST-NORM subtask, we annotated each fragment with
the oncology codes exclusively occurring within the fragment. Then, we used the annotated
fragments to perform the supervised fine-tuning of the BERT model on a fragment-level
multilabel classification task. Finally, since the evaluation of the CANTEMIST-CODING participating
systems was performed at document level, we post-processed the probabilities predicted by
the model at fragment level using a maximum probability criterion, producing a list of codes
ordered by relevance for each clinical document [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>
          Nevertheless, in this work, we have not directly applied the first phase of the fragment-based
classification approach described above. Instead, we have modified the first of the three-phases
forming the fragment-based strategy, performing the text segmentation at the sentence level.
Concretely, during the splitting phase, for each clinical case, the text was firstly split into
sentences using the SPACCC Sentence Splitter tool [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Then, the WordPiece tokenization
was performed on each sentence, producing a sequence of sub-tokens   = (  1, ..., 
1 ) for
every sentence, with length  . For each sentence   , if  &gt; 
− 2 (considering that BERT
always adds the sub-tokens [CLS] and [SEP] at the start and end positions, respectively, of
an input sequence), we split   into ⌈ /(
produced sentence had a maximum length of 
 = ( 1, ...,   ) = (( 11, ..., 
1 ), ..., (  1, ...,
        </p>
        <p>)) of 
− 2)⌉ further sentences, ensuring that each finally
− 2 sub-tokens. Hence, a final sequence
sub-tokens sentences was generated.</p>
        <p>
          Lastly, we split the sequence  into a sequence of fragments of contiguous sentences  =
( 1,  2, ...,   ) = (( 1, ...,   ), (  +1, ...,   )..., (  , ...,   )) using a simple greedy strategy: each fragment
  contained the maximum number of adjacent sentences such that ∑  ∈  |  | ≤  − 2. Using this
enhanced version of the splitting phase, along with the other two stages of the original version
of our fragment-based classification approach [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], we could produce annotated short fragments
of text comprising a sequence of sentences with full semantic meaning.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Experiments</title>
        <p>
          We used two diferent versions of the BERT-Base model, namely the Multilingual BERT and the
BERT-SciELO models. To perform the unsupervised pre-training of both models on the clinical
corpora described in Section 2.1.1, we made use of the original TensorFlow implementation of
BERT [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. As the vocabulary of the BERT-SciELO model does not account for any punctuation
character, we performed a pre-processing procedure consisted in substituting all punctuation
marks contained in the Galén oncology corpus (see Section 2.1.1) by spaces, and used the
pre-preprocessed version of the corpus to pre-train the weights of the model. Once pre-trained,
the model was fine-tuned on the CANTEMIST-CODING task, applying the same pre-processing
procedure to the CANTEMIST corpus before training the architecture. In the case of the
Multilingual BERT model, since punctuation marks are considered in its vocabulary, the raw
texts of both the clinical corpora and the CANTEMIST corpus were employed to pre-train
and then fine-tune the architecture, respectively. Regarding the models hyper-parameters, for
ifne-tuning, we used a maximum input sequence length of  = 100 for the Multilingual BERT
and a value of  = 72 for the BERT-SciELO model; for both models, we used RAdam [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] with
a learning rate of 3 × 10−5, a batch size of 16 and the number of epochs were experimentally
determined on the CANTEMIST-CODING development-2 subset using early-stopping, with an
upper limit of 40 epochs. Finally, with respect to the hardware resources, all experiments were
executed on a single GeForce GTX 1080 Ti 11 GB GPU.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>In this section, we describe the results obtained by our team, ICB-UMA, at the
CANTEMISTCODING task. We submitted five diferent runs of our fragment-based classification system. The
ifrst (ICB-UMA run1) and the fourth (ICB-UMA run4) submissions corresponded to the original
Multilingual BERT and BERT-SciELO models, respectively, fine-tuned on the
CANTEMISTCODING training, development-1 and development-2 corpora. Submissions ICB-UMA run2 and
ICB-UMA run 5 contained the codes predicted by the Multilingual BERT and the BERT-SciELO
models, respectively, further pre-trained on the Galén oncology corpus (see section 2.1.1) and
then fine-tuned on the CANTEMIST-CODING corpus. Finally, submission ICB-UMA run3
corresponded to the Multilingual BERT model further pre-trained on the bilingual clinical
corpus comprising the texts from the Galeń oncology corpus and the discharge summaries from
the MIMIC-III database, and subsequently fine-tuned on the CANTEMIST-CODING corpus. To
ifne-tune the weights of each of the submitted BERT models, the representation generated by
BERT for the initial [CLS] sub-token was fed into an output multi-label classification layer of
743—the number of unique codes present in the training, development-1 and development-2
CANTEMIST-CODING corpora (see Table 2)—units.</p>
      <p>Submission
ICB-UMA run1
ICB-UMA run2
ICB-UMA run3
ICB-UMA run4
ICB-UMA run5</p>
      <p>MAP</p>
      <p>
        In Table 3 and Table 4, we show the classification performance of our five submitted runs
on the CANTEMIST-CODING test corpus. In particular, Table 3 describes the results obtained
according to the main evaluation metric of the CANTEMIST-CODING subtask, i.e. the Mean
Average Precision (MAP) [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. The second column of the table shows the MAP values computed
considering all codes contained in the CANTEMIST-CODING test annotated corpus, whereas
the values presented in the last column (MAP No-Code) were calculated without considering
the overrepresented metastasis ICD-O-3 code (8000/6). According to the results observed in
Table 3, the Multilingual version of BERT outperformed the BERT-SciELO model, as ICB-UMA
run1, ICB-UMA run2 and ICB-UMA run3 systems achieved higher values than ICB-UMA run4
and ICB-UMA run5 systems for the two analysed metrics in the table. The best performance is
obtained by the Multilingual BERT model pre-trained on the Galén oncology corpus (ICB-UMA
run2), followed by the same model pre-trained on the bilingual medical corpus (ICB-UMA run3).
If we compare ICB-UMA run4 and ICB-UMA run5 systems, we can see that the BERT-SciELO
model further pre-trained on the Galén oncology corpus (ICB-UMA run5) outperformed the
original version of the BERT-SciELO model (ICB-UMA run4). Thus, the obtained results in
this work demonstrate that, both the Multilingual BERT and the BERT-SciELO models, when
adapted to the Spanish clinical texts domain by means of further pre-training their weights on
an unlabeled medical corpus in Spanish, outperformed the original version of the models on
the CANTEMIST-CODING task.
      </p>
      <p>
        On the other hand, to perform a more extensive analysis of the obtained results, the
organisers of the CANTEMIST track evaluated the classification performance of the submitted
systems according to a set of additional metrics. Hence, in Table 4, the second, third and fourth
columns present the computed values using precision (P), recall (R) and the F-score (F1) metrics,
respectively, taking into consideration all codes contained in the CANTEMIST-CODING test
subset, while the last three columns (P No-Code, R No-Code and F1 No-Code) show the results
calculated using the same three metrics but without considering the 8000/6 ICD-O-3 code. As
it can be seen from Table 4, our five submitted systems obtained really poor values for both
precision and F-score metrics, while for the recall metric the obtained values were unusually
high. The reason is that, with the goal of maximising the score obtained for the main evaluation
MAP metric, as performed in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], for each clinical case from the test corpus, we submitted
all ICD-O-3 codes considered by the classification system— 743, the number of units of the
output classification layer of the models—sorted by their predicted probability of occurrence in
descending order. On the contrary, if we had maximised precision, recall and F-score metrics,
instead of submitting all considered codes, we would have defined a classification threshold to
select only a subset of the codes according to their predicted probabilities.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>
        In this paper, we present our contribution to the CANTEMIST-CODING subtask from the
CANTEMIST track [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], in the context of the IberLEF 2020. This shared task consisted in the
automatic assignment of ICD-O-3 codes to oncology clinical cases written in Spanish. We have
addressed the task as a multi-label text classification problem using BERT model [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. With the
goal of adapting BERT to the distinctive features of the CANTEMIST-CODING corpus, we have
applied an improved version of our fragment-based classification approach initially developed
for the CodiEsp-D task [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In this way, using the available information for the
CANTEMISTNORM subtask, we converted the CANTEMIST-CODING multi-label long-text classification
task into a multi-label short-text classification problem, generating short fragments of text with
full semantic meaning annotated with ICD-O-3 codes. We experimented with two diferent
versions of the BERT-Base model, namely the Multilingual BERT [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and the BERT-SciELO [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
models. The best classification performance among our five submitted systems was achieved
by the Multilingual BERT model further pre-trained on a medical corpus of Spanish oncology
clinical cases, obtaining a MAP score of 0.847 on the evaluation set. Besides, both Multilingual
BERT and BERT-SciELO models further pre-trained on a medical corpus outperformed the
original versions of the models on the CANTEMIST-CODING subtask, reinforcing the idea that
a clinical domain version of BERT achieves higher performance on medical classification tasks
that a non-clinical domain version of the model.
      </p>
      <p>
        In future works, given the promising results obtained by the Multilingual BERT model when
applied to the CANTEMIST corpus, we will tackle other Spanish medical text classification
tasks, such as CodiEsp-D subtask [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], using the Multilingual BERT fragment-based classification
approach developed in this work. Furthermore, we will investigate whether further pre-training
the model architecture using not only Spanish and English medical texts, but also French, Italian
or German clinical documents, could leverage all the cross-lingual features extracted by the
Multilingual BERT model and improve the results presented in this work for a Spanish oncology
text classification task.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work was partially supported by the project TIN2017-88728-C2-1-R, MINECO, Plan Nacional
de I+D+I, and I Plan Propio de Investigación y Transferencia of the Universidad de Málaga.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          ,
          <source>in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          , Association for Computational Linguistics, Minneapolis, Minnesota,
          <year>2019</year>
          , pp.
          <fpage>4171</fpage>
          -
          <lpage>4186</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N19</fpage>
          -1423.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>López-García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Jerez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. J.</given-names>
            <surname>Veredas</surname>
          </string-name>
          ,
          <string-name>
            <surname>ICB-UMA at CLEF</surname>
          </string-name>
          e-Health
          <source>2020 Task</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Automatic</surname>
            <given-names>ICD</given-names>
          </string-name>
          <article-title>-10 coding in Spanish with BERT</article-title>
          , in: Working Notes of Conference and
          <article-title>Labs of the Evaluation Forum (CLEF)</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Google</given-names>
            <surname>Research</surname>
          </string-name>
          ,
          <string-name>
            <surname>Multilingual</surname>
            <given-names>BERT</given-names>
          </string-name>
          ,
          <year>2019</year>
          . URL: https://github.com/google-research/bert/ blob/master/multilingual.md.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Akhtyamova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Verspoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cardif</surname>
          </string-name>
          ,
          <article-title>Testing Contextualized Word Embeddings to Improve NER in Spanish Clinical Case Narratives, Preprint (Version 1</article-title>
          ) available at Research Square (
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .21203/rs.2.22697/v1.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Ribelles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Jerez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Urda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Subirats</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Márquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Torres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Franco</surname>
          </string-name>
          , E. Alba, Galén: Sistema de Información para la gestión y coordinación de procesos en un servicio de Oncología, FeSALUD
          <volume>6</volume>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>K.</given-names>
            <surname>Kreimeyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Foster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pandey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Arya</surname>
          </string-name>
          , G. Halford,
          <string-name>
            <given-names>S. F.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Forshee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Walderhaug</surname>
          </string-name>
          , T. Botsis,
          <article-title>Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>73</volume>
          (
          <year>2017</year>
          )
          <fpage>14</fpage>
          -
          <lpage>29</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2017</year>
          .
          <volume>07</volume>
          .012.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Stanfill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Fenton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Jenders</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. R.</given-names>
            <surname>Hersh</surname>
          </string-name>
          ,
          <article-title>A systematic literature review of automated clinical coding and classification systems</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>17</volume>
          (
          <year>2010</year>
          )
          <fpage>646</fpage>
          -
          <lpage>651</lpage>
          . doi:
          <volume>10</volume>
          .1136/jamia.
          <year>2009</year>
          .
          <volume>001024</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Pestian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Brew</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Matykiewicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Hovermale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , K. B.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Duch</surname>
            ,
            <given-names>A Shared</given-names>
          </string-name>
          <string-name>
            <surname>Task</surname>
          </string-name>
          <article-title>Involving Multi-Label Classification of Clinical Free Text</article-title>
          ,
          <source>in: Proceedings of the Workshop on BioNLP</source>
          <year>2007</year>
          :
          <article-title>Biological, Translational, and Clinical Language Processing</article-title>
          , BioNLP '07,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, USA,
          <year>2007</year>
          , p.
          <fpage>97</fpage>
          -
          <lpage>104</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Perotte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pivovarov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Natarajan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Weiskopf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Elhadad</surname>
          </string-name>
          ,
          <article-title>Diagnosis code assignment: models and evaluation metrics</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>21</volume>
          (
          <year>2013</year>
          )
          <fpage>231</fpage>
          -
          <lpage>237</lpage>
          . doi:
          <volume>10</volume>
          .1136/amiajnl-2013-002159.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Fei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <source>Automated ICD-9 Coding via A Deep Learning Approach, IEEE/ACM Transactions on Computational Biology and Bioinformatics</source>
          <volume>16</volume>
          (
          <year>2019</year>
          )
          <fpage>1193</fpage>
          -
          <lpage>1202</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Vítores</surname>
          </string-name>
          ,
          <article-title>El español: una lengua viva</article-title>
          .
          <source>Informe</source>
          <year>2019</year>
          . Instituto Cervantes,
          <year>2019</year>
          . URL: https://www.cervantes.es/imagenes/File/espanol_lengua_viva_
          <year>2019</year>
          .pdf.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Miranda-Escalada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Farré</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          ,
          <article-title>Named entity recognition, concept normalization and clinical coding: Overview of the CANTEMIST track for cancer text mining in Spanish, Corpus, Guidelines, Methods and Results</article-title>
          ,
          <source>in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF</source>
          <year>2020</year>
          ),
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kaiser</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          , in: I. Guyon,
          <string-name>
            <given-names>U. V.</given-names>
            <surname>Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          <volume>30</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2017</year>
          , pp.
          <fpage>5998</fpage>
          -
          <lpage>6008</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A. E. W.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , T. J.
          <string-name>
            <surname>Pollard</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>L. H.</given-names>
          </string-name>
          <string-name>
            <surname>Lehman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ghassemi</surname>
            , B. Moody, P. Szolovits,
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Celi</surname>
          </string-name>
          , R. G.
          <article-title>Mark, MIMIC-III, a freely accessible critical care database</article-title>
          ,
          <source>Scientific Data</source>
          <volume>3</volume>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Johnson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schuster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Le</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krikun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Thorat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Viégas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wattenberg</surname>
          </string-name>
          , G. Corrado,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hughes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          ,
          <source>Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation, Transactions of the Association for Computational Linguistics</source>
          <volume>5</volume>
          (
          <year>2017</year>
          )
          <fpage>339</fpage>
          -
          <lpage>351</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Plan</surname>
            <given-names>TL</given-names>
          </string-name>
          -Sanidad,
          <article-title>The Sentence Splitter (SS) for Clinical Cases</article-title>
          Written in Spanish,
          <year>2019</year>
          . doi:
          <volume>10</volume>
          .5281/zenodo.2586995.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Google</surname>
            <given-names>Research</given-names>
          </string-name>
          , BERT,
          <year>2019</year>
          . URL: https://github.com/google-research/bert.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>L.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gao</surname>
          </string-name>
          , J. Han,
          <article-title>On the Variance of the Adaptive Learning Rate and Beyond (</article-title>
          <year>2019</year>
          ). arXiv:
          <year>1908</year>
          .03265.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Schütze</surname>
          </string-name>
          , Introduction to Information Retrieval, Cambridge University Press, USA,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>