<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extracting Neoplasms Morphology Mentions in Spanish Clinical Cases through Word Embeddings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pilar López-Úbeda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel C. Díaz-Galiano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M. Teresa Martín-Valdivia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>L. Alfonso Ureña-López</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>SINAI research group, University of Jaén</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <fpage>324</fpage>
      <lpage>334</lpage>
      <abstract>
        <p>Biomedicine is an ideal environment for the use of Natural Language Processing (NLP), due to the huge amount of information processed and stored in electronic format. This information can be managed in diferent ways by NLP tasks such as the Named Entity Recognition (NER). To address this task, CANTEMIST is the first challenge specifically focusing on NER and named entity normalization of a critical type of concept related to cancer. For the entity standardization, the challenge proposes to use the ICD-O codes (International Classification of Diseases for Oncology, 3rd Edition - ICD-O-3). In this paper, we present an automated system based on neural networks for the extraction of tumor morphology mentions in Spanish clinical cases. In particular, we use a Bidirectional variant of Long Short Term Memory (BiLSTM) neural network with a Conditional Random Fields (CRF) layer. The input to this network is a combination of diferent word embeddings. In the NER task we achieved encouraging results obtaining 85.5% of F1-score. Moreover, a dictionary-based system is used to subsequently assign an ICD-O code to each annotated entity. For this subtask our group achieved 75.9% of F1-score.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Named Entity Recognition</kwd>
        <kwd>named entity normalization</kwd>
        <kwd>tumor morphology</kwd>
        <kwd>word embeddings</kwd>
        <kwd>neural networks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>CANTEMIST is the first shared task specifically focusing on named entity recognition of a
critical type of concept related to cancer, namely tumor morphology. The CANTEMIST task is
structured into three independent subtasks, each taking into account a particular important use
case scenario:
• CANTEMIST-NER track: This subtask consists of automatically finding tumor morphology
mentions.
• CANTEMIST-NORM track: The second subtask requires returning all tumor morphology
entity mentions together with their corresponding ICD-O codes (Spanish version:
eCIEO-3.1), i.e. finding and normalizing tumor morphology mentions. This subtask is also
known as clinical concept normalization or named entity normalization.
• CANTEMIST-CODING track: The last subtask requires returning for each of document a
ranked list of its corresponding ICD-O-3 codes.</p>
      <p>This paper describes the system presented by the SINAI team for the first two subtasks
proposed by the organizers of the CANTEMIST challenge: NER track and named entity
normalization track.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>
        In the biomedical domain, NER systems identify entities from clinical patient reports. Several
NER systems have been developed using NLP-based systems such as MedLEE [2], MetaMap [
        <xref ref-type="bibr" rid="ref1">3</xref>
        ]
and cTAKES [
        <xref ref-type="bibr" rid="ref2">4</xref>
        ]). Most of these are rule-based systems that use extensive medical vocabularies.
      </p>
      <p>
        Approaches for NER can be classified into diferent categories [
        <xref ref-type="bibr" rid="ref3">5</xref>
        ]: dictionary-based,
rulebased and machine learning-based. Specifically, our group has experience in the NER task in
the biomedical domain using diferent methodologies such as traditional machine learning
[
        <xref ref-type="bibr" rid="ref4">6</xref>
        ], Recurrent Neural Networks (RNNs) [
        <xref ref-type="bibr" rid="ref5">7</xref>
        ] and unsupervised machine learning [
        <xref ref-type="bibr" rid="ref6 ref7">8, 9</xref>
        ], among
others.
      </p>
      <p>
        Current state-of-the-art approaches to the NER task propose the use of RNNs to learn useful
representations automatically because they facilitate the modeling of long-distance dependencies
between words in a sentence. These networks usually rely on word embeddings, which represent
words as vectors of numbers. There are diferent types of word embeddings: classical [
        <xref ref-type="bibr" rid="ref8 ref9">10, 11</xref>
        ],
character-level [
        <xref ref-type="bibr" rid="ref10">12</xref>
        ] and contextualized [
        <xref ref-type="bibr" rid="ref11">13</xref>
        ] which are commonly pre-trained over very large
corpora to capture latent syntactic and semantic similarities between words.
      </p>
      <p>
        Based on the success of the previous challenges focused on the NER task in the
biomedical domain (PharmaCoNER [
        <xref ref-type="bibr" rid="ref12">14</xref>
        ], eHealth-KD [
        <xref ref-type="bibr" rid="ref13">15</xref>
        ], eHealth CLEF [16], MEDDOCAN [17],
CHEMDNER [18], i2b2 [19]), CANTEMIST is the first shared task specifically focusing on the
extraction of a critical type of concept related to cancer, namely tumor morphology or neoplasms
morphology.
      </p>
      <p>Following the neural network proposed by Huang et al [20], our work uses the Bidirectional
variant of Long Short Term Memory along with a stacked Conditional Random Fields decoding
layer (BiLSTM-CRF) to extract the tumor morphology mentions in Spanish biomedical literature.
We also evaluate the usefulness of some word embedding in two diferent ways: independently
and in combination. Subsequently, we use an unsupervised dictionary-based method in order
to automatically assign a ICD-O code to each entity detected.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset</title>
      <p>The CANTEMIS corpus was manually annotated by clinical experts following the guidelines1.
The CANTEMIST corpus is composed of clinical cases and is divided into diferent sets: training,
development and test set. The task organizers have also provided two development sets (dev 1
and dev 2) so that participants can train their systems more accurately. Some statistics of the
CANTEMIST corpus can be found in Table 1. In this table you can see information related to
the number of annotated entities, number of ICD-O codes, information related to sentences and
vocabularies, among others.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>The workflow to address the proposed task in CANTEMIST challenge consists of two sequential
steps, first detecting tumor morphology mentions in Spanish clinical documents, and
subsequently the extracted entities must be assigned to a unique identifier code using ICD-O
terminology. This section is organized according to each subtask carried out by our team: NER
track and named entity normalization track.</p>
      <sec id="sec-4-1">
        <title>4.1. Named entity recognition</title>
        <p>The approach used to address in the first subtask of the CANTEMIST challenge is based on
deep learning by implementing the RNN proposed by Huang et al. [20]. Specifically, we have
used a BiLSTM with a sequential CRF.</p>
        <sec id="sec-4-1-1">
          <title>4.1.1. Word embedding</title>
          <p>RNNs generally use an embedding layer as an input, which makes it possible to represent words
and documents using a dense vector representation. Word embeddings are a type of word
representation that allows words with similar meanings to have a similar representation.</p>
          <p>Our first approach involves combining diferent word embeddings to form the input layer to
the proposed deep neural network. Each word representation used is explained in detail below:
- FastText embeddings trained over Wikipedia. This type of word embeddings are
considered classic because they are static and word-level, meaning that each distinct word
receives exactly one pre-computed embedding. Our experiments use FastText2 embeddings
trained over Spanish Wikipedia and size 100.</p>
          <p>- Spanish Medical Embeddings (SME). Although there are available biomedical word
embeddings for Spanish [21, 22, 23], we have tried to generate new ones from existing corpora
related to the biomedical domain in Spanish. For this purpose, firstly we extracted the Spanish
corpus from MeSpEN [24]. Later, extra information in Spanish from diferent clinical information
websites such as Mayo Clinic [25], World Health Organization [26], and WebMD [27] was added
to the former corpus. The pre-processing carried out to train the word embeddings consisted
of converting the text to lowercase, removing the URLs, and removing the multi-lines. Finally,
FastText [28] was used to perform the training by applying the following setup: skip-gram
model, 0.05 for the learning rate, size of 300 for the word vectors, 10 for the number of epochs
and 5 for the minimal number of word occurrences.</p>
          <p>- Contextual word embedding (CWE). This word embeddings capture latent
syntacticsemantic information that goes beyond standard word embeddings [29]. This representation
treats text as distributions over characters and is capable of generating embeddings for any
string of characters within any textual context, in other words, the same word will have
diferent embeddings depending on its contextual use. For our experiments, we used the pooled
contextualized embeddings proposed by Akbik et al. [30] to help with the recognition of tumor
morphology mentions.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. BiLSTM-CRF model</title>
          <p>For the entity recognition task, the annotations provided were encoded by using the BIO tagging
scheme. Thus each token in a sentence was labeled with O (non-entity), B (beginning token of
an entity), or I (inside token of an entity). This scheme is the most popular in the NER task.</p>
          <p>The architectures of BiLSTM-CRF model is illustrated in Figure 1. This architecture is similar
to the proposal by Huang et al. [20], Lample et al. [31], Ma and Hovy [32]. This model is
described in a layered way:
1 Embedding combination layer. Firtly, each sentence is divided by tokens using the SPACCC
POS Tagger tool3. The sentence is represented as a sequence of words  = ( 1,  2,  3, ...,   ),
where  is the length of the sentence. In addition, each word is represented as a vector
of concatenated embeddings. In our approach we use the word embeddings mentioned
previously: FastText, SME and CWE.</p>
          <p>2https://fasttext.cc
3DOI: 10.5281/zenodo.2560344
CRF layer</p>
          <p>Tahn layer
BiLSTM layer</p>
          <p>CWE layer
SME layer
GloVe layer
Carcinoma microcítico
de
pulmón
...</p>
          <p>tumor
.
2 BiLSTM layer. The embeddings are given as input to a BiLSTM layer. In this layer, BiLSTM
layer is to split the neurons of a regular LSTM into two directions, a forward state computes a
representation ℎ⃖⃖⃗ of the sequence from left to right at every word  , and another for negative
time direction (backward states) computes a representation ⃖ℎ⃖⃖ . Using two time directions,
input information from the past and future of the current time frame can be used to better
understand the context of the medical report. The representation of a word ℎ = [ ℎ⃖⃖⃗; ⃖ℎ⃖⃖ ] is
obtained by concatenating its forward and backward states[31].
3 Tahn layer. Next, tahn layer is used to predict confidence scores for each word according to
the existing labels in the corpus.
4 CRF layer. Finally, CRF layer combines the advantage of graphical modeling to predict
multivariate output, in other words, this layer decodes the best label in all possible labels
[33].</p>
          <p>For the implementation, we employed Flair [34]. Flair is a simple framework for NLP tasks
including NER. Flair is used with the following configuration: learning rate as 0.1, dropout as
0.5, maximum epoch as 150, 300 neurons with tahn activation function and a batch size of 32.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Named entity normalization</title>
        <p>The second subtask consists of assigning a unique ICD-O-10 identifier to each entity
annotated in subtask one. For this subtask we use an unsupervised dictionary-based method. The
methodology followed for the selection of an identifier is shown below:
1 ICD-O dictionary creation.</p>
        <p>In order to generate the dictionary, we use the descriptions and ICD-0 codes obtained from
the oficial website 4 and the training set provided by the organizers. The main purpose was
to generate a tuple containing the ICD-O code and the description as shown above:
&lt; 8000/6, neoplasia metastásica &gt;
&lt; 8000/6, émbolo tumoral &gt;
&lt; 8000/6, tumor metastásico &gt;
&lt; 8000/3, neoplasia maligna &gt;
&lt; 8000/3, cáncer &gt;
&lt; 8000/3, malignidad &gt;
As we can see, the same code corresponds to several descriptions, which means that these
are synonymous. All descriptions are in lowercase in order to obtain a better future match.
2 Levenshtein distance.</p>
        <p>The second step is to compute the Levenshtein distance between the recognized entity and
each dictionary description. Levenshtein distance is the minimum number of operations
required to transform one character string into another. For instance, for the entity cáncer,
the output provided by this step would be as shown below::
&lt; 8000/6, neoplasia metastásica, 20 &gt;
&lt; 8000/6, émbolo tumoral, 13 &gt;
&lt; 8000/6, tumor metastásico, 16 &gt;
&lt; 8000/3, neoplasia maligna, 17 &gt;
&lt; 8000/3, cáncer, 0 &gt;
&lt; 8000/3, malignidad, 9 &gt;
According to our previous example, cáncer gets a value of 0 with the code 8000/3 - cáncer.
However, we can see that the entity cáncer gets other values for each description, for example,
the distance between cáncer and émbolo tumoral is 13.
3 Code ranking.</p>
        <p>The last step involves choosing from all the calculated distance values. To do this, we sort
the dictionary by Levenshtein distance and choose the first ICD-O code from the list. In this
way we assign a unique identifier to the entity.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Experimental setup and results</title>
      <p>The task organizers provided several datasets (training, development and test) to allow the
participants to train their systems properly. For all scenarios, we used the training set and
the number 1 development set (22,022 + 10,847 sentences respectively) for training, while the
number 2 development set (9,917 sentences) was used to validate our system. We decided to
include the number 1 development set in the training since it has a greater number of annotated
entities than the number 2 development set.</p>
      <p>4https://eciemaps.mscbs.gob.es/ecieMaps/browser/index_o_3.html</p>
      <p>Precision (%) Recall (%) F1-score (%)
System
FastText
SME
CWE
SME + CWE
SME + CWE + FastText</p>
      <p>The metrics defined by the CANTEMIST challenge to evaluate the submitted experiments are
those commonly used for some NLP tasks such as NER or text classification, namely precision,
recall, and F1-score. Table 2 and 3 shows the results obtained by the SINAI team for the first
and the second subtask respectively.</p>
      <p>As we can see in Table 2, the results are encouraging. The combination of diferent word
embeddings improves the use of each one of them separately. This means that each word
embeddings used provides a meaningful representation of each word, which helps the neural
network to learn to detect the entity.</p>
      <p>The use of the classic word embeddings trained on wikipedia (FastText) obtains the worst
value of precision, recall and F1. On the other hand, the use of the word embeddings trained on
a corpus related to medicine improves and achieves 84.5 of F1 and gets the best recall (85.5%).
SME word embeddings also improve CWE results, although in this case the diference is not very
significant. These results demonstrate that using resources trained on a specific domain helps
the task to be solved. In our case, training some word embeddings related to the biomedical
domain helps the recognition of named entities.</p>
      <p>Our best result (taking into account the F1 measure) was obtained by combining two types of
word embeddings: SME + CWE. With this combination we achieved an 85.5% of F1, an 85.9% of
precision and an 85.1% of recall.</p>
      <p>In addition, we were also able to check the use of FastText word embeddings trained with
Wikipedia in their combination (SME + CWE + FastText) but they reached worse values than
without them.</p>
      <p>Table 3 summarizes the results obtained for the second subtask (named entity normalization)
by the SINAI team. In this subtask, we use the output of the previous task to standardize the
recognized entities. Taking into account the F1-score for the extraction of neoplastic tumours,
we found that the combination of word embeddings obtains the same percentage (95.9%). In
contrast, if we consider the measure F1 for non-metastasis recognition, the best value of F1 was
obtained using the combination of the three word representations (SME + CWE + FastText).</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion and future work</title>
      <p>This paper presents the participation of the SINAI group in the CANTEMIST challenge. Our
group has participated in two subtasks proposed by the challenge: NER track and named entity
normalization track. Both subtasks are considered relevant in the NLP because they are related
to information extraction and standardization in the biomedical domain through an oncology
dictionary. More specifically, the first subtask addresses the task of extracting mentions of
neoplasms morphology mentions, and the second subtask involves assigning an ICD-O code to
each previously recognized entity.</p>
      <p>In the first subtask our proposal follows a deep learning-based approach for NER in Spanish
clinical cases. This methodology is focused on the use of a BiLSTM-CRF where diferent
word embeddings are combined as input to the architecture. Our main goal was to prove the
performance of diferent types of word embeddings for the NER task in the medical domain:
own-generated medical embeddings (SME), contextualized embeddings (CWE) and FastText
embeddings trained over Wikipedia. The obtained results are encouraging for the NER task
achieving 85.5% of F1-score, 85.9% of precision and 85.1% of recall. On the other hand, in
the second subtask we use an unsupervised dictionary-based method. In this scenario, the
combination of diferent word representations also obtained the best value achieving 75.9% of
F1-score, 76.4% of precision and 75.4% of recall. For future work we plan to improve our entity
detection system using new transfer learning techniques. In addition, there are available
pretrained models for the biomedical domain such as BioBERT that could be taken into consideration.
Although BioBERT is in English, an ideal scenario would be the generation of a new model for
Spanish through a large biomedical corpus.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments References</title>
      <p>This work has been partially supported by LIVING-LANG project (RTI2018-094653-B-C21) from
the Spanish Government and Fondo Europeo de Desarrollo Regional (FEDER).</p>
      <p>[1] A. Miranda-Escalada, E. Farré, M. Krallinger, Named entity recognition, concept
normalization and clinical coding: Overview of the cantemist track for cancer text mining in
spanish, corpus, guidelines, methods and results, in: Proceedings of the Iberian Languages
Evaluation Forum (IberLEF 2020), CEUR Workshop Proceedings, 2020.
[2] C. Friedman, Towards a comprehensive medical language processing system: methods and
issues., in: Proceedings of the AMIA annual fall symposium, American Medical Informatics
Association, 1997, p. 595.
[16] L. Kelly, H. Suominen, L. Goeuriot, M. Neves, E. Kanoulas, D. Li, L. Azzopardi, R. Spijker,
G. Zuccon, H. Scells, et al., Overview of the clef ehealth evaluation lab 2019, in:
International Conference of the Cross-Language Evaluation Forum for European Languages,
Springer, 2019, pp. 322–339.
[17] M. Marimon, A. Gonzalez-Agirre, A. Intxaurrondo, H. Rodríguez, J. A. Lopez Martin,
M. Villegas, M. Krallinger, Automatic de-identification of medical texts in spanish: the
meddocan track, corpus, guidelines, methods and evaluation of results, in: Proceedings of
the Iberian Languages Evaluation Forum (IberLEF 2019), volume TBA, CEUR Workshop
Proceedings (CEUR-WS.org), Bilbao, Spain, 2019, p. TBA. URL: TBA.
[18] M. Krallinger, F. Leitner, O. Rabal, M. Vazquez, J. Oyarzabal, A. Valencia, Chemdner: The
drugs and chemical names extraction challenge, Journal of cheminformatics 7 (2015) S1.
[19] W. Sun, A. Rumshisky, O. Uzuner, Evaluating temporal relations in
clinical text: 2012 i2b2 Challenge, Journal of the American
Medical Informatics Association 20 (2013) 806–813. URL: https://doi.org/
10.1136/amiajnl-2013-001628. doi:10.1136/amiajnl-2013-001628.
arXiv:https://academic.oup.com/jamia/article-pdf/20/5/806/17374624/20-5-806.pdf.
[20] Z. Huang, W. Xu, K. Yu, Bidirectional lstm-crf models for sequence tagging, arXiv preprint
arXiv:1508.01991 (2015).
[21] F. Soares, M. Villegas, A. Gonzalez-Agirre, M. Krallinger, J. Armengol-Estapé, Medical
word embeddings for Spanish: Development and evaluation, in: Proceedings of the
2nd Clinical Natural Language Processing Workshop, Association for Computational
Linguistics, Minneapolis, Minnesota, USA, 2019, pp. 124–133. URL: https://www.aclweb.
org/anthology/W19-1916. doi:10.18653/v1/W19-1916.
[22] S. Santiso, A. Casillas, A. Pérez, M. Oronoz, Word embeddings for negation
detection in health records written in Spanish, Soft Computing (2019). doi:10.1007/
s00500-018-3650-7.
[23] I. Segura-Bedmar, P. Martínez, Simplifying drug package leaflets written in Spanish
by using word embedding, Journal of Biomedical Semantics (2017). doi:10.1186/
s13326-017-0156-7.
[24] M. Villegas, A. Intxaurrondo, A. Gonzalez-Agirre, M. Marimon, M. Krallinger, The MeSpEN
resource for English-Spanish medical machine translation and terminologies: census of
parallel corpora, glossaries and term translations, LREC MultilingualBIO: Multilingual
Biomedical Text Processing (Malero M, Krallinger M, Gonzalez-Agirre A, eds.) (2018).
[25] Mayo clinic, 1998-2020. URL: https://www.mayoclinic.org/es-es.
[26] Organización mundial de la salud, 2020. URL: https://www.who.int/es.
[27] Webmd - better information. better health., 2005-2020. URL: https://www.webmd.com/.
[28] P. Bojanowski, E. Grave, A. Joulin, T. Mikolov, Enriching word vectors with subword
information, Transactions of the Association for Computational Linguistics 5 (2017)
135–146.
[29] A. Akbik, D. Blythe, R. Vollgraf, Contextual string embeddings for sequence labeling, in:
Proceedings of the 27th International Conference on Computational Linguistics, 2018, pp.
1638–1649.
[30] A. Akbik, T. Bergmann, R. Vollgraf, Pooled contextualized embeddings for named entity
recognition, in: Proceedings of the 2019 Conference of the North American Chapter
of the Association for Computational Linguistics: Human Language Technologies,
Volume 1 (Long and Short Papers), Association for Computational Linguistics,
Minneapolis, Minnesota, 2019, pp. 724–728. URL: https://www.aclweb.org/anthology/N19-1078.
doi:10.18653/v1/N19-1078.
[31] G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer, Neural architectures
for named entity recognition, arXiv preprint arXiv:1603.01360 (2016).
[32] X. Ma, E. Hovy, End-to-end sequence labeling via bi-directional lstm-cnns-crf, arXiv
preprint arXiv:1603.01354 (2016).
[33] J. Laferty, A. McCallum, F. C. Pereira, Conditional random fields: Probabilistic models for
segmenting and labeling sequence data (2001).
[34] A. Akbik, T. Bergmann, D. Blythe, K. Rasul, S. Schweter, R. Vollgraf, FLAIR: An easy-to-use
framework for state-of-the-art NLP, in: Proceedings of the 2019 Conference of the North
American Chapter of the Association for Computational Linguistics (Demonstrations),
Association for Computational Linguistics, Minneapolis, Minnesota, 2019, pp. 54–59. URL:
https://www.aclweb.org/anthology/N19-4010. doi:10.18653/v1/N19-4010.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Aronson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.-M.</given-names>
            <surname>Lang</surname>
          </string-name>
          ,
          <article-title>An overview of metamap: historical perspective and recent advances</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>17</volume>
          (
          <year>2010</year>
          )
          <fpage>229</fpage>
          -
          <lpage>236</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Savova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Masanz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. V.</given-names>
            <surname>Ogren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sohn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. C.</given-names>
            <surname>Kipper-Schuler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. G.</given-names>
            <surname>Chute</surname>
          </string-name>
          ,
          <article-title>Mayo clinical text analysis and knowledge extraction system (ctakes): architecture, component evaluation and applications</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>17</volume>
          (
          <year>2010</year>
          )
          <fpage>507</fpage>
          -
          <lpage>513</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Campos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Matos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <article-title>Biomedical named entity recognition: a survey of machine-learning tools, Theory and Applications for Advanced Text Mining (</article-title>
          <year>2012</year>
          )
          <fpage>175</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Úbeda</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. C. D. Galiano</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Martín-Valdivia</surname>
            ,
            <given-names>L. A. U. Lopez,</given-names>
          </string-name>
          <article-title>Using machine learning and deep learning methods to find mentions of adverse drug reactions in social media</article-title>
          ,
          <source>in: Proceedings of the Fourth Social Media Mining for Health Applications (# SMM4H) Workshop &amp; Shared Task</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>102</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Úbeda</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. C. D. Galiano</surname>
            ,
            <given-names>L. A. U.</given-names>
          </string-name>
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Martín-Valdivia</surname>
          </string-name>
          ,
          <article-title>Using Snomed to recognize and index chemical and drug mentions</article-title>
          ,
          <source>in: Proceedings of The 5th Workshop on BioNLP Open Shared Tasks</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>115</fpage>
          -
          <lpage>120</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>López-Ubeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Dıaz-Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Martın-Valdivia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Urena-López</surname>
          </string-name>
          ,
          <article-title>Sinai en tass 2018 task 3. clasificando acciones y conceptos con umls en medline</article-title>
          ,
          <source>Proceedings of TASS 2172</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>López-Úbeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Díaz-Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montejo-Ráez</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>T.</given-names>
            <surname>Martín-Valdivia</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. A.</surname>
          </string-name>
          <article-title>UreñaLópez, An Integrated Approach to Biomedical Term Identification Systems</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>10</volume>
          (
          <year>2020</year>
          )
          <fpage>1726</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pennington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Socher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          , GloVe:
          <article-title>Global vectors for word representation</article-title>
          ,
          <source>in: EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference</source>
          ,
          <year>2014</year>
          . doi:
          <volume>10</volume>
          .3115/v1/d14-
          <fpage>1162</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , I. Sutskever,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Corrado,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          ,
          <article-title>Distributed representations ofwords and phrases and their compositionality</article-title>
          ,
          <source>in: Advances in Neural Information Processing Systems</source>
          ,
          <year>2013</year>
          . arXiv:
          <volume>1310</volume>
          .
          <fpage>4546</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>G.</given-names>
            <surname>Lample</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ballesteros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Subramanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kawakami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dyer</surname>
          </string-name>
          ,
          <article-title>Neural architectures for named entity recognition, in: 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</article-title>
          ,
          <source>NAACL HLT 2016 - Proceedings of the Conference</source>
          ,
          <year>2016</year>
          . doi:
          <volume>10</volume>
          .18653/v1/n16-
          <fpage>1030</fpage>
          . arXiv:
          <volume>1603</volume>
          .
          <fpage>01360</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Neumann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Iyyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gardner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <article-title>Deep contextualized word representations</article-title>
          ,
          <source>in: NAACL HLT</source>
          <year>2018</year>
          <article-title>- 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies -</article-title>
          <source>Proceedings of the Conference</source>
          ,
          <year>2018</year>
          . doi:
          <volume>10</volume>
          .18653/v1/n18-
          <fpage>1202</fpage>
          . arXiv:
          <year>1802</year>
          .05365.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Agirre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marimon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Intxaurrondo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Rabal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Villegas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          , Pharmaconer:
          <article-title>Pharmacological substances, compounds and proteins named entity recognition track</article-title>
          ,
          <source>in: Proceedings of The 5th Workshop on BioNLP Open Shared Tasks</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piad-Morfis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gutiérrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Estevez-Velarde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Almeida-Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Muñoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montoyo</surname>
          </string-name>
          ,
          <source>Overview of the eHealth Knowledge Discovery Challenge at IberLEF</source>
          <year>2020</year>
          , in:
          <source>Proceedings of the Iberian Languages Evaluation Forum (IberLEF</source>
          <year>2020</year>
          ),
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>