<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LinkMed: Entity Recognition and Relation Extraction from Clinical Notes in Spanish</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Carlos Muñoz-Castro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrés Carvallo</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matías Rojas</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudio Aracena</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rodrigo Guerra</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benjamín Pizarro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jocelyn Dunstan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Mathematical Modeling, Universidad de Chile</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, Pontificia Universidad Católica de Chile</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Faculty of Physical and Mathematical Sciences, Universidad de Chile</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Institute for Mathematical and Computational Engineering, Pontificia Universidad Católica de Chile</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Millenium Institute Foundational Research on Data</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>National Center for Artificial Intelligence</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>Relation extraction is an essential component of Natural Language Processing (NLP) and significantly influences information retrieval and structured information extraction. Within clinical notes, the task is needed to establish connections among illnesses, therapies, indications, and other medical concepts. Motivated by the above, in this work, we propose a two-step model approach for entity linking; in the first step, we solve entity recognition, and in the second, a relation classification approach. We evaluated our approach in a Spanish corpus of the TESTLINK challenge in IberLEF2023 (Iberian Languages Evaluation Forum), comprising 81 clinical notes to train and 80 clinical notes to test. Our results show competitive performance with a precision of 0.47, recall of 0.43, and F1-score of 0.45, presenting an efective strategy for relation extraction from clinical notes in Spanish.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Natural Language Processing</kwd>
        <kwd>Link prediction</kwd>
        <kwd>Named Entity Recognition</kwd>
        <kwd>Clinical Text</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The increasing complexity of healthcare services has accentuated the importance of clinical
notes as indispensable sources of insights into patients’ health conditions. These documents
contain data from clinical visits, physical examinations, diagnoses, and follow-up treatments
and often encompass critical outcomes of laboratory tests and measurements - integral elements
in disease and disorder diagnosis. However, the extraction of these pertinent data, particularly
from Spanish-language documents, has yet to be explored, and research on this field, especially
in the Spanish language, still needs to be explored.</p>
      <p>
        This paper introduces LinkMed, our proposed method, to the TESTLINK task at IberLEF2023
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This task, grounded in clinical cases from the E3C corpus, poses a challenge of relation
extraction from clinical narratives. It demands identifying test results and measurements within
the text, establishing links between these results, and the textual mentions of the corresponding
laboratory tests and measurements.
      </p>
      <p>Difering from a conventional Named Entity Recognition (NER) task, this challenge requires
the interpretation of numeric values and ranges, establishing its identity as a Relation Extraction
(RE) task that considers elements involved in the relation and its directionality.</p>
      <p>Our approach, LinkMed, introduces a two-step solution to this challenge. Initially, a Named
Entity Recognition (NER) model is used to identify potential entities of interest in the clinical
notes that could be linked. The NER approach is followed by a relation classification system on
all the combinations of found pairs to ascertain the existence of a valid relation between those
identified entities. Our method mainly addresses the entity-linking problem in Spanish clinical
notes.</p>
      <p>This paper presents an in-depth description of our proposed solution to the TESTLINK
task. While the challenge encompasses both Spanish and Basque languages, our work focuses
explicitly on the former, thus enriching resources available for Spanish-language clinical
documents and promoting thorough patient care and clinical decision-making processes within this
linguistic context.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>
        Entity linking for medical text analysis presents a unique challenge in non-English corpora,
exacerbated by the scarcity of resources available for other languages [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The methodologies
applied to address this task predominantly fall into two distinct categories: rule-based systems
and machine learning-based approaches.
      </p>
      <p>
        The source of rule-based systems, initially designed to facilitate medical evidence searches in
databases like MEDLINE by identifying specific medical terms in texts [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], marked a significant
breakthrough. Systems such as CLARIT [4], SAPHIRE [5], and MetaMap [6] capitalized on
linguistic rules and dictionaries to map concept mentions to Medical Subject Headings (MeSH)
terms, significantly enhancing the interpretability and accessibility of medical text data.
Subsequent systems such as CHARTLINE [7] and MedLEE [8] expanded upon these ideas, employing
dictionary-matching techniques to extract and link entities within clinical reports to the Unified
Medical Language System (UMLS). Innovations like REX [9] pushed boundaries by linking
clinical note mentions to ICD-9-CM codes, thereby aiding medical record coding. However, the
main limitation of rule-based systems is their struggle with semantic understanding and the
diverse terminology present in clinical narratives [10, 11, 12].
      </p>
      <p>
        On the other hand, machine learning-based methods transitioned entity linking from a mere
matching problem to a complex mapping task, leveraging numerical representations of mentions
and concepts [13]. The emergence of deep learning techniques and contextual embeddings, such
as ELMo [14] and BERT [15], caused a paradigm shift in entity-linking research. Currently, the
majority of state-of-the-art systems employ deep contextualized embeddings, combining these
with a variety of methods, including binary [16], multi-class [17], and clustering approaches
[
        <xref ref-type="bibr" rid="ref4">18</xref>
        ]. However, a persistent challenge in this domain is the scarcity of resources for efective
training of entity-linking models.
      </p>
      <p>
        Despite significant advances in Spanish language models such as BETO [
        <xref ref-type="bibr" rid="ref5">19</xref>
        ] and DistillBETO
[
        <xref ref-type="bibr" rid="ref6">20</xref>
        ], along with the development of their evaluation frameworks for both general-domain
[
        <xref ref-type="bibr" rid="ref7 ref8">21, 22</xref>
        ] and biomedical [
        <xref ref-type="bibr" rid="ref10 ref11 ref9">23, 24, 25</xref>
        ] contexts, the specific problem of entity linking in medical
texts remains relatively unexplored.
      </p>
      <p>This work proposes a methodology that integrates the strengths of machine-learning
focused on natural language processing and a pairwise text classification approach. In detail,
we first utilize a transformer-based Named Entity Recognition (NER) model to identify potential
entities, followed by a relation classification method to determine potential links. This novel
approach addresses the present limitations in entity-linking solutions and contributes to the
underexplored area of Spanish entity-linking in medical texts.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset</title>
      <p>
        We assessed the method used for clinical cases extracted from the E3C corpus [
        <xref ref-type="bibr" rid="ref12">26</xref>
        ], featured in
the TESTLINK challenge in IberLEF2023 [
        <xref ref-type="bibr" rid="ref1 ref13 ref14">1, 27, 28</xref>
        ], with a primary emphasis on documents in
the Spanish language. This clinical corpus represents a medical history of diferent anonymized
patients, where we can find; a general patient description, the reason for a visit, the medical
history associated with the consultation, the diagnosis, the results of treatments, and more.
      </p>
      <p>
        The composition of the dataset is 81 documents for training and 80 for testing; the first has
597, and the second has 668 annotated relations for humans. Both are under PubTator format
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], indicating an ordered pair of entity mentions (i.e., RML, EVENT ). RML (Resource Mapping
Language) is a tag created to mark test results, and the tag EVENT corresponds to activities,
conditions, and situations significant to an individual’s medical history.
      </p>
      <p>100001 |t| Paciente de 65 a. de edad, que presentaba una elevación progresiva de las cifras de PSA
desde 6 ng/ml a 12 ng/ml en el último año. Dicho paciente había sido sometido un año antes a una biopsia
transrectal de próstata ecodirigida por sextantes que fue negativa. Se decide, ante la elevación del PSA,
realizar una E-RME previa a la 2ª biopsia transrectal, en la que se objetiva una lesión hipointensa que
abarca zona central i periférica del ápex del lóbulo D prostático. El estudio espectroscópico de ésta lesión
mostró una curva de colina discretamente más elevada que la curva de citrato, con un índice de Ch-Cr/Ci
&gt; 0,80, que sugería la presencia de lesión neoplásica, por lo que se biopsia dicha zona por ecografía
transrectal. La AP de la biopsia confirmó la presencia de un ADK próstata Gleason 6.</p>
      <p>
        An example of the task is shown in Figure 1. It can be seen the task of entity linking within
clinical narratives in Spanish presents intricate complexity. One element contributing to this
complexity is the variable context size surrounding an entity pair; it may be either broad
or minimal. Moreover, the entities may comprise multiple tokens, adding further dificulty.
Additionally, the directionality of relationships between these entities is subject to change,
further complicating the task. The multifaceted structure of this task underscores the fact that
the TESTLINK challenge [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is far from a simple problem to solve, instead manifesting as a
compelling challenge within the realm of Clinical Natural Language Processing (NLP). Thus, it
is essential to explore and decipher these complexities to advance in the field and improve the
eficiency and accuracy of clinical data interpretation.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Proposed Model</title>
      <p>To address the relation extraction task, we propose LinkMed, a deep learning model based on
two sequential steps; entity recognition and the relation classification modules. Specifically,
two models were created: a NER and a relation classification model. The NER model obtains
mentions of tag events and their corresponding results. Then, the classification model takes
those pair of mentions and predicts whether there is a relation between them. Figure 2 depicts
the relation between the two used modules in LinkMed.</p>
      <p>text</p>
      <p>LinkMed
step 1: NER module
..........................................
..........................................
..........................................
..........................................
..........................................
..........................................
..........................................</p>
      <p>step 2: Classif. module
..........................................
..........................................
..........................................
..........................................
..........................................
..........................................
..........................................</p>
      <p>text +
relations</p>
      <p>
        Our approach for solving the Teslink task is shown in Figure 2, which indicates that the task
consists of two consecutive components: a NER module and a relationship classification module
between the found entities. Regarding the NER model, we solved the task using the FLERT
model [
        <xref ref-type="bibr" rid="ref15">29</xref>
        ], which has shown outstanding results in Spanish NER tasks [
        <xref ref-type="bibr" rid="ref16">30</xref>
        ]. This approach
consists of fine-tuning a transformer-based model, not at the sentence level but at the document
level. In other words, the input of the model considers not only the sequence of tokens of the
current sentence but also a window of tokens of the previous and following sentences, thus
incorporating more context. For our experiments, we tested three language models; biomedical
and clinical versions of RoBERTa [
        <xref ref-type="bibr" rid="ref17">31</xref>
        ], and the Spanish version of BERT [
        <xref ref-type="bibr" rid="ref5">19</xref>
        ].
      </p>
      <p>Then the model used for the relation extraction module followed the same architecture. We
ifne-tuned a domain-specific language model to create contextual representations of the spans
found in the NER module. Then, these representations are fed into a linear layer to determine
whether there is a relation between both entities. We are once again contemplating a focus on
the level of documents. In experiments using the validation partition, our NER model obtained
a mean of 0.84 according to the F1-score using the biomedical version of RoBERTta, and the
combination of both modules obtained a 0.51 F1-score using the oficial evaluation script. In
both modules, the language model that obtained the best results was the biomedical version of
RoBERTa.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>After obtaining the results of our proposed solution, we employed a range of metrics to measure
our model’s performance. These include False Positives (FP), False Negatives (FN ), precision,
recall, and the F1-score as shown in Table 1. The model’s eficacy was assessed utilizing a test
dataset containing distinct entities and their corresponding relationships.</p>
      <p>Concerning the general performance of the model shown in the last row (all) of Table 1,
the system registered 326 False Positive instances, denoting scenarios where it incorrectly
recognized a relationship between two entities within the test dataset. In addition, it failed
to identify 379 existing relationships, classifying them as False Negatives. The precision of
the model, a ratio representing accurately identified relationships over the sum of identified
relationships, approximates 0.47. This suggests that nearly 47% of the relationships the system
identified align with those defined within the test set. The model’s recall, calculated as the
proportion of accurately identified relationships and overall actual relationships in the test set,
approximates 0.43. This inference reveals that the system correctly identified around 43% of all
actual relationships embedded within the test set context. Finally, the F1-score, representing the
harmonic mean of precision and recall, was determined to be approximately 0.45. This score
embodies a trade-of between precision and recall, indicating a need for further refinement in the
model’s performance. Collectively, these findings elucidate both the promising prospects and
inherent challenges of entity-relationship recognition within defined contexts. The occurrence
of both false positives and false negatives pinpoints areas for potential enhancement in the
model’s performance.</p>
      <p>token division
single
two
multiple
all
# relations
326
250
92
668
count</p>
      <p>TP
170
76
43
289</p>
      <p>In order to clarify the overall results shown in Table 1, we divided the data into three
classifications (first three rows of Table 1) based on the number of tokens present in the RML
entity: single token, two tokens, and multiple tokens. As the event entity (EVENT ) invariably
contained only one token, it did not facilitate the creation of these categories.</p>
      <p>The classification of the single token obtained a precision of 0.49, a recall of 0.52, and an
F1-score of 0.51. Interestingly, the single-token category outperformed the general results and
other classifications across all the metrics evaluated, thereby indicating a robust structure for
resolving the composite entity comprising one token.</p>
      <p>In contrast, comparing the two-token and multiple-token results reveals an interesting pattern,
deviating from the established trend. The performance in the multiple-token category surpassed
that of the two-token category, yielding a recall of 0.47 compared to 0.30 and an F1-score of
0.45 compared to 0.36. This suggests that the model employed in this study demonstrated a
heightened performance under more complex circumstances. Specifically, there was a relative
increase in True Positives (TP) and a relative decrease in False Negatives (FN ) concerning the
evaluated category. This unique performance underlines the capability of the model to adapt
and perform proficiently, even under more challenging conditions.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Limitations</title>
      <p>Our proposed model, while achieving commendable performance in various scenarios for section
identification in Electronic Clinical Narratives (ECNs), has its limitations. One of the most
significant restrictions stems from the text-chunking module. When this component incorrectly
identifies a section, the error tends to propagate to subsequent sections due to the sequential
nature of the task, where all parts of the text have a designated section. This has a cascading
efect on the precision of the classification for the entire ECN.</p>
      <p>Another limitation relates to the interdependency of the two modules of our model. As the
performance of the section classification module is inherently dependent on the accuracy of the
text-chunking module, any classification error can negatively impact the overall matching
accuracy. Thus, the combined performance of both modules is a critical factor for the efectiveness
of our model.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions and Future Work</title>
      <p>In conclusion, our hybrid approach efectively drives the challenges of entity linking in the
Spanish clinical domain. Combining the strengths of a transformer-based Named Entity
Recognition (NER) model with a pair-wise classification module, we successfully identify relevant
entities and their potential relationships within clinical notes. However, there is still space for
improvement, particularly within the pair-wise classification component. Its current design
has limitations in capturing semantic relatedness between entities and the directional nature
of their relationships, which can impede overall performance. Future work should enhance
this component by integrating a method capable of discerning semantic relatedness and
directionality among identified entities. Ultimately, such advancements will improve the model’s
performance and further contribute to exploring the entity linking task in Clinical-NLP.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgements</title>
      <p>This work was funded by ANID Chile: Basal Funds for Center of Excellence FB210017 (CENIA),
FB210005 (CMM); Millennium Science Initiative Program ICN17_002 (IMFD) and ICN2021_004
(iHealth), Fondecyt grant 11201250, and National Doctoral Scholarships 21211659 (Claudio
Aracena) and 21221155 (Carlos Muñoz-Castro). This research was partially supported by the
supercomputing infrastructure of the NLHPC (ECM-02) and the Patagón supercomputer of
Universidad Austral de Chile (FONDEQUIP EQM180042).
ceedings of the Annual Symposium on Computer Application in Medical Care, American
Medical Informatics Association, 1987, p. 717.
[4] D. A. Evans, K. Ginther-Webster, M. Hart, R. G. Leferts, I. A. Monarch, Automatic indexing
using selective nlp and first-order thesauri, in: Intelligent Text and Image Handling-Volume
2, 1991, pp. 624–643.
[5] W. Hersh, T. Leone, The saphire server: a new algorithm and implementation., in:
Proceedings of the Annual Symposium on Computer Application in Medical Care, American
Medical Informatics Association, 1995, p. 858.
[6] A. R. Aronson, Efective mapping of biomedical text to the umls metathesaurus: the
metamap program., in: Proceedings of the AMIA Symposium, American Medical
Informatics Association, 2001, p. 17.
[7] R. A. Miller, F. M. Gieszczykiewicz, J. K. Vries, G. F. Cooper, Chartline: providing
bibliographic references relevant to patient charts using the umls metathesaurus knowledge
sources., in: Proceedings of the Annual Symposium on Computer Application in Medical
Care, American Medical Informatics Association, 1992, p. 86.
[8] C. Friedman, G. Hripcsak, W. DuMouchel, S. B. Johnson, P. D. Clayton, Natural language
processing in an operational clinical information system, Natural Language Engineering 1
(1995) 83–108.
[9] J. Friedlin, C. J. McDonald, A natural language processing system to extract and code
concepts relating to congestive heart failure from chest radiology reports, in: AMIA annual
symposium proceedings, volume 2006, American Medical Informatics Association, 2006, p.
269.
[10] J. D’Souza, V. Ng, Sieve-based entity linking for the biomedical domain, in: Proceedings
of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th
International Joint Conference on Natural Language Processing (Volume 2: Short Papers),
2015, pp. 297–302.
[11] H. Liu, S. T. Wu, D. Li, S. Jonnalagadda, S. Sohn, K. Wagholikar, P. J. Haug, S. M. Huf, C. G.</p>
      <p>Chute, Towards a semantic lexicon for clinical natural language processing, in: AMIA
Annual Symposium Proceedings, volume 2012, American Medical Informatics Association,
2012, p. 568.
[12] A. Leal, B. Martins, F. M. Couto, Ulisboa: Recognition and normalization of medical
concepts, in: proceedings of the 9th International Workshop on Semantic Evaluation
(SemEval 2015), 2015, pp. 406–411.
[13] R. Leaman, R. Islamaj Doğan, Z. Lu, Dnorm: disease name normalization with pairwise
learning to rank, Bioinformatics 29 (2013) 2909–2917.
[14] M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee, L. Zettlemoyer, Deep
contextualized word representations. arxiv preprint, arXiv preprint arXiv:1802.05365
(2018).
[15] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional
transformers for language understanding, arXiv preprint arXiv:1810.04805 (2018).
[16] J. Lee, W. Yoon, S. Kim, D. Kim, S. Kim, C. H. So, J. Kang, Biobert: a pre-trained biomedical
language representation model for biomedical text mining, Bioinformatics 36 (2020)
1234–1240.
[17] W.-T. Kao, H.-y. Lee, Is bert a cross-disciplinary knowledge learner? a surprising finding</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1] IberLEF, TESTLINK@IberLEF
          <year>2023</year>
          ,
          <year>2023</year>
          . URL: https://e3c.fbk.eu/testlinkiberlef.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>E.</given-names>
            <surname>French</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. T.</given-names>
            <surname>McInnes</surname>
          </string-name>
          ,
          <article-title>An overview of biomedical entity linking throughout the years</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Lowe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. O.</given-names>
            <surname>Barnett</surname>
          </string-name>
          ,
          <article-title>Micromesh: a microcomputer system for searching and exploring the national library of medicine's medical subject headings (mesh) vocabulary, in: Proof pre-trained models' transferability</article-title>
          ,
          <source>arXiv preprint arXiv:2103.07162</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Subakti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Murfi</surname>
          </string-name>
          ,
          <string-name>
            <surname>N. Hariadi,</surname>
          </string-name>
          <article-title>The performance of bert as data representation of text clustering</article-title>
          ,
          <source>Journal of big Data</source>
          <volume>9</volume>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cañete</surname>
          </string-name>
          , G. Chaperon,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fuentes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-H.</given-names>
            <surname>Ho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pérez</surname>
          </string-name>
          ,
          <article-title>Spanish pre-trained bert model and evaluation data</article-title>
          ,
          <source>Pml4dc at iclr</source>
          <year>2020</year>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cañete</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Donoso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bravo-Marquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <article-title>Albeto and distilbeto: Lightweight spanish language models</article-title>
          ,
          <source>arXiv preprint arXiv:2204.09145</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>V.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kundu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cañete</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mendoza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. E.</given-names>
            <surname>Mercer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bravo-Marquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.-F.</given-names>
            <surname>Moens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Soto</surname>
          </string-name>
          ,
          <article-title>Evaluation benchmarks for spanish sentence representations</article-title>
          ,
          <source>arXiv preprint arXiv:2204.07571</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>C.</given-names>
            <surname>Aspillaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <article-title>Stress test evaluation of transformer-based models in natural language understanding tasks</article-title>
          , arXiv preprint arXiv:
          <year>2002</year>
          .
          <volume>06261</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>V.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Aspillaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Thorne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Parra</surname>
          </string-name>
          ,
          <article-title>Stress test evaluation of biomedical word embeddings</article-title>
          ,
          <source>arXiv preprint arXiv:2107.11652</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>V.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Aspillaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Parra</surname>
          </string-name>
          ,
          <article-title>On adversarial examples for biomedical nlp tasks</article-title>
          , arXiv preprint arXiv:
          <year>2004</year>
          .
          <volume>11157</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>A.</given-names>
            <surname>Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Parra</surname>
          </string-name>
          ,
          <article-title>Comparing word embeddings for document screening based on active learning</article-title>
          .,
          <source>in: BIRNDL@ SIGIR</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>100</fpage>
          -
          <lpage>107</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>B.</given-names>
            <surname>Magnini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Altuna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lavelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Speranza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zanoli</surname>
          </string-name>
          ,
          <article-title>The e3c project: Collection and annotation of a multilingual corpus of clinical cases</article-title>
          ,
          <source>Proceedings of the Seventh Italian Conference on Computational Linguistics</source>
          CLiC-it
          <year>2020</year>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Jiménez-Zafra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Montes-y Gómez, Overview of IberLEF 2023: Natural Language Processing Challenges for Spanish and other Iberian Languages, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS</article-title>
          .org,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>B.</given-names>
            <surname>Altuna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Agerri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Salas-Espejo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Saiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zanoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Speranza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Magnini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lavelli</surname>
          </string-name>
          , G. Karunakaran, Overview of TESTLINK at IberLEF 2023:
          <article-title>Linking Results to Clinical Laboratory Tests</article-title>
          and Measurements,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>71</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>S.</given-names>
            <surname>Schweter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Akbik</surname>
          </string-name>
          , Flert:
          <article-title>Document-level features for named entity recognition</article-title>
          , ArXiv abs/
          <year>2011</year>
          .06993 (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rojas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Barros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Araneda-Hernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dunstan</surname>
          </string-name>
          ,
          <string-name>
            <surname>PLN</surname>
          </string-name>
          <article-title>CMM at SocialDisNER: Improving Detection of Disease Mentions in Tweets by Using Document-Level Features</article-title>
          ,
          <source>in: Proceedings of The Seventh Workshop on Social Media Mining for Health Applications</source>
          , Workshop &amp; Shared Task, Association for Computational Linguistics, Gyeongju, Republic of Korea,
          <year>2022</year>
          , pp.
          <fpage>52</fpage>
          -
          <lpage>54</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .smm4h-
          <fpage>1</fpage>
          .
          <fpage>15</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>C. P.</given-names>
            <surname>Carrino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Llop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pàmies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gutiérrez-Fandiño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Armengol-Estapé</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. SilveiraOcampo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Valencia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gonzalez-Agirre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Villegas</surname>
          </string-name>
          ,
          <article-title>Pretrained Biomedical Language Models for Clinical NLP in Spanish</article-title>
          ,
          <source>in: Proceedings of the 21st Workshop on Biomedical Language Processing</source>
          , Association for Computational Linguistics, Dublin, Ireland,
          <year>2022</year>
          , pp.
          <fpage>193</fpage>
          -
          <lpage>199</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .bionlp-
          <volume>1</volume>
          .19. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2022</year>
          . bionlp-
          <volume>1</volume>
          .
          <fpage>19</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>