<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Jaén, Spain
* Corresponding author.
$ mchizhik@ujaen.es (M. Chizhikova); mcdiaz@ujaen.es (M. C. Díaz-Galiano); laurena@ujaen.es
(L. A. Ureña-López); maite@ujaen.es (M. T. M. Valdivia)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Automatic Segmentation of Clinical Narratives in Sections with Pre-Trained Clinical Transformer Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mariia Chizhikova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Carlos Díaz-Galiano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luis Alfonso Ureña-López</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>María Teresa Martín Valdivia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Jaén</institution>
          ,
          <addr-line>Campus Las Lagunillas, s/n, Jaén, 23071</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>This paper presents the participation of the SINAI team in the ClinAIS shared task at IberLEF 2023, focusing on the task of section identification in clinical reports. The approach involves a multiclass token classification framework for section boundary detection, utilizing two system variants tuned for detecting one-token and three-token long boundaries. The system is built upon a RoBERTa architecture model pre-trained on biomedical and clinical corpora, and fine-tuned for the token classification task through hyperparameter optimization trials.The results show that fine-tuning for longer boundaries improved performance (0.6766 vs 0.6986 weighted B2 score). Error analysis revealed challenges in detecting "DERIVED_FROM/TO" and "EVOLUTION" sections due to class imbalance and semantic confusion.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Clnical Natural Language Processing</kwd>
        <kwd>Section Identification</kwd>
        <kwd>RoBERTa language model</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        entities and map them into codes from a controlled vocabulary like SNOMED-CT [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] or classify
texts into categories [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] among others methods.
      </p>
      <p>
        The main challenges of clinical NLP involve dealing with the overall linguistic complexity of
an EHR: negated expressions, co-references, misspellings, abbreviations, anaphoric relations
and so on [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The task of Section Identification (SI) consists in detecting boundaries of text
sections and adding semantic annotations to the divided text. This task can be considered as a
step towards addressing the aforementioned problems by identification and disambiguation of
the narrative structure underlying to each clinical report.
      </p>
      <p>
        The realization of SI was proven to be beneficial for the performance in clinical information
extraction tasks such as entity recognition in Chinese clinical reports [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and temporal relation
extraction [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Many works on this specific task focused on clinical narratives written in English:
Li et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] implemented a Hidden Markov Model based algorithm trained on a corpus of
9,679 clinical notes from New York-Presbyterian Hospital. More recently, Sadoughi et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
proposed a neural network based approach to SI that trains a model with a Long Short-Term
Memory (LSTM) layer in its core. This work also proves that SI is beneficial for the quality of
the post-processing of automatically transcribed clinical reports.
      </p>
      <p>
        However, the capability to analyze clinical text in languages other than English holds
significant potential for accessing crucial medical data pertaining to patient cohorts treated in
countries where English is not the oficial language[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. When it comes to processing clinical
narratives written in Spanish, SI appears to be a not so widely explored task, despite the fact of
Spanish being the fourth most spoken language in the world. Goenaga et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] evaluate three
diferent approaches to automatically standardizing Spanish electronic discharge summaries
from two diferent hospitals following the HL7 Clinical Document Architecture and state that
transfer-learning approaches show the best performance compared to other ML and rule-based
techniques.
      </p>
      <p>
        ClinAIS shared task at the Iberian Languages Evaluation Forum (IberLEF) 2023 aims to impulse
the research in the automatic SI applied to the Spanish language by providing a dataset of 1,038
clinical reports annotated with seven predefined medical sections: Present Illness, Derived
from/to, Past Medical History, Family history, Exploration, Treatment and Evolution [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ].
      </p>
      <p>
        The main objective of this paper is to describe the system presented by the SINAI team at
the ClinAIS shared task. Our approach follows the line of the best performed system described
in Goenaga et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We tackle the Section Identification task as a token classification problem
that intends to detect the borders of each of the seven sections.
      </p>
      <p>The remainder of the paper is organized as follows: Section 2 provides a detailed description
of the datasets that were made available by the organizers of the competition, Section 3 is
dedicated to the system we developed to tackle the task of automatic SI, Section 3.1 outlines
the specific procedures and configurations used during the experiments, Section 4 discloses the
results obtained by the presented system during the oficial evaluation that are subsequently
analyzed in Section 5 which critically examines the overall system performance and investigates
potential sources of errors. Finally, Section 6 summarizes the main aspects of our contribution
and ofers insights to enhance future iterations of the automatic section identification system.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Data</title>
      <p>
        ClinAIS corpus [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is a collection of 1,038 clinical case reports from diferent medical specialties
that were obtained by randomly sampling the CodiEsp corpus [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>The data was divided into three subsets in a stratified manner so that the section distribution
is similar in all three sets: training, development and test. The proportion of notes in each is
0.75, 0.125 and 0.125 respectively. It is worth mentioning that the evaluation set was released
alongside 2,751 unannotated documents in order to prevent the participants from performing
manual corrections and/or annotations.</p>
      <p>Each corpus entry was annotated with borders of sections from the following list: Present
Illness, Derived from/to, Past Medical History, Family history, Exploration, Treatment and
Evolution. Figure 1 shows the class label distribution across training and development datasets.
The sections Derived from/to and Family are scarcely represented compared to other labels.</p>
      <p>As it can be inferred from the Table 1 which summarizes some relevant statistics of the
data subsets, the average report length (measured in tokens produced by RoBERTa tokenizer)
doesn’t exceed 512, the maximum input length of the majority of pre-trained transformer
models available for Spanish, in all of the subsets. We can also point out that the data is quite
homogeneous across subsets in terms of the average of the annotated sections.</p>
      <p>Notably, section borders in many cases do not match with sentence starts which makes it
unviable to rely on a sentence classification approach.
Number of reports
Avg. report length (STD)
Avg. sections (STD)
Avg. unique sections (STD)</p>
    </sec>
    <sec id="sec-3">
      <title>3. System Description</title>
      <p>
        Fine-tuning large pre-trained transformer models was proven to be an efective approximation to
many NLP tasks like text classification [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] or NER [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Our approach follows this methodology
because we tackle the task of automatic SI as a multiclass token classification problem by
finetuning a large language model to detect section boundaries.
      </p>
      <p>In order to convert the corpus into a format that would be coherent with our problem
formulation, we performed pre-tokenization by splitting the corpus by white space and treating
the punctuation marks as separate units. As for the token labeling, we compared the approach
of marking only the boundary word as an entity to the method of selecting the first three words
of each section and labeling these according to the BIO (Beginning, Inside and Outside) scheme.</p>
      <p>
        We opted for building our system with a RoBERTa architecture model [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] pre-trained on a
combination of biomedical and clinical corpora [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. This model was fine-tuned for the token
classification task by adding a dropout and a linear layer on top of the original architecture.
3.1. Experimental Setup
We conducted two experiments in order to compare the aforementioned labeling strategies. In
order to maximize the resulting performance of the systems, we carried out optimization of
hyperparameters for model fine-tuning. This process relied on the Optuna framework which
provides eficient trial pruning and parameter sampling strategies [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>For each system, we performed 5 trials on a single NVIDIA Ampere A100 GPU. The
hyperparameter search space was defined as follows:
• Learning rate: a float value between 3 − 5 and 5 − 5
• Per device train batch size: either 8 or 16
• Weight decay: a float value between 1 − 12 and 1 − 1
• Adam epsilon: a float value between 1 − 10 and 1 − 6
• Warmup steps: an integer value between 0 and 1000</p>
      <p>The number of training epochs was selected by making use of the Early Stopping strategy
that interrupts fine-tuning when the reference metric doesn’t improve during 3 epochs. Table 2
presents the hyperparameters selected for each of the experiments.</p>
      <p>Single word boundary</p>
      <p>Three word boundary
Learning rate
Training epochs
Batch size
Weight decay
AdamW epsilon
Warmup steps</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>
        Our team submitted a total of two runs: one per each of the described pre-processing
methodologies. Identification of sections within unstructured clinical notes poses numerous challenges,
thereby impeding the comprehensive assessment of this task. Prominent among these challenges
is the inherent interconnection between the end of one section and the start of another. Another
issue is related to the fact that sections are not delimited by paragraphs, lines or phrases. To
address this the organizers designed the B2 evaluation metric, which is an adaptation of the
boundary distance [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ]. Table 3 displays the results obtained during the oficial evaluation
of the two presented system variants.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Performance Analysis</title>
      <p>This section attempts to provide insights regarding the overall system performance and
investigate the possible reasons for its errors. As shown in Table 3, the variant that involved
recognition of the first three tokens of each section was proven to be the best option scoring
0.6986 weighed B2 during the oficial evaluation on the test set. This performance improvement
can be explained by the fact that the majority of the words at the beginning of each section are
adpositions (e.g. con (with), en (in)), determiners (e.g. el/la (the)) and pronouns (e.g. su (his/her)).
This type of words lack of semantic information almost completely which has a negative impact
on the amount of information that can be captured by an embedding of a contextual model
such as RoBERTa clinical. Therefore, the classifier layer cannot learn and generalize correctly
distinctive features of such tokens which leads to classification errors. Figure 2 displays the
frequency of apparition of diferent parts-of-speech at the beginning of the sections.</p>
      <p>Moreover, we carried out an error analysis using the predictions made with the best
performing variant of our system on the development subset and evaluated with the oficial evaluation
script scoring 0.7477 weighted B2. We distinguish between four types of errors: additions (when
the system identifies a section that is not present in the report), deletions (when the system is
not capable to detect a section), transpositions (the section was identified, but the extracted
span is transposed) and substitutions (the identified section is assigned a wrong label). Figure 3
summarizes the error type and rate per each section type.</p>
      <p>The highest number of matches with respect to a total number of reference annotations (169
out of 218) is shown for the label “PRESENT_ILLNESS” that is the most commonly seen across
the training dataset. Nevertheless, there is no clear direct relation between the most represented
labels in the dataset and the quality of predictions: the categories with the worst
matches/reference annotations ratio are “DERIVED_FROM/TO” - 10 matches out of 24 reference annotations
and “EVOLUTION” - 81 matches out of 198 references. Notably, the “DERIVED_FROM/TO”
class is the only one where the number of additions (11) is higher than the matches. As for the
identification of the “EVOLUTION” sections, the frequency of deletions (70) is worth mentioning.</p>
      <p>As for the substitutions, 14 errors of this type were detected when evaluating our best
performing system on the development dataset. The most confounded label resulted to be
“EVOLUTION” which was substituted with “EXPLORATION” 4 times and once with
“TREATMENT”. Moreover, it substituted the label “TREATMENT” twice and “PRESENT_ILLNESS” once.
The “EVOLUTION” was defined in the annotation guidelines as the “ Evolution of the patient’s
health status. It may include diferential diagnoses ”, which means it can include references to the
explorations performed and its results, as well as the response to treatments and updates on the
present illness, so semantically this section might be close to many other sections depending on
the report and section position within it.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions and Future Work</title>
      <p>This paper covers the participation of the SINAI team at the ClinAIS shared task held on IberLEF
2023. We describe our approach to the task of section identification in clinical reports. Our
problem formulation follows a multiclass token classification approach for section boundary
detection and we compare two variants of systems: one tuned to detect one-token boundary
and the other tuned to recognize three-token-long boundaries. We based our system on a
RoBERTa architecture model pre-trained on a combination of biomedical and clinical corpora
that was fine-tuned for the token classification task with hyperparameters selected during a 5
trial optimization.</p>
      <p>The approach of fine-tuning a model to detect longer section boundaries performed better
scoring 0.6986 weighted B2 score. This improvement is probably due to the fact that the majority
of sections start with a function word like a preposition or a determiner.</p>
      <p>In order to shed light on the errors made by our system we conducted an error analysis of
the predictions of our best performing system on the development set. The two sections that
resulted to be the hardest to detect have proven to be “DERIVED_FROM/TO”, one of the less
represented classes in the training set, and “EVOLUTION” which might be semantically close
to other sections like “TREATMENT”, “PRESENT_ILLNESS” or “EXPLORATION” and is thus
frequently confounded or omitted.</p>
      <p>With the purpose of dealing with these issues, an implementation of a weighted loss function
can be proposed as a way of mitigating the impact of class imbalance. With the same aim
data augmentation techniques like back translation or generative language models can be
used. A more in-depth dataset analysis can provide more detailed information regarding the
linguistic characteristics of each section which can be employed as an extension to the contextual
embedding during the classification process.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work has been partially supported by WeLee project (1380939, FEDER Andalucía 2014-2020)
funded by the Andalusian Regional Government, and projects CONSENSO
(PID2021-122263OBC21), MODERATES (TED2021-130145B-I00), SocialTOX (PDC2022-133146-C21) funded by Plan
Nacional I+D+i from the Spanish Government, and project PRECOM (SUBV-00016) funded by
the Ministry of Consumer Afairs of the Spanish Government.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rastegar-Mojarad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Afzal</surname>
          </string-name>
          , S. Liu,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mehrabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sohn</surname>
          </string-name>
          , et al.,
          <article-title>Clinical information extraction applications: a literature review</article-title>
          ,
          <source>Journal of biomedical informatics 77</source>
          (
          <year>2018</year>
          )
          <fpage>34</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Miranda-Escalada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gascó</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lima-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Farré-Maduell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Estrada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nentidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krithara</surname>
          </string-name>
          , G. Katsimpras, G. Paliouras,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Krallinger, Overview of distemist at bioasq: Automatic detection and normalization of diseases from clinical texts: results, methods, evaluation and multilingual resources</article-title>
          , in: Working Notes of Conference and
          <article-title>Labs of the Evaluation (CLEF) Forum</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Chizhikova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>López-Úbeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Collado-Montañez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Martín-Noguerol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>DíazGaliano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Luna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Ureña-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Martín-Valdivia</surname>
          </string-name>
          ,
          <article-title>Cares: A corpus for classiifcation of spanish radiological reports</article-title>
          ,
          <source>Computers in Biology and Medicine</source>
          <volume>154</volume>
          (
          <year>2023</year>
          )
          <fpage>106581</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Pomares-Quimbaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kreuzthaler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schulz</surname>
          </string-name>
          ,
          <article-title>Current approaches to identify sections within clinical narratives from electronic health records: a systematic review</article-title>
          ,
          <source>BMC medical research methodology</source>
          <volume>19</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>A comprehensive study of named entity recognition in chinese clinical text</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>21</volume>
          (
          <year>2014</year>
          )
          <fpage>808</fpage>
          -
          <lpage>814</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kropf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Krücken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Mueller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Denecke</surname>
          </string-name>
          ,
          <article-title>Structuring legacy pathology reports by openehr archetypes to enable semantic querying</article-title>
          ,
          <source>Methods of information in medicine 56</source>
          (
          <year>2017</year>
          )
          <fpage>230</fpage>
          -
          <lpage>237</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. Lipsky</given-names>
            <surname>Gorman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Elhadad</surname>
          </string-name>
          ,
          <article-title>Section classification in clinical notes using supervised hidden markov model</article-title>
          ,
          <source>in: Proceedings of the 1st ACM International Health Informatics Symposium</source>
          , IHI '10,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2010</year>
          , p.
          <fpage>744</fpage>
          -
          <lpage>750</lpage>
          . URL: https://doi.org/10.1145/1882992.1883105. doi:
          <volume>10</volume>
          .1145/1882992. 1883105.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>N.</given-names>
            <surname>Sadoughi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. P.</given-names>
            <surname>Finley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Edwards</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Korenevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brenndoerfer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Axtmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Suendermann-Oeft</surname>
          </string-name>
          ,
          <article-title>Detecting section boundaries in medical dictations: Toward real-time conversion of medical dictations to clinical reports</article-title>
          , in: Speech and Computer: 20th International Conference,
          <string-name>
            <surname>SPECOM</surname>
          </string-name>
          <year>2018</year>
          , Leipzig, Germany,
          <source>September 18-22</source>
          ,
          <year>2018</year>
          , Proceedings 20, Springer,
          <year>2018</year>
          , pp.
          <fpage>563</fpage>
          -
          <lpage>573</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Névéol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dalianis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Velupillai</surname>
          </string-name>
          , G. Savova,
          <string-name>
            <surname>P. Zweigenbaum,</surname>
          </string-name>
          <article-title>Clinical natural language processing in languages other than english: opportunities and challenges</article-title>
          ,
          <source>Journal of biomedical semantics 9</source>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>I.</given-names>
            <surname>Goenaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lahuerta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Atutxa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gojenola</surname>
          </string-name>
          ,
          <article-title>A section identification tool: towards hl7 cda/ccr standardization in spanish discharge summaries</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>121</volume>
          (
          <year>2021</year>
          )
          <fpage>103875</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>I. de la Iglesia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Vivó</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Chocrón</surname>
            , G. de Maeztu,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Gojenola</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Atutxa</surname>
          </string-name>
          , Overview of ClinAIS at IberLEF 2023:
          <article-title>Automatic Identification of Sections in Clinical Documents in Spanish</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>71</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Jiménez-Zafra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Montes-y Gómez, Overview of IberLEF 2023: Natural Language Processing Challenges for Spanish and other Iberian Languages, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS</article-title>
          .org,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Miranda-Escalada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gonzalez-Agirre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Armengol-Estapé</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Krallinger, Overview of automatic clinical coding: Annotations, guidelines, and solutions for non-english clinical cases at codiesp track of CLEF ehealth 2020</article-title>
          , in: L.
          <string-name>
            <surname>Cappellato</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Eickhof</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Névéol (Eds.), Working Notes of CLEF 2020 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , Thessaloniki, Greece,
          <source>September 22-25</source>
          ,
          <year>2020</year>
          , volume
          <volume>2696</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2020</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2696</volume>
          /paper_263.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>López-Ubeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Díaz-Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A. U.</given-names>
            <surname>López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Martín-Valdivia</surname>
          </string-name>
          , T. MartínNoguerol, A. Luna,
          <article-title>Transfer learning applied to text classification in spanish radiological reports</article-title>
          ,
          <source>in: Proceedings of the LREC 2020 Workshop on Multilingual Biomedical Text Processing (MultilingualBIO</source>
          <year>2020</year>
          ),
          <year>2020</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Miranda-Escalada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Farré</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krallinger</surname>
          </string-name>
          ,
          <article-title>Named entity recognition, concept normalization and clinical coding: Overview of the cantemist track for cancer text mining in spanish, corpus, guidelines, methods and results</article-title>
          .,
          <source>IberLEF@ SEPLN</source>
          (
          <year>2020</year>
          )
          <fpage>303</fpage>
          -
          <lpage>323</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Roberta: A robustly optimized bert pretraining approach</article-title>
          , arXiv preprint arXiv:
          <year>1907</year>
          .
          <volume>11692</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C. P.</given-names>
            <surname>Carrino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Armengol-Estapé</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gutiérrez-Fandiño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Llop-Palao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pàmies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gonzalez-Agirre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Villegas</surname>
          </string-name>
          ,
          <article-title>Biomedical and clinical language models for spanish: On the benefits of domain-specific pretraining in a mid-resource scenario</article-title>
          ,
          <source>arXiv preprint arXiv:2109.03570</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>T.</given-names>
            <surname>Akiba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yanase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ohta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Koyama</surname>
          </string-name>
          ,
          <article-title>Optuna: A next-generation hyperparameter optimization framework</article-title>
          ,
          <source>in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery &amp; data mining</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>2623</fpage>
          -
          <lpage>2631</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>C.</given-names>
            <surname>Fournier</surname>
          </string-name>
          ,
          <article-title>Evaluating text segmentation using boundary edit distance</article-title>
          ,
          <source>in: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , Sofia, Bulgaria,
          <year>2013</year>
          , pp.
          <fpage>1702</fpage>
          -
          <lpage>1712</lpage>
          . URL: https://aclanthology.org/P13-1167.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>I. de la Iglesia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Vivó</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Chocrón</surname>
            , G. de Maeztu,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Gojenola</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Atutxa</surname>
          </string-name>
          ,
          <article-title>An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>