<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Automatic De-Identi cation of Medical Texts in Spanish: the MEDDOCAN Track, Corpus, Guidelines, Methods and Evaluation of Results</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Montserrat Marimon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aitor Gonzalez-Agirre</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ander Intxaurrondo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heidy Rodr guez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jose Antonio Lopez Martin</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marta Villegas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Krallinger</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Barcelona Supercomputing Center</institution>
          ,
          <addr-line>BSC</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Centro Nacional de Investigaciones Oncologicas</institution>
          ,
          <addr-line>CNIO</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Hospital 12 de Octubre - Madrid</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>618</fpage>
      <lpage>638</lpage>
      <abstract>
        <p>There is an increasing interest in exploiting the content of electronic health records by means of natural language processing and text-mining technologies, as they can result in resources for improving patient health/safety, aid in clinical decision making, facilitate drug repurposing or precision medicine. To share, re-distribute and make clinical narratives accessible for text mining research purposes, it is key to fulll legal conditions and address restrictions related data protection and patient privacy. Thus, clinical records cannot be shared directly "as is". A necessary precondition for accessing clinical records outside of hospitals is their de-identi cation or exhaustive removal/replacement of all mentioned privacy related protected health information phrases. Providing a proper evaluation scenario for automatic anonymization tools is key for approval of data redistribution. The construction of manually de-identi ed medical records is currently the main rate and cost-limiting step for secondary use applications of clinical data. This paper summarizes the settings, data and results of the rst shared track on anonymization of medical documents in Spanish, the MEDDOCAN (Medical Document Anonymization) track. This track relied on a carefully constructed synthetic corpus of clinical case documents, the MEDDOCAN corpus, following annotation guidelines for sensitive data based on the analysis of the EU General Data Protection Regulation. A total of 18 teams (from the 51 registrations) submitted 63 runs for rst sub-track 1 and 61 systems for the second sub-track. The top scoring systems were based on sophisticated deep learning approaches, representing strategies that can signi cantly reduce time and costs associated to accessing textual data containing privacy-related sensitive information. The results of this track might help in lowering the clinical data access hurdle for Spanish language technology developers, showing also potentials for similar settings using data in other languages or from di erent domains.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        There is an increasing interest in exploiting the content of unstructured clinical
narratives by means of language technologies. Therefore, and because there is
clear interest in the health sector by the language technology industry, one of
the agship projects of the Spanish National Plan for the Advancement of
Language Technology (Plan TL4) is related to the clinical and biomedical eld. The
Plan TL has promoted the generation of a collection of resources for Spanish
biomedical NLP5, including corpora [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], gazetteers [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], components [
        <xref ref-type="bibr" rid="ref19 ref2">2, 19</xref>
        ] and
tools, as well as evaluation e orts [
        <xref ref-type="bibr" rid="ref11 ref12 ref18">18, 11, 12</xref>
        ]. Due to their central role in
fostering language technology resources, the promotion of shared tasks and evaluation
campaigns is of particular relevance for the Plan TL, being considered a key
instrument for: (1) independent quality evaluation of components, (2) promotion
of standards, interoperability and harmonization of resources, (3) generation of
new systems, tools and software components, (4) promotion of con dence by end
users, investors and commercial partners in language technologies, (5)
promoting new start ups and innovative ideas, (6) improving access to data, (7) create
collaborative research interactions and networks and (8) serve as a knowledge
transfer and learning experience engaging both academia and industry.
Structured clinical data, in the form of codi ed clinical information using controlled
indexing vocabulary such as ICD10, only covers a fraction of the medically
relevant information stored in electronic health records (EHRs) and clinical texts.
Complex relations such as drug-related allergies, constituting a serious health
risk, cannot be captured well by the coding schemes followed typically by
clinical documentalists and, thus, require direct processing of clinical narrative texts.
      </p>
      <p>Being able to transform automatically clinical documents into some
structured representations is nonetheless needed to enable secondary use of EHRs to
carry out population and epidemiological studies, to detect medication-related
adverse events or for monitoring systematically treatment-related responses, just
to name a few.</p>
      <p>
        To be able to share, re-distribute and make clinical narratives accessible for
text mining and natural language processing (NLP) purposes, it is key to ful ll
legal conditions and address restrictions related data protection and patient
privacy legislations [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Some e orts have been made to examine GDPR demands
Copyright c 2019 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0). IberLEF 2019, 24
September 2019, Bilbao, Spain.
4 https://www.plantl.gob.es
5 https://github.com/PlanTL-SANIDAD
      </p>
      <p>
        MEDDOCAN: Automatic de-identi cation of medical texts in Spanish
for the construction of de-identi ed textual corpora for research purposes [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
Thus, clinical records with protected health information (PHI) cannot be directly
shared "as is", due to privacy constraints, making it particularly cumbersome
to carry out NLP research in the medical domain. A necessary precondition for
accessing clinical records outside of hospitals is their de-identi cation, i.e., the
exhaustive removal (or replacement) of all mentioned PHI phrases.
      </p>
      <p>
        Studies describing services for pseudonymization of EHRs based on
standards such as the ISO/EN 13606 were previously published for data in Spanish
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], but are generally limited to the structured elds of the clinical documents,
have not been evaluated against any particular Gold Standard dataset (i.e. lack
proper evaluation), and, most importantly, are not accessible or released on
public software repositories, making it impossible to actually carry out a proper
independent benchmark study. Providing a proper evaluation scenario of
automatic anonymization tools, with well-de ned sensitive data types, is crucial for
approval of data redistribution consents signed by ethical committees of
healthcare institutions. It is important to highlight that the construction of manually
de-identi ed medical records is currently the main rate and cost-limiting step for
secondary use applications. Moreover, such settings also require very carefully
designed annotation guidelines and interfaces to assure that there is no leak of
sensitive information from clinical records and that the resulting de-identi ed
datasets are compliant with all legal constraints.
      </p>
      <p>
        The practical relevance of anonymization or de-identi cation of clinical texts
motivated the proposal of two shared tasks, the 2006 and 2014 de-identi cation
tracks [
        <xref ref-type="bibr" rid="ref21 ref24">24, 21</xref>
        ], organized under the umbrella of the i2b2 (i2b2.org) community
evaluation e ort. The i2b2 e ort has deeply in uenced the clinical NLP
community worldwide, but was focused on documents in English and covering
characteristics of US-healthcare data providers. Systems used for de-identifying
English clinical texts like Carafe, based on Conditional Random Fields or MIST
(the MITRE Identi cation Scrubber Toolkit) have bene ted from i2b2 shared
tasks to improve, evaluate and analyze these tools. The interest in automated
de-identi cation and anonymization systems is not limited to data in English,
and there is also a growing awareness in developing such systems for other
languages, such as French [
        <xref ref-type="bibr" rid="ref7 ref9">9, 7</xref>
        ], German [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], Dutch [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], Portuguese [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], Danish
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], Swedish [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] or Norwegian [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>
        In case of texts in Spanish, there has been so far a rather limited attempt
in developing and characterizing automatic de-identi cation strategies [
        <xref ref-type="bibr" rid="ref10 ref14 ref25 ref6">10, 14,
25, 6</xref>
        ], even though some in house tools, such as the AEMPS anonymizer or
a recent publication by Medina and Turmo [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] show that e orts in this
direction are being made and such tools are already explored in practice. We,
therefore, organized the rst community challenge track speci cally devoted to
the anonymization of medical documents in Spanish, called the MEDDOCAN
(Medical Document Anonymization) track, as part of the IberLEF evaluation
initiative.
2
2.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <sec id="sec-3-1">
        <title>Track Description</title>
        <p>The MEDDOCAN track was one of the nine challenge tracks of the Iberian
Languages Evaluation Forum (IberLEF 2019)6 evaluation campaign, which had
the goal of promoting the development of language technologies for Iberian
languages. MEDDOCAN was the rst community challenge track speci cally
devoted to the anonymization of medical documents in Spanish and it evaluated the
performance of the systems for identifying and classifying sensitive information
in clinical case studies written in Spanish.</p>
        <p>The evaluation of automatic predictions for this track had two di erent
scenarios or sub-tracks:
1. NER o set and entity type classi cation : the rst sub-track was focused
on the identi cation and classi cation of sensitive information (e.g., patient
names, telephones, addresses, etc.).
2. Sensitive span detection: the second sub-track was focused on the detection
of sensitive text more speci c to the practical scenario necessary for the
release of de-identi ed clinical documents, where the objective is to identify
and to mask con dential data, regardless of the real type of entity or the
correct identi cation of PHI type.
2.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Track data</title>
        <p>For this track, we prepared a synthetic corpus of clinical cases enriched with
PHI expressions, named the MEDDOCAN corpus. The MEDDOCAN corpus,
of 1,000 clinical case studies, was selected manually by a practicing physician and
augmented with PHI phrases by health documentalists, adding PHI information
from discharge summaries and medical genetics clinical records.</p>
        <p>
          To carry out the manual annotation, we constructed the rst public
guidelines for PHI in Spanish [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], following the speci cations derived from the
General Data Protection Regulation (GDPR) of the EU, as well as the annotation
guidelines and types de ned by the i2b2 de-identi cation tracks, based on the US
Health Insurance Portability and Accountability Act (HIPAA). The
construction of these annotation guidelines involved active feedback over a six-month
period from a hybrid team of nine persons with expertise in both healthcare and
NLP, resulting in a 28-page document that has been distributed along with the
corpus. Along with the annotation rules, illustrative examples were provided to
make the interpretation and use of the guidelines as easy as possible.
        </p>
        <p>The MEDDOCAN corpus was randomly sampled into three subset: the train
set, which contained 500 clinical cases, and the development and test sets of 250
clinical cases each. These clinical cases were manually annotated using a
customized version of AnnotateIt. Then, the BRAT annotation toolkit was used to
6 http://hitz.eus/sepln2019/?q=node/21
correct errors and add missing annotations, achieving an inter-annotator
agreement (IAA) of 98% (calculated with 50 documents). Together with the test set,
we released an additional collection of 3,501 documents (background set7) to
make sure that participating teams were not able to do manual corrections and
also to promote that these systems would potentially be able to scale to larger
data collections.</p>
        <p>The MEDDOCAN annotation guidelines de ned a total of 29 entity types.
track and the number of occurrences among the training, development and test
sets.</p>
        <p>The MEDDOCAN corpus was distributed in plain text in UTF-8 encoding,
where each clinical case was stored as a single le, while PHI annotations were
released in the BRAT format, which makes visualization of results
straightforward, as you can see in Fig. 1 For this track, we also prepared a conversion script8
between the BRAT annotation format and the annotation format used by the
7 The background set included the train, development and test sets, and an additional
collection of 2,751 clinical cases (totalling 3,751 clinical cases).
8 https://github.com/PlanTL-SANIDAD/MEDDOCAN-Format-Converter-Script</p>
        <p>Marimon et al.
previous i2b2 e ort, to make comparison and adaptation of previous systems
used for English texts easier.
We developed an evaluation script that supported the evaluation of the
predictions of the participating teams. For both sub-tracks the primary evaluation
metrics used consisted of standard measures from the NLP community, namely
micro-averaged precision, recall, and balanced F-score, being the last one the
only o cial evaluation measure of both sub-tracks:</p>
        <p>T P
Precision: P = T P +F P</p>
        <p>T P
Recall: R = T P +F N</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Participation and Results</title>
      <sec id="sec-4-1">
        <title>Participation</title>
        <p>To participate in the MEDDOCAN track it was necessary to register both on
the o cial website9 and in the CodaLab competition10. Training and
development sets were made available for download on the o cial website11, and the
evaluation script was uploaded to GitHub12, to ensure a transparent evaluation.</p>
        <p>Submissions had to be provided in a prede ned prediction format (BRAT
or i2b2). The participants had a period of almost two months to develop their
system. In the middle of this period, the text and background sets were released
with the 3,751 documents that the participants had to process and label,
although the nal evaluation was done on the 250 documents of the test set. As
we have mentioned, the participants could submit a maximum of 5 system runs,
and, once the submission deadline expired, we published the Gold Standard
annotations of the test set, in order to ensure a transparent evaluation process.</p>
        <p>A total of 18 teams participated in the track, submitting a total of 63 systems
for sub-track 1 and 61 systems for sub-track 2. Teams from eight di erent
nationalities participated in the track: ten from Spain, two from the United States,
and one from Argentina, China, Germany, Italy, Japan, and Russia. Among all
the participants, only one belonged to an institution of a commercial nature.
Table 2 summarizes the most relevant information about the participants.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Baseline system</title>
        <p>We produced a baseline system using a vocabulary transfer approach. Each
annotation from the train and development datasets was transferred to the test
dataset using strict string matching. For those cases where the text was the
same, but the entity type was di erent, we decided to annotate all entity types
that matched that text.
9 http://temu.bsc.es/meddocan/
10 https://competitions.codalab.org/competitions/22643
11 http://temu.bsc.es/meddocan/index.php/data/
12
https://github.com/PlanTL-SANIDAD/MEDDOCAN-CODALAB-EvaluationScript
Marimon et al.</p>
        <p>MEDDOCAN: Automatic de-identi cation of medical texts in Spanish
set and entity type classi cation.
with a recall of 0.98335, lukas.lange, with a recall of 0.98264, and, mhjabreel, with
a recall of 0.97471.</p>
        <p>An analysis of errors showed that some of the annotations in the Gold
Standard (GS) corpus were not detected by any of the systems (at least not exactly).
Some of them are listed here:
{ HOSPITAL: Hospital General de Agudos P. Pin~ero
{ FAMILIARES SUJETO</p>
        <sec id="sec-4-2-1">
          <title>ASISTENCIA: tres hermanos varones sordomudos y otro con baja vision</title>
          <p>{ OTROS SUJETO</p>
        </sec>
        <sec id="sec-4-2-2">
          <title>ASISTENCIA: estudiante de administracion de empresas</title>
          <p>On the contrary, some systems annotated entities that were not in the GS but
probably should be. For instance, "ex-operario de la industria textil " was
annotated as PROFESION</p>
          <p>by jiangdehuan, jimblair, and Jordi, but this annotation
was not in the GS.
MEDDOCAN: Automatic de-identi cation of medical texts in Spanish
One of the primary goals of this track was to develop systems capable of
completely de-identifying sensitive information from clinical documents. However,
none of submitted systems managed to obfuscate all the sensitive information.
In this section, we present two experiments we performed that evaluated the
performance of combined systems to de-identify the test dataset without leaks.
The rst experiment was based on a joint system, the second experiment, on a
voting system.</p>
          <p>Joint system The goal of this experiment was to nd the combination of
individual systems that achieved the best possible performance. For this, rst,
we ranked all the systems by F-score, and then we joined the annotations of the
two best system. If the performance of the Joint system improved, we continued
with the next best system, if not, we kept the previous system (or the previous
joint system). We repeated this until no systems were left. We measured the
performance of the joint system using three metrics:
1. Best F1: If the F-score of the joint system improved when we added the
annotations from the next system, we updated the joint system with the
new one. If the F-score did not improve, but it was maintained and the
recall was better, we also updated the joint system with the new one (same
F-score, better recall, worse precision).
2. Best Recall: If the recall of the joint system improved, we updated the joint
system, regardless of the drop in the F-score. It tried to maximize the chances
of completely de-identifying the documents.
3. Balanced: If the recall of the joint system improved, we updated the joint
system only if the decrease of the F-score was at much four times the increase</p>
          <p>MEDDOCAN: Automatic de-identi cation of medical texts in Spanish</p>
          <p>Marimon et al.
of the recall. That it, for every point of increase in recall, we allowed 4 point
of decrease in F-score, but not more. It tried to increase the recall, but
without hurting the F-Score too much.</p>
          <p>The systems that were used to achieve the best results for these metrics were
the following:
{ Best F1:
{ Recall:
lukas.lange/run3 improves the F-score from 0 a 0.96961.
lukas.lange/run2 improves the F-score from 0.96961 a 0.96997.
lukas.lange/run1 improves the F-score from 0.96997 a 0.97033.
lukas.lange/run3 improves the recall from 0 to 0.96944.
lukas.lange/run2 improves the recall from 0.96944 to 0.97209.
lukas.lange/run1 improves the recall from 0.97209 to 0.97492.
lukas.lange/run4 improves the recall from 0.97492 to 0.97562.
Fadi/15-7 improves the recall from 0.97562 to 0.97898.</p>
          <p>Fadi/14-5 improves the recall from 0.97898 to 0.97951.</p>
          <p>Fadi/17-3 improves the recall from 0.97951 to 0.98022.</p>
          <p>Fadi/16-3 improves the recall from 0.98022 to 0.98039.
nperez/ncrfpp improves the recall from 0.98039 to 0.98181.</p>
          <p>FSL/run1 improves the recall from 0.98181 to 0.98393.</p>
          <p>FSL/run2 improves the recall from 0.98393 to 0.9841.
nperez/sp-test-03-empty improves the recall from 0.9841 to 0.98516.
mhjabreel/run3 improves the recall from 0.98516 to 0.98551.
mhjabreel/run2 improves the recall from 0.98551 to 0.98569.
jiangdehuan/run3 improves the recall from 0.98569 to 0.98693.
jiangdehuan/run2 improves the recall from 0.98693 to 0.9871.
jimblair/run2 improves the recall from 0.9871 to 0.98763.
jimblair/run3 improves the recall from 0.98763 to 0.98781.
jiangdehuan/run1 improves the recall from 0.98781 to 0.98816.
Jordi/run3 improves the recall from 0.98816 to 0.98869.</p>
          <p>vcotik/run5 improves the recall from 0.98869 to 0.98887.
{ Balanced:
lukas.lange/run3 improves the recall from 0 to 0.96944 (+0.96944)
without losing too much F-score: 0.96961 (-0.96961).
lukas.lange/run2 improves the recall from 0.96944 to 0.97209 (+0.00265)
without losing too much F-score: 0.96841 (0.00112).
lukas.lange/run1 improves the recall from 0.97209 to 0.97492 (+0.00283)
without losing too much F-score: 0.96647 (0.00194).</p>
          <p>Fadi/15-7 improves the recall from 0.97492 to 0.97863 (+0.00371)
without losing too much F-score: 0.96181 (0.00466).</p>
          <p>Fadi/17-3 improves the recall from 0.97863 to 0.97951 (+0.00088)
without losing too much F-score: 0.95868 (0.00313).
nperez/ncrfpp improves the recall from 0.97951 to 0.98128 (+0.00177)
without losing too much F-score: 0.95308 (0.00560).</p>
          <p>FSL/run1 improves the recall from 0.98128 to 0.98375 (+0.00247)
without losing too much F-score: 0.94342 (0.00966).</p>
          <p>MEDDOCAN: Automatic de-identi cation of medical texts in Spanish
Voting The combination of individual systems from the previous experiment
was done directly on the test set. It is very di cult for a given combination of
systems to be transferable from one data set to another. Therefore, it should
be taken as only an approximation of the upper bound that can be obtained
by combining individual systems. In this experiment, we combined the systems
using a voting scenario: we accepted as good the annotations that had predicted
by N systems.</p>
          <p>We created 50 systems for sub-track 1. The rst system accepted all the
annotations predicted by, at least, one of the systems, while the last one accepted
only the annotations that were predicted by, at least, 50 systems. The results of
this experiment is shown in Table 9. As expected, as the value of N increased (we
increased the number of required votes), the recall got worse and the precision
improved. The maximum value of F-score on the train and development sets was
obtained combining 17 systems (F-score of 0.9942). When we used the train and
development sets as train corpus to select the optimal value of N and used this
value on the test set, we obtained an F-score of 0.9757. This score was lower than
the best one that could be obtained (0.9768, with N = 23), but the di erence
was (in practice) negligible.</p>
          <p>Comparing the results of the two experiments, we see that the voting system
improved the joint system by 0.54 points. In addition, as we see in the Table
9, the values were very stable and a non-optimal choice of the value N did not
vary much the result. The negative part was that the voting scenario required
many systems to obtain this result (17 systems out of 63 had to agree in order
to accept an annotation), while the joint system was a combination of only 3
systems. The voting system matched the performance of the joint system when
N is 13, scoring 0.9701 (the joint system scored 0.9703) .</p>
          <p>For reasons of space, we do not include the results of this experiment for
sub-tracks 2A and 2B, but they showed a very similar behavior.
In this section we analyze the performance of the systems on the di erent data
sets. As we have said, the background set included, the train set and the
develMarimon et al.
opment set, which allowed us to measure the F-score of all the systems on the
train, development and test set, and to analyze their behavior.</p>
          <p>All the scores of this analysis are shown in table 10, where the drop column
indicates the di erence of performance in the test set with respect to the
development set (a negative value indicates a lower performance on the test set). There
were two teams that achieved a F-score of 1.0 in both train and development
set: jimblair (in all tracks) and m. domrachev (in sub-tracks 1 and 2A). The
former had a performance drop of 6.25 points, and the latter of 9.99 points in
the test set, probably because both systems of these competitors memorized the
train and development data, obtaining a perfect score, incurring in over tting.
This also suggested that they could have used the development set to train the
system, and not just to tune it.</p>
          <p>In contrast to this, we see that lukas.lange, which was rst team on the test
set for sub-track 1, was also the rst on the development set (without taking
into account those who had scored 1.0), but third on the train set (without
taking into account those who scored 1.0). The performance of their system only
dropped 0.14 points in the test set with respect to the development set. Probably
they used the train set to build the system and the development only for tuning,
not incurring in over tting. This demonstrated that the ability of the systems
to generalize was very important.</p>
          <p>Taking into account all the sub-tracks, the maximum performance drop was
su ered by m.domrachev, losing 9.99 points in sub-track 1. Without taking into
account those who had scores 1.0 on the development set, the system that lost
more points was the one submitted by Jordi, which lost 5.25 points on track
2B (0.33 points in sub-track 1 ,and 0.29 points in sub-track 2A). The next
participants with the highest loss of performance were VSP and FSL.</p>
          <p>The maximum improvement in the test set with respect to the development
set was 3.32 points, corresponding to the system submitted by jiangdehuan, in
track 2A.</p>
          <p>As a curiosity, ccolon scored exactly the same result on the development and
test set. However, its performance decreased with respect to the train set (by
3.77 points).
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>The MEDDOCAN track attracted a considerable number of teams, not only from
Spain, but also from other countries, stressing the global interest in solving the
clinical data access hurdles and assuring patient data privacy requirements.
Compared to previous e orts for English, namely the i2b2 de-identi cation tracks,
MEDDOCAN could even reach a higher number of participation. It is
important to point out that the MEDDOCAN track bene ted signi cantly from the
experiences, setting and annotation process pioneered by the i2b2 e orts.</p>
      <p>
        In case of the 2006 i2b2 shared task [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], a total of 7 teams participated in
the track, providing 16 systems. The ve best systems scored above 0.95 for the
entity detection track and equaled or exceeded an F-score of 0.95 for the
tokenbased evaluation. The 2014 i2b2 de-identi cation shared task [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] had 10 teams,
submitting 22 runs. The top team reached an F-score of 0.9360 for the entity
detection track, and 0.9611 for the evaluation based on tokens. It is important to
mention that in case of MEDDOCAN a synthetic corpus was used so the results
might not be directly comparable to i2b2. Also, it is well known that there is
a considerable variability in density, distribution and characteristics of sensitive
information even between di erent types of clinical records.
      </p>
      <p>De-identi cation is still a very hard task, because for the special
characteristics of clinical texts and the importance of recall, i.e. avoiding leakage of sensitive
information. The top three teams are above 0.96 in F-score, for the track based
on entity detection.</p>
      <p>The top scoring systems make use of the most cutting-edge NLP techniques,
i.e. exploiting Deep Learning. Their results are comparable to single manual
anonymization done by humans. Automatic anonymization with manual revision
to detect potential leakages might result in anonymized Spanish clinical records
that allow data redistribution. Nevertheless, a follow up task, using real EHRs
from various healthcare institutions, and assessing the practical user scenario
with experts in the loop would be desirable to quantify also cost reduction and
bene ts of the quality of anonymization strategies assisted by automated tools.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>
        The results of the MEDDOCAN shared task and evaluation e ort on automatic
de-identi cation of sensitive information from texts in Spanish show that
advanced deep learning approaches in combination with rule based systems and
gazetteer resources can provide very competitive results when a high quality
manually labeled dataset is available. The construction of Gold Standard corpora
is key and require very detailed annotation guidelines and a carefully designed
corpus generation process with involvement of clinical domain experts. We
expect that such a corpus and evaluation will also be carried out for data in other
languages and that automatic anonymization and de-identi cation systems will
be bene cial beyond EHRs, such as medical surveys [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] or legal- nancial
documents [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In order to improve the impact of future shared tasks on
anonymization, the involvement should not be limited to academic groups on language
technologies, but also directly data providers (health institutions), legal experts
and national and European institutions. For instance, the European Medicines
Agency (EMA) has launched a Technical Anonymisation Group (TAG)
consisting of a group of experts in data anonymisation to help further develop best
practices for the anonymisation of clinical reports. Moreover, we also would like
to stress the key importance of making the systems code or developed participant
tools accessible/available and the need to explore strategies to promote start-ups
and commercialization of solutions resulting from shared tasks and evaluation
campaigns.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>We acknowledge the Encargo of Plan TL (SEAD) to CNIO and BSC for funding,
and the scienti c committee for their valuable comments and guidance. We would
also like to thank Siamak Barzegar for his help in setting up MEDDOCAN at
CodaLab, and Felipe Soares for input in preparing the manuscript and task.</p>
      <p>MEDDOCAN: Automatic de-identi cation of medical texts in Spanish</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Alfalahi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brissman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dalianis</surname>
          </string-name>
          , H.:
          <article-title>Pseudonymisation of personal names and other phis in an annotated clinical swedish corpus</article-title>
          .
          <source>In: Third Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM</source>
          <year>2012</year>
          )
          <article-title>Held in Conjunction with LREC</article-title>
          . pp.
          <volume>49</volume>
          {
          <issue>54</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Amengol-Estape</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soares</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marimon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Pharmaconer tagger: a deep learning-based tool for automatically nding chemicals and drugs in spanish medical texts</article-title>
          .
          <source>Genomics &amp; Informatics</source>
          <volume>17</volume>
          (
          <issue>2</issue>
          ) (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bick</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barreiro</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Automatic anonymisation of a new portuguese-english parallel corpus in the legal- nancial domain</article-title>
          .
          <source>Oslo Studies in Language</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ) (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cristobal</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carrero</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carrasco</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodr</surname>
            <given-names>guez</given-names>
          </string-name>
          , M.C.,
          <string-name>
            <surname>Mendez</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Mingo</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tello</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de</surname>
            <given-names>Madariaga</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.S.</given-names>
            ,
            <surname>Serrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.C.</given-names>
            ,
            <surname>Aza</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.V.</surname>
          </string-name>
          , et al.:
          <article-title>Sistema anonimizador conforme a la norma une-en iso 13606 (</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Fernandez-Aleman</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          , Sen~or,
          <string-name>
            <given-names>I.C.</given-names>
            ,
            <surname>Lozoya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.A.O.</given-names>
            ,
            <surname>Toval</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Security and privacy in electronic health records: A systematic literature review</article-title>
          .
          <source>Journal of biomedical informatics 46(3)</source>
          ,
          <volume>541</volume>
          {
          <fpage>562</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Garc</surname>
          </string-name>
          <article-title>a Sardin~a, L.: Automating the anonymisation of textual corpora (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Gaudet-Blavignac</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fou</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wehrli</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lovis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>De-identi cation of french medical narratives</article-title>
          .
          <source>Swiss Medical Informatics</source>
          <volume>34</volume>
          (
          <issue>00</issue>
          ) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gentili</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hajian</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castillo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A case study of anonymization of medical surveys</article-title>
          .
          <source>In: Proceedings of the 2017 International Conference on Digital Health</source>
          . pp.
          <volume>77</volume>
          {
          <fpage>81</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Grouin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>De-identi cation of clinical notes in french: towards a protocol for reference corpus development</article-title>
          .
          <source>Journal of biomedical informatics 50</source>
          ,
          <volume>151</volume>
          {
          <fpage>161</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hassan</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingo-Ferrer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soria-Comas</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Anonimizacin de datos no estructurados a travs del reconocimiento de entidades nominadas</article-title>
          . In: Actas de la XV Reunin Espaola sobre Criptologa y Seguridad de la Informacin - RECSI
          <year>2018</year>
          . pp.
          <volume>102</volume>
          {
          <issue>106</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marimon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Agirre</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez-Martin</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santamaria</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Finding mentions of abbreviations and their de nitions in spanish clinical cases: The barr2 shared task evaluation results</article-title>
          .
          <source>In: IberEval@ SEPLN</source>
          . pp.
          <volume>280</volume>
          {
          <issue>289</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perez-Perez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perez-Rodr guez</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>Lopez-Mart n</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santamaria</surname>
            , J., de la Pena,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akhondi</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lourenco</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kralllinger</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The biomedical abbreviation recognition and resolution (barr) track: benchmarking, evaluation and importance of abbreviation recognition systems applied to spanish biomedical abstracts</article-title>
          .
          <source>SEPLN</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Mamede</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baptista</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dias</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Automated anonymization of text documents</article-title>
          .
          <source>In: 2016 IEEE Congress on Evolutionary Computation (CEC)</source>
          . pp.
          <volume>1287</volume>
          {
          <fpage>1294</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Medina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turmo</surname>
          </string-name>
          , J.:
          <article-title>Building a spanish/catalan health records corpus with very sparse protected information labelled</article-title>
          .
          <source>In: LREC</source>
          <year>2018</year>
          : Workshop MultilingualBIO: Multilingual Biomedical Text Processing: proceedings. pp.
          <volume>1</volume>
          {
          <issue>7</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Megyesi</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Granstedt</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johansson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prentice</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Schenstrom, C.J.,
          <string-name>
            <surname>Sundberg</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiren</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Volodina</surname>
          </string-name>
          , E.:
          <article-title>Learner corpus anonymization in the age of gdpr: Insights from the creation of a learner corpus of swedish</article-title>
          .
          <source>In: Proceedings of the 7th workshop on NLP for Computer Assisted Language Learning</source>
          . pp.
          <volume>47</volume>
          {
          <issue>56</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Mota</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart n</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferrete</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santamar</surname>
            <given-names>a</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Marimon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Agirre</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : Gu as de anotacion de informacion de salud protegida (
          <year>Oct 2018</year>
          ), http://temu.bsc.es/meddocan/wp-content/uploads/2019/02/gu as-de
          <article-title>-anotacionde-informacion-de-salud-protegida</article-title>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Pantazos</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lauesen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lippert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Preserving medical correctness, readability and</article-title>
          consistency in de-identi ed health records.
          <source>Health informatics journal 23(4)</source>
          ,
          <volume>291</volume>
          {
          <fpage>303</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Perez-Perez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perez-Rodr guez</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>Blanco-M guez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fdez-Riverola</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lourenco</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Next generation community assessment of biomedical entity recognition web servers: metrics, performance, interoperability aspects of becalm</article-title>
          .
          <source>Journal of Cheminformatics</source>
          <volume>11</volume>
          (
          <issue>1</issue>
          ),
          <volume>42</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Santamar</surname>
            <given-names>a</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Construccion de recursos terminologicos medicos para el espan~ol: el sistema de extraccion de terminos cutext y los repositorios de terminos biomedicos</article-title>
          .
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>61</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Scheurwegs</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luyckx</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van der Schueren</surname>
          </string-name>
          , F.,
          <string-name>
            <surname>Van</surname>
          </string-name>
          den Bulcke, T.:
          <article-title>Deidenti cation of clinical free text in dutch with limited training data: a case study</article-title>
          .
          <source>In: Proceedings of the Workshop on NLP for Medicine and Biology associated with RANLP</source>
          <year>2013</year>
          . pp.
          <volume>18</volume>
          {
          <issue>23</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Stubbs</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kot la</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O</given-names>
          </string-name>
          .
          <article-title>: Automated systems for the de-identi cation of longitudinal clinical narratives: Overview of 2014 i2b2/uthealth shared task track 1</article-title>
          .
          <source>Journal of biomedical informatics 58 Suppl, S11{9</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Tomanek</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Daumke</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Enders</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huber</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Theres</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Muller,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>An interactive de-identi ca-tion-system</article-title>
          .
          <source>In: Proceedings of SMBM 2012-The 5th International Symposium on Semantic Mining in Biomedicine</source>
          . pp.
          <volume>82</volume>
          {
          <issue>86</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Tveit</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edsberg</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rost</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faxvaag</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nytro</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nordgard</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ranang</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grimsmo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Anonymization of general practioner medical records</article-title>
          . In: second HelsIT Conference (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Uzuner</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szolovits</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Evaluating the State-of-the-Art in Automatic De-identi cation</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <volume>14</volume>
          (
          <issue>5</issue>
          ),
          <volume>550</volume>
          {
          <volume>563</volume>
          (09
          <year>2007</year>
          ). https://doi.org/10.1197/jamia.M2444, https://doi.org/10.1197/jamia.M2444
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Vico</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , et al.: De nicion de una arquitectura de referencia para anonimizar documentos (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Intxaurrondo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Agirre</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marimon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krallinger</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The mespen resource for english-spanish medical machine translation and terminologies: census of parallel corpora, glossaries and term translations</article-title>
          .
          <source>In: Proceedings of the LREC 2018 Workshop MultilingualBIO: Multilingual Biomedical Text Processing</source>
          , Paris, France.
          <source>European Language Resources Association (ELRA)</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>