<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>MAMTRA-MED at CLEF eHealth 2018: A Combination of Information Retrieval Techniques and Neural Networks for ICD-10 Coding of Death Certi cates</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mario Almagro</string-name>
          <email>malmagro@lsi.uned.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Soto Montalvo</string-name>
          <email>soto.montalvo@urjc.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. D az de Ilarraza</string-name>
          <email>a.diazdeilarraza@ehu.eus</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Perez</string-name>
          <email>alicia.perez@ehu.eus</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad Nacional de Educacion a Distancia (UNED)</institution>
          ,
          <addr-line>Madrid 28040</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universidad Rey Juan Carlos (URJC)</institution>
          ,
          <addr-line>Madrid 28933</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of the Basque Country UPV/EHU (IXA NLP)</institution>
          ,
          <addr-line>Bilbao 48013</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of the Basque Country UPV/EHU (IXA NLP)</institution>
          ,
          <addr-line>Donostia-San Sebastian 20018</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes the systems proposed by LSI UNED team in Task 1 of the CLEF eHealth 2018 challenge. The main objective is the automatic coding of death certi cates in French, Italian and Hungarian languages according to the ICD-10. This task has been tackled through supervised learning methods such as neural networks, and techniques based on Information Retrieval (IR) systems. The rst approach has been implemented by training one model for each of the most frequent ICD-10 codes in the corpus. For this purpose, a bag-of-words approach has been applied using the TF-BNS value for terms contained in death certi cate statements. As for the IR approach, Lucene has been used as a search engine, indexing dictionaries and the content of the death certi cates in the training corpus. Finally, a combination of both methods has been proposed to balance precision and recall, using the IR system for diseases not classi ed by any learning model. Similar F1 scores are obtained on the test datasets of each language by supervised methods and the combined system giving the latter greater recall values.</p>
      </abstract>
      <kwd-group>
        <kwd>ICD-10 Coding ICD-10 Codes Neural Networks Deep Learning CepiDC CLEF eHealth</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The amount of health text available in electronic format is immense |scienti c
papers, websites, forums, social networks or Electronic Health Records (EHRs)|
so managing health information to support medical decisions is no easy task. An
example of this complexity is the analysis of numerous clinical texts generated
by health care centres, which requires a large amount of resources that are often
unavailable. The 2018 CLEF eHealth Evaluation Lab [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is intended to address
these challenges through di erent tasks aimed at facilitating access to health
information.
      </p>
      <p>
        Our proposals have focused on the resolution of the CLEF task [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for the
ICD-10 coding of death certi cates in French, Italian and Hungarian. The
classication of clinical texts according to the International Classi cation of Diseases
(ICD) is one of the most pressing problems in hospital management due to its
statistical purposes for morbidity and mortality. The 10th version of this coding
assigns a unique identi er of between 3 and 7 alphanumeric symbols to disorders,
grouping together nearly 16,000 possible diagnoses with a wealth of nuances.
      </p>
      <p>
        As a possible resolution to the described task, di erent approaches supported
by supervised learning and search engines have been proposed in this paper.
Due to the nature of the ICD-10 codes, the data generally present a very biased
distribution, with a small set of frequently occurring diagnoses [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The same
distribution can be seen in the corpus used in this task, and therefore, it is
considered the combination of approaches that could try to maximize precision
|such as machine learning approximations| and methods that tend to
maximize recall |such as the search engines on which Information Retrieval (IR)
systems are built| could be of great interest. These two aspects will give rise
to joint proposals.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>In general, the approaches used in the state of the art for the recommendation
and assignment of ICD-10 codes can be divided into two groups: those based on
medical language processing (MLP) and those based on classi cation techniques.</p>
      <p>
        The rst ones use unsupervised techniques to nd correspondences between
the concepts in standard descriptions and health concepts identi ed through
medical knowledge bases and ontologies in health documents. For example, Ning
et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] apply an example-based model generated from a Chinese terminology
containing correspondences with 4-digit ICD-10 codes, thus taking advantage of
the hierarchical structure in the standard coding. Chen et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] explore semantic
similarity by applying the Longest Common Subsequence (LCS) method to the
diagnoses and names given by ICD-10 codes. Other systems following this trend
have participated in previous versions of the CLEF task [
        <xref ref-type="bibr" rid="ref10 ref3">3, 10</xref>
        ].
      </p>
      <p>
        On the other hand, the second approaches generate classi ers by using
supervised learning algorithms. Zweigenbaum and Lavergne [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] apply two classi ers:
one trained with a set of EHRs, and the other trained with di erent medical
dictionaries; Miftakhutdinov and Tutubalina [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] use word embeddings trained
from a corpus of medical user opinions, along with recurrent neural networks to
assign codes.
      </p>
      <p>
        At the same time, mixed approaches combining both methods can be found.
For example, Seva et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] use an IR approach to search for possible candidate
ICD-10 codes in di erent dictionaries, along with several classi ers to lter them.
Jatunarapit et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] employ English corpus-based classi ers and a set of IR
techniques to establish similarities with Thai terms.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Proposed Approach</title>
      <p>In this paper we have explored two alternative approaches for the assignment of
ICD-10 codes to death certi cates. On the one hand, a supervised approach is
proposed through Vector Support Machines (SVMs) and neural networks that
aims to take advantage of the training corpus by generating One-Vs-Rest (OVR)
models for the most frequent ICD-10 codes. The dependence on examples makes
this approach less robust in the face of the possibility of coding new diseases,
given the immense number of ICD-10 codes with little or no representation in
the corpus. For this reason, it is also proposed to complement learning models
with an unsupervised approach based on IR techniques to achieve greater recall.</p>
      <p>The machine learning approach is based on the training of a binary model for
each of the target ICD-10 codes, indicating the presence or absence of the code.
As this is a multiclass and multilabel problem, the coding of diseases is carried
out by grouping the results of all the binary classi ers. The particularization of a
model for each ICD code allows the processing of the data adapted to each class,
as will be seen later when applying the weighting with Bi-Normal Separation
(BNS). To implement these models, di erent con gurations have been developed
with linear SVMs and Multi-Layer Perceptrons (MLPs).</p>
      <p>The proposed IR approach consists of a search engine in which information
relating to codes has been indexed, both terminology from provided dictionaries
and associated sentences from training data. In this way, the coding of death
certi cates is reduced to the generation of queries based on their terms, choosing
the result with the highest score. As a drawback, retrieving a xed number of
results (in this case only one) implies the loss of the ability to adapt the number of
codes assigned to a line in death certi cates, which may contain several disorders.</p>
      <p>These two approaches yield di erent results. As expected in the
experimentation, while the supervised approach achieves higher precision values, a hybrid
method involving IR techniques ensures a better balance between the correct
coding rate and the number of di erent codes capable of coding.
4
4.1</p>
    </sec>
    <sec id="sec-4">
      <title>Experiments</title>
      <sec id="sec-4-1">
        <title>Datasets</title>
        <p>The training data is organized in three separate corpus, one for each language:
French, Italian and Hungarian. Although each corpus is structured in two modes
|aligned, for line-level annotation, and raw, for document-level annotation|,
line-level annotation is only available for the French corpus. For this reason, this
proposal has focused only on coding at document level for all three languages.</p>
        <p>Each corpus has di erent metadata on diagnoses, dictionaries with
equivalences between ICD-10 codes and terms, the text of death certi cates and
linelevel equivalences between its processed content and ICD-10 codes.</p>
        <p>Italian Hungarian French Total
Number of certi cates in training 14,502 84,702 65,843 165,047
Number of certi cates in test 3,617 21,175 24,375 49,167
Number of ICD-10 codes 60,954 392,019 527,940 980,913
Number of unique ICD-10 codes 1,442 3,123 3,829 5,011
Overlapping with Italian codes 100% 34% 30%
Overlapping with Hungarian codes 73% 100% 57%</p>
        <p>Overlapping with French codes 79% 70% 100%</p>
        <p>A general summary of the amount of data grouped by corpus is given in
Table 1. Although the Hungarian corpus contains more death certi cates, the
number of ICD-10 codes present in the French corpus far exceeds that of the
rest. As can be seen, most of the ICD-10 codes in the Italian corpus are also
present in some of the other corpora, with an average overlap of 76%. Given
this overlapping, a large part of the results achieved on the Italian corpus could
be considered extrapolable since the model is expected to behave similarly in at
least the same codes. For this reason, the experimentation shown in this paper
is only carried out on it, taking advantage of its lower volume.</p>
        <p>In terms of distribution, the frequency of codes follows a power law, with
most of the entries corresponding to a small group of codes. This implies that
a supervised approach alone has a more restrictive limit to improvement than
other techniques.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Experimental Setting</title>
        <p>Regardless of the approach used, a common pre-processing has been developed.
A lowercase conversion and accent removal has been applied, as well as a stop
word lter and a stemming process for each language.</p>
        <p>Supervised approach Here the problem has been addressed through the
combination of di erent binary classi ers, each one determining the presence or
absence of a speci c code. Due to the scarcity of data in training collections on
some ICD-10 disorders, model generation has been limited to only those ICD
codes that appear more than a certain number of times. With this it is
understood that the rest of the ICD codes (those absent in the corpora or with little
presence) cannot be represented by supervised models since data lack su cient
examples with which to abstract the corresponding patterns.</p>
        <p>In order to nd the con guration that best suits this task, multilayer
perceptrons with di erent numbers of neurons and hidden layers have been
implemented, as well as variations of the rest of the hyperparameters. In addition,
linear SVMs have been trained to compare the e ciency of both models in
ICD10 coding.</p>
        <p>As for the input data, once the pre-processing has been applied, di erent
textual representations have been used. On the one hand, the models have been
generated with the frequency of terms weighted with Inverse Document Frequency
(IDF) and Bi-Normal Separation (BNS) values. IDF is calculated at document
level, determining the relevant terms based on the number of documents in which
they appear. This may penalize those terms relevant to a class but too frequent.
For this reason, the BNS feature is introduced in the experimentation, since it
estimates the representation of terms at class level, avoiding this type of error. This
weight is de ned as BN S = jIcdf ( P (W j class +) ) Icdf ( P (W j class ) )j,
where Icdf is the inverse cumulative distribution function, P (W j class +) is the
probability of nding a word in the positive classes and P (W j class ) is the
probability of nding a word in the negative classes. Based on both measures,
n-grams of two and three words have been considered. On the other hand, a
feature ltering has been performed using Chi-Square ( ~2). In Table 2 di erent
con gurations implemented in the experimentation are presented, which include
the di erent options mentioned above. The structure of the MLPs shown consists
of 4 hidden layers and 80 neurons each.
Unsupervised IR approach This approach uses Lucene as the search engine.
To enrich the indexes, the diseases present in the CepiDC dictionaries have been
added as well as the content of the death certi cates in the training corpus for
each language. The aim is to make each of the possible ICD-10 codes accessible
through a set of descriptions and associated terms. There are fewer descriptions
for less common codes, so it has been decided to remove duplicate descriptions
during indexing to avoid penalties. In addition, it has been considered to
include the o cial description of codes as it appears in the ICD standard, taking
advantage of the electronic versions provided by some governments.</p>
        <p>Each query has been generated from the terms contained in each line of the
death certi cates. As it is a multilabel problem, the number of classes assigned
to a document line varies. Since Lucene's output consists of a ranking of results,
the evaluation has been carried out according to the number of results chosen
for each query (1 or 2). The di erent con gurations are presented in Table 3.
Combined approach The method based on the combination of supervised
models together with search engine aims to take advantage of the e ectiveness
of the rst ones with an increase in robustness for codes that are absent or
hardly present in the training corpus. Since less common diseases |no learning
model| should be left without ICD-10 code assigned after applying multiple
trained models, it seems reasonable to use search engines only with those death
certi cate statements not classi ed by the supervised approach. Table 4 shows
the combinations of learning model and IR system con gurations that give the
best results.
The results of the con gurations are only shown in the Italian corpus, as this
represents to a large extent the type of disorders present in the other corpora. This
choice is based on the lower number of certi cates and the higher percentage of
common diseases in other corpora. Di erent con gurations have been evaluated
using a k-fold cross-validation of 5 folds and a 94/6 split. The results are shown
in Table 5.</p>
        <p>The use of o cial descriptions in the IR system worsens both Precision and
Recall, which could be an indication of how di erent the diagnoses in practice and
descriptions in the standard are. Thus, although the use of o cial descriptions
does not seem advisable in itself due to noise, it would be interesting to use
synonyms to enrich them. The con guration with the highest F1 score is the
combination of MLPs models to assign ICD-10 codes with frequencies greater
than 100 occurrences on the training corpus, and search engines selecting only
the result with the highest a nity. The models chosen have been trained with
the Tf-BNS of the 1,000 most relevant features.</p>
        <p>
          Finally, the proposals S5 and S16 |called LSI UNED-run2 and LSI
UNEDrun1 respectively|- have been used on the o cial test dataset provided by
each language. S6 has been chosen as the system o ering the best results in the
supervised approach. In [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] you can see the ranking of the task published. Our
results are shown in Table 6.
        </p>
        <p>In principle, it appears from this data that combining IR techniques with
supervised approaches decreases the Precision value to the same extent as it
increases the Recall value, so the F1 score does not change. Nevertheless, in our
opinion the combination of both approaches results in a more robust system
compared to other possible distributions with a greater number of infrequent
codes, so it would be preferable to a single approach based on Machine Learning.
Di erent methods have been proposed for the automatic coding of diseases
according to the ICD-10 standard. Although a supervised learning approach seems
an appropriate solution at rst glance, we understand that in a distribution as
complex as the one presented by the data, it is necessary to extend these models
with other techniques that o er greater coverage, such as the IR approach.</p>
        <p>The development of automatic systems for coding death certi cates can
provide a major boost to health administrations in managing their resources. And
to this end, the results published in the CLEF task seem promising.</p>
        <p>One of the main problems of natural language processing in health scope
is multilingualism, since it is a very broad and specialized domain, and at the
same time it requires a large amount of textual resources that do not yet exist for
certain languages. Therefore, in the near future we hope to limit the dependence
on these textual resources by improving IR techniques and advance in di erent
ways of combining the methods described.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work has been supported by the Spanish Ministry of Science and Innovation
MAMTRA-MED Project (TIN2016-77820-C3-2-R).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Almagro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            <given-names>nez</given-names>
          </string-name>
          , R.,
          <string-name>
            <surname>Fresno</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montalvo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Estudio preliminar de la anotacion automatica de codigos CIE-10 en informes de alta hospitalarios</article-title>
          .
          <source>Procesamiento del Lenguaje Natural, Revista no 60</source>
          , pp.
          <fpage>45</fpage>
          -
          <lpage>52</lpage>
          . DOI 10.26342/2018- 60-5.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity</article-title>
          .
          <source>In PloS one</source>
          , vol.
          <volume>12</volume>
          (
          <issue>3</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Ho-Dac</surname>
            ,
            <given-names>L. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fabre</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birski</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boudraa</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bourriot</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cassier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Delvenne</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Gonzalez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piccinini</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rohrbacher</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seguier</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>LITL at CLEF eHealth2017: automatic classi cation of death reports</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Jatunarapit</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piromsopa</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charoeanlap</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Development of thai textmining model for classifying ICD-10 TM</article-title>
          .
          <source>In Proceedings of ECAI 2016</source>
          , pages
          <issue>1</issue>
          {
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Miftakhutdinov</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tutubalina</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Kfu at clef ehealth 2017 task 1: Icd10 coding of english death certi cates with recurrent neural networks</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grippo</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morgand</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orsi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pelikan</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramadier</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rey</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>CLEF eHealth 2018 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certi cates in French, Hungarian and Italian</article-title>
          .
          <article-title>CLEF 2018 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ning</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation</article-title>
          .
          <source>In BMC Medical Informatics and Decision Making</source>
          , vol.
          <volume>1</volume>
          :
          <issue>16</issue>
          {
          <fpage>30</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Seva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kittner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roller</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leser</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Multi-lingual ICD-10 coding using a hybrid rule-based and supervised classi cation approach at CLEF eHealth 2017</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Suominen</surname>
            , Hanna, Kelly, Liadh, Goeuriot, Lorraine, Kanoulas, Evangelos, Azzopardi, Leif, Spijker, Rene, Li, Dan, Neveol, Aurelie, Ramadier, Lionel, Robert, Aude, Palotti, Joao, Jimmy, Zuccon,
            <given-names>Guido.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Overview of the CLEF eHealth Evaluation Lab 2018</article-title>
          .
          <source>CLEF 2018 - 8th Conference and Labs of the Evaluation Forum, Lecture Notes in Computer Science (LNCS)</source>
          , Springer.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>van Mulligen</surname>
            ,
            <given-names>E. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Afzal</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akhondi</surname>
            ,
            <given-names>S. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kors</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <source>Erasmus MC at CLEF eHealth</source>
          <year>2016</year>
          :
          <article-title>Concept Recognition and Coding in French Texts</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Multiple methods for multi-class, multilabel ICD-10 coding of multi-granularity, multilingual death certi cates</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>