<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Fusion Methods for ICD10 Code Classi cation of Death Certi cates in Multilingual Corpora</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mike Ebersbach</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert Herms</string-name>
          <email>robert.herms@cs.tu-chemnitz.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maximilian Eibl</string-name>
          <email>maximilian.eibl@cs.tu-chemnitz.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Chair Media Informatics, Chemnitz University of Technology</institution>
          ,
          <addr-line>09107 Chemnitz</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this working notes paper, we present our methodology and the results for Task 1 of the CLEF eHealth Evaluation Lab 2017. This benchmark addresses information extraction in written text with focus on unexplored languages corpora, speci cally English and French. The goal is to automatically assign codes (ICD10) to text content of death certi cates. Our approach is focused on fusion methods in conjunction with support vector machines for ICD10 code classi cation. First, we composed a large scale feature set comprising more than 40k features based on bag of words, bag of 2-grams, bag of 3-grams, latent Dirichlet allocation, and the ontologies of WordNet and UMLS. In the development phase, we evaluated three di erent methods: each feature type separately (no fusion), early feature-level fusion, and late fusion including the rules majority vote, maximum, and average. For the English test set, the best F-measure was 0.8187 using early fusion. For the two French test sets, we achieved 0.6692 and 0.7216 using late fusion in connection with the rule average for bag of words and bag of 2-grams.</p>
      </abstract>
      <kwd-group>
        <kwd>Natural language processing</kwd>
        <kwd>Clinical texts</kwd>
        <kwd>ICD10 coding</kwd>
        <kwd>Death certi cates</kwd>
        <kwd>Machine learning</kwd>
        <kwd>Fusion</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The amount of digital medical documents expands over the years, which is a
major challenge regarding data processing and management in clinical institutions.
However, state-of-the-art technologies can assist work ows including verbal
handover supplemented with written material. For instance, the work of [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] applied
automatic speech recognition to transform verbal clinical information into
written free-text records. These records can then be structured by automatically
identifying relevant text-snippets (e.g., [2{4]). A further aspect in hospitals and
clinical institutions involves the assignment of ICD codes to reports of diseases,
disorders, injuries and other related health conditions. ICD { the International
Classi cation of Diseases system { is published by the World Health
Organisation (WHO). Some previous work has been done for the processing of medical
text corpora in conjunction with ICD codes (e.g., [5{9]). In this context, the
CLEF eHealth Evaluation Lab 2017 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] aims to ease patients and nurses in
understanding and accessing eHealth information. Task 1 (Multilingual
Information Extraction - ICD10 coding) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] of this benchmark addresses information
extraction in written text with focus on unexplored languages corpora, speci
cally English and French. The goal of this task is to automatically assign codes
(ICD10) to text content of death certi cates. This challenge can be regarded as
a classi cation task.
      </p>
      <p>
        In this working notes paper, we present our methodology and the results for
Task 1 of the CLEF eHealth Evaluation Lab 2017. Our approach is focused on
the investigation of fusion methods for multilingual text classi cation
regarding ICD10 codes. Hence, we implemented di erent fusion techniques to evaluate
which method leads to the best result in conjunction with support vector
machines. First, we composed a large scale feature set comprising more than 40k
features based on the types bag of words, bag of 2-grams, bag of 3-grams, latent
Dirichlet allocation [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], and the ontologies of WordNet [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and UMLS [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In
the development phase, we evaluated three di erent methods: each feature type
separately (no fusion), early feature-level fusion, and late fusion including the
rules majority vote, maximum, and average.
      </p>
      <p>This paper is organized as follows: In the next section, we introduce the
dataset. Our approach including feature extraction and fusion methods are
proposed in Section 3. In Section 4, the experimental setup and the evaluation
results are described. Finally, we conclude the paper in Section 5 and give some
future directions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Dataset</title>
      <p>The used dataset is divided into two parts regarding the language: the CepiDc
corpus (French) and the CDC corpus (English). The documents comprise
freetext descriptions of causes of death as reported by physicians in standardized
forms. Each document was manually labeled with one or more ICD10 codes.
Two di erent formats are considered, the so called raw and aligned format. For
English, only the raw format is included whereas the French version consists
of the raw and the aligned format. Altogether, we used all three di erent data
subsets for the evaluation. The data is partitioned into training sets (English
raw with 1,073 classes and French aligned with 3,232 classes), development sets
(English raw with 663 classes and French aligned with 2,363 classes), and test
sets (English raw, French raw, and French aligned).
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <p>In this work, ICD10 code assignment to text content of death certi cates is
regarded as a classi cation task. Machine learning is performed using support
vector machine (SVM). Moreover, each language is treated separately, i.e.,
training, development, and testing is performed on the basis of the same language.
The following subsections introduce the features and the applied fusion methods.
3.1</p>
      <sec id="sec-3-1">
        <title>Feature Extraction</title>
        <p>In the preprocessing phase, all terms were stemmed and transformed to lower
case and all special characters like punctuation or brackets were removed. For
the French dataset, we transformed typical su xes to their English counterpart.
Furthermore, infrequent terms were removed to reduce the number of features.
Subsequently, the following features were extracted:
{ Bag of n-grams: The tf-idf (term frequency - inverse document frequency)
of the terms from all documents is calculated. Feature vectors were created
for bag of words (about 9k features for the French corpus and about 2k for
the English corpus), bag of 2-grams, and bag of 3-grams (both with about
14k features for the French and about 3k features for the English corpus).
{ Latent Dirichlet allocation (LDA) features: Similarities between the
documents were determined by categorizing them to a preset number of
topics. The con dence values of the topic assignments were used as features.</p>
        <p>
          For our experiments we used a number of 20 topics.
{ WordNet features: Related terms of words in the documents were
extracted to enrich the feature set with semantic information. In more detail,
the rst synonym and hypernym of a word (noun, verb, adjective, and
adverb) ranked by WordNet was added to the feature set. The search was
repeated concerning hypernyms to nd more general hypernyms which were
also added to the feature set. In summary, 2,784 features were extracted for
the French dataset and 1,704 features for the English dataset.
{ UMLS features: Semantic types of health vocabulary were extracted from
the Uni ed Medical Language System (UMLS) using MetaMap [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. There
are 133 semantic types described in the UMLS. As not all types appear in
the dataset, we considered a subset of 107 types (features). A feature vector
was then created where each feature represents the number of search results
for a particular semantic type.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Fusion</title>
        <p>We implemented an analysis framework to investigate two fusion methods: early
fusion to combine features before classi cation and late fusion to combine the
outputs after classi cation. These fusion methods are illustrated in Fig. 1.</p>
        <p>Early fusion is performed on feature-level. In this case, the feature vectors
from di erent sources are concatenated into one large feature vector which will
then be used for classi cation. As this vector consists of many features, training
and classi cation time will increase. However, a large scale feature vector in
conjunction with suitable learning methods can lead to much better performance
in the end. Furthermore, only one learning phase is needed.</p>
        <p>
          Late Fusion (or decision-level fusion) indicates combining the outputs after
classi cation. This process predicts the nal output by considering the
individual labels (hard level) or scores (soft level) of the involved classi ers [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The
following decision rules were used: majority vote (most represented class label),
maximum (class label with the highest con dence), and average (class label with
the highest averaged con dence).
        </p>
        <sec id="sec-3-2-1">
          <title>Bag of Words</title>
        </sec>
        <sec id="sec-3-2-2">
          <title>Bag of 2-Grams</title>
        </sec>
        <sec id="sec-3-2-3">
          <title>Bag of 3-Grams LDA</title>
        </sec>
        <sec id="sec-3-2-4">
          <title>UMLS</title>
        </sec>
        <sec id="sec-3-2-5">
          <title>WordNet Early Fusion SVM SVM</title>
          <p>SVM
SVM
SVM
SVM
SVM</p>
        </sec>
        <sec id="sec-3-2-6">
          <title>Late Fusion</title>
          <p>In this section, we describe the setup for the experiments. Afterwards, we report
the results obtained using the six feature types and the fusion methods early
feature-level fusion and late fusion.
The system performance is assessed by precision, recall, and F-measure (F1)
for ICD10 code assignment. For development, we used only the F1 score as a
reference for the best methods.</p>
          <p>
            Classi cation was performed using SVM; the LIBLINEAR library [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ] is
used for model training. In the development phase we optimized the complexity
parameter C of the SVM classi er only for early fusion. The goal is to observe
the generalization performance of the classi er. We used six di erent values of
C (1, 0.1, 0.01, 0.001, 0.0001, 0.00001). The evaluation of other methods was
performed using complexity C = 1.
          </p>
          <p>The two best performing methods (no fusion, early fusion, or late fusion)
of each dataset version (language) in the development phase are applied on the
corresponding test set.
A series of experiments was carried out for the automatic classi cation of ICD10
codes in medical text corpora. Table 1 summarizes the development set results
for the English and French dataset. Although our criterion for the selection of
the best two methods of each language is the F1 score, the results of precision
and recall are shown for comparison purposes.</p>
          <p>In the feature type experiments without fusion, the best F1 results were
obtained by bag of 2-grams (Bo2G) for both languages; 0.7694 for English and
0.7667 for French. In contrast, the highest recall measure for French (0.6796)
was achieved with bag of words (BoW).</p>
          <p>For early fusion, the best F1, precision, and recall measures were obtained
using SVM complexity C = 1 concerning both languages (F1 is 0.7847 for English
and 0.7549 for French). With C &lt; 1, the values are too small which results in
over-generalization, i.e., under tting of the SVM model.</p>
          <p>The late fusion scheme has been applied to all feature types. Additionally,
the top three feature types were selected to investigate the results without
features that have a low classi cation performance (threshold is F1 = 0.7). As a
consequence, the top three features types are bag of words (BoW), bag of
2grams (Bo2G), and bag of 3-grams (Bo3G). For the English language, the best
F1 score is 0.7684 using BoW+Bo2G+Bo3G in connection with majority vote.
However, the best precision with 0.8807 was achieved using BoW+Bo2G+Bo3G
and the rule average. In case of French, BoW+Bo2G and the rule average was
superior with a F1 score of 0.7775 whereas the best precision with 0.8931 was
obtained using BoW+Bo2G+Bo3G and the rule maximum.</p>
          <p>The two best performing methods of each language in the development phase
were then applied on the corresponding test sets. The results are shown in Table
2. The main evaluation reference for the task refers to all ICD10 codes.
Additionally, external causes, characterized by the codes V01 to Y98, are considered
as a secondary reference. In this case, the evaluation addresses a speci c type of
deaths such as violent deaths which are avoidable.</p>
          <p>Regarding the English test set, the best method was early fusion which
achieved a F1 score of 0.8187 (all ICD codes) and 0.2914 (external causes).
For the French test set, the highest F1 score was obtained using late fusion of
BoW+Bo2G in connection with the rule average (raw format: 0.6692 for all ICD
codes and 0.4232 for external cases; aligned format: 0.7216 for all ICD codes and
0.4515 for external cases). However, the best results for the French test set are
non-o cial, because they were submitted after the task deadline. Consequently,
as shown in Table 2, the only o cial result for the French test set is obtained
using the feature type Bo2G with a F1 score of 0.7191 (all ICD codes) and 0.4450
(external causes).
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>We presented our methodology for Task 1 of the CLEF eHealth Evaluation Lab
2017 where the goal is to automatically assign codes (ICD10) to text content of
death certi cates. The corpus is made of two versions regarding the language:
English and French.</p>
      <p>Our approach is focused on fusion methods in conjunction with support
vector machines for ICD10 code classi cation. We composed a set of features based
on bag of words, bag of 2-grams, bag of 3-grams, latent Dirichlet allocation, and
the ontologies of WordNet and UMLS. Three di erent methods were evaluated:
each feature type separately (no fusion), early feature-level fusion, and late
fusion. For the English test set, the best F-measure was 0.8187 using early fusion.
For the two French test sets, we achieved 0.6692 and 0.7216 using late fusion in
connection with the rule average for bag of words and bag of 2-grams.</p>
      <p>However, further improvements could be achieved by more knowledge bases
and other appropriate features from the eld of Natural Language Processing.
Moreover, the holistic system could bene t from other machine learning methods
such as arti cial neural networks, Naive Bayes, or k-nearest neighbors. Finally,
fusion schemes can be optimized by input weights and the consideration of
correlations between the inputs.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Herms</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richter</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eibl</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ritter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Unsupervised language model adaptation using utterance-based web search for clinical speech recognition</article-title>
          .
          <article-title>CLEF 2015 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
          </string-name>
          , G.:
          <article-title>ECNU at 2016 eHealth Task 1: Handover Information Extraction</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Quiroz</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mennes</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dehghani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanoulas</surname>
          </string-name>
          , E.:
          <article-title>Distributional Semantics for Medical Information Extraction</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ebersbach</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herms</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lohr</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eibl</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Wrappers for Feature Subset Selection in CRF-based Clinical Information Extraction</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Dermouche</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Looten</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flicoteaux</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chevret</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velcin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taright</surname>
          </string-name>
          , N.:
          <article-title>ECSTRA-INSERM@ CLEF eHealth2016-task 2: ICD10 code extraction from death certi cates</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Mottin</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gobeill</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mottaz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasche</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaudinat</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruch</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>BiTeM at CLEF eHealth Evaluation Lab 2016 Task 2: Multilingual Information Extraction</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>LIMSI ICD10 coding experiments on CepiDC death certi cate statements</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cabot</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahamna</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>SIBM at CLEF eHealth Evaluation Lab 2016: Extracting Concepts in French Medical Texts with ECMT and CIMIND</article-title>
          .
          <article-title>CLEF 2016 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lohr</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herms</surname>
          </string-name>
          , R.:
          <article-title>A Corpus of German Clinical Reports for ICD and OPS-based Language Modeling</article-title>
          .
          <source>Proceedings of the Controlled Language Applications Workshop (CLAW) at LREC</source>
          <year>2016</year>
          , pages
          <fpage>20</fpage>
          {
          <fpage>23</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanoulas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spijker</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
          </string-name>
          , J.:
          <article-title>Overview of the CLEF eHealth Evaluation Lab 2017</article-title>
          .
          <source>CLEF 2017 - 8th Conference and Labs of the Evaluation Forum, Lecture Notes in Computer Science (LNCS)</source>
          , Springer, September,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>R. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>K. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rey</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <string-name>
            <surname>CLEF eHealth 2017 Multilingual Information</surname>
          </string-name>
          <article-title>Extraction task overview: ICD10 coding of death certi cates in English and French</article-title>
          .
          <article-title>CLEF 2017 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>September</year>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Latent dirichlet allocation</article-title>
          .
          <source>Journal of machine Learning research</source>
          ,
          <volume>3</volume>
          (Jan):
          <volume>993</volume>
          {
          <fpage>1022</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>WordNet: a lexical database for English</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>38</volume>
          (
          <issue>11</issue>
          ):
          <volume>39</volume>
          {
          <fpage>41</fpage>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Bodenreider</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>The uni ed medical language system (UMLS): integrating biomedical terminology</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <volume>32</volume>
          (
          <issue>suppl 1</issue>
          ):D267{
          <fpage>D270</fpage>
          , Oxford Univ Press,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Aronson</surname>
          </string-name>
          , A.:
          <article-title>E ective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program</article-title>
          .
          <source>Proceedings of the AMIA Symposium</source>
          , American Medical Informatics Association,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Castellano</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gunes</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peters</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuller</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Multimodal a ect recognition for naturalistic human-computer and human-robot interactions</article-title>
          .
          <source>The Oxford handbook of a ective computing</source>
          , Oxford University Press, USA,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Fan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hsieh</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>LIBLINEAR: A library for large linear classi cation</article-title>
          .
          <source>Journal of machine learning research</source>
          ,
          <volume>9</volume>
          (Aug):
          <year>1871</year>
          {
          <year>1874</year>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>