<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SIBM at CLEF e-Health Evaluation Lab 2015</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lina F. Soualmia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chloé Cabot</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Badisse Dahamna</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stéfan J. Darmoni</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>French National Institute for Health</institution>
          ,
          <addr-line>INSERM, LIMICS UMR-1142</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Normandie Univ., SIBM - TIBS - LITIS EA 4108, Rouen University and Hospital</institution>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we report on our participation in the clinical named entity recognition task of the CLEF eHealth 2015 evaluation initiative i.e. to fully automatically identify clinically relevant entities in medical text in French. We address the task by using several biomedical knowledge organization systems (KOS) containing terms and their variations already in French or that we have partially translated in the context of existing projects. The extraction method is available online in the form a web-based service that requests the KOS to extract clinical concepts from Electronic Health Records. It is also available via a user-friendly interface developed for clinicians. Our system has not obtained good results in inexact matching against the gold standard. However, this first participation allowed us to analyze our system and method and will allow us to improve it.</p>
      </abstract>
      <kwd-group>
        <kwd>Information extraction</kwd>
        <kwd>Bagging</kwd>
        <kwd>Lexical semantics</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>Information storage and retrieval</kwd>
        <kwd>Vocabulary controlled</kwd>
        <kwd>Systematized Nomenclature of Medicine</kwd>
        <kwd>Medical Subject Headings</kwd>
        <kwd>International Classification of Diseases</kwd>
        <kwd>Unified Medical Language System</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        With the increasing development of Electronic Health Records (EHRs) in hospitals
and healthcare institutions [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the amount of clinical documents, such as discharge
summaries, in electronic format is also growing [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The retrieval of such documents
is important in clinical and research tasks such as cohort studies or decision support in
personalized medicine, a medicine tailored to each patient by considering genomic
and clinical contexts of individuals. Indeed, these clinical documents are not only
important to clinicians in daily use but also valuable to researchers and administrators.
EHRs generate large amount of data that offer new opportunities to gain insight into
clinical care. Particularly, EHR repositories enable to compose patient cohorts for the
study of clinical hypotheses, hard to test experimentally, such as for example
individual variability in drug responses. However, to compose those cohorts, efficient and
user-friendly information retrieval systems are needed. To improve the performance
of these systems, it is mandatory to develop an automatic indexing system that gives
as output the representative index of an EHR. The latter should be represented by
clinical related terms even if the discharge summaries are composed by free terms.
      </p>
      <p>
        Since 1995, the department of BioMedical Informatics of the Rouen University
Hospital (SIBM; URL: www.cismef.org) is working on developing tools to access
health knowledge (information retrieval and automatic indexing) in French [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref7 ref8">3-8</xref>
        ].
SIBM is a multidisciplinary team composed by physicians, medical informaticians,
computer scientists, R&amp;D engineers, librarians, postdoctoral and PhD students
(n=21). SIBM is part of the Computer Science, Information Processing, and Systems
Laboratory (LITIS-EA 4108), in Rouen, Normandy, France. Until recently, SIBM is
working on the evaluation of health information systems and information retrieval and
indexing in EHR [
        <xref ref-type="bibr" rid="ref10 ref9">9-10</xref>
        ]. In this context, a user-friendly tool and a web-based service
ECMT (Extracting Concepts with Multiple Terminologies) is developed. It has been
included in several projects subsidized by the French national research agency
[1112]. To evaluate the precision of ECMT, SIBM participated for the first time to the
CLEF eHealth evaluation initiative [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The main motivation in participating is to
improve the functionalities of the tool. The clinical named entity recognition task is
retained [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. It aims to fully automatically identify clinically relevant entities in
medical texts in French. ECMT uses natural language processing (NLP), patterns and
exploit several biomedical knowledge organization systems (KOS).
      </p>
      <p>The rest of paper is organized as follows. In Section 2 we present related work, in
Section 3 we introduce our extraction approach and tool and we describe our
experimental setup. Section 4 reports on our results and on error analysis and reflections.
Finally, Section 5 wraps up concluding remarks and outlines future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Information extraction is the extraction of pre-defined types of information from text
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. There are four primary methods available to implement an information
extraction system, including Natural Language Processing (NLP), pattern matching, rules,
and machine learning. The primary disadvantage of machine learning used for
information extraction is that it requires a labeled dataset for training [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. As most clinical
data are stored in free text, the primary means of performing information extraction is
natural language processing [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Several NLP systems have shown promising results
in extracting information from medical narratives [
        <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21">18-21</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], Turchin et al. used
regular expressions (a meta-language which describes string search patterns), to
extract numeric data form free-text. The use of rules and pattern-matching exploits basic
patterns over a variety of structures, such as text strings, part-of-speech tags, semantic
pairs, and dictionary entries [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. Patterns are easily recognized by humans and can be
expressed directly using special purpose representation languages such as regular
expressions. Regular expressions are effective when the structure of the text and the
tokens are consistent, but tend to be one-off methods tailored to the extraction task.
Regular expressions have been used to extract blood pressure values from progress
notes [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. NLP has been useful for extracting medical information such as principal
diagnosis [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and medication use [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] from clinical narratives.
      </p>
      <p>
        Using tools built over ontologies or controlled vocabularies such as the
Systematized NOmenclature of MEDicine-Clinical Terms (SNOMED-CT) or the International
Classification of Diseases-10 (ICD-10) have enabled researchers to automate the
capture of information in clinical narratives [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. Other tools have been developed. For
example Aronson et al. [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] developed the Medical Text Indexer. It is based on
matching document terms with UMLS terms [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] using MetaMap, comparing the
phrases of the document with the phrases of the concepts using the trigram method
and extracting MeSH terms from the k-nearest neighbors (kNN) of the document to
be indexed. The indexing method of Névéol et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] combines a linguistic method
and kNN. The EAGL method [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] combines the vector space model (VSM) and a
regular expression pattern matcher. BioAnnotator [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] uses a parser to identify noun
phrases from a document and then matches them to UMLS concepts using a rule
engine. AMTEx (automatic MeSH term extraction) [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] applies the C/NC value method,
which allows extraction of composed terms from the text combining statistic and
linguistic information and ranks the terms according to the value of C/NC. Only terms
belonging to MeSH terms are kept. Jonquet et al. [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] applied the Mgrep tool for
extracting concepts using 200 biomedical ontologies and computed a score for each
generated annotation according to its origin (preferred term, non-preferred term,
synonym term…). BioDI [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] reduces the limitation of partial matching through filtering
MeSH concepts, which are extracted using VSM. MaxMatcher+ [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] exploits the
BM25 weight for ranking the concepts extracted using MaxMatcher [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ], which
annotates documents with only the most significant words in the UMLS Metathesaurus.
      </p>
      <p>
        All these methods are based on the use of one ore several biomedical KOS which
link health concepts and gives their associated terms, as well as their definition and
code. Such a system may take the form of a terminology, thesaurus, controlled
vocabulary, nomenclature, classification, taxonomy, ontology …etc. Indeed, KOS
facilitates the indexing, coding and annotation of different kinds of documents. In the
health domain, a great number of bio-terminological resources have been developed
for different purposes (the content and structure depending on the purpose to be
served). This proliferation of resources has made finding the correct concept
increasingly complex when using multiple terminologies simultaneously. For example, the
ICD-10 was designed for coding medical reports, the MeSH Thesaurus, for document
indexing, the ATC Classification, for coding drugs, the SNOMED-CT, for semantic
interoperability among EHRs, and the MedDRA for adverse drug events. However,
few of these resources are available for languages other than English [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ]. SIBM
developed and maintains a Health Terminologies and Ontologies Portal (HeTOP) [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]
that contains 55 KOS in several languages. ECMT relies on the information system of
HeTOP.
3
3.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Material &amp;Method</title>
      <p>Extracting Concepts with Multiple terminologies : ECMT
ECMT is developed to extract as accurately as possible from texts as input, a list of
candidate health concepts from the 55 KOS included in HeTOP. The extraction is
performed at the phrase level of the text. A SOAP and REST Web services allow to
provide a response in XML for each concept and contains: the offset of the first and
the final word contained in the health concept, and which led to a medical concept in
the final list, the identifier and its semantic type if the health concept is included in the
UMLS Metathesaurus, and the medical specialty of the concept. The latter are based
on manual semantic links between general medical specialties (e.g. dermatology,
oncology …etc.) and the KOS included in HeTOP. ECMT relies on bag-of-words and
also pattern-matching designed for discharge summaries, procedure reports or
laboratory results which contains symbolic data (presence or absence), numerical data and
units of measurement. The method of bag-of-words was developed mainly for
information retrieval and it has been adapted for indexing i.e. only the largest set of words
that maps a concept label is extracted, even if is subsets map other concepts. The
method is considered as being more precise and avoiding noise. The text in input is
normalized and each phrase is processed separately to extract the concepts.</p>
      <p>ECMT has also a user-friendly interface (Fig. 1) accessible after authentication
(http://ecmt.chu-rouen.fr/). Several options are available to index the text:
• "c" : categorizing. If "c=true" the specialties of each extracted concept are given
as output and their UMLS semantic type (default value: "true").</p>
      <p>• "r" : refined. If "r=true" the search is stopped when a concept that matches a
maximum of words is extracted (default value: "true"). For example, for
"cardiopathie hypertensive", if "r=true" only the concept "hypertension artérielle" is returned;
if "r=false", the method returns "hypertension artérielle" and "maladie cardiaque"
(the latter is returned because "cardiopathie" is a synonym of the concept "maladie
cardiaque").</p>
      <p>
        • "sn": semantic network. If "sn=true" the concepts that are related directly
(aligned [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ]) to the concepts of the text are also returned by ECMT; (default value:
"false").
      </p>
      <p>• "e": exclusions. It is a string containing the identifiers of concepts to exclude (a
specialty, a semantic type, a broader or a narrower term…etc.). For example,
"e=CIS_MT_8,UML_ST_T060, MSH_D_C,T_DESC_PHARMA_RACINE" returns
only concepts that are not "chirurgies" (CIS_MT_8) nor "procedures de diagnostic"
(UML_ST_T060) nor MeSH "Maladies" (MSH_D_C) nor "racines de spécialités
pharmaceutiques" (T_DESC_PHARMA_RACINE) (default value:"", all the
categories are returned, the user can filter them after the extraction). In the case of the use of
a father concept, all its descendants are excluded in the output.</p>
      <p>• "fi" : filters. It is a string containing the identifiers of concepts to keep in the
output (same as "e").</p>
      <p>• "a" : ancestors. If "a=true" ECMT returns also the ancestors of each concept
(default value: "false").</p>
      <p>• "d" : descendants. If "d=true" ECMT returns the descendants of each concept
(default value: "false").</p>
      <p>• "at" : alternative terms. If "at=true" the synonyms of the concepts are also
returned in the output (default value: "true").</p>
      <p>The answer of the web-based service is an XML file which serializes the output of
the annotation of the text. The following tags compose it:
• &lt;cis-sentences&gt; : the set of phrases that correspond to the input.
• &lt;timemillis&gt; : processing time in ms.
• &lt;cis-sentence&gt; : a phrase.
• &lt;idsentence&gt; : identifier of the phrase.
• &lt;position&gt; : beginning position of the phrase in the text.
• &lt;start&gt; : beginning position of indexing.
• &lt;end&gt; : end position of indexing.
• &lt;idterm&gt; : concept identifier in the original KOS.
• &lt;offset&gt; : set of the terms positions composing the concept.
• &lt;ter&gt; : acronym of the concept KOS.
• &lt;umlscui&gt; : UMLS concept identifier.
• &lt;matchterms&gt; : set of labels that allowed to retrieve the concept.
• &lt;cis:term&gt; : preferred label of the concept.
• &lt;cis:label&gt; : label.
• &lt;lang&gt; : label language.
• &lt;cis:altterms&gt; : list of alternative labels of the concept.
• &lt;cis:altterm&gt; : alternative label of the concept (synonym).
• &lt;cis:categorization&gt; : list of specialties or semantic types.
• &lt;cis:category&gt; : a specialty or a semantic type.
• &lt;cis:descendants&gt; : list of all descendants of the concept.
• &lt;cis:descendant&gt; : a descendant of the concept.
• &lt;cis:ancestors&gt; : list of all ancestors of the concept.
• &lt;cis:ancestor&gt; : an ancestor of the concept.
• &lt;cis:relateds&gt; : list of all concepts related semantically with the concept.
• &lt;cis:related&gt; : a concept related semantically with the concept.
• &lt;relationLabel&gt; : label of the relation.</p>
      <p>Fig.2. gives an an example of processing the phrase “La contraception par les
dispositifs intra utérins”. ECMT extracts the MeSH terms “dispositifs contracptifs” (CUI
C0009886), “dispositifs intra-utérins” (CUI C0021900) and the ATC term
“contraceptifs intra-utérins” (CUI C3653534). The user can also visualize the alternative
terms and categories (Fig.3).
The information retrieval system of HeTOP, and thus of ECMT, operates on more
than 55 terminologies in both French and English partially or totally translated into
French, aligned with semantic relations. However, for the latest version of ECMT
(v3), the relational database management system is replaced by the distributed cache
Infinispan to allow fast processing of the inputs (the example of Fig.2 is processed in
89 ms). The main objectives are the optimization of the response times and the
dissociation of the search engine from a proprietary RDBMS. The NoSQL solution
Infinispan allows data distribution and calling from several web-based servers. The version
with Hibernate search combined with Apache Lucene for full text indexing is
retained. This configuration allows ECMT the processing of 70,000 electronic health
records per day, using the 55 KOS.</p>
      <p>
        At the date of the challenge of the CLEF-eHealth task 1b [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], seven KOS were
migrated to Infinispan and were available for ECMT: the Medical Subject Headings,
the Anatomical Therapeutic Chemical classification, the Classification Commune des
Actes Médicaux, the Classification Internationale des Maladies - 10ème révision,
MedlinePlus, the Systematized Nomenclature of MEDicine International, and
Pharmacology. Table 1 contains their metrics. Each concept of these KOS, when it is available
in the UMLS, has a Concept Unique Identifier. It is the case for example for the
CIM10 and not for the CCAM.
The data set is the QUAERO French Medical Corpus, which has been developed as a
resource for named entity recognition and normalization in 2013 [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ]. The data set
has been created by Névéol et al. in the wake of the 2013 CLEF-ER challenge, with
the purpose of creating a gold standard set of normalized entities for French
biomedical text. A selection of the MEDLINE titles and EMEA documents used in the 2013
CLEF-ER challenge were selected for human annotation and are used in this
challenge. Annotations are provided in the BRAT1 standoff format and the annotation
process was guided by concepts in the UMLS. Ten types of clinical entities which are
UMLS Semantic Groups were annotated: Anatomy, Chemical and Drugs, Devices,
Disorders, Geographic Areas, Living Beings, Objects, Phenomena, Physiology,
Procedures. The annotations were made in a comprehensive fashion, so that nested
entities were marked, and entities could be mapped to more than one UMLS concept.
      </p>
      <p>In particular: (i) If a mention can refer to more than one Semantic Group, all the
relevant Semantic Groups should be annotated. For instance, the mention “récidive”
(recurrence) in the phrase “prévention des récidives” (recurrence prevention) should
be annotated with the category “DISORDER” (CUI C2825055) and the category
“PHENOMENON” (CUI C0034897); (ii) If a mention can refer to more than one
UMLS concept within the same Semantic Group, all the relevant concepts should be
annotated. For instance, the mention “maniaques” (obsessive) in the phrase “patients
maniaques” (obsessive patients) should be annotated with CUIs C0564408 and
C0338831 (category “DISORDER”); (iii) Entities which span overlaps with that of
another entity should still be annotated. For instance, in the phrase “infarctus du
myocarde” (myocardial infarction), the mention “myocarde” (myocardium) should be
annotated with category “ANATOMY” (CUI C0027061) and the mention “infarctus
du myocarde” should be annotated with category “DISORDER” (CUI C0027051).
                                                                                                           
1 http://brat.nlplab.org/standoff.html</p>
    </sec>
    <sec id="sec-4">
      <title>Results &amp; Discussion</title>
      <p>For each run (MEDLINE and ELMA) the web-based service of ECMT is used.
Before submitting our runs, we have tested ECMT with the default options (described in
the section 3.1) and with the 7 available KOS for extracting entities and normalized
entities. For the concerns of the task and the evaluation, the ECMT output is
converted into the BRAT format. Fig.4. is the annotation file obtained and related to the
phrase of Fig.2. La contraception par les dispositifs intra utérins.</p>
      <p>
        The results obtained for the challenge (exact match precision, recall and F-score)
are presented in tables 3, 5, 7 and 9 below (MEDLINE and EMEA) and are reported
in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. We also present inexact performance scores in tables 4, 6, 8 and 10.  
      </p>
      <p>TP
680</p>
      <p>FP
2297</p>
      <p>FN
4412</p>
      <p>TP
596</p>
      <p>FP
1990</p>
      <p>FN
3542</p>
      <p>
        The results obtained for the challenge are not satisfactory at all, specifically for the
EMEA corpus. The bad results obtained for the MEDLINE corpus should be
explained by the existing doubloons in the KOS (Tab.11) that decrease the precision,
and by the concepts extracted even if the KOS is not included in the UMLS, and thus
no CUI and no semantic group are available in the output, giving noise. Also, the bad
exact match results, compared to inexact match results, could be explained by slight
differences in terms used. The gold standard uses UMLS labels while ECMT outputs
preferred labels in the original KOS. This leads to minor differences between CLEF
and ECMT outputs, such as douleur in CLEF output vs. douleurs in ECMT output.
Finally, as no specific processing was done to extract overlapping entities as
described for the task [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], several nested entities are missed. For example, in Fig. 4.
only the concept “C0021900” is in common with the gold standard (Fig.5). Other
entities are extracted with ECMT but are not in the gold standard. As they are more
precise, these concepts should not be considered as noise.
      </p>
      <p>Tab.11. Total of terms (distinct) in French (preferred, concept labels, synonyms …etc) of the
KOS used in the task.</p>
      <p>ATC 11,322
CCAM 25,609</p>
      <p>CIM-10 107,790
MelinePlus 877</p>
      <p>MeSH 288,016</p>
      <p>Pharma 34,172</p>
      <p>SNOMeD-Int. 151,407</p>
      <p>The results obtained for the EMEA corpus are null (Tab.5, Tab.6, Tab.9, Tab.10).
These should be explained by the presence of specific characters in the text. Fig. 6
and Fig. 7 give an example the processing of an EMEA document excerpt: “Dans
quel cas Tysabri est-il utilisé ? Tysabri est utilisé dans le traitement des adultes
atteints de sclérose en plaques”, all the rest of the phrase after the character “?” is
ignored. Also, some characters such as “:” “µ” or newlines cause offsets to be shifted,
due to specific ECMT processes, leading to decreased exact match results, especially
in EMEA documents which contain many of those characters.</p>
      <p>After the submission of the runs of ECMT, the migrating process of the 55 KOS of
HeTOP into Infinispan was achieved. A set of 32 KOS in French are available
(Tab.12). We have tested all the training dataset (832 vs. 400 in the first test) by using
the initial 7 KOS used in the challenge and also the 32 ones.</p>
      <p>The obtained results are reported in tables 13, 14, 15 and 16 hereafter. The results
are not null but neither satisfactory. Including several KOS increases the precision
and decreases the recall in exact matching.
 </p>
      <p>TP
80
86
7
225
21
89
2
9
42
86
647</p>
      <p>FP
411
258
32
725
12
205
25
51
118
486
2323</p>
    </sec>
    <sec id="sec-5">
      <title>Perspectives for future work</title>
      <p>
        SIBM participated for the first time to an evaluation challenge. The clinical named
entity recognition task of the CLEF eHealth 2015 evaluation initiative [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] allowed us
to evaluate ECMT in a very specific context (indexing MEDLINE titles and EMEA
documents in French). ECMT is developed to index Electronic Health Records via a
web-based service and also via a user-friendly interface. The actual version of ECMT
(v3) is optimized to process around 70,000 EHR per day. ECMT was not trained with
the training sets of the challenge and it used the default options and the 7 (vs. 55
today) KOS. For this kind of challenge, clinical named entity recognition, it would be
more interesting, in our point of view, having a dataset clinical documents in French
instead MEDLINE titles or EMEA documents with special characters.
      </p>
      <p>The main conclusion of this work and the obtained results is that before running
the datasets we should have studied the training sets and identified for example the
specialized characters that are ignored by ECMT (mainly in the EMEA corpus). We
should have also identified the set of KOS that gives the best results. We should have
also tested the combinations of the options vs. the default values. For instance, for
managing overlapping entities, the value of “r” should be r=false to avoid the
recognition of only the concept that maps the largest bag-of-words. For normalized
entities, the value of the parameter “sn” should be sn=true to exploit all the existing
mappings until recognizing an UMLS concept that belongs to the semantic groups of
the task. We expect doing this tuning parameter in the near future. We project to
participate to other similar challenges but with a better training.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Jha</surname>
            <given-names>AK</given-names>
          </string-name>
          ,
          <string-name>
            <surname>DesRoches</surname>
            <given-names>CM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kralovec</surname>
            <given-names>PD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joshi</surname>
            <given-names>MS</given-names>
          </string-name>
          .
          <article-title>A progress report on electronic health records in US hospital</article-title>
          .
          <source>Health affairs</source>
          <year>2010</year>
          ,
          <volume>29</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1951</fpage>
          -
          <lpage>57</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Schuemie</surname>
            <given-names>MJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sen</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jong</surname>
            <given-names>GW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Soest</surname>
            <given-names>EM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sturkenboom</surname>
            <given-names>MC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kors</surname>
            <given-names>JA</given-names>
          </string-name>
          .
          <article-title>Automating classification of free-text electronic health records for epidemiological studies</article-title>
          .
          <source>Pharmacoepidemiology and drug safety</source>
          <year>2012</year>
          ,
          <volume>21</volume>
          (
          <issue>6</issue>
          ):
          <fpage>651</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thirion</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leroy</surname>
            <given-names>JP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Douyère</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lacoste</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Godard</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rigolle</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brisou</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Videau</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goupy</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piot</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Quéré</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ouazir</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdulrab</surname>
            <given-names>H.</given-names>
          </string-name>
          <article-title>A search tool based on 'encapsulated' MeSH thesaurus to retrieve quality health resources on the internet</article-title>
          .
          <source>Medical Informatics and the Internet in Medicine</source>
          <year>2001</year>
          ,
          <volume>26</volume>
          (
          <issue>3</issue>
          ):
          <fpage>165</fpage>
          -
          <lpage>178</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          .
          <article-title>Combining different standards and different approaches for health information retrieval in a quality-controlled gateway</article-title>
          .
          <source>International Journal of Medical Informatics</source>
          <year>2005</year>
          ,
          <volume>74</volume>
          (
          <issue>2-4</issue>
          ):
          <fpage>141</fpage>
          -
          <lpage>50</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Névéol</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rogozan</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          .
          <article-title>Automatic indexing of online health resources for a French quality controlled gateway</article-title>
          .
          <source>Information Processing &amp; Management</source>
          <year>2006</year>
          ,
          <volume>42</volume>
          (
          <issue>3</issue>
          ) :
          <fpage>695</fpage>
          -
          <lpage>709</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sakji</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Letord</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rollin</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Massari</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          .
          <article-title>Improving information retrieval with multiple health terminologies in a quality-controlled gateway</article-title>
          .
          <source>BMC Health Information Science and Systems</source>
          <year>2013</year>
          ,
          <volume>1</volume>
          :
          <fpage>8</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Griffon</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuers</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kerdelhué</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kergoulay</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahama</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni SJ</surname>
          </string-name>
          .
          <article-title>A Search Engine to Access PubMed Monolingual Subsets: Proof of Concept - Evaluation in French</article-title>
          .
          <source>Journal of Medical Internet Research</source>
          <year>2014</year>
          ,
          <volume>16</volume>
          (
          <issue>12</issue>
          ) :
          <fpage>e271</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Chebil</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Omri</surname>
            <given-names>MN</given-names>
          </string-name>
          , Darmoni,
          <string-name>
            <surname>SJ.</surname>
          </string-name>
          <article-title>Indexing biomedical documents with a possibilistic network</article-title>
          .
          <source>Journal of the Association for Information Science and Technology</source>
          <year>2015</year>
          , in press.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Cabot</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lelong</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lefebvre</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lecroq</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          , Darmoni, SJ.
          <article-title>Omic Data Modelling for Information Retrieval</article-title>
          .
          <source>Proceedings of the 2nd International WorkConference on Bioinformatics and Biomedical Engineering</source>
          , IWBBIO,
          <year>2014</year>
          , pp.
          <fpage>415</fpage>
          -
          <lpage>424</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lelong</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Merabti</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          , et al.
          <article-title>Moteur de recherche sémantique au sein du dossier du patient informatisé : langage de requêtes spécifique</article-title>
          .
          <source>In proceeding of 15èmesJournées Francophones d'Informatique Médicale</source>
          ,
          <year>2014</year>
          , CEUR Workshop Proceedings Vol :
          <volume>1323</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Dupuch</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Segond</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bittar</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dini</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gicquel</surname>
            <given-names>Q</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzger</surname>
            <given-names>MH</given-names>
          </string-name>
          .
          <article-title>Separate the grain from the chaff: make the best use of language and knowledge technologies to model textual medical data extracted from electronic health records</article-title>
          .
          <source>In proceedings of the 6th Language &amp; Technology Conference</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Thiessard</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mougin</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diallo</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jouhet</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cossin</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcelon</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campillo</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jouini</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Massari</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Griffon</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dupuch</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tayalati</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dugas</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Balvet</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grabar</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pereira</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frandji</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuggia</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>RAVEL: Retrieval And Visualization in ELectronic health records</article-title>
          .
          <source>In Studies in Health Technologies and Informatics</source>
          ,
          <year>2012</year>
          ,
          <volume>180</volume>
          :
          <fpage>194</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Goeuriot</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanlen</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Névéol</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            <given-names>G</given-names>
          </string-name>
          .
          <article-title>Overview of the CLEF eHealth Evaluation Lab 2015</article-title>
          .
          <source>CLEF 2015 - 6th Conference and Labs of the Evaluation Forum, Lecture Notes in Computer Science (LNCS)</source>
          , Springer,
          <year>September 2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Névéol</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tannier</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamon</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum</surname>
            <given-names>P. CLEF</given-names>
          </string-name>
          <article-title>eHealth Evaluation Lab 2015 Task 1b: Clinical Named Entity Recognition</article-title>
          .
          <source>In CLEF 2015 Online Working Notes. CEUR-WS.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. DeJong
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>An overview of the FRUMP system</article-title>
          .
          <source>Strategies for natural language processing</source>
          .
          <year>1982</year>
          :
          <fpage>149</fpage>
          -
          <lpage>176</lpage>
          (
          <issue>Chapter 5</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Zweigenbaum</surname>
            ,
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavergne</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grabar</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamon</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosset</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            <given-names>C.</given-names>
          </string-name>
          <article-title>Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case study</article-title>
          .
          <source>Biomedical Informatics Insights</source>
          ,
          <year>2013</year>
          ,
          <volume>6</volume>
          (
          <issue>Suppl 1</issue>
          ):
          <fpage>51</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Hayes</surname>
            <given-names>PJ</given-names>
          </string-name>
          , Carbonell J.
          <source>Natural Language Understanding. Encyclopedia of Artificial Intelligence</source>
          <year>1987</year>
          :
          <fpage>660</fpage>
          -
          <lpage>677</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Tange</surname>
          </string-name>
          , H.J,
          <string-name>
            <surname>de</surname>
            <given-names>Hasman</given-names>
          </string-name>
          , PF,
          <string-name>
            <surname>Schouten</surname>
            <given-names>HC</given-names>
          </string-name>
          .
          <article-title>Medical narratives in electronic medical records</article-title>
          .
          <source>International Journal of Medical Informatics</source>
          ,
          <year>1997</year>
          ,
          <volume>46</volume>
          :
          <fpage>7</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Taira</surname>
            ,
            <given-names>R. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soderland</surname>
            <given-names>SG</given-names>
          </string-name>
          .
          <article-title>A statistical natural language processor for medical reports</article-title>
          .
          <source>Proceedings of the American Medical Informatics Association Symposium</source>
          ,
          <year>1999</year>
          :
          <fpage>970</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <surname>Qing</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goryachev</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weiss</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sordo</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            <given-names>SN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ross</surname>
            <given-names>L</given-names>
          </string-name>
          .
          <article-title>Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system</article-title>
          .
          <source>BMC Medical Informatics and Decision Making</source>
          ,
          <year>2006</year>
          6:
          <fpage>30</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Voorham</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denig</surname>
            <given-names>P</given-names>
          </string-name>
          .
          <article-title>Computerized extraction of information on the quality of diabetes care from free text in electronic patient records of general practitioners</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <year>2007</year>
          ,
          <volume>14</volume>
          (
          <issue>3</issue>
          ):
          <fpage>349</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Turchin</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolatkar</surname>
            <given-names>NS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grant</surname>
            <given-names>RW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Makhni</surname>
            <given-names>ML</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pendergrass</surname>
            <given-names>EC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Einbinder</surname>
            <given-names>JS</given-names>
          </string-name>
          .
          <article-title>Using regular expressions to abstract blood pressure and treatment intensification information from the text of physician notes</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <year>2006</year>
          ,
          <volume>13</volume>
          :
          <fpage>691</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Pakhomov</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buntrock</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duffy</surname>
            <given-names>P</given-names>
          </string-name>
          .
          <article-title>High throughput modularized NLP system for clinical text</article-title>
          .
          <source>In proceedings of the Association for Computational Linguistics</source>
          <year>2005</year>
          ,
          <fpage>25</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Xu</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stenner</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doan</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
            <given-names>KB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waitman</surname>
            <given-names>LR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denny</surname>
            <given-names>JC.</given-names>
          </string-name>
          <article-title>MedEx: a medication information extraction system for clinical narratives</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          <year>2010</year>
          ,
          <volume>17</volume>
          :
          <fpage>19</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Aronson</surname>
            <given-names>AR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mork</surname>
            <given-names>JG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gay</surname>
            <given-names>CW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Humphrey</surname>
            <given-names>SM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rogers</surname>
            <given-names>WJ</given-names>
          </string-name>
          .
          <article-title>The NLM indexing initiative's medical text indexer</article-title>
          .
          <source>Medical Health Informatics</source>
          ,
          <year>2004</year>
          ,
          <volume>11</volume>
          (
          <issue>1</issue>
          ):
          <fpage>268</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Bodenreider O. The Unified Medical Language</surname>
          </string-name>
          <article-title>System (UMLS): Integrating biomedical terminology</article-title>
          .
          <source>Nucleic Acids Research</source>
          <year>2004</year>
          ,
          <volume>32</volume>
          (
          <issue>4</issue>
          ):
          <fpage>267</fpage>
          -
          <lpage>270</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Ruch</surname>
            <given-names>P</given-names>
          </string-name>
          .
          <article-title>Automatic assignment of biomedical categories: Toward a generic approach</article-title>
          .
          <source>Bioinformatics</source>
          <year>2006</year>
          ,
          <volume>22</volume>
          (
          <issue>6</issue>
          ):
          <fpage>658</fpage>
          -
          <lpage>664</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Mukherjea</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Subramaniam</surname>
            <given-names>SV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chanda</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sankararaman</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kothari</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batra</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhardwaj</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srivastava</surname>
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Enhancing</surname>
          </string-name>
          <article-title>a biomedical information extraction system with dictionary mining and context disambiguation</article-title>
          .
          <source>IBM Journal of Research and Development</source>
          <year>2004</year>
          ,
          <volume>48</volume>
          (
          <issue>5-6</issue>
          ):
          <fpage>693</fpage>
          -
          <lpage>701</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Hliaoutakis</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zervanou</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petrakis</surname>
            <given-names>EGM</given-names>
          </string-name>
          .
          <article-title>The AMTEx approach in the medical document indexing and retrieval application</article-title>
          .
          <source>Data and Knowledge Engigneering</source>
          <year>2009</year>
          ,
          <volume>68</volume>
          (
          <issue>3</issue>
          ):
          <fpage>380</fpage>
          -
          <lpage>392</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Jonquet</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lependu</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Falconer</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coulet</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noy</surname>
            <given-names>NF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musen</surname>
            <given-names>MF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shah</surname>
            <given-names>NH</given-names>
          </string-name>
          .
          <article-title>NCBO resource index: Ontology-based search and mining of biomedical resources</article-title>
          .
          <source>Journal of Web Semantics</source>
          <year>2011</year>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <fpage>316</fpage>
          -
          <lpage>324</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Chebil</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          , Darmoni,
          <string-name>
            <surname>SJ.</surname>
          </string-name>
          <article-title>BioDI: a new approach to improve biomedical documents indexing</article-title>
          .
          <source>Proceedings of the 24th International Conference on Database and Expert Systems Applications</source>
          <year>2013</year>
          :
          <fpage>78</fpage>
          -
          <lpage>87</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Dinh</surname>
            <given-names>D</given-names>
          </string-name>
          , Tamine L.
          <article-title>Towards a context sensitive approach to searching information based on domain specific knowledge sources</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <year>2012</year>
          ,
          <fpage>12</fpage>
          -
          <lpage>13</lpage>
          :
          <fpage>41</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Zhou</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            <given-names>X.</given-names>
          </string-name>
          <article-title>MaxMatcher: Biological concept extraction using approximate dictionary lookup</article-title>
          .
          <source>In Pacific Rim International Conferences on Artificial Intelligence</source>
          <year>2006</year>
          :
          <fpage>145</fpage>
          -
          <lpage>149</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Névéol</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum</surname>
            <given-names>P</given-names>
          </string-name>
          .
          <article-title>Language Resources for French in the Biomedical Domain</article-title>
          .
          <source>Language and Resource Evaluation Conference</source>
          ,
          <year>2014</year>
          :
          <fpage>2146</fpage>
          -
          <lpage>2151</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Merabti</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahamna</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kergourlay</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thirion</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          .
          <article-title>Health Multi-Terminology Portal: a semantics added-value for patient safety</article-title>
          .
          <source>Studies in Health Technology and Informatics</source>
          <year>2011</year>
          , Vol.
          <volume>166</volume>
          :
          <fpage>129</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Merabti</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soualmia</surname>
            <given-names>LF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosjean</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joubert</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darmoni</surname>
            <given-names>SJ</given-names>
          </string-name>
          .
          <article-title>Aligning Biomedical Terminologies in French: Towards Semantic Interoperability in Medical Applications</article-title>
          . Chapter in Medical Informatics,
          <year>2012</year>
          :
          <fpage>41</fpage>
          -
          <lpage>68</lpage>
          . InTech Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Névéol</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leixa</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosset</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zweigenbaum P. The QUAERO French Medical</surname>
          </string-name>
          <article-title>Corpus: A Ressource for Medical Entity Recognition and Normalization</article-title>
          .
          <source>Fourth Workshop on Building and Evaluating Ressources for Health and Biomedical Text Processing - BioTxtM</source>
          <year>2014</year>
          :
          <fpage>24</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>