<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multi-Relation Modeling on Multi Concept Extraction LIG participation at ImageClefMed</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lo¨ıc Maisonnasse</string-name>
          <email>loic.maisonnasse@imag.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eric Gaussier</string-name>
          <email>eric.gaussier@imag.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean-Pierre Chevallet</string-name>
          <email>jean-pierre.chevallet@imag.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>General Terms</string-name>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Algorithms, Theory</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>This paper presents the LIG contribution to the CLEF 2008 medical retrieval task (i.e. ImageCLEFmed). The main idea behind our contribution is to incorporate knowledge in the language modeling approach to information retrieval (IR). On ImageCLEFmed our model makes use of the textual part of the corpus and of the medical knowledge found in the Unified Medical Language System (UMLS) knowledge sources. Last year, we used UMLS to create a conceptual representation for each sentence in the corpus, and proposed a language modeling approach on these representations. The use of a conceptual representation allows the system to work at a more abstract semantic level, which solves some of the information retrieval problems, as the one of terminological variation. We also used different concept extraction methods, and tested how to combine these extraction methods on queries. This year, we have extended our previous method in two ways: first, we have used, in addition to relations derived from UMLS, co-occurrence relations; second, we have combined concept extraction methods not only on queries, but also on documents. In this paper, we first detail some IR approaches that use advanced index terms. We then develop the graph model used in our submission to ImageCLEFmed 2008, and the different ways use to combine graphs derived from different concept extraction methods. After this, we present our results on this year collection, showing that combined concept extraction on document improves the MAP results and that relations impact more first results precision. Finally, we conclude this work and present some possible extensions.</p>
      </abstract>
      <kwd-group>
        <kwd>H</kwd>
        <kwd>3 [Information Storage and Retrieval]</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>1 Content Analysis and Indexing</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>3 Information Search and Retrieval</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>4 Systems and Software</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Best performing methods from ImageCLEFmed 2007 used advanced indexing schemes, such as
conceptual or graph index (see [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]) for representing queries and documents. Such indexing schemes
allow one to better capture the content of queries and documents. They also allow matching
documents and queries at an abstract semantic level. However, such representations are sometimes
hard to detect from texts and may contain errors that can lower IR results. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposed a
graph language modeling approach that consider terms or concepts with labeled relations between
them. This model takes into account semantic relations provided by an external resource, but not
relations that express lexical links between terms. We extend here the model of [?] by integrating
co-occurrence relations between terms or concepts.
      </p>
      <p>If this model allows one to take in account advanced representations in an efficient IR model,
it does not completely solve the problems associated with the difficulty of detecting such
representations in text. We address here some of these problems by combining different concept extraction
methods, on both queries and documents.</p>
      <p>This paper first presents a short overview on the use of advanced representations in IR. A second
section details the graph model used for our contribution and the methods used to combine these
representations on both queries and documents. We then describe the graph extraction process
used for documents and queries, and finally we present the different results obtained on the CLEF
2008 medical retrieval task.
2</p>
    </sec>
    <sec id="sec-2">
      <title>State of the Art</title>
      <p>
        Using semantic resources for indexing has shown promising results on domain-specific colelctions.
For example, in previous ImageCLEFmed editions, conceptual indexing based on UMLS provided
some of the best systems on text, significantly outperforming standard, keyword indexing (cf.
[
        <xref ref-type="bibr" rid="ref2 ref4">4, 2</xref>
        ]). Similar results have also been obtained on TREC genomics, where[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] uses the Mesh and
Entrez databases to select terms related to concepts from medical publications.
      </p>
      <p>
        Several works went beyond the use of mere concepts by exploiting relations between them.
Some are based on the standard space vector model, as [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] who evaluates the usefulness of UMLS
concepts and semantic relations in medical IR, while others have tried to use more advanced
models, as the language model of [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], to integrate dependencies between index terms in IR. Along
this last line, [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] have proposed extensions of te language modeling approach that can deal
with dependencies, syntactic ones in the case of [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], either syntactic or semantic in the case of [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        The model of [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] relies on a variable L, defined as a ”linkage” over query terms, which is
generated from a document according to P (L|Md), where Md represents a document model. The
query is then generated according to P (Q|L, Md). In principle, the probability of the query,
P (Q|Md), is to be calculated over all linkages Ls, but, for efficiency reasons, the authors make the
standard assumption that these linkages are dominated by a single one, the most probable one:
L = argmaxLP (L|Q). The probability P (Q|M d) is then formulated as:
      </p>
      <p>
        P (Q|Md) = P (L|Md) P (Q|L, Md)
(1)
In the case of a dependency parser, as the one used in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], each term has exactly one governor
in each linkage L. Then the above quantity can be decomposed, leading to a new one with three
terms. This decomposition restricts the use of this model to dependency structure. Furthermore,
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] shows that this decomposition is not completely satisfactory from a theoretical point of view.
      </p>
      <p>
        The second approach [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposes a graph modeling approach where query and documents
are represented as graphs G = (C, E), where C represents the node set of the graph and E the
relation set, that they assumed labeled. The relation E is define by an application that indicates
the labels associated to such relation. The probability that the graph of query Gq is generated by
the model of document Md is then decomposed as:
      </p>
      <p>P (Gq|Md) = P (C|Md) P (E|C, Md)
(2)</p>
      <p>Where P (C|Md) corresponds to the nodes contribution and P (E|c, Md) the edges contribution.
This second approach is well founded theoretically and can handle different types of graphs.</p>
    </sec>
    <sec id="sec-3">
      <title>Graph Model</title>
      <p>
        We improve the graph model proposed in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] in which each relation is labelled with one or more
labels. The next sections shows the different improvements of this model.
3.1
      </p>
      <sec id="sec-3-1">
        <title>Node Contribution</title>
        <p>Assuming that, conditioned on Md, query concepts are independent of one another (a standard
assumption in the language model) the node contribution can be decomposed in two different
ways:</p>
        <p>P (C|Md) =</p>
        <p>Qci∈C PU (ci|Md)</p>
        <p>Qci∈C λtPU (ci|Md) + (1 − λt)Ptr (ci|Md)
where P (ci|Md) is the probability of a concept from the query and Ptr (ci|Md) is a translation
model.</p>
        <p>The first method correspond to the standard language model, and is based on the computation
of PU (ci|Md). The second one correspond to the usual way to incorporate lexical associations in
the language modeling. This method is based on the combination of a standard language model
with a translation model, and allows to take in account lexical relations. In both cases, the
quantity P (ci|Md) of equations 3 is computed through a simple Jelinek-Mercer smoothing:
PU (ci|Md) = (1 − λu) NNdd((c∗i)) + λu ND(∗)</p>
        <p>ND(ci)
where Nd(ci) (respectively ND(ci)) is the number of times that ci appears in the document
d (respectively in the collection), and Nd(∗) (respectively ND(∗)) the number of concepts in
document d (the collection).</p>
        <p>The translation model is computed as:</p>
        <p>Ptr (ci|Md) =</p>
        <p>X P (ci|ct) PU (ct|Md)
ct∈Rl
where Rl is the set of concepts lexically related to ci and P (ci|ct) the probability for a concept
ct to be translated by the query concept ci. The contribution PU (ci|Md) still corresponds to
a standard unigram language model but applied to the translated concept, with a smoothing
parameter different from the one for PU . We will refer to it as λ′u.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Relation Contribution</title>
        <p>We assume that E is an application from C × C in P (L)1 that associates to each relation a set of
labels. Thus the edge contribution can be decomposed as:</p>
        <p>Y
i,j∈C,i≤j
P (E|C, Md) =</p>
        <p>P (E(ci, cj) = L|ci, cj, Md)
(6)
where E(ci, cj) = L indicates that a relation exists between ci and cj and that this relation is
associated to the label set L.</p>
        <p>We furthermore decomposed this probability as:</p>
        <p>P (E(ci, cj) = Lij |ci, cj, Md) =</p>
        <p>P (e(ci, cj) = label|ci, cj, Md)</p>
        <p>Y
label∈Lij
where e(qi, qj) = label indicates that there is a relation between qi and qj, the label set of which
contains label.</p>
        <p>1L is the set of all possible labels for a relationship and P(L) is the set of sets of L.
(3)
(4)
(5)</p>
        <p>An edge probability is thus equal to the product of the corresponding single-label relations.
Following standard practice in language modeling, one can furthermore “smooth” this estimate
by adding a contribution from the collection. This results in:</p>
        <p>P (e(ci, cj) =, label)|ci, cj, Md) = (1 − λe)</p>
        <sec id="sec-3-2-1">
          <title>D(ci, cj, label) D(ci, cj) + λe</title>
        </sec>
        <sec id="sec-3-2-2">
          <title>C(ci, cj, label) C(ci, cj)</title>
          <p>(7)
where D(ci, cj, label) (C(ci, cj, label)) is the number of times ci and cj are linked with a relation
labeled label in the document (collection). D(ci, cj) (C(ci, cj)) is the number of times ci and cj
are observed together in the document.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Model combinaison</title>
        <p>We present here the methods used to combine different graphs (i.e. different dependency structures
obtained from different analyses of the queries and/or documents) in the model presented above.
First, we group the different analysis of a query. To do so, we assume that a query is represented
by a set of graphs Q = Gq; and that the probability of a set of graphs assuming a document graph
model is computed by the product of the probability of each query graph:</p>
        <p>P (Q = {Gq} |Mg) = Y P (Gq|Md)</p>
        <p>Gq</p>
        <p>This model considers that a relevant document model must generate all the posible analyses
of a query Q. The best probabilities will be obtained for a document model which can generate
all analyses of the query with high probability.</p>
        <p>Second, we group the different analysis of a document. To do so, we assume that a query can be
generated by different models of the same document Md∗ (i.e. a set of models). As a result of this
generation process, we keep the higher probability among the different models of the document:
(8)
(9)
P (C|Md∗) = argmaxMd∈Md∗</p>
        <p>Y P (ci|Md)
ci∈C
!</p>
        <p>With this method, documents are ranked, for a given query, according to their best model.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Graph Extractions</title>
      <p>UMLS is a good candidate as a knowledge source for medical text indexing. It is more than a
terminology because it describes terms with associated concepts. This knowledge is large (more
than 1 million concepts, 5.5 million of terms in 17 languages). UMLS is not an ontology, as there
is no formal description of concepts, but its large set of terms and their variants specific to the
medical domain, enables full scale conceptual indexing. In UMLS, all concepts are assigned to
at least one semantic type from the Semantic Network. This provides consistent categorization
of all concepts in the meta-thesaurus at the relatively general level represented in the Semantic
Network. The Semantic Network also contains relations between concepts, which allows one to
derive relations between concepts in documents (and queries).</p>
      <p>From this information, graphs are produced in two steps: concept detection and then relation
detection.
4.1</p>
      <sec id="sec-4-1">
        <title>Concepts Detection</title>
        <p>The detection of concepts in a document from a thesaurus is a relatively well established process.
It consists of four major steps:
1. Morpho-syntactic Analysis (POS tagging) of document with a lemmatization of inflected
word forms;
2. Filtering empty words on the basis of their grammatical class;
3. Detection in the document of words or phrases appearing in the meta-thesaurus;
4. Possible filtering of concepts identified.</p>
        <p>
          For the first step, various tools can be used depending on the language. We used MiniPar(cf. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ])
and TreeTagger2.
        </p>
        <p>
          Once the documents are analyzed, the second and third steps are implemented directly, first
by filtering grammatical words (prepositions, determinants, pronouns, conjunctions), and then by
a look-up of word sequences in UMLS. This last step will find all alternatives, present in UMLS,
of a concept. One can certainly improve this simple lookup by identifying potential terminological
variants (see for example [?]). We have not used such a refinement here and merely rely on a
simlpe look-up. It should be noted that we have not used all of UMLS for the third step: the
thesauri NCI and PDQ were not taken into account as they are related to areas different from
the one covered by the collection3. Such a restriction is also used in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The fourth step of
the indexing process is to eliminate a number of errors generated by the above steps. However,
the work presented in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] shows that it is preferable to retain a greater number of concepts for
information retrieval. We thus did not use any filtering here.
        </p>
        <p>We finally obtain two variations of concept detection:
• (MP) uses our term mapping tools with MiniPar.</p>
        <p>• (TT) uses our term mapping tools with TreeTagger.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Relations Detection</title>
        <p>After concept detection, we add conceptual relations between concepts. The relations used are
those defined in the Semantic Network. We made the hypothesis that a relation exists in a
document if two concepts are detected in the same sentence and if a relation between these
concepts is defined in the Semantic Network. For finding relations, we first tag concept with their
semantic type and then add semantic relations that link concepts with corresponding tags. A
sample result of the relation extraction process for a sentence can be viewed on figure 4.2. We do
not make any further disambiguisation on relations. Finally, for each concept extraction method,
we obtain one graph for each document and for each query.
4.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Coocurence Extractions</title>
        <p>We want here to extract lexical links from the collection. We made the standard assumption that
similar concepts occur in the same context (i.e. they co-occur with the same concepts). Based
2www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/
3This is justified here by the fact that these thesauri focus on specific issues of cancer while the collection is
considered more general and covers all diseases.
on this assumption, a standard method consists in building a context vector for each concept of
the collection, and to compute a similarity between concepts using context vectors. In this work,
we assume that a concept is in the context of another if these two concepts appear in the same
sentence. Thus we compute a context vector for each concept of the collection based on a mutual
information score. The weigth of the dimension cj from the context vector of ci is computed as:
M I (ci, cj) = log(</p>
        <p>P (ci, cj)
P (ci) ∗ P (cj)</p>
        <p>)
=</p>
        <p>N ph(ci, cj) ∗ N ph</p>
        <p>N ph(ci) ∗ N ph(cj)
where N ph(ci, cj) is the number of times that the two concepts ci and cj appear in the same
sentence, N ph(ci) is the number of times that ci appear in a sentence, and N ph is the number
of sentences in the collection. For efficiency and based on experimental results, we only keep the
200 highest dimensions in the context vector. We then calculate the similarity between concepts
throug the cosine of their context vectors. We consider a concept ci related to another concept
cj if ci is in the 200 nearest neighbors (as defined by the cosine similarity) of cj. This method
provides a first set of concepts Rl.</p>
        <p>We used the concepts in Rl to compute the translation probabilities, by dividing the cosine of
the concept by the sum of the cosine of all the retained concepts:</p>
        <p>P (ci|ct) =</p>
        <p>cos(Vctxt(ci), Vctxt(ct))</p>
        <p>Pcj∈Rl cos(Vctxt(ci), Vctxt(cj))
where Vctxt(c) is a context vector built with mutual information, Rl is the list of the N selected
concepts and cos is the cosine between vectors.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <p>
        We show here the results obtain for this methods on the corpus CLEFmed 2007 [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and on the
test one the CLEFmed 2008 corpus.
5.1
      </p>
      <sec id="sec-5-1">
        <title>Model Variations</title>
        <p>This year the track ImageCLEFmed is based on a new collection. On this collection, we submit
10 runs these runs explore differents variation of our relational model and the differents analysis
merging methods. Last year results show that merging queries improves the results. As
consequence, this year we do not test query graphs combination and we allways use the two graphs
detected on a query.</p>
        <p>We test 4 model variations :
• (UNI) that only use node contribution (as define in ??).
• (RET) that use the node contribution and a relation contribution.
• (COS) that use the node contribution with translation.
• (RC) that use the node contribution with translation and a relation contribution.</p>
        <p>For each model, we test it on the analysed collection obtain with MiniPar (MP) and on the
collection analysed by MiniPar and TreeTagger (MPTT) using the combination methods proposed
in this paper.</p>
        <p>We also submit two other run, one that use a unigramme model with an extended image
description that integrates the text that corresponds to the paragraph where the image is referred.
A second one use the COS model but use coocurence computed on the previous ImagesCLEFmed
collections.
(10)
(11)
(12)
From each method we use the bests parameters obtained on ImageCLEFmed corpus for MAP and
we use these parameters on the new collection. We compare the variation between the results on
the two corpus for the MAP and the P5D.</p>
        <p>The best results obtained on the new medical collection are those of the unigramme model with
a collection analysis by MiniPar and TreeTagger. On 2008 collection, integrating relations only
improves the results when lexical relations are used on the collection analized by MiniPar. In the
others cases no improvement are obtained with relations and thus combination of the two types of
relation did not improved the results. On the P@5 the use of relations improves the results even
more if the two analysis are used.</p>
        <p>The results of our two other runs show that using coocurence computed on the past collection
gives better results than the coocurences learned on the 2008 collection. This run gives us our
best MAP result (0.2791). The other run that uses part of the article, provides surprising low
results (0.1908). This can be due to the fact that the text added is considered as equivalent to the
image caption, but it can be less precise or less image related. Thus we think taht this approach
could provide good results if we adapt our model to take in account this new text.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>We proposed here a framework for using semantic resources in the medical domain. We describe
a method for using relations in language modeling, and for merging different document or query
versions in this framework. Results show that relation are useful to maintain good results on the
first retrieved documents, when mixing different detection trends to improve the recall. This paper
shows the robustness of our method on a new corpus, where they provide good results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Jianfeng</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jian-Yun</surname>
            <given-names>Nie</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guangyuan Wu</surname>
            , and
            <given-names>Guihong</given-names>
          </string-name>
          <string-name>
            <surname>Cao</surname>
          </string-name>
          .
          <article-title>Dependence language model for information retrieval</article-title>
          .
          <source>In Research and Development in Information Retrieval</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Caroline</given-names>
            <surname>Lacoste</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jean-Pierre</surname>
            <given-names>Chevallet</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joo-Hwee</surname>
            <given-names>Lim</given-names>
          </string-name>
          , Xiong Wei, Daniel Raccoceanu, Diem Le Thi Hoang, Roxana Teodorescu, and
          <string-name>
            <given-names>Nicolas</given-names>
            <surname>Vuillenemot</surname>
          </string-name>
          .
          <article-title>Ipal knowledge-based medical image retrieval in imageclefmed 2006</article-title>
          .
          <source>In Working Notes for the CLEF 2006 Workshop</source>
          ,
          <fpage>20</fpage>
          -
          <lpage>22</lpage>
          September, Alicante, Spain,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <article-title>Dependency-based evaluation of MiniPar</article-title>
          .
          <source>In Workshop on the Evaluation of Parsing Systems</source>
          , Granada, Spain, May. ACM,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Eric</given-names>
            <surname>Gaussier</surname>
          </string-name>
          <article-title>Loc Maisonnasse and Jean Pierre Chevallet</article-title>
          .
          <article-title>Multiplying concept sources for graph modeling</article-title>
          .
          <source>In CLEF 2007, LNCS 5152 proceedings</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Y. Huang HJ</surname>
          </string-name>
          .
          <article-title>Lowe and WR</article-title>
          .
          <string-name>
            <surname>Hersh</surname>
          </string-name>
          .
          <article-title>A pilot study of contextual UMLS indexing to improve the precision of concept-based representation in XML-structured clinical radiology reports</article-title>
          .
          <source>In Conference of the American Medical Informatics Association</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Loic</given-names>
            <surname>Maisonnasse</surname>
          </string-name>
          , Eric Gaussier, and
          <string-name>
            <surname>Jean-Pierre Chevallet</surname>
          </string-name>
          .
          <article-title>Revisiting the dependence language model for information retrieval</article-title>
          .
          <source>In Research and Development in Information Retrieval</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Henning</given-names>
            <surname>Mu</surname>
          </string-name>
          ¨ller, Thomas Deselaers, Eugene Kim, Jayashree Kalpathy-Cramer,
          <string-name>
            <given-names>Thomas M.</given-names>
            <surname>Deserno</surname>
          </string-name>
          , Paul Clough, and
          <string-name>
            <given-names>William</given-names>
            <surname>Hersh</surname>
          </string-name>
          .
          <article-title>Overview of the ImageCLEFmed 2007 medical retrieval and annotation tasks</article-title>
          .
          <source>In Working Notes of the 2007 CLEF Workshop</source>
          , Budapest, Hungary,
          <year>September 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Ponte</surname>
          </string-name>
          and
          <string-name>
            <given-names>W. B.</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>A language modeling approach to information retrieval</article-title>
          .
          <source>In Research and Development in Information Retrieval</source>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Said</given-names>
            <surname>Radhouani</surname>
          </string-name>
          , Loic Maisonnasse,
          <string-name>
            <surname>Joo-Hwee</surname>
            <given-names>Lim</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thi-Hoang-Diem Le</surname>
          </string-name>
          , and
          <string-name>
            <surname>Jean-Pierre Chevallet</surname>
          </string-name>
          .
          <article-title>Une indexation conceptuelle pour un filtrage par dimensions, experimentation sur la base medicale imageclefmed avec le meta thesaurus umls</article-title>
          . In COnference en Recherche Information et Applications CORIA'
          <year>2006</year>
          , pages
          <fpage>257</fpage>
          -
          <lpage>271</lpage>
          , mars
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>VolkSemantic M. Vintar</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buitelaar</surname>
            <given-names>P.</given-names>
          </string-name>
          <article-title>Relations in concept-based cross-language medical information retrieval</article-title>
          .
          <source>In Proceedings of the ECML/PKDD Workshop on Adaptive Text Extraction and Mining (ATEM)</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Neil</given-names>
            <surname>Smalheiser Vetle Torvik Jie Hong Wei Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Clement</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <article-title>Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature</article-title>
          .
          <source>In Research and Development in Information Retrieval</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>