<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Text-mess in the Medical Retrieval ImageCLEF08 Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>S. Navarro</string-name>
          <email>snavarro1@dlsi.ua.es</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M.C. D´ıaz</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>R. Mun˜oz</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M.A. Garc´ıa</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>F. Llopis</string-name>
          <email>llopis5@dlsi.ua.es</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M.T. Mart´ın</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>L.A. Uren˜a</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Montejo</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>This paper describes our participation in the Medical Retrieval task at ImageCLEF 2008. We present the joint work of two teams belonging to the TEXT-MESS project using a new system that combines the 2 individual systems of these teams. The aim of the experiments performed is to figure out if there are techniques used in one of the two systems which can complement the other system in order to improve their performance. The best results obtained in the training phase and in the competition has been reached with a configuration which uses the IR-n system with a negative query expansion based on the acquisition type of the image mixed with the SINAI system with a MeSH based query expansion. We have obtained a MAP of 0.2777 for our best run, obtaining the 5th place in the ranking of textual participant runs submitted, and the 6th place in the global classification.</p>
      </abstract>
      <kwd-group>
        <kwd>{mcdiaz2</kwd>
        <kwd>magc4</kwd>
        <kwd>maite6</kwd>
        <kwd>laurena7</kwd>
        <kwd>amontejo8}@ujaen</kwd>
        <kwd>es</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>This paper describes our participation in the Medical Retrieval task at ImageCLEF 2008. We
present the joint work of two teams belonging to the TEXT-MESS project using a new system
that combines the 2 individual systems of these teams. The aim of the experiments performed is
to figure out if there are techniques used in one of the two systems which can complement the
other system in order to improve their performance.</p>
      <p>Our experiments has been focused on to use a reranking method combining different
configurations of the individual systems. Moreover, we have made experiments with two different reranking
methods - the standard one and TF-IDF multimodal reranking - in order to perform a multimodal
participation with our joint systems.</p>
      <p>This paper is structured as follows: Firstly, it presents the main characteristics of the SINAI
and IR-n system, then it moves on to explain the experiments we have made to evaluate the join
system, and finally it describes the results and conclusions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The System</title>
      <p>
        The complete system is composed by two systems adapted to the IR medical domain - SINAI and
IR-n -. They work in a parallel mode. The ranking returned by each one of the systems is merged
following a standard reranking method. Furthermore, for our participation in the mixed modality
- runs that mix a visual and a textual approach - the system uses two modalities for multimodal
reranking. The standard reranking method and the IR-n TF-IDF multimodal rerankig described
in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
2.1
      </p>
      <sec id="sec-2-1">
        <title>The SINAI System</title>
        <p>
          SINAI system use Lemur software1 to textual information retrieval. One of the purposes of the
improvements added to this system is to compare the performance of query expansion using two
different ontologies: MeSH and UMLS. Experiments with the MeSH ontology have been carried
out in the past [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] obtaining good results. The expansion method using MeSH is the same as
presented last year.
        </p>
        <p>On the other hand, the UMLS metathesaurus is a repository of biomedical ontologies and
associated software tools developed by the US National Library of Medicine(NLM)2. It is built
from different source vocabularies. One of the source vocabularies is MeSH ontology.</p>
        <p>
          To expand the queries we have used MetaMap program [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] that was originally developed for
use in information retrieval. In order to reduce the number of terms that could expand the query,
to make it equal to that of MeSH expansion, we have used MetaMap, restricting the semantic
types in the mapped terms [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>The IR-n System</title>
        <p>
          IR-n is an information retrieval system based on passages. Those type of IR systems, unlike
document-based systems, can consider the proximity of words with each other, that appear in a
document in order to evaluate their relevance [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          This system has added for its current participation in this task a common approach to the
multimodal issue. It allows two working modes. The first one is the standard one for merging two
lists, based on set values to the weighting factor of each list in order to create a joined list. The
second one is the TF-IDF multimodal reranking, it is a variation of the standard one. It bases
the calculus of the relevance of an image on the quantity and the quality of its annotations in
order to decide whether the relevance value returned by the textual IR system is enough to rank
a document or it is needed to add the relevance returned by a CBIR system [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>
          Furthermore, in order to adapt the system to this restricted domain, it has used two automatic
query expansion techniques related with the medical domain. On one hand it adds expanded
terms to the queries based on MeSH ontology. And on the other hand when in the query there
are terms related to the type of the images that have to be retrieved, the system uses a negative
term expansion based on the terms related to other types of a taxonomy of types of images. In
order to move away from the top positions of the ranking, those documents which do not belong
to the type/s requested in the query [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>
          Finally in this CLEF edition IR-n has added Local Context Analisy (LCA) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] as alternative
strategy to PRF, in order to compare its behaviour within this restricted domain.
1http://www.lemurproject.org
2http://www.nlm.nih.gov/
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Training</title>
      <p>This section describes the training process that was carried out in order to obtain the best possible
features for improving the performance of the whole system. The collections and resources are
described first, and the next section describes specific experiments.
3.1</p>
      <sec id="sec-3-1">
        <title>Data Collections</title>
        <p>
          For this year task, a new collection has been used in order to evaluate and to compare the
participant systems. This collection is the Goldminer collection 3. The subset used contains all
images from articles published in Radiology and Radiographics including the text of the captions
and a link to the html of the full text articles. We have used two versions of this textual collection
for generate our submitted runs - which were obtained by SINAI group following the method
described in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] -:
        </p>
        <sec id="sec-3-1-1">
          <title>CT: It contains image captions and article titles.</title>
          <p>CTS: It contains captions, titles and texts of the sections where the images appear.</p>
          <p>
            The training for our participation have been done with the Consolidated Collection. This
collection consists the image collection and topics used in ImageCLEFmed 2005-2007 merged into
one single new collection, with relevance judgments done for all topics based on all collections. We
have used a preprocessed version of this collection which is compounded by a textual document
per image, which has been translated to English when it has been need it [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ] -.
3.2
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Experiments</title>
        <p>In the training phase, initially for each IR system, we have worked in the selection of those runs
which better results obtained on its training phase. Next, in order to figure out which are the
most suitable weighting values for the re-reanking module, we have carried out a training phase
with each pair of selected runs in the previous step.</p>
        <p>The runs selected using SINAI system have been the following:</p>
        <sec id="sec-3-2-1">
          <title>OnlyText: Baseline experiment. OnlytextMeshSimAceSinrepe: Query expanded with MeSH ontology. OnlytextUmlsFiltered: Query expanded with UMLS metathesaurus using MetaMap program.</title>
        </sec>
        <sec id="sec-3-2-2">
          <title>The runs selected using IR-n system have been the following:</title>
          <p>EXP NEGATIVA: It uses negative expansion based on the acquisition type of the image
EXP NEGATIVA MeSH: It uses negative expansion based on the acquisition type of the
image and query expansion based on MeSH ontology.</p>
          <p>EXP PRF NEGATIVA MESH: It uses negative expansion based on the acquisition type
of the image, query expansion based on MeSH ontology and PRF as relevance feedback
strategy.</p>
          <p>The Table 1, show us the MAP obtained for each standalone run and the MAP obtained with
the best standard reranking run with the weights used.</p>
          <p>We can see that the best results have been obtained combining the IR-n run which have used
the negative expansion based on the acquisition type of the image, with the SINAI runs which have
used MeSH or UMLS for the query expansion. We select these two configurations for our textual
submissions of this year, - we have submitted 2 runs using the CT collection and 2 more using
the CTS collection - . Furthermore we this two configurations to figure out the best parameters
to perform a multimodal reranking in order to merge their output with the ranking list returned
by a CBIR.</p>
          <p>For this training phase we have used the University of Geneva CBIR baseline run for the 2007
query set - in the moment of the experiments we did not have available a CBIR baseline that
answer to the whole consolidated query set -. Since that the reranking results have been obtained
with runs that for the 2007 queries - 30 queries of a total of 85 queries - have used the reranking
technique and for the other queries they have used a purely textual retrieval. Bearing that in mind
we have evaluated the results of the reranking experiments as if its improvement or worsening was
minimized by its partial using with the training query set.</p>
          <p>The Table 2, shows us the resulting MAP obtained for the previous reranked runs, the MAP
of the CBIR baseline run, the multimodal reranking strategy used to combine those runs, the best
configuration of weighting values - for the standard reranking strategy - or the best threshold -for
the TF-IDF reranking - and its MAP result.</p>
          <p>Its important to stand out the extremely low threshold percentage value used in the best
TF-IDF reranking run - specially if we compare it with the 60</p>
          <p>For our participation in the task, we have submitted these 4 multimodal runs within these
mixed modality - mixing a visual and a textual approach - using CT collection and 2 runs more
with the best configuration using the collection CTS.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results in ImageCLEF Medical Retrieval 2008</title>
      <p>The Tables 3 and 4 shows us the official MAP results, the ranking position within the textual and
the mixed modality respectively and the ranking position within all the participant runs.</p>
      <p>In the Table 3 we can see that our best TEXT-MESS run in the text modality has been ranked
in the 5th place. It is the same configuration that obtained the best results in our training phase.
It uses IR-n with negative expansion based on the acquisition type of the image - Type - and</p>
      <sec id="sec-4-1">
        <title>SINAI with MeSH based expansion - mesh -.</title>
        <p>We can observe that our standard reranking runs has gone down in the ranking, while the
TF-IDF runs are in the top positions of the ranking, but they obtain the same MAP as the MAP
obtained by the same configuration but without multimodal reranking - textual submissions -. It
has happened because the threshold that we have used for TF-IDF reranking strategy is too much
low for the competition collection. It makes that the system treat all the documents retrieved by
the CBIR as if they have enough textual information to perform a suitable retrieval - skipping their
CBIR relevance value-. We also have had a tuning problem with the parameters of the standard
reranking strategy. Since that we only trained it using only a subset of the whole training query
set and that we have used a different collection and a different CBIR from the ones we used in the
competition. All of this has affected negatively to the performance of this strategy with the test
collection.
The major finding in this results has been to check that the performance of the global system can
be improved joining two standalone systems which use complementary and successful methods for
improving the retrieval.</p>
        <p>Furthermore, the fact that the negative query expansion based on the acquisition type of the
image using a non visual approach has had a good behaviour complementing the work done by
the SINAI system using MeSH, is a non expected good result, that we should study and exploit
in the future.</p>
        <p>Finally, in order to avoid the problems experienced with the TF-IDF reranking strategy, we are
planning on to work on finding an alternative method to establish the TF-IDF reranking threshold.
Which instead of use the documents retrieved, use the whole collection to work out this value.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgement</title>
      <p>This research has been partially funded by the Spanish Government within the framework of the
TEXT-MESS (TIN-2006-15265-C06-01 and TIN-2006-15265-C06-03) project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>M.C. D´</surname>
            ıaz-Galiano,
            <given-names>M.A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          <article-title>´ıa-</article-title>
          <string-name>
            <surname>Cumbreras</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
          </string-name>
          <article-title>´ın-</article-title>
          <string-name>
            <surname>Valdivia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montejo-Raez</surname>
            , and
            <given-names>L.A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          <article-title>˜a Lo´pez. SINAI at ImageCLEFmed 2007</article-title>
          . In In on-line Working Notes,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2007</year>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M.C. D´</surname>
            ıaz-Galiano,
            <given-names>M.A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          <article-title>´ıa-</article-title>
          <string-name>
            <surname>Cumbreras</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
          </string-name>
          <article-title>´ın-</article-title>
          <string-name>
            <surname>Valdivia</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          <article-title>˜a Lo´pez, and A Montejo-Ra´ez. SINAI at ImageCLEFmed 2008</article-title>
          . In In on-line Working Notes,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2008</year>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>M.C. D´</surname>
            ıaz-Galiano,
            <given-names>M.A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          <article-title>´ıa-</article-title>
          <string-name>
            <surname>Cumbreras</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          <article-title>˜a Lo´pez, M.T. Mart´ın-</article-title>
          <string-name>
            <surname>Valdivia</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Montejo-Raez</surname>
          </string-name>
          .
          <article-title>SINAI at ImageCLEF 2006</article-title>
          . In Working Notes,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Fernando</given-names>
            <surname>Llopis. IR-n: Un Sistema de Recuperacin de Informacin Basado</surname>
          </string-name>
          en Pasajes.
          <source>PhD thesis</source>
          , University of Alicante,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Sergio</given-names>
            <surname>Navarro</surname>
          </string-name>
          , Fernando Llopis, and
          <article-title>Rafael Mun˜oz.</article-title>
          <string-name>
            <given-names>A.R.</given-names>
            <surname>Effective</surname>
          </string-name>
          <article-title>Mapping of Biomedical text to the UMLS Metathesaurus: the MetaMap Program</article-title>
          .
          <source>In Proc.of the AMIA Symposium</source>
          ,
          <source>pages Nov.3-717-21</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Sergio</given-names>
            <surname>Navarro</surname>
          </string-name>
          , Fernando Llopis, and
          <article-title>Rafael Mun˜oz. Different Multimodal Approaches using IR-n in ImageCLEFphoto 2008</article-title>
          . In In on-line Working Notes,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2008</year>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Jinxi</given-names>
            <surname>Xu</surname>
          </string-name>
          and
          <string-name>
            <given-names>W. Bruce</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>Improving the effectiveness of information retrieval with local context analysis</article-title>
          .
          <source>ACM Trans. Inf</source>
          . Syst.,
          <volume>18</volume>
          (
          <issue>1</issue>
          ):
          <fpage>79</fpage>
          -
          <lpage>112</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>