<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Information Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sara Lana-Serrano</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>José M. Goñi-Menoyo</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>José C. González-Cristóbal</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universidad Politécnica de Madrid</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>DAEDALUS - Data</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Decisions</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Language</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2005</year>
      </pub-date>
      <volume>3237</volume>
      <fpage>210</fpage>
      <lpage>219</lpage>
      <abstract>
        <p>This paper presents the 2005 MIRACLE's team approach to Cross-Language Geographical Retrieval (GeoCLEF). The main goal of the GeoCLEF participation of the MIRACLE team was to test the effect that geographical information retrieval techniques cause to information retrieval. The baseline approach is based on the development of named entity recognition and geospatial information retrieval tools and on its combination with linguistic techniques to perform indexing and retrieval tasks.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <sec id="sec-1-1">
        <title>Geographical IR, geographic entity recognition, spatial retrieval, gazetteer, linguistic engineering, information</title>
        <p>retrieval, trie indexing.</p>
        <p>
          The MIRACLE team is made up of three university research groups located in Madrid (UPM, UC3M and UAM)
along with DAEDALUS, a company founded in 1998 as a spin-off of two of these groups. DAEDALUS is a
leading company in linguistic technologies in Spain and is the coordinator of the MIRACLE team. This is the
third participation in CLEF, after years 2003 and 2004 [4], [5], [7], [8], [9], [
          <xref ref-type="bibr" rid="ref4">10</xref>
          ],[
          <xref ref-type="bibr" rid="ref5">11</xref>
          ], [
          <xref ref-type="bibr" rid="ref11">17</xref>
          ], [
          <xref ref-type="bibr" rid="ref12">18</xref>
          ]. As well as
GeoCLEF tasks, the team has participated in the ImageCLEF, Q&amp;A, WebCLEF and bilingual, monolingual and
cross lingual tracks.
        </p>
        <p>In GeoCLEF task the objective is to evaluate Geographical Information Retrieval (GIR) system involving both
spatial and multilingual aspects. The main challenges in the development of a system of these characteristics are
the side aspects of the main problem of geographical information retrieval in a multilingual environment
(translating locations, ambiguity of geo-references, finding/creating a multilingual gazetteer…) and the inherent
ones to the information retrieval (stemming, transformation, filtering, generation of n-grams, relevance feedback,
indexing…).</p>
        <p>The main objective of the MIRACLE team participation in GeoCLEF task has been to have a first contact with
Geographical Information Retrieval systems, focusing most of the effort on the resolution of problems related to
the geospatial retrieval: creating multilingual gazetteers, geo-entities recognition, processing spatial queries,
document tagging, and document and topic expansion. For information retrieval we have used the set of basic
components developed for MIRACLE team [5]: stemming, transformation (transliteration, elimination of
diacritics and conversion to lowercase) and filtering (elimination of stop and frequent words). A more in-depth
description of the MIRACLE toolbox used for pre-processing and indexing the document collections required
for this track can be found in the paper “MIRACLE’s 2005 Approach to Monolingual Information Retrieval”
that can be found in this on-line documentation.</p>
      </sec>
      <sec id="sec-1-2">
        <title>For this year, we have submitted runs for the following tracks: a) b)</title>
      </sec>
      <sec id="sec-1-3">
        <title>Monolingual English.</title>
      </sec>
      <sec id="sec-1-4">
        <title>Monolingual German.</title>
        <p>1</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Geo-entity Recognition</title>
      <p>The general task of Named Entity Recognition (NER) involves the identification of proper names in the text and
their classification as different types of named entities (persons, organizations, locations). The lexical resources
that are typically included in a NER system are a lexicon and a grammar. The lexicon stores, using one or more
lists, a set of well-known names classified according to their type. The grammar is used for disambiguating the
entities that match the lexicon entries in more than one list.</p>
      <p>The geo-entity recognition process that we have developed involves a lexicon consisting of a gazetteer list of
geographical resources and several modules for linguistic processing, carrying tasks such as geo-entity
identification and tagging.</p>
      <sec id="sec-2-1">
        <title>Gazetteer creation</title>
        <p>A gazetteer is an index or geographical directory consists of geo-gazetteer entries that define natural and cultural
features with one o more names in one or more languages, sets of coordinates, feature designations, hierarchical
relationships and complementary information.</p>
        <p>
          For lexicon creation we have coalesced two existing gazetteers: the Geographic Names Information System
(GNIS) gazetteer of the U.S. Geographic Survey [
          <xref ref-type="bibr" rid="ref9">15</xref>
          ] and the Geonet Names Server (GNS) gazetteer of the
National Geospatial Intelligence Agency (NGA) [
          <xref ref-type="bibr" rid="ref10">16</xref>
          ]. When used together, they meet the main criteria for
gazetteer selection we have taken into account: world-wide scope, free availability, open format, location using
longitude and latitude coordinates, and homogeneity and high granularity. However, they have some unsuitable
properties for our purposes that we have had to improve:



        </p>
        <sec id="sec-2-1-1">
          <title>They use the geographic area as the only criterion to relate resources. We have provided the gazetteers</title>
          <p>with a flexible structure that allows us to define other types of relationships between resources, for
example based on its language (Latin America, countries Anglo-Saxon) or religion (catholic, protestant,</p>
        </sec>
        <sec id="sec-2-1-2">
          <title>Islamic,...).</title>
        </sec>
        <sec id="sec-2-1-3">
          <title>The top of the hierarchic relationships between resources is the country. It has been necessary to add new features to all the entries to store information about the continent they belong to.</title>
        </sec>
        <sec id="sec-2-1-4">
          <title>The entries are in vernacular language. We have selected the most relevant geographic resources</title>
          <p>(continents, countries, region, counties/provinces and more popular cities) and translated them into</p>
        </sec>
        <sec id="sec-2-1-5">
          <title>English, Spanish and German languages. For this task, manual translating has been combined with the</title>
        </sec>
        <sec id="sec-2-1-6">
          <title>Systran software [13].</title>
          <p>The gazetteer we have been finally working with has 7,323,408 entries, each one characterized by several
features, such as unique identifier, continent, country, longitude, latitude, name, etc.</p>
          <p>
            The information retrieval engine used for indexing and searching the gazetteers has been Lucene [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ]. Lucene is a
freely available open-source from the Apache Jakarta project. Lucene supports a Boolean query language,
performs ranked retrieval using the standard tf.idf weighting scheme with the cosine similarity measure and
allows content tagging by treating documents as collections of fields.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Named Geo-entity Identification</title>
        <p>The developed named geo-entity identifier involves several stages: text preprocessing by filtering special
symbols and punctuation marks, initial delimitation by selecting tokens with a starting uppercase letter, token
expansion by searching possible named entities consisting of more than one word, and filtering tokens that do
not match exactly any gazetteer entry.</p>
      </sec>
      <sec id="sec-2-3">
        <title>Named Entity Tagging</title>
        <p>For the geographical entities tagging we have chosen an annotation scheme that allows us to specify the
geographical path to the entity. Each one of the elements of this path provides information of its level in the
geographical hierarchy (continent, country, region…) as well as a unique identifier that distinguishes it from the
rest of geographical resources of the gazetteer.</p>
        <p>

</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Topic expansion</title>
      <sec id="sec-3-1">
        <title>The topic expansion tool developed consists of three functional blocks:</title>
      </sec>
      <sec id="sec-3-2">
        <title>Geo-entity Identifier: identifies geographic entities using the information stored in the gazetteer.</title>
      </sec>
      <sec id="sec-3-3">
        <title>Spatial Relation Identifier: identifies spatial relationships. It can identify the spatial relations defined in a configuration file. Each entry in this file defines both a spatial relationship and its related regular expressions which define patterns for several languages.</title>
      </sec>
      <sec id="sec-3-4">
        <title>Expander: tags and expands the topic in order to identify the spatial relationships and the geo-entities related to them. This block uses a relational database system to compute the points located in a geographic area whose centroid is known.</title>
      </sec>
      <sec id="sec-3-5">
        <title>The expansion made by the algorithm is determined by the type of geographic resource (continent,</title>
        <p>country, region, county, city…) and the associated spatial relation. Table 1 shows the different space
relations supported by the algorithm and the expansion conducted for each one from them.</p>
        <p>All the expansions are based on determining the existing
geographical resources in a space region delimited by, at least,
three of the following points: N, S, E and W, where:





</p>
      </sec>
      <sec id="sec-3-6">
        <title>C is the centroid of the geographic resource.</title>
      </sec>
      <sec id="sec-3-7">
        <title>N is the point locates d km north of C.</title>
      </sec>
      <sec id="sec-3-8">
        <title>S is the point locates d km south of C.</title>
      </sec>
      <sec id="sec-3-9">
        <title>E is the point locates d km east of C.</title>
      </sec>
      <sec id="sec-3-10">
        <title>W is the point locates d km west of C. d depend on the resource and spatial relation.</title>
        <p>3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Description of the experiments</title>
      <p>The baseline approach to processing documents and topic queries is composed of the following sequence of
steps:</p>
      <sec id="sec-4-1">
        <title>Extraction: ad-hoc scripts are run on the files that contain particular documents or topic queries</title>
        <p>collections, to extract the textual data enclosed in XML marks. We have used HEADLINE and TEXT
marks for document collections and the TITLE, DESC, CONCEPT, SPATIALRELATION and</p>
      </sec>
      <sec id="sec-4-2">
        <title>LOCATION marks for topics. The contents inside these marks were concatenated to feed the followings steps.</title>
      </sec>
      <sec id="sec-4-3">
        <title>Remove accents: all documents words are normalized by eliminating accents in words. In spite of this process provides better results running it before the stemming step, we have had to do in this order because our gazetteer consists of normalized entity names.</title>
      </sec>
      <sec id="sec-4-4">
        <title>Geo-entity Recognition or Topic Expansion: All document collections and topics are parsed and</title>
        <p>tagged using the geo-entity recognition tool and the topic expansion tool introduced in the previous
section.</p>
        <p>Lowercase words: all document words and tags are normalized by changing all uppercase letters to
lowercase.</p>
      </sec>
      <sec id="sec-4-5">
        <title>5. Stopwords filter: all the words known as stop words are eliminated from the document.</title>
      </sec>
      <sec id="sec-4-6">
        <title>6. Stemming: the process known as stemming is applied to each one of the words of the document.</title>
      </sec>
      <sec id="sec-4-7">
        <title>7. Indexing: once all document collections have been processed, they are indexed. For this GeoCLEF edition we have used the two following search engines applying them to different experiments:</title>
        <p>
</p>
      </sec>
      <sec id="sec-4-8">
        <title>Indexing and retrieval system based on the trie [1] data structure developed by</title>
      </sec>
      <sec id="sec-4-9">
        <title>MIRACLE team during the two last years [6].</title>
      </sec>
      <sec id="sec-4-10">
        <title>Lucene system from the Apache Jakarta project. 8.</title>
      </sec>
      <sec id="sec-4-11">
        <title>Retrieval: once all topic queries have been processed and expanded they are fed to the trie or Lucene</title>
        <p>engine for searching the previously built index. In our experiments we have only used OR combinations
on the search terms.</p>
        <p>
          For running most of the previous steps, we have used the set of basic components developed by MIRACLE team
[5] adapting them when needed. We have used Porter [
          <xref ref-type="bibr" rid="ref6">12</xref>
          ] stemmers and some resources from Neuchatel [
          <xref ref-type="bibr" rid="ref8">14</xref>
          ].
For this year, we have submitted only runs for monolingual tracks. In addition to the required experiment
(identified with the suffix NOR in the run identifier) we have defined four additional experiments. They are
differentiated mainly in the search engine used as well as in the topic processing. The experiments whose run
identifier has the prefix GC have used the trie-based search engine whereas these ones whose run identifier has
the prefix LGC have used Lucene system.
        </p>
        <p>The suffix CS and NCS refer to topic processing. For topics processing we have used topic title, topic
description and all the geographical tags provided. In the experiments whose run identifier end in CS, all the
topic text has fed the topic expansion process, whereas for the ones that end in NCS we have used only the text
from the geographical tag for topic expansion.</p>
        <p>The following figures show the results obtained by the experiments in monolingual English (EN) and
monolingual German (DE) tasks.</p>
        <p>Interpolated Recall vs Average Precision - EN</p>
        <p>Interpolated Recall vs Average Precision - DE
65
)60
(%55
s50
eg45
ra40
veA3305
n25
io20
is15
ec10
rP 5
0
If we analyze the individual topic results, we observe that the topic expansion improves slightly the precision
results for some topics, but it gets worse for others. For a topic such as ‘…rice imports in Japan…’, the topic
expansion process, in conjunction with OR based searching, transforms documents with any Japanese resources
into pertinent documents. In other topics, such as topic number 016, in which an ambiguous query (…oil
prospecting in Siberia…) meets a high granularity in the gazetteer, the topic expansion produces considerably
worse results (our gazetteer stores 47 different resources named exactly Siberia).</p>
        <p>We can assert that CS experiments provide worse results than NCS experiments. This fact can be explained since
the geo-entity recognition process do not have the capability to distinguish the class of named entities outcoming
noise.</p>
        <p>30</p>
        <p>40 50 60 70</p>
        <p>Interpolated Recall (%)</p>
        <p>GCenCS GCenNCS LGCenCS
GCenNOR</p>
        <p>LGCenNCS</p>
        <p>GCdeNOR
0
10
20
80</p>
        <p>Precision averages (%) for individual queries - DE</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>The fundamentals of a geographical information system are the Named Entity Recognition System (NER) in
conjunction with the Geographic Information Retrieval (GIR). At this GeoCLEF edition we have tried to attack
both aspects of the problem. In order to obtain a solution that approaches better to all the aspects of the problem
a great human effort is required. For this reason we have obtained only one first approach that will be necessary
improved.</p>
      <p>Nevertheless, in spite of the drawbacks of our solution, we consider that the set of topics selected for the
experiments are not very suitable to evaluate the kindness of such approach, due to the small number of pertinent
documents. This fact has had a negative impact on the evaluation of the performance of the module of geospatial
relationships processing.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Future work</title>
      <sec id="sec-6-1">
        <title>Future work of the MIRACLE team in this task will be directed to several action lines:</title>
        <p>
</p>
      </sec>
      <sec id="sec-6-2">
        <title>Improvement of the named entity recognition system adding to it part of speech tagging, classification of the entities and geo-entity disambiguation.</title>
      </sec>
      <sec id="sec-6-3">
        <title>Incorporation of the improvements obtained by the MIRACLE team, by means of its participation in bilingual, monolingual and cross lingual tracks, by using selective or averaging result combination techniques for information retrieval.</title>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This work has been partially supported by the Spanish R+D National Plan, by means of the project RIMMEL
(Multilingual and Multimedia Information Retrieval, and its Evaluation), TIN2004-07588-C03-01.
Special mention to our colleagues of the MIRACLE team should be done (in alphabetical order): Ana María
García-Serrano, Ana González-Ledesma, José Mª Guirao-Miras, José Luis Martínez-Fernández, Paloma</p>
      <sec id="sec-7-1">
        <title>Martínez-Fernández, Ángel Martínez-González, Antonio Moreno-Sandoval and César de Pablo-Sánchez.</title>
        <p>GCdeNOR
GCdeCS
GCdeNCS
LGCdeCS
LGCdeNCS</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Appendix: Tables and figures</title>
      <p>[3] Automatic Trans SL, Spain. Automatic translation server. On line http://www.automatictrans.es [Visited
28/07/2005].
[4] Goñi-Menoyo, José M; González, José C.; Martínez-Fernández, José L.; and Villena, J. MIRACLE’s</p>
      <sec id="sec-8-1">
        <title>Hybrid Approach to Bilingual and Monolingual Information Retrieval. CLEF 2004 proceedings (Peters,</title>
      </sec>
      <sec id="sec-8-2">
        <title>C. et al., Eds.). Lecture Notes in Computer Science, vol. 3491, pp. 188-199. Springer, 2005 (to appear).</title>
        <p>[5] Goñi-Menoyo, José M.; González, José C.; Martínez-Fernández, José L.; Villena-Román, Julio;
García</p>
      </sec>
      <sec id="sec-8-3">
        <title>Serrano, Ana; Martínez-Fernández, Paloma; de Pablo-Sánchez, César; and Alonso-Sánchez, Javier.</title>
      </sec>
      <sec id="sec-8-4">
        <title>MIRACLE’s hybrid approach to bilingual and monolingual Information Retrieval. Working Notes for the</title>
      </sec>
      <sec id="sec-8-5">
        <title>CLEF 2004 Workshop (Carol Peters and Francesca Borri, Eds.), pp. 141-150. Bath, United Kingdom,</title>
        <p>2004.</p>
      </sec>
      <sec id="sec-8-6">
        <title>Martínez, J.L.; Villena-Román, J.; Fombella, J.; García-Serrano, A.; Ruiz, A.; Martínez, P.; Goñi, J.M.;</title>
        <p>and González, J.C. (Carol Peters, Ed.): Evaluation of MIRACLE approach results for CLEF 2003.</p>
      </sec>
      <sec id="sec-8-7">
        <title>Working Notes for the CLEF 2003 Workshop, 21-22 August, Trondheim, Norway.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Aoe</surname>
            , Jun-Ichi; Morimoto, Katsushi; Sato,
            <given-names>Takashi.</given-names>
          </string-name>
          <article-title>An Efficient Implementation of Trie Structures</article-title>
          .
          <source>Software Practice and Experience</source>
          <volume>22</volume>
          (
          <issue>9</issue>
          ):
          <fpage>695</fpage>
          -
          <lpage>721</lpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] Apache Lucene project</article-title>
          . On line http://lucene.apache.
          <source>org [Visited</source>
          <volume>17</volume>
          /08/2005]. [6]
          <string-name>
            <surname>Goñi-Menoyo</surname>
          </string-name>
          , José Miguel;
          <article-title>González-Cristóbal, José Carlos</article-title>
          and
          <string-name>
            <surname>Fombella-Mourelle</surname>
            ,
            <given-names>Jorge.</given-names>
          </string-name>
          <article-title>An optimised trie index for natural language processing lexicons</article-title>
          .
          <source>MIRACLE Technical Report</source>
          . Universidad Politécnica de Madrid,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Martínez-Fernández</surname>
          </string-name>
          , José L.;
          <string-name>
            <surname>García-Serrano</surname>
            , Ana; Villena,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Méndez-Sáez</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ;
          <article-title>MIRACLE approach to ImageCLEF 2004: merging textual and content-based Image Retrieval</article-title>
          .
          <article-title>CLEF 2004 proceedings</article-title>
          (Peters,
          <string-name>
            <surname>C.</surname>
          </string-name>
          et al.,
          <source>Eds.). Lecture Notes in Computer Science</source>
          , vol.
          <volume>3491</volume>
          . Springer,
          <year>2005</year>
          (to appear).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [10]
          <string-name>
            <surname>de Pablo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez-Fernández</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Villena</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>García-Serrano</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Goñi</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>González</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          <article-title>miraQA: Initial experiments in Question Answering</article-title>
          .
          <source>Working Notes for the CLEF 2004 Workshop</source>
          , pp.
          <fpage>405</fpage>
          -
          <lpage>411</lpage>
          (Carol Peters and Francesca Borri, Eds.), pgs.
          <fpage>371</fpage>
          -
          <lpage>376</lpage>
          . Bath, United Kingdom,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [11]
          <string-name>
            <surname>de Pablo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez-Fernández</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Villena</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>García-Serrano</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Goñi</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>González</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          <article-title>miraQA: Initial experiments in Question Answering</article-title>
          .
          <article-title>CLEF 2004 proceedings</article-title>
          (Peters,
          <string-name>
            <surname>C.</surname>
          </string-name>
          et al.,
          <source>Eds.). Lecture Notes in Computer Science</source>
          , vol.
          <volume>3491</volume>
          . Springer,
          <year>2005</year>
          (to appear).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Porter</surname>
            ,
            <given-names>Martin.</given-names>
          </string-name>
          <article-title>Snowball stemmers and resources page</article-title>
          . On line http://www.snowball.tartarus.
          <source>org [Visited</source>
          <volume>13</volume>
          /07/2005].
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>SYSTRAN</given-names>
            <surname>Software</surname>
          </string-name>
          <article-title>Inc</article-title>
          .,
          <source>USA. SYSTRAN 5</source>
          .
          <article-title>0 translation resources</article-title>
          . On line http://www.systransoft.
          <source>com [Visited</source>
          <volume>13</volume>
          /07/2005].
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [14] University of Neuchatel.
          <article-title>Page of resources for CLEF (Stopwords, transliteration</article-title>
          , stemmers …). On line http://www.unine.ch/info/clef
          <source>[Visited</source>
          <volume>13</volume>
          /07/2005].
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>U.S. Geological</given-names>
            <surname>Survey</surname>
          </string-name>
          . On line http://www.usgs.
          <source>gov [Visited</source>
          <volume>17</volume>
          /08/2005].
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>U.S.</given-names>
            <surname>National Geospatial Intelligence Agency</surname>
          </string-name>
          . On line http://www.nga.
          <source>mil [Visited</source>
          <volume>17</volume>
          /08/2005].
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Villena</surname>
          </string-name>
          , Julio; Martínez, José L.;
          <string-name>
            <surname>Fombella</surname>
          </string-name>
          , Jorge; G. Serrano, Ana; Ruiz, Alberto; Martínez, Paloma; Goñi, José M.; and González, José C.
          <article-title>Image Retrieval: The MIRACLE Approach</article-title>
          .
          <article-title>Comparative Evaluation of Multilingual Information Access Systems (Peters, C; Gonzalo</article-title>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ; Brascher,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ; and Kluck, M., Eds.).
          <source>Lecture Notes in Computer Science</source>
          , vol.
          <volume>3237</volume>
          , pp.
          <fpage>621</fpage>
          -
          <lpage>630</lpage>
          . Springer,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Villena-Román</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Fombella</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>García-Serrano</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ruiz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Martínez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Goñi</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>González</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          (Carol Peters, Ed.);
          <article-title>MIRACLE results for ImageCLEF 2003</article-title>
          .
          <source>Working Notes for the CLEF 2003 Workshop</source>
          ,
          <fpage>21</fpage>
          -
          <lpage>22</lpage>
          August, Trondheim, Norway.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>