<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>IXA at CLEF 2008 Robust-WSD Task: using Word Sense Disambiguation for (Cross Lingual) Information Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Arantxa Otegi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eneko Agirre</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>German Rigau</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Donostia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Basque Country</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>General Terms</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Robust Retrieval, CLIR, Word Sense Disambiguation, Query Expansion, Structured Queries</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IXA NLP Group - University of the Basque Country</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Measurement</institution>
          ,
          <addr-line>Performance, Experimentation</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes the participation of the IXA NLP group at the CLEF 2008 Robust-WSD Task. This is our rst time at CLEF, and we participated at both the monolingual (English) and the bilingual (Spanish to English) subtasks. We tried several query and document expansion and translation strategies, with and without the use of the word sense disambiguation results provided by the organizers. All expansions and translations were done using the English and Spanish wordnets as provided by the organizers and no other resource was used. We used Indri as the search engine, which we tuned in the training part. Our main goal was to improve (Cross Lingual) Information Retrieval results using WSD information, and we attained improvements in both mono and bilingual subtasks, although the improvement was only signi cant for the bilingual subtask. As a secondary goal, our best systems ranked 4th overall and 3rd overall in the monolingual and bilingual subtasks, respectively.</p>
      </abstract>
      <kwd-group>
        <kwd>H</kwd>
        <kwd>3 [Information Storage and Retrieval]</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>1 Content Analysis and Indexing</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>3 Information Search and Retrieval</kwd>
        <kwd>I</kwd>
        <kwd>2 [Arti cial Intelligence]</kwd>
        <kwd>I</kwd>
        <kwd>2</kwd>
        <kwd>7 Natural Language Processing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The objective of the robust task in previous editions was to give preference to systems which
achieve good stable performance over all queries. This year it also included an additional goal: to
test whether word sense disambiguation (WSD) can be used bene cially by retrieval systems. All
our experiments have been focused on the second goal and we experimented on how WSD data
can be exploited in order to improve retrieval. In this sense, we carried out di erent expansion and
translation strategies of both the topics and documents with and without word sense information.</p>
      <p>
        For this purpose, we used the open source Indri search engine, which is based on the inference
network framework and supports structured queries [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>The remainder of this paper is organized as follows. Section 2 describes the experiments carried
out, Section 3 presents the results obtained and, nally, Section 4 draws the conclusions and future
work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Experiments</title>
      <p>In short, our main experimentation strategy consisted on trying several expansion and translation
strategies, all of which used the synonyms in the English and Spanish wordnets made available
by the organizers as the sole resources (i.e., we did not use any other external resource), with and
without word sense information. Our runs have consisted of di erent combination of expanded
(translated) topics and documents. The steps of our retrieval system are the following.</p>
      <p>We rst expand and translate the documents and topics. In a second step we index the original,
expanded and translated document collections. Then we tested di erent query expansion and
translation strategies, and nally we search for the queries in the indexes in various combinations.
We will see each in turn.
2.1</p>
      <sec id="sec-2-1">
        <title>Expansion and translation strategies</title>
        <p>WSD data provided to the participants was based on WordNet version 1.6. Each word sense have
a WordNet synset assigned with a score. Using those synset codes and the English and Spanish
wordnets, we expanded both the documents and the topics. In this way, we generated di erent
topic and document collections using di erent approaches of expansion and translation, as follows:
Full expansion of English topics and documents: expansion to all synonyms of all senses.
Best expansion of English topics and documents: expansion to all synonyms of only the
highest scored sense for each word using the two di erent expansion collections using UBC
and NUS disambiguation data (as provided by organizers).</p>
        <p>Full translation of English documents: translation from English to Spanish of all senses.
Best translation of English documents: translation from English to Spanish of only the
highest scored sense for each word using the two di erent translation collections using UBC
and NUS disambiguation data.</p>
        <p>Translation of Spanish topics: translation from Spanish to English of the rst sense for each
word.</p>
        <p>In the subsequent steps, we used di erent combinations of these expanded and translated
collections.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Indexing</title>
        <p>Once the collections had been pre-processed, they were indexed using Indri. While indexing, the
Indri implementation of the Krovetz stemming algorithm was applied to document terms.</p>
        <p>We created several indexes: one with the original collection words, and one with each collection
created after applying di erent expansion (and translation) strategies, as explained in Section 2.1).</p>
        <p>No stopword list was used, but only nouns, adjectives, verbs and numbers were indexed.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Query construction</title>
        <p>We constructed queries using the title and description topic elds. Based on the training topics, we
excluded some words and phrased from the queries, such as nd, describing, discussing, document,
report for English and encontrar, describir, documentos, noticias, ejemplos for Spanish.</p>
        <p>After excluding those words and taking only nouns, adjectives, verbs and numbers, we
constructed several queries for each topic as follows:
1. Only original words.</p>
        <sec id="sec-2-3-1">
          <title>2. Both original words and all expansions for each word.</title>
          <p>3. Both original words and only expansions for the best sense of each word.
4. Only translated words, using all translations for each word. If a word had not a translation,
that original word was included in the query.
5. Only translated words, using translations for the best sense of each word. If a word had not
a translation, that original word was included in the query.</p>
          <p>The rst three cases are for the monolingual runs, and the last one for the bilingual runs which
translated the query.</p>
          <p>In the rst case, we constructed a simple query combining only the original words using the
Indri operator #combine. In the other cases, we have used some of the operators available in
the structured query language. For example, when we wanted to use original words as well as
synonyms (obtained after expansion) in the same query, we constructed two subqueries (one with
original words, and another one with the expanded words). Then we integrated both subqueries
in the same query using the #weight operator and giving a weight of 0.6 to the original word's
subquery and 0.4 to the other subquery. We used also the #syn operator to join the expanded
words of each sense, as they are meant to be synonyms. In the case of full expansion, instead
of #syn, we used #wsyn (weighted synonym). This operator allows to give di erent weights to
synonyms. So, as in the previous case, we joined synonyms, and also, we weighted each
synonymset with the score that the disambiguation system had assigned to each sense. Finally, multiword
expressions, such as prime minister are added to the query joined with the #1 operator (ordered
window).
2.4</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Retrieval</title>
        <p>We carried out several retrieval experiments combining di erent kinds of indexes with di erent
kinds of queries. We used the training data to perform extensive experimentation, and chose
the ones with best MAP results in order to created the test topic runs. The submitted runs are
described in Section 3.</p>
        <p>In some of the experiments we applied pseudo-relevance feedback (PRF) with these default
parameters: fbDocs:10, fbTerms:50, fbMu:0 and fbOrigWeight: 0.5. Unfortunately, we did not
have time to tune those parameters for the o cial deadline.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>En2EnFullStructTopNusDocsPsrel original and fully expanded terms in topics; both
both original and expanded terms in documents, using best sense according to NUS
word sense disambiguation; PRF.
monolingual no WSD
bilingual
with WSD
no WSD
with WSD</p>
      <p>runId</p>
      <sec id="sec-3-1">
        <title>En2EnNowsd</title>
        <p>En2EnNowsdPsrel
En2EnNusDocsPsrel
En2EnUbcDocsPsrel
En2EnFullStructTopsNusDocsPsrel
Es2EnNowsd
Es2EnNowsdPsrel
Es2EnNusDocsPsrel
Es2EnUbcDocsPsrel
Es2En1stTopsNusDocsPsrel
Es2En1stTopsUbcDocsPsrel
Es2EnNowsd original terms in topics (in Spanish); translated terms in documents (from</p>
        <p>English to Spanish).</p>
        <p>Es2EnNowsdPsrel same as Es2EnNowsd, but with PRF.
bilingual with WSD:
Es2EnNusDocsPsrel original terms in topics (in Spanish); translated terms in documents,
using the best sense according to NUS word sense disambiguation; PRF.</p>
        <p>Es2EnUbcDocsPsrel original terms in topics (in Spanish); translated terms in documents,
using the best sense according to UBC word sense disambiguation; PRF.</p>
        <p>Es2En1stTopsNusDocsPsrel translated terms in topics (from Spanish to English) for
rst sense in Spanish; both original and expanded terms of the best sense according to
NUS disambiguation data; PRF.</p>
        <p>Es2En1stTopsUbcDocsPsrel translated terms in topics (from Spanish to English) for
rst sense in Spanish; both original and expanded terms of the best sense according to
UBC disambiguation data; PRF.</p>
        <p>The results show that the use of WSD data has been e ective. With respect to
monolingual retrieval, En2EnUbcDocsPsrel obtains the best results from our runs, although no
significant di erence is found with respect to En2WnNowsdPsrel1. Regarding the bilingual results,
Es2En1stTopsUbcDocsPsrel is the best, and it is signi cantly better than Es2EnNowsdPsrel.
These results con rm the results we got in the training data. Although not shown here, the
results of using WSD are signi cantly better in the training data with respect to using all senses
(full expansion).</p>
        <p>Although it was not our main goal, our systems ranked high in the exercise, making the 7th
best in the monolingual no-WSD subtask, 9th in monolingual using WSD, 5th best in the bilingual
no-WSD subtask, and 1st in bilingual using WSD. Overall, our systems ranked 4th overall and
3rd overall in the monolingual and bilingual subtasks, respectively.</p>
        <p>After analyzing the experiments and the results, we have found that the approach of expanding
the documents works better than expanding the topics. The extensive experimentation that we
performed on the use of structured queries did not yield better results than just expanding the
documents.</p>
        <p>In our experiments we did not make any e ort to deal with hard topics, and we only paid
attention to improvements in Mean Average Precision (MAP) metric. That is why the Geometric
Mean Average Precision (GMAP) values are lower.</p>
        <p>1Paired Randomization Tests over MAPs with =0.05 have been used along this work</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and future work</title>
      <p>We have reported our experiments for the Robust-WSD Track at CLEF. All our runs ended up
in good ranking, taking into account that these have been our rst experiments in the eld of
information retrieval. This is remarkable, as we did not use any external resources, except the
WSD information and Spanish and English wordnets provided by the organizers, and given that
we did not do any proper parameter tuning (e.g. in the relevance feedback step) on the training
part.</p>
      <p>Our main goal was to get better (CL)IR results using WSD and we achieved it, obtaining
remarkable gains in bilingual IR, and smaller gains in monolingual IR. We discovered that using
the WSD information for documents was a good strategy, in contrast to most of previous IR work,
which has focused on WSD of topics.</p>
      <p>For the future we plan to improve the bilingual results, mainly incorporating external resources.
We tried straightforward methods to exploit WSD information for the expansion and indexing of
the documents. We would like to pursue more sophisticated methods.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work has been supported by KNOW (TIN2006-15049-C03-01) and KYOTO
(ICT-2007211423). Arantxa Otegi's work is funded by a PhD grant from the Basque Government.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turtle Strohman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Metzler</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.B.</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>Indri: A language model-based search engine for complex queries</article-title>
          .
          <source>Proceedings of the International Conference on Intelligence Analysis</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>