<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Interactive Cross-Language Searching: phrases are better than terms for query formulation and re nement</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fernando Lopez-Ostenero</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julio Gonzalo</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anselmo Pen~as</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Felisa Verdejo</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>This paper summarizes the participation of the UNED group in the CLEF 2002 Interactive Track. We focused on interactive query formulation and re nement, comparing two approaches: a) a reference system that assists the user to provide adequate translations for terms in the query; and b) a proposed system that assists the user to formulate the query as a set of relevant phrases, and to select promising phrases in the documents to enhance the query. All collected evidence indicates that the phrasebased approach is preferable: the o cial F =0:8 measure is 65% better for the proposed system, and all users in our experiment preferred the phrase-based system as a simpler and faster way of searching.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>Experiment Design</title>
      <sec id="sec-2-1">
        <title>Our experiment consists of:</title>
        <p>Eight native Spanish speakers with null or very low English skills.</p>
        <p>The Spanish version of the four o cial iCLEF topics.</p>
        <p>The English CLEF document collection (LA Times 1994).</p>
        <p>A reference interactive cross-language search system based on assisted term translation
(System WORDS).</p>
        <p>A proposed system based on noun-phrase selections (System PHRASES).
The o cial iCLEF latin square to combine topics, searchers and systems into 32 di erent
searching sessions.</p>
        <p>The o cial iCLEF search procedure.</p>
        <p>In this section we describe the most relevant aspects of the above items.
2.1</p>
        <sec id="sec-2-1-1">
          <title>Reference system</title>
          <p>The reference system (WORDS) uses assisted query term translation and re nement all along
the search process:</p>
          <p>Initial query formulation. The system translates all content words in the iCLEF topic
using a bilingual dictionary, and displays possible English translations to the user. When
the user points to an English term, the system displays inverse translations into Spanish.
This information can be used by the searcher to decide which translations to keep and which
translations to discard before performing the rst search. Figure 1 illustrates this initial
step.</p>
          <p>A) Colour codes in the ranked list indicate already judged documents.</p>
          <p>B) Clicking on a Spanish term in the document takes the user to the source English keyword
matched.
Cross-Language search. The system performs a monolingual search of the LA Times
collection with the English terms selected by the user.</p>
          <p>Ranked document list. The ranked list of documents displays the (translated) title of the
document and a colour code to indicate whether each document has already been marked
as relevant, not relevant or unsure. Figure 2 A shows a retrieved ranked list.
Document selection. Instead of using Machine Translation to display the contents of
a document, the system displays a cross-language summary consisting on the translation
of all noun phrases in the body of the document, plus an MT (Systran Professional 3.0)
translation of the title. The user can select the document as relevant, mark the document
as non-relevant or unsure, or leave it unmarked.</p>
          <p>Query re nement by selection. When a Spanish term in a document translation
corresponds to an original English term already in the query, the user can point to the Spanish
term (highlighted); then the system points to the English query term, allowing for
deselection or selection of the English term (or some of its companion translations) or the
original Spanish term (then all translations are disabled). Figure 2 B illustrates this process.
Additional query re nement. Additionally, the user can also enter a single term at
any time along the search. Again, the system displays its possible translations into the
target language, along with their inverse translations, and permits individual selection and
de-selection of translations.
2.2</p>
        </sec>
        <sec id="sec-2-1-2">
          <title>Phrase-based searching</title>
          <p>Our proposed system uses noun phrasal information all along the Cross-Language assisted search
process:</p>
          <p>Initial query formulation. The system extracts noun phrases from the full iCLEF topic,
lters phrases with optimal translations, and displays the resulting set of phrases for user
selection.</p>
          <p>Cross-Language search. The system translates automatically the phrases selected by the
user, and performs a monolingual search in the document collection.</p>
          <p>Ranked document list. The ranked list is identical for both systems (see reference system
above).</p>
          <p>Document selection. Again, document selection is identical for both systems (see WORDS
system above).</p>
          <p>Query re nement by term suggestion. Optimally translated noun phrases in the
documents can be selected to enrich the original query. When a user clicks on a noun-phrase
in a document, the system automatically translates the noun-phrase and performs a new
monolingual search with the enlarged query, updating the list of ranked documents. This
process is illustrated in Figure 3.</p>
          <p>Additional query re nement. Identical in both systems (see system WORDS above).</p>
          <p>
            In order to achieve such functionalities, there is a pre-processing phase using shallow Natural
Language Processing techniques, which has been described in detail in [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ]. The essential steps are:
Phrase indexing. Shallow parsing of two comparable collections (the CLEF Spanish and
English collections in this case) to obtain an index of all noun phrases in both languages and
their statistics.
          </p>
          <p>
            Phrase Alignment. Spanish and English noun phrases (up to three lemmas) are aligned for
translation equivalents using only a bilingual dictionary and statistical information about
phrases (see [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ] for details). As a result of this step, aligned phrases receive a list of
candidate phrase translations in decreasing order of frequency. The result is a pseudo bilingual
dictionary of phrases that is used in all other translation steps. The statistics for the CLEF
English-Spanish collection can be seen in Table 1.
          </p>
          <p>A) Clicking on best-aligned phrases incorporates them to the query.</p>
          <p>B) Results of clicking the phrase \huelga de hambre en Guatemala". The phrase is added to the
query and a new ranked list is displayed.
Document translation. All noun phrases are extracted and translated. Translation is
performed in two steps: rst, maximal aligned subphrases are translated according to the
alignment information. Then, the rest of the terms are translated using an estimation that
selects target terms which overlap maximally with the set of related subphrases.
Only an additional step is required at searching time:</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Phrase set</title>
        <p>
          Spanish, 2 lemmas
Spanish, 3 lemmas
English, 2 lemmas
English, 3 lemmas
Query translation. All Spanish phrases selected by the user are replaced by: 1) the most
frequent aligned English phrase and 2) the second most frequent aligned phrase, if its frequency
reaches a threshold of 80% of the most frequent one. The INQUERY phrase operator is
used to formulate the nal monolingual query with all English phrases. The search is then
performed using the INQUERY search engine.
Every searcher performed 4 searches, one per iCLEF topic, alternating systems and topics
according to the iCLEF latin square design. The time for each search was 20 minutes, and the overall
time per searcher was around three hours, including training, questionnaires and searches (see [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]
for details). For every user/topic/system combination, the following data were collected:
The set of documents retrieved by the user, and the time at which every selection was made.
The ranked lists produced by the system in each query re nement.
        </p>
        <p>The questionnaires lled-in by the user.</p>
        <p>An observational study of the search sessions.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>O</p>
      <p>cial F =0:8 scores
The o cial iCLEF score for both systems is F =0:8, which combines precision and recall over the
set of manually retrieved documents, favoring precision. The results of our experiment can be
seen in Table 2. Our proposed system (PHRASES) improves the reference system (WORDS) by
a 65% increment. In a more detailed analysis per topic, there can be seen that topic 3 was too
di cult and did not contribute to the results (no searcher found relevant documents with any of
the systems). All the other topics receive a better F measure with the PHRASES system than
with the WORDS system. The di erence is not very high for topics 1 and 2, but it is very accused
for topic 4, which seemed easy for system PHRASES and very di cult for system WORDS.</p>
      <p>The most important expression in Topic 4 is \hunger strikes" (the description is \documents
will report any information relating to a hunger strike attempted in order to attract attention to
a cause"). Searchers using the PHRASES system easily select \huelga de hambre" (the
Spanish equivalent) from the displayed options, and the aligned translation, which is in turn \hunger
strikes", will retrieve useful documents. Searchers using the WORDS system, however, nd that
\huelga" (strike) and \hambre" (hunger) may receive many possible translations into English.
Looking at the average F , it is obvious that they do not manage to nd the appropriate
translations for both terms, failing to match relevant documents.
3.2</p>
      <sec id="sec-3-1">
        <title>Additional data</title>
        <p>Besides the o cial F result, there are many other sources of evidence to compare both systems:
additional quantitative data (time logs, ranked results for every query re nement), questionnaires</p>
        <sec id="sec-3-1-1">
          <title>System WORDS</title>
          <p>PHRASES
lled by participants, and the observation study of their searching sessions.
additional evidence here.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>We discuss that</title>
          <p>3.2.1</p>
          <p>Searching behavior across time
2
0</p>
          <p>8</p>
          <p>The plot of document selections against time in Figure 4 provides interesting evidence about
searching behavior: Searchers begin selecting documents much faster with the PHRASES system
(8 selections made in minute one) than with the WORDS system (the rst selection is made
in minute 3). The obvious explanation is that initial query formulation is very simple in the
PHRASES system (select a few phrases in the native language), and time consuming in the
WORDS system (examining many foreign-language candidate translations per term and selecting
them using inverse dictionary evidence).</p>
          <p>The initial precision (i.e. the precision after initial query formulation) is not higher for
system WORDS, in spite of the substantially higher time spent by searchers in the rst
query formulation. This con rms that a good initial selection of native-language phrases
can provide good initial translations of the topic terms.</p>
          <p>Searchers perform many more query re nements with the PHRASES system, con rming
that is easier to enhance the query using phrases selected from documents.</p>
          <p>Searchers obtain occasional precision gures of 1, .95, .90, etc. using the PHRASES system,
while the highest precision obtained with WORDS is .75 for topic 1, searcher 1.</p>
          <p>Overall, the additional quantitative data also supports our initial hypothesis.
3.2.3</p>
          <p>Analysis of questionnaires
The answers supplied by the eight searchers strongly support our hypothesis. All of them stated
that the PHRASES system was easier to learn, easier to user and better overall. They appreciated
both the ability of selecting phrases rather than individual terms, and most of them added that it
was much better not to see English terms at any moment. A general claim was that the dictionary
had too many acceptions for each term.
3.2.4</p>
          <p>Observational study
The careful observation of searchers' behavior is in agreement with the above results. Some points
are worth commenting:</p>
          <p>Users get discouraged with terms that have a lot of alternative translations in the WORDS
system. Even if the term is important for the topic, they try to avoid them.</p>
          <p>Selecting foreign-language terms is perceived as a hard task; when no relevant documents
are found after a few iterations, users get discouraged with the WORDS system.
The re nement loop works well for the PHRASES system once relevant documents begin to
appear. However, if relevant documents do not appear soon, the initial query re nements
are not obvious and both systems are equally hard.</p>
          <p>The automatic translation of phrases may be harmful when the aligned equivalent is
incorrect. This is the case of \busqueda de tesoros", which does not receive a correct translation
(\treasure hunting") and it is the most important concept for Topic 2. The problem is that
users do not detect that the translation is incorrect; they simply think that there is no match
in the collection for such concept.</p>
          <p>The di culty of topic 3 (campaigns against racism in Europe) comes from the fact that the
LA Times collection does not refer to any of such campaigns as generically \European", and
the overwhelming majority of documents about racism are US-centered.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>We have obtained multiple evidence (quantitative data, user opinions and observational study)
that a phrase-based approach to cross-language query formulation and re nement, without
userassisted translation, can be easier to use and more e ective than assisted term by term translation.
Of course, this is not an absolute conclusion, if only because our reference system o ered only crude
help for term-by-term translation (inverse translations using a bilingual dictionary). Probably a
more sophisticated translation assistance would stretch the di erences between approaches. But
we believe that a valid conclusion, in any case, is that language barriers are perceived as a strong
impediment by users, and it is worth studying strategies of Cross-Language Search Assistance
keeping a monolingual perspective from the user.
100
90
80 75
70
60
50
40
30
20
10
0</p>
    </sec>
    <sec id="sec-5">
      <title>Topic 1: Iterative rankings</title>
      <p>5
re nements</p>
      <p>4
re nements
0
1
2
3
4
5
Searcher 2
Searcher 7
0
1</p>
      <p>2
re nements
0
re nements
0
re nements
0
1</p>
      <p>2
re nements
System PHRASES
System WORDS</p>
    </sec>
    <sec id="sec-6">
      <title>Topic 2: iterative rankings</title>
      <p>5
re nements</p>
      <p>5
re nements
15
3
4
re nements
5
7
40</p>
      <p>2
re nements
Searcher 7</p>
      <p>5
re nements</p>
      <p>2
re nements</p>
      <p>4
re nements
System WORDS</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Erbach</surname>
          </string-name>
          , Gunter Neumann, and
          <string-name>
            <given-names>Hans</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          . Mulinex:
          <article-title>Multilingual intexing, navigation and editing extensions for the world-wide web</article-title>
          .
          <source>In AAAI Symposium on Cross-Language Text and Speech Retrieval</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonzalo</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Oard</surname>
          </string-name>
          .
          <article-title>The clef 2002 interactive track</article-title>
          .
          <source>In Proceedings CLEF</source>
          <year>2002</year>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>F.</given-names>
            <surname>Lopez-Ostenero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonzalo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          <article-title>Pen~as, and</article-title>
          <string-name>
            <given-names>F.</given-names>
            <surname>Verdejo</surname>
          </string-name>
          .
          <article-title>Noun-phrase translations for crosslanguage document selection</article-title>
          .
          <source>In Proceedings of CLEF</source>
          <year>2001</year>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W.</given-names>
            <surname>Ogden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cowie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Davis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Ludovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Nirenburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Molina-Salgado</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>N.</given-names>
            <surname>Sharples</surname>
          </string-name>
          . Keizai:
          <article-title>An interactive cross-language text retrieval system</article-title>
          .
          <source>In Proceeding of the MT SUMMIT VII Workshop on Machine Translation for Cross Language Information Retrieval</source>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>