<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>How do Named Entities Contribute to Retrieval Effectiveness?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Thomas Mandl</string-name>
          <email>mandl@uni-hildesheim.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christa Womser-Hacker</string-name>
          <email>womser@uni-hildesheim.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Topic Properties</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Hildesheim, Information Science</institution>
          ,
          <addr-line>Marienburger Platz 22 D-31141 Hildesheim</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Named entities in topics are a major factor contributing to the quality of retrieval results. In this paper, we report on an analysis on the correlation between the number of named entities present in a topic and the retrieval quality achieved for these topics by retrieval systems within CLEF. We found that a medium positive correlation exists for German, English and Spanish topics. Furthermore, we analyze the effect of the document or target language on the retrieval quality.</p>
      </abstract>
      <kwd-group>
        <kwd>Results per</kwd>
        <kwd>Run and Topic</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>&lt;top&gt;
&lt;num&gt; C001
&lt;S-ti&lt;tloep&gt;
Arqui&lt;tneucmt&gt;urCa00e1n Berlín
&lt;S-de&lt;sSc-&gt;ti&lt;tloep&gt;&gt;
EnconAtrrqauri&lt;tndeuocmct&gt;umrCea0n0teon1sBseorblríen la
ea&lt;anrSrq-qgunueiai&lt;LE&lt;ntrtSonSeere-sc-rc&gt;cnodattdne&lt;EA&lt;lóurotsSnrS,nrrcrc-q-ia&gt;ua&gt;doutscmreniiooeestttbsnndcrelrto&gt;aceedBocrt&gt;eesulrmuoBlrerdseínaornltcrl.eoueaívsmnsnaegnsnootots,ebsroeesntrlsaaotbarne, la
Los daorcqumietnetcotsurraeleenvaBnetrelsínBt.erartlaínn,
partiecnulgaern,eqrusaioltb,ercestolbuareareleconosnBsretarsuglcoícsni.ón
arqui&lt;tSe-cntaórnri&gt;cos de Berlín o, en
partiLcousladro,cusmobernetolsa rreelcoenvsatnrtuecsción
tratan, en general, sobre los
rasgos arquitectónicos de Berlín</p>
      <sec id="sec-1-1">
        <title>Stop Stem</title>
        <p>worSdtsopmeSrtemIndex
worSdtsopmeSrtemIndex</p>
      </sec>
      <sec id="sec-1-2">
        <title>Index</title>
        <sec id="sec-1-2-1">
          <title>Wewigohrtdss BmRFer</title>
        </sec>
      </sec>
      <sec id="sec-1-3">
        <title>Weights BRF</title>
      </sec>
      <sec id="sec-1-4">
        <title>Weights BRF</title>
        <p>Systems</p>
        <p>
          Patterns
Our current analysis concentrates on named entities within the topics of CLEF. Named entities frequently occur
in CLEF as part of the topic formulation. Table 1 gives an overview.
The large number of named entities in the topic set shows that they are a subject worth studying. The large
number may be due to the fact that the document corpus for CLEF consists of newspaper texts. We can also
observe an increase of named entities per topic in 2002 compared to 2001. Because of the effect of named
entities on retrieval performance
          <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref8">(Mandl &amp; Womser-Hacker 2004c)</xref>
          , the number of named entities needs to be
carefully monitored. Table 2 shows how the named entities are distributed over groups with different numbers of
named entities and shows the tasks analyzed in this paper.
In a study presented at CLEF in 2003, we showed a correlation between the number of named entities present in
topics and the systems’ performance for these topics
          <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref8">(Mandl &amp; Womser-Hacker 2004b)</xref>
          . In this paper, we extend
the analysis to Spanish and monolingual tasks. In our earlier analysis, the relation was shown for English and
German. Including Spanish will show, whether this effect can be revealed for another topic language. By
including monolingual tasks, we may be able to compare the strength of the effect between cross- and
monolingual retrieval tasks.
        </p>
        <p>
          Named entities were intellectually assessed according to the schema of
          <xref ref-type="bibr" rid="ref9">Sekine et al. 2002</xref>
          . The performance of
the systems was extracted from the CLEF proceedings. The average precision for a topic is calculated as the
average precision of all systems for a individual topic. From the average precision for a topic, we can calculate
the average of all topics which contain n named entities. Figure 2 and 3 show the average precision for topics
with n named entities for tasks in CLEF 3 and CLEF 4.
0,6
n
iso0,5
i
c
e
rP0,4
e
g
a
rev0,3
A
0,2
0,1
0
0.8
0.7
0.6
ion 0.5
s
i
c
e
rp 0.4
e
g
a
rve 0.3
A
0.2
0.1
0
0
Monolingual English
Monolingual German
Monolingual Spanisch
Bilingual Topic Language
German
Multilingual Spanisch
Multilingual English
0
1
2
        </p>
        <p>3 4
Number of named entities
5
6</p>
        <p>7
In figure 2 and 3 we can observe that monolingual tasks generally result in higher average precision than
crosslingual tasks. The precision tends to be better when more named entities are present.</p>
        <sec id="sec-1-4-1">
          <title>The relation previously observed for German and English can also be seen for Spanish.</title>
          <p>2</p>
          <p>4
N um ber of n am ed en tities
6
8</p>
          <p>M onolingual G erm an
M onolingual S panisch
B ilingual Topic Language
E nglish
M ultilingual E nglish
M ultilingual G erm an
We also calculate the correlation between the number of named entities and the average precision per topic for
each of the tasks. The results are presented in table 3 and 4.
The systems tested at CLEF perform differently well for topics with different numbers of named entities.
Although proper names make topics easier in general and for almost all runs, the performance of systems varies
within the three classes of topics based on the number of named entities. We distinguished three classes of
topics, (a) the first class with no proper names called none, (b) the second class with one and two named entities
called few and (c) one class with three or more named entities called lots. The patterns of the systems are
strikingly different for the three classes. As a consequence, there seems to be potential to improve system by
fusion based on the number of named entities in a topic. Many systems already apply fusion techniques.
We propose a simple fusion rule. First, the number of named entities is determined for each topic. Subsequently,
this topic is channeled to the system with the best performance for this named entity class. The best system is a
combination of at most three runs. Each category of topics is answered by the optimal system within a group of
systems for that number of named entities. The groups were selected from the original CLEF ranking of the runs
in one task. We used a window of five runs. That means, five neighboring runs by systems which perform
similarly well overall are grouped and fused by our approach. Table 5 shows the improvement by the fusion
based on the optimal selection of a system for each category of topics.</p>
          <p>The highest levels of improvement are achieved for the topic language English. For 2002, we observe the highest
improvement of 10% for the bilingual runs.</p>
          <p>CLEF
year
2001
2001
2002
2002
2003
2003
2003</p>
        </sec>
        <sec id="sec-1-4-2">
          <title>Run type</title>
        </sec>
        <sec id="sec-1-4-3">
          <title>Bilingual</title>
        </sec>
        <sec id="sec-1-4-4">
          <title>Multilingual</title>
        </sec>
        <sec id="sec-1-4-5">
          <title>Bilingual</title>
        </sec>
        <sec id="sec-1-4-6">
          <title>Multilingual</title>
        </sec>
        <sec id="sec-1-4-7">
          <title>Bilingual</title>
        </sec>
        <sec id="sec-1-4-8">
          <title>Bilingual</title>
        </sec>
        <sec id="sec-1-4-9">
          <title>Multilingual</title>
          <p>This approach regards the systems as black boxes and requires no knowledge about the treatment of named
entities within the systems. Considering the linguistic processing within the systems might be even more
rewarding. Potentially, further analysis might reveal which approaches, which components and which parameters
are especially suited for topics with and without named entities.</p>
          <p>
            This analysis shows that the performance of retrieval systems can be optimized by channeling topics to the
systems best appropriated for topics without, with one or two and with three and more names. Certainly, the
application of this fusion on the past results approach is artificial and the number of topics in each subgroup is
not sufficient for a statistically reliable result
            <xref ref-type="bibr" rid="ref10 ref11">(Voorhees &amp; Buckley 2002)</xref>
            . Furthermore, in our study, the number
of named entities was determined intellectually. However, this mechanism can be easily implemented by using
an automatic named entity recognizer. We intend to apply this fusion technique in an upcoming CLEF task as
one element of the fusion framework MIMOR
            <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref8">(Mandl &amp; Womser-Hacker 2004a, Hackl et al 2004)</xref>
            .
          </p>
          <p>Named Entities in Topics and Retrieval Performance for Target Languages
So far, our studies have been focused to the language of the initial topic which participants used for their
retrieval efforts. Additionally, we have analyzed the effect of the target or document language. In this case, we
cannot consider the multilingual tasks where there are several target languages. The monolingual tasks have
already been analyzed in section 2 and are also considered here. Therefore, this analysis is targeted at bilingual
retrieval tasks. We grouped all bilingual runs with English, German and Spanish as document language. The
correlation between the number of named entities in the topics and the average precision of all systems for that
topic was calculated. The average precision may be interpreted as the difficulty of the topic. The following table
shows the results of this analysis.
In this paper a strong relation between named entities in topics and the performance of retrieval systems for these
topics was confirmed. This finding allows us to formulate a hint for searchers and users of retrieval systems:
Whenever you can think of a name related to your retrieval problem, consider including it in the query.
In addition, our results encourage further analysis of other topic features. We are especially considering a part of
speech (POS) analysis of the CLEF topics.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Acknowledgements</title>
      <p>We would like to thank Martin Braschler for providing the crucial data for our study. Furthermore, we
acknowledge the work of several students from the University of Hildesheim who contributed to this analysis as
part of their course work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Braschler</surname>
          </string-name>
          , Martin; Peters,
          <string-name>
            <surname>Carol</surname>
          </string-name>
          (
          <year>2004</year>
          ):
          <article-title>Cross-Language Evaluation Forum: Objectives, Results, Achievements</article-title>
          . In: Information Retrieval. no.
          <issue>7</issue>
          . pp.
          <fpage>7</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Fuhr</surname>
          </string-name>
          ,
          <string-name>
            <surname>Norbert</surname>
          </string-name>
          (
          <year>2003</year>
          )
          <article-title>(ed</article-title>
          .):
          <article-title>Initiative for the Evaluation of XML Retrieval (INEX) : INEX 2003 Workshop Proceedings</article-title>
          , Dagstuhl, Germany, December 15-
          <issue>17</issue>
          ,
          <year>2003</year>
          . http://purl.oclc.org/NET/duett-07012004-093151
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Hackl</surname>
          </string-name>
          , René; Kölle, Ralph; Mandl, Thomas; Ploedt, Alexandra; Scheufen,
          <string-name>
            <surname>Jan-Hendrik;</surname>
          </string-name>
          Womser-Hacker,
          <source>Christa</source>
          (
          <year>2004</year>
          )
          <article-title>: Multilingual Retrieval Experiments with MIMOR at the University of Hildesheim</article-title>
          . To appear in: Peters, Carol; Braschler, Martin; Gonzalo, Julio; Kluck, Michael (eds.):
          <article-title>Evaluation of CrossLanguage Information Retrieval Systems</article-title>
          .
          <source>Proceedings of the CLEF 2003 Workshop</source>
          . Berlin et al.: Springer [Lecture Notes in Computer Science]
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Mandl</surname>
          </string-name>
          , Thomas; Womser-Hacker,
          <article-title>Christa (2004a): A Framework for long-term Learning of Topical User Preferences in Information Retrieval</article-title>
          . In: New Library World. vol.
          <volume>105</volume>
          (
          <issue>5</issue>
          /6). pp.
          <fpage>184</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Mandl</surname>
          </string-name>
          , Thomas; Womser-Hacker,
          <article-title>Christa (2004b): Analysis of Topic Features in Cross-Language Information Retrieval Evaluation</article-title>
          .
          <source>In: 4th International Conference on Language Resources</source>
          and
          <string-name>
            <surname>Evaluation (LREC) Lisbon</surname>
          </string-name>
          , Portugal, May
          <volume>24</volume>
          -30. Workshop Lessons Learned from Evaluation:
          <article-title>Towards Transparency and Integration in Cross-Lingual Information Retrieval (LECLIQ)</article-title>
          . pp.
          <fpage>17</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Mandl</surname>
          </string-name>
          , Thomas; Womser-Hacker,
          <article-title>Christa (2004c): Proper Names in the Multilingual CLEF Topic Set</article-title>
          . To appear in: Peters, Carol; Braschler, Martin; Gonzalo, Julio; Kluck, Michael (eds.):
          <article-title>Evaluation of CrossLanguage Information Retrieval Systems</article-title>
          .
          <source>Proceedings of the CLEF 2003 Workshop</source>
          . Berlin et al.: Springer [Lecture Notes in Computer Science] Preprint: http://clef.iei.pi.cnr.it:2002/2003/WN_web/53.pdf
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Oyama</surname>
          </string-name>
          , Keizo; Ishida, Emi; Kando,
          <string-name>
            <surname>Noriko</surname>
          </string-name>
          (
          <year>2003</year>
          )
          <article-title>(eds</article-title>
          .):
          <source>NTCIR Workshop3 Proceedings of the Third NTCIR Workshop on research in information Retrieval</source>
          ,
          <article-title>Automatic Text Summarization and Question Answering (September2001-October2002</article-title>
          ) http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings3/index.html
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Schneider</surname>
          </string-name>
          , René; Mandl, Thomas; Womser-Hacker,
          <source>Christa</source>
          (
          <year>2004</year>
          ): Workshop LECLIQ:
          <article-title>Lessons Learned from Evaluation: Towards Integration and Transparency in Cross-Lingual Information Retrieval with a special Focus on Quality Gates</article-title>
          .
          <source>In: 4th International Conference on Language Resources</source>
          and
          <string-name>
            <surname>Evaluation (LREC) Lisbon</surname>
          </string-name>
          , Portugal, May
          <volume>24</volume>
          -30. Workshop Lessons Learned from Evaluation:
          <article-title>Towards Transparency and Integration in Cross-Lingual Information Retrieval (LECLIQ)</article-title>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Sekine</surname>
          </string-name>
          , Satoshi; Sudo, Kiyoshi; Nobata,
          <string-name>
            <surname>Chikashi</surname>
          </string-name>
          (
          <year>2002</year>
          )
          <article-title>: Extended Named Entity Hierarchy</article-title>
          .
          <source>In: Proceedings of Third International Conference on Language Resources and Evaluation (LREC</source>
          <year>2002</year>
          )
          <article-title>; Las Palmas</article-title>
          , Canary Islands, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Voorhees</surname>
          </string-name>
          , Ellen; Buckland,
          <string-name>
            <surname>Lori</surname>
          </string-name>
          (
          <year>2002</year>
          )
          <article-title>(eds</article-title>
          .):
          <source>The Eleventh Text Retrieval Conference (TREC</source>
          <year>2002</year>
          ).
          <source>NIST Special Publication 500-251. National Institute of Standards and Technology. Gaithersburg, Maryland. November</source>
          <year>2002</year>
          . http://trec.nist.gov/pubs/trec11/t11_proceedings.html
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Voorhees</surname>
          </string-name>
          , Ellen; Buckley,
          <string-name>
            <surname>Chris</surname>
          </string-name>
          (
          <year>2002</year>
          ):
          <article-title>The Effect of Topic Set Size on Retrieval Experiment Error</article-title>
          .
          <source>In: Proc Annual Intl. ACM Conference on Research and Development in Information Retrieval (SIGIR '02) Tampere</source>
          , Finland. pp.
          <fpage>316</fpage>
          -
          <lpage>323</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Womser-Hacker</surname>
          </string-name>
          ,
          <source>Christa</source>
          (
          <year>2002</year>
          )
          <article-title>: Multilingual Topic Generation within the CLEF 2001 Experiments</article-title>
          . In: Peters, Carol; Braschler, Martin; Gonzalo, Julio; Kluck, Michael (eds.):
          <source>Evaluation of Cross-Language Information Retrieval Systems</source>
          . Springer [LNCS 2406] pp.
          <fpage>389</fpage>
          -
          <lpage>393</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>