<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Review of User-centred Information Retrieval Tasks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>CCS Concepts</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ali Hosseinzadeh vahid</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Roghaiyeh Gachpaz hamed</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Although, most of the recent studies within the IR domain tend to target how users behave while addressing their information needs. However, even though the collection of document sets and user pro ling is a top research problem, it holds inherent di culties for the establishment of a comparative task to evaluate various approaches. Also nding a comprehensive metrics to evaluate di erent a ects of various aspects, that play a signi cant role in satisfaction of users of personalized IR systems, seems to be another noticeable issue. With the review of related tasks in well-known IR evaluation communities, this paper will discuss the way of gathering users pro les and objects of their interest in last 5 years of IR evaluation campaigns.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Evaluation tasks are well-known and innovative ways of
providing the infrastructure necessary for stimulating,
demonstrating and evaluating substantial improvements of
information retrieval methodologies. \Future information retrieval
systems must anticipate user needs and respond with
information appropriate to the current context."[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] These
systems are challenged in three stages: (1) How to model the
information about the user, task, and context; (2) How to
nd and acquire "objects of interest" and (3) how to
exploit this information in order to retrieve the most relevant
results, which satisfy the users information needs. In
recent years di erent IR evaluation campaigns have focused
on the development of a variety of task for the exchange
of research ideas on user-centred IR systems. This paper
investigates and compares the objectives, approaches and
impacts of such tasks with the hope to nd more
appropriate approaches to evaluate improved user-dependent and
context-aware IR systems.
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>USER-CENTRED IR TASKS</title>
      <p>Considering the daily-growing tendency to user-centred
IR systems, evaluation campaigns are promoted to the
exploration of new evaluation methodologies for such systems.
Nevertheless, due to the complexity of the evaluation of
personalized IR systems and the involved potential costs of
evaluation, some IR evaluation forums such as NII Testbeds
and Community for Information access Research (NTCIR)1
and Forum for Information Retrieval Evaluation (FIRE)2
have not focused on user-centred tasks yet.So, this section
describes a user-oriented and context-based task approach
which has been provided in other IR evaluation
communities.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Contextual Suggestion Task of TREC</title>
      <p>Starting in 1992, the Text REtrieval Conference (TREC)3,
co-sponsored by the National Institute of Standards and
Technology (NIST) and U.S. Department of Defense, is the
most popular IR evaluation campaign which has also
provided the rst large-scale evaluations of cross-lingual and
multilingual document retrieval tasks. TREC has also
introduced evaluations for open-domain question answering,
content-based retrieval of digital video and retrieval of
recordings of speech.</p>
      <p>
        The goal of the contextual suggestion track [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is to
evaluate the search techniques for complex information needs of
users with respect to context and their point of interest.
Introduced in TREC 2012, this track investigates to develop a
system that is able to make suggestions of sites with the goal
to explore an unknown city based upon the users personal
interests in the users home city. A set of user preferences,
example suggestions and a set of contexts are given to
participants as inputs: Constant number of manually gathered
suggestions consist of a title, a short description and a
website URL of di erent attractions within speci c, prede ned
regions have been recommended to a user as something they
nd interesting. Pro les are built by conducting a survey
advertised to crowdsourcing workers to indicate their
preferences to the set of above mentioned example suggestions.
1http://research.nii.ac.jp/ntcir/index-en.html
2http:// re.irsi.res.in/
3http://trec.nist.gov/
These assessors asked to give two ratings for each
attraction: 1) How interesting the suggested attraction seemed to
them based on its description and 2) based on its website,
respectively. Contexts describe which city a user is currently
located in. There were 50 cities chosen randomly from the
list of primary cities in metropolitan areas in the United
States from Wikipedia. Each submitted run consists of up
to 50 ranked suggestions for each pro le-context pair, with
formatting similar to that of the sample suggestions.
Participants have been able to gather suggestions from either
the open web, ClueWeb124, or xed set of documents.
Precision at Rank 5 (P@5), Mean Reciprocal Rank (MRR) and
a modi ed version of Time-Biased Gain (TBG)[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] are used
to rank runs of participants.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Social Book Search Task of CLEF</title>
      <p>Conference and Labs of the Evaluation Forum, formerly
known as Cross-Language Evaluation Forum (CLEF)5
promotes development of information access systems with an
emphasis on multilingual and multimodal information with
various levels of structure.</p>
      <p>
        The Social Book Search [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] investigates Evaluation
methodologies for book search task using a combination of various
aspects of retrieval and recommendations dealing with
professional and user-generated meta-data.
      </p>
      <p>As a continuation of the INEX SBS Track that ran from
2011 up to 2014, the task is targeted to returning a list of
recommended books in reply to a user request posted on
a LibraryThing 6(LT) discussion forums by matching the
user's information need. A set of book requests and a set
of user pro les have been assumed as inputs of the task and
a submitted ranked list of recommended books has been
evaluated as the result of participant's system.</p>
      <p>
        The test collection [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] consists of 2.8 million book records
from Amazon, extended with social metadata from LT. Each
book record is an XML le with elds like isbn, title, author,
publisher, dimensions, numberofpages and publicationdate.
The social metadata from Amazon and LT is stored in the
tag, rating, and review elds. To improve the quality of the
meta-data, they are extended with library catalogue records
from the Library of Congress (LoC) and the British Library
(BL).
      </p>
      <p>The topic set is focused on requests which are provided as
a narrative description of the information need of a user and
one or more example books to guide the suggestions. Users
typically describe what they are looking for, give examples
of what they like and do not like, indicate which books they
already know and ask other members for recommendations.
There are also annotated elds by crowdsourcing workers
to indicate whether the example book had been read by
requester and to judge his/her attitude about the book.</p>
      <p>The books suggested by members, which are directly linked
to their corresponding records on Amazon, have been used
as initial relevance judgements for evaluation of participated
systems in the Suggestion Track.</p>
      <p>The rich user pro les of the topic creators and other LT
users have been used as valuable resources of User pro les
and personal catalogues. These pro les generally contain
information on which books they have in their personal
catalogue on LT, which ratings and tags they assigned to them
and a social network of friendship relations, interesting
library relations and group memberships.</p>
      <p>The o cial evaluation measure for this task is nDCG@10.
It takes graded relevance values into account and is designed
for evaluation based on the top retrieved results. In addition,
P@10, MAP and MRR scores will also be reported, with the
evaluation results.
3.</p>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION</title>
      <p>Although de ning tasks and scenarios for evaluation
purposes in IR domain is one of the most common ways for the
exploration of new methodologies and innovative ways in
using and discussing experimental data, it has to be noted that
the comparison of approaches, the exchange of ideas and
transfer of knowledge has been considered a valuable
contribution to evaluation tasks during last decades. However,
the tendency of modern user-centred IR system for taking
user preferences and interests into account through
information seeking process also has changed the identi cation,
setting and evaluation of shared tasks. This paper reviewed
and compared existing user tasks and described their way of
collecting task resources, methods and metrics with hope to
help improving them with combining/ summarising them to
propose new user centered tasks in future.
4.</p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGMENTS</title>
      <p>"The ADAPT Centre for Digital Content Technology is
funded under the SFI Research Centres Programme (Grant
13/RC/2106) and is co-funded under the European Regional
Development Fund."
5.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Allan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Croft</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Mo at, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Sanderson</surname>
          </string-name>
          . Frontiers, challenges, and
          <article-title>opportunities for information retrieval: Report from swirl 2012 the second strategic workshop on information retrieval in lorne</article-title>
          .
          <source>In ACM SIGIR Forum</source>
          , volume
          <volume>46</volume>
          , pages
          <fpage>2</fpage>
          <lpage>{</lpage>
          32. ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Beckers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Pharo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Nordlie</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K. N.</given-names>
            <surname>Fachry</surname>
          </string-name>
          .
          <article-title>Overview and results of the inex 2009 interactive track</article-title>
          .
          <source>In Research and Advanced Technology for Digital Libraries</source>
          , pages
          <volume>409</volume>
          {
          <fpage>412</fpage>
          . Springer,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dean-Hall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Clarke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kamps</surname>
          </string-name>
          , P. Thomas,
          <string-name>
            <given-names>N.</given-names>
            <surname>Simone</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Voorhees</surname>
          </string-name>
          .
          <article-title>Overview of the trec 2013 contextual suggestion track</article-title>
          .
          <source>Technical report, DTIC Document</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Koolen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Bogers</surname>
          </string-name>
          , M. Gade, M. Hall,
          <string-name>
            <given-names>H.</given-names>
            <surname>Huurdeman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kamps</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Skov</surname>
          </string-name>
          , E. Toms, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Walsh</surname>
          </string-name>
          .
          <article-title>Overview of the clef 2015 social book search lab</article-title>
          .
          <source>In Experimental IR Meets Multilinguality, Multimodality, and Interaction</source>
          , pages
          <volume>545</volume>
          {
          <fpage>564</fpage>
          . Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Smucker</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Clarke</surname>
          </string-name>
          .
          <article-title>Time-based calibration of e ectiveness measures</article-title>
          .
          <source>In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval</source>
          , pages
          <volume>95</volume>
          {
          <fpage>104</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>