<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards User-Centered Retrieval Algorithms</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Manuel J. Fonseca</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering INESC-ID/IST/Technical University of Lisbon R. Alves Redol</institution>
          ,
          <addr-line>9, 1000-029 Lisboa</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
      </contrib-group>
      <fpage>2</fpage>
      <lpage>4</lpage>
      <abstract>
        <p>Nowadays almost all retrieval algorithms (for text, images, drawings, etc.) are mainly concerned in achieving good system-centered measures, such as precision and recall. However, these systems are used by users, who try to achieve goals through the execution of tasks. To better satisfy the users' needs we must involve them in the development process of the retrieval systems. In this paper, we argue that a user-centered approach, where users are included in the development cycle of the overall retrieval system, can lead to improved retrieval algorithms and also to a better user satisfaction while using the system.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>The majority of the retrieval algorithms, whether they
are for text, images, drawings, 3D objects, audio, video, etc.,
are mainly interested in performing well for system-centered
measures, like for instance precision and recall. However,
these systems are used by users who want to perform
speci c tasks and achieve speci c goals. We can develop a good
retrieval system, that performs well against a prede ned
ground truth, but when we delivery it to users they may
Copyright c 2011 for the individual papers by the papers’ authors.
Copying permitted only for private and academic purposes. This volume is
published and copyrighted by the editors of euroHCIR2011.</p>
      <p>EuroHCIR ’11 Newcastle, UK
.
not be able to nd what they want or they may not even be
able to submit a query to the system.</p>
      <p>For illustration purposes let us consider the following
hypothetic scenario: \We developed a system for retrieving
generic complex vector drawings, like for instance
technical drawings, architectural plants or clipart drawings. We
evaluated it using query-by-example and a set of prede ned
drawings, achieving a good precision and recall measure.
Afterwards, when we delivered the system to users, we noticed
that they were not able to use it, because they could not nd
the ( rst) drawing that they must use as query to nd the
desired drawing. Moreover, users do not want to search for
the complete drawing, but only by a subpart of the drawing."</p>
      <p>This scenario could be avoided if before we developed the
retrieval system we asked users what were their needs, what
did they want to perform on the system and how they want
to do it. To collect all this information we need to apply
a user-centered approach where users are involved in the
development of the retrieval system and algorithms.</p>
      <p>In this paper we defend an user-centered approach as a
way to create better retrieval algorithms and improve the
overall retrieval system. We start by shortly describe the
user-centered approach and the iterative cycle used in the
user interface design. In Section 3 we describe our
application of the user-centered approach in the development of
retrieval algorithms. Finally, we present some conclusions.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>USER-CENTERED DESIGN</title>
      <p>The user-centered design (UCD) is a design methodology,
where the needs, skills and limitations of the users are taken
into account during all stages of the development of the
system. The key premise of the user-centered design is that
the active involvement of the users in the development
process as well as in the evaluation of the interactive products
can lead to well-designed systems that best meet the desired
usability goals. These systems will take advantage of users
skills, will be relevant to their work and activities, and will
help them rather than constrain their actions.</p>
      <p>
        One of the principles from the UCD [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] states that we
rst need to identify who the users will be (pro le, skills
limitations, etc.) and what tasks they perform and/or wish
to perform. The second principle mentions that the systems
should be exposed to users in the early stages of development
to collect feedback from them. Finally, the third principle is
iterative design. The results and feedback from user testing
should be used to x and improve the system. The UCD
assumes an iterative cycle with identi cation of the users'
needs, design of the solution and evaluation, repeated as
often as necessary, as depicted in Figure 1.
      </p>
      <p>!"#$%&amp;'(%)&amp;"*%</p>
      <p>+'&amp;,-"."%</p>
    </sec>
    <sec id="sec-3">
      <title>USER-CENTERED RETRIEVAL</title>
      <p>Typically when we want to develop a new retrieval
approach, we look at the media to retrieve (text, audio, video,
drawings, images, etc.), identify the features that better
describe the media, create a matching algorithm and nally
we compute precision and recall. Although this
methodology allows us to create retrieval systems, we believe that by
including the user in the development cycle will allow us to
deliver better and more usable retrieval systems, that will
allow users to achieve their goals and not only systems that
have a good precision and recall performance.</p>
      <p>Moreover, we should not develop retrieval systems, and
that includes descriptor computation, matching algorithms
and presentation of the results, without rst identifying a
set of user needs and functional requirements ( rst step in
the user-centered design). We need to know our users, their
skills, their background, their pro le. We must identify their
needs and requirements, their goals and how they achieve
them. In summary, we need to do an user and task analysis
before we start developing our retrieval system. User and
task analysis should not only in uence the design of the
user interface, but also the design of the retrieval approach
or algorithm.</p>
      <p>For instance, users could use various strategies to perform
a search in a drawing retrieval system. They could use a
drawing that they already have, in a le, to search for
similar drawings using query-by-example, or they could draw
a sketch of the drawing that they want to nd. As we can
see, the retrieval solution (feature extraction, indexing and
matching algorithms) will be di erent on each case. While
in the rst case we only need to compare two drawings of
the same complexity and with the same characteristics (sets
of lines and polygons), in the second case we need to
compare complex drawings with sketches (typically simpler and
with less elements). Thus, the way users perform the task
to achieve their goal in uence the retrieval approach that
we should develop.</p>
      <p>After developing the retrieval solution based on the user
requirements, we should evaluate the retrieval system, using
not only system-centered measures, but also user-centered
measures, such as time to complete tasks, error rates,
satisfaction, etc. As in the user-centered design of interactive
systems, results from the evaluation of the retrieval system
(system and user centered measures) should be used to
improve the system and to re ne the user and functional
requirements of the retrieval system.</p>
      <p>One of the things that we observed in one evaluation
session with users, was that users did not care about where
in the order of retrieval the intended drawing appears, the
important fact being that it was there. One of the users
produced this comment \It [the system] found it [the drawing]!
That is what counts!" However, when we evaluate retrieval
systems, the majority of the existing measures and ground
truth datasets privilege precision. Of course this
systemcentered evaluation is important, but we should also take
into account the users perspective, where they privilege
recall.
3.1</p>
    </sec>
    <sec id="sec-4">
      <title>An Example</title>
      <p>
        Involving the users can a ect the way we develop the
retrieval algorithms. In recent years we developed a generic
approach for complex vector drawing retrieval, based on the
topology and geometry of the elements present in the
drawing. These two features were used to describe the content
of the drawings, and during matching, we rst compare the
drawings using topology and them we compare the
geometry of those with similar topologies, giving the same weigh
to both features (for more details see [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]). This generic
retrieval approach was used to develop one system for
retrieving technical drawings [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and another for retrieving clipart
drawings [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Before we developed this solution and the two retrieval
systems, we performed user and task analysis to understand
how users wanted to make queries to this type of systems.
We notice that they prefer to draw sketches of the drawing
that they were looking for than to submit an existing
drawing to perform a query-by-example. Moreover, most of the
times they do not have a drawing similar to the one that
they are looking for.</p>
      <p>
        The two systems were both evaluated with users, and from
those evaluations we observed that the way users search for
technical drawings was di erent from the way they search
for clipart drawings [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. While in the case of technical
drawings users draw more complete sketches with several visual
elements, and consequently de ning a richer topological
con
      </p>
      <p>guration, as illustrated in Figure 2; for clipart drawings,
users produced simpler sketches, with fewer elements and
with a poorer topological description (see Figure 3).</p>
      <p>
        Due to this observation during tests with users, we re ne
our retrieval algorithm for retrieving clipart drawings [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
putting more emphasis on the geometry than on topology.
With this change we were able to achieve a better precision
and recall measure for clipart drawings, and we adapted our
retrieval system to the users' way of sketching queries.
3.2
      </p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>We can not develop our retrieval algorithms without
involving our users into the development cycle. As in the
design of interactive systems, also in the development of
retrieval systems we must involve the users.</p>
      <p>They must be involved in the initial phase, so we can
understand how they search for the information, what are
their knowledge, what are their limitations and what is their
pro le. With this we are able to identify users needs and
functional requirements.</p>
      <p>Later on, during the development of the algorithms we
should take into account this input and adapt the algorithms
to provide \good results" for "our" users, and not for the users
in general, or for the system.</p>
      <p>Finally, during the evaluation stage, besides computing
the traditional system-centered measures, for a set of datasets
de ned as ground truth, we should also involve users in the
evaluation to collect quantitative and qualitative measures.
Information gather during evaluation should be used to
improve the retrieval algorithms and the overall retrieval
system, in the next iteration of the iterative cycle of the
usercentered approach.</p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSIONS</title>
      <p>In this paper we defended a user-centered approach for
the development of retrieval systems. As in the case of user
interfaces design, also for retrieval systems is important to
know our users, adapt the algorithms to them, and involve
the users in the evaluation of the system.</p>
      <p>We believe, and we had con rmed, that the involvement
of the user in the development cycle of retrieval systems can
conduct to better systems that satisfy users needs and are
more adapted to them.</p>
    </sec>
    <sec id="sec-7">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work was supported by FCT through the PIDDAC
Program funds (INESC-ID multiannual funding) and the
Crush project, PTDC/EIA-EIA/108077/2008.
5.
6.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          .
          <article-title>Sketch-Based Retrieval in Large Sets of Drawings</article-title>
          .
          <source>PhD thesis</source>
          , Instituto Superior Tecnico / Technical University of Lisbon,
          <year>July 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Barroso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Jorge</surname>
          </string-name>
          .
          <article-title>Retrieving clipart images by content</article-title>
          .
          <source>In Proceedings of the 3rd International Conference on Image and Video Retrieval (CIVR'04)</source>
          , volume
          <volume>3115</volume>
          of Lecture Notes in Computer Science, pages
          <volume>500</volume>
          {
          <fpage>507</fpage>
          . Springer-Verlag, Dublin, Ireland,
          <year>July 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ferreira</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Jorge</surname>
          </string-name>
          .
          <article-title>Content-based retrieval of technical drawings</article-title>
          .
          <source>International Journal of Computer Applications in Technology (IJCAT)</source>
          ,
          <volume>23</volume>
          (
          <issue>2</issue>
          {4):
          <volume>86</volume>
          {
          <fpage>100</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Gould</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Lewis</surname>
          </string-name>
          .
          <article-title>Designing for usability: key principles and what designers think</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>28</volume>
          (
          <issue>3</issue>
          ):
          <volume>300</volume>
          {
          <fpage>311</fpage>
          ,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Sousa</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          .
          <article-title>Geometric matching for clip-art drawing retrieval</article-title>
          .
          <source>Journal of Visual Communication and Image Representation (JVCI)</source>
          ,
          <volume>20</volume>
          (
          <issue>2</issue>
          ):
          <volume>71</volume>
          {
          <fpage>83</fpage>
          ,
          <year>February 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Sousa</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Fonseca</surname>
          </string-name>
          .
          <article-title>Sketch-based retrieval of drawings using spatial proximity</article-title>
          .
          <source>Journal of Visual Languages and Computing (JVLC)</source>
          ,
          <volume>21</volume>
          (
          <issue>2</issue>
          ):
          <volume>69</volume>
          {
          <fpage>80</fpage>
          ,
          <string-name>
            <surname>April</surname>
          </string-name>
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>