=Paper= {{Paper |id=None |storemode=property |title=Towards User-Centered Retrieval Algorithms |pdfUrl=https://ceur-ws.org/Vol-763/posterA.pdf |volume=Vol-763 |dblpUrl=https://dblp.org/rec/conf/eurohcir/Fonseca11 }} ==Towards User-Centered Retrieval Algorithms== https://ceur-ws.org/Vol-763/posterA.pdf
                  Towards User-Centered Retrieval Algorithms

                                                             Manuel J. Fonseca
                                          Department of Computer Science and Engineering
                                            INESC-ID/IST/Technical University of Lisbon
                                            R. Alves Redol, 9, 1000-029 Lisboa, Portugal
                                                                mjf@inesc-id.pt


ABSTRACT                                                                    not be able to find what they want or they may not even be
Nowadays almost all retrieval algorithms (for text, images,                 able to submit a query to the system.
drawings, etc.) are mainly concerned in achieving good                        For illustration purposes let us consider the following hy-
system-centered measures, such as precision and recall. How-                pothetic scenario: “We developed a system for retrieving
ever, these systems are used by users, who try to achieve                   generic complex vector drawings, like for instance techni-
goals through the execution of tasks. To better satisfy the                 cal drawings, architectural plants or clipart drawings. We
users’ needs we must involve them in the development pro-                   evaluated it using query-by-example and a set of predefined
cess of the retrieval systems.                                              drawings, achieving a good precision and recall measure. Af-
   In this paper, we argue that a user-centered approach,                   terwards, when we delivered the system to users, we noticed
where users are included in the development cycle of the                    that they were not able to use it, because they could not find
overall retrieval system, can lead to improved retrieval algo-              the (first) drawing that they must use as query to find the
rithms and also to a better user satisfaction while using the               desired drawing. Moreover, users do not want to search for
system.                                                                     the complete drawing, but only by a subpart of the drawing.”
                                                                              This scenario could be avoided if before we developed the
                                                                            retrieval system we asked users what were their needs, what
Categories and Subject Descriptors                                          did they want to perform on the system and how they want
H.3.3 [Information Storage and Retrieval]: Information                      to do it. To collect all this information we need to apply
Search and Retrieval; H.5.2 [Information Interfaces and                     a user-centered approach where users are involved in the
Presentation]: User Interfaces - Graphical user interfaces                  development of the retrieval system and algorithms.
(GUI)                                                                         In this paper we defend an user-centered approach as a
                                                                            way to create better retrieval algorithms and improve the
                                                                            overall retrieval system. We start by shortly describe the
General Terms                                                               user-centered approach and the iterative cycle used in the
Design, Human Factors                                                       user interface design. In Section 3 we describe our appli-
                                                                            cation of the user-centered approach in the development of
Keywords                                                                    retrieval algorithms. Finally, we present some conclusions.
User-Centered Design, User-centered approach, Retrieval al-
gorithms                                                                    2.   USER-CENTERED DESIGN
                                                                               The user-centered design (UCD) is a design methodology,
1.    INTRODUCTION                                                          where the needs, skills and limitations of the users are taken
   The majority of the retrieval algorithms, whether they                   into account during all stages of the development of the sys-
are for text, images, drawings, 3D objects, audio, video, etc.,             tem. The key premise of the user-centered design is that
are mainly interested in performing well for system-centered                the active involvement of the users in the development pro-
measures, like for instance precision and recall. However,                  cess as well as in the evaluation of the interactive products
these systems are used by users who want to perform spe-                    can lead to well-designed systems that best meet the desired
cific tasks and achieve specific goals. We can develop a good               usability goals. These systems will take advantage of users
retrieval system, that performs well against a predefined                   skills, will be relevant to their work and activities, and will
ground truth, but when we delivery it to users they may                     help them rather than constrain their actions.
                                                                               One of the principles from the UCD [4] states that we
                                                                            first need to identify who the users will be (profile, skills
                                                                            limitations, etc.) and what tasks they perform and/or wish
                                                                            to perform. The second principle mentions that the systems
                                                                            should be exposed to users in the early stages of development
                                                                            to collect feedback from them. Finally, the third principle is
Copyright c 2011 for the individual papers by the papers’ authors. Copy-    iterative design. The results and feedback from user testing
ing permitted only for private and academic purposes. This volume is pub-   should be used to fix and improve the system. The UCD
lished and copyrighted by the editors of euroHCIR2011.
EuroHCIR ’11 Newcastle, UK                                                  assumes an iterative cycle with identification of the users’
.                                                                           needs, design of the solution and evaluation, repeated as
often as necessary, as depicted in Figure 1.                      (system and user centered measures) should be used to im-
                                                                  prove the system and to refine the user and functional re-
                         !"#$%&'(%)&"*%                           quirements of the retrieval system.
                           +'&,-"."%                                 One of the things that we observed in one evaluation ses-
                                                                  sion with users, was that users did not care about where
                                                                  in the order of retrieval the intended drawing appears, the
                                                                  important fact being that it was there. One of the users pro-
                                                                  duced this comment “It [the system] found it [the drawing]!
                                                                  That is what counts!” However, when we evaluate retrieval
                                                                  systems, the majority of the existing measures and ground
            89&,1&20'%                      /0,120'%              truth datasets privilege precision. Of course this system-
                                          3#".4'%&'(%             centered evaluation is important, but we should also take
                                          5$0606-7.'4%            into account the users perspective, where they privilege re-
                                                                  call.

                                                                  3.1   An Example
                                                                     Involving the users can affect the way we develop the re-
     Figure 1: User-centered design iterative cycle.
                                                                  trieval algorithms. In recent years we developed a generic
                                                                  approach for complex vector drawing retrieval, based on the
                                                                  topology and geometry of the elements present in the draw-
3.    USER-CENTERED RETRIEVAL                                     ing. These two features were used to describe the content
   Typically when we want to develop a new retrieval ap-          of the drawings, and during matching, we first compare the
proach, we look at the media to retrieve (text, audio, video,     drawings using topology and them we compare the geome-
drawings, images, etc.), identify the features that better de-    try of those with similar topologies, giving the same weigh
scribe the media, create a matching algorithm and finally         to both features (for more details see [1]). This generic re-
we compute precision and recall. Although this methodol-          trieval approach was used to develop one system for retriev-
ogy allows us to create retrieval systems, we believe that by     ing technical drawings [3] and another for retrieving clipart
including the user in the development cycle will allow us to      drawings [2].
deliver better and more usable retrieval systems, that will          Before we developed this solution and the two retrieval
allow users to achieve their goals and not only systems that      systems, we performed user and task analysis to understand
have a good precision and recall performance.                     how users wanted to make queries to this type of systems.
   Moreover, we should not develop retrieval systems, and         We notice that they prefer to draw sketches of the drawing
that includes descriptor computation, matching algorithms         that they were looking for than to submit an existing draw-
and presentation of the results, without first identifying a      ing to perform a query-by-example. Moreover, most of the
set of user needs and functional requirements (first step in      times they do not have a drawing similar to the one that
the user-centered design). We need to know our users, their       they are looking for.
skills, their background, their profile. We must identify their      The two systems were both evaluated with users, and from
needs and requirements, their goals and how they achieve          those evaluations we observed that the way users search for
them. In summary, we need to do an user and task analysis         technical drawings was different from the way they search
before we start developing our retrieval system. User and         for clipart drawings [6]. While in the case of technical draw-
task analysis should not only influence the design of the         ings users draw more complete sketches with several visual
user interface, but also the design of the retrieval approach     elements, and consequently defining a richer topological con-
or algorithm.
   For instance, users could use various strategies to perform
a search in a drawing retrieval system. They could use a
drawing that they already have, in a file, to search for sim-
ilar drawings using query-by-example, or they could draw
a sketch of the drawing that they want to find. As we can
see, the retrieval solution (feature extraction, indexing and
matching algorithms) will be different on each case. While
in the first case we only need to compare two drawings of
the same complexity and with the same characteristics (sets
of lines and polygons), in the second case we need to com-
pare complex drawings with sketches (typically simpler and
with less elements). Thus, the way users perform the task
to achieve their goal influence the retrieval approach that
we should develop.
   After developing the retrieval solution based on the user
requirements, we should evaluate the retrieval system, using
not only system-centered measures, but also user-centered
measures, such as time to complete tasks, error rates, sat-
isfaction, etc. As in the user-centered design of interactive     Figure 2: Sketch specifying a query to find a tech-
systems, results from the evaluation of the retrieval system      nical drawing.
                                                                   4.   CONCLUSIONS
                                                                      In this paper we defended a user-centered approach for
                                                                   the development of retrieval systems. As in the case of user
                                                                   interfaces design, also for retrieval systems is important to
                                                                   know our users, adapt the algorithms to them, and involve
                                                                   the users in the evaluation of the system.
                                                                      We believe, and we had confirmed, that the involvement
                                                                   of the user in the development cycle of retrieval systems can
                                                                   conduct to better systems that satisfy users needs and are
Figure 3: Sketch specifying a query to find a clipart              more adapted to them.
drawing.
                                                                   5.   ACKNOWLEDGMENTS
                                                                     This work was supported by FCT through the PIDDAC
figuration, as illustrated in Figure 2; for clipart drawings,      Program funds (INESC-ID multiannual funding) and the
users produced simpler sketches, with fewer elements and           Crush project, PTDC/EIA-EIA/108077/2008.
with a poorer topological description (see Figure 3).
   Due to this observation during tests with users, we refine
our retrieval algorithm for retrieving clipart drawings [5],       6.   REFERENCES
putting more emphasis on the geometry than on topology.            [1] M. J. Fonseca. Sketch-Based Retrieval in Large Sets of
With this change we were able to achieve a better precision            Drawings. PhD thesis, Instituto Superior Técnico /
and recall measure for clipart drawings, and we adapted our            Technical University of Lisbon, July 2004.
retrieval system to the users’ way of sketching queries.           [2] M. J. Fonseca, B. Barroso, P. Ribeiro, and J. A. Jorge.
                                                                       Retrieving clipart images by content. In Proceedings of
3.2    Discussion                                                      the 3rd International Conference on Image and Video
   We can not develop our retrieval algorithms without in-             Retrieval (CIVR’04), volume 3115 of Lecture Notes in
volving our users into the development cycle. As in the                Computer Science, pages 500–507. Springer-Verlag,
design of interactive systems, also in the development of re-          Dublin, Ireland, July 2004.
trieval systems we must involve the users.                         [3] M. J. Fonseca, A. Ferreira, and J. A. Jorge.
   They must be involved in the initial phase, so we can               Content-based retrieval of technical drawings.
understand how they search for the information, what are               International Journal of Computer Applications in
their knowledge, what are their limitations and what is their          Technology (IJCAT), 23(2–4):86–100, 2005.
profile. With this we are able to identify users needs and         [4] J. D. Gould and C. Lewis. Designing for usability: key
functional requirements.                                               principles and what designers think. Commun. ACM,
   Later on, during the development of the algorithms we               28(3):300–311, 1985.
should take into account this input and adapt the algorithms       [5] P. Sousa and M. J. Fonseca. Geometric matching for
to provide “good results” for ”our” users, and not for the users       clip-art drawing retrieval. Journal of Visual
in general, or for the system.                                         Communication and Image Representation (JVCI),
   Finally, during the evaluation stage, besides computing             20(2):71–83, February 2009.
the traditional system-centered measures, for a set of datasets    [6] P. Sousa and M. J. Fonseca. Sketch-based retrieval of
defined as ground truth, we should also involve users in the           drawings using spatial proximity. Journal of Visual
evaluation to collect quantitative and qualitative measures.           Languages and Computing (JVLC), 21(2):69–80, April
Information gather during evaluation should be used to im-             2010.
prove the retrieval algorithms and the overall retrieval sys-
tem, in the next iteration of the iterative cycle of the user-
centered approach.