<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Search System Functions for Supporting Search Modes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Thomas Beckers</string-name>
          <email>thomas.beckers@uni-due.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Norbert Fuhr</string-name>
          <email>norbert.fuhr@uni-due.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>The 2nd European Workshop on Human-Computer Inter-</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Information Engineering, University of Duisburg-Essen</institution>
          ,
          <addr-line>Duisburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>action and Information Retrieval (EuroHCIR)</institution>
          ,
          <addr-line>Nijmegen</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Tasks in web search are often rather simple, e.g. navigating to an already known web page or looking up a fact. However, tasks in other domains are usually more complex and diverse. Thus, we discuss various search modes of tasks and how they might be supported by functions of a search system. We give examples of the required search functions of di erent search modes and describe the implications for the design of search systems.</p>
      </abstract>
      <kwd-group>
        <kwd>system functions</kwd>
        <kwd>search modes</kwd>
        <kwd>user interfaces</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION AND RELATED WORK</title>
      <p>
        While tasks in web search are often rather simple [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] (e.g.
navigating to an already known web page or looking up a
fact), tasks in other domains (e.g. searches for scienti c
literature or patents) are usually more complex and diverse.
A set of search system functions that is well-suited for these
simple tasks is not appropriate for other more complex task
types. In our opinion, each type of task requires a di erent
set of search system functions. Thus, we argue that a \one
size ts all" approach (that is, using a search systems with
functions e.g. optimized for web search for di erent tasks in
other domains) does not allow the user to search e ectively
and e ciently. We propose a model of search functions that
allows mapping of search activities (search tasks) to
necessary system functions comprising the entire search activity.
      </p>
      <p>
        Hughes-Morgan and Wilson [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have examined whether
improvements of an interactive search system are due to the
Presented at EuroHCIR2012. Copyright ' 2012 for the individual
papers by the papers’ authors. Copying permitted only for private and
academic purposes. This volume is published and copyrighted by its
editors.
newly introduced meta-data or to new search functionality.
They conclude that users can bene t from improved search
features while still using the same meta-data.
      </p>
      <p>
        Russel-Rose et al. developed a taxonomy for enterprise
search and site search by analyzing real-world scenarios [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">11,
12, 10</xref>
        ] based on three top-level categories of search activities
originally proposed by Marchionini [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]:
      </p>
      <sec id="sec-1-1">
        <title>Lookup a) Locate b) Verify c) Monitor</title>
      </sec>
      <sec id="sec-1-2">
        <title>Learn a) Compare b) Comprehend c) Explore</title>
      </sec>
      <sec id="sec-1-3">
        <title>Investigate a) Analyze b) Evaluate c) Synthesize</title>
        <p>
          These categories are orthogonal to each other.
RusselRose et al. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] introduce the notion of search modes. A
search mode is a concrete value of a search activity
category. Search modes can be combined to longer sequences or
networks. For enterprise search Locate is far less common
then Analyze and Evaluate. In the domain of site search the
emphasis is on Locate and Explore.
        </p>
        <p>In the remainder of this paper, we rst describe the
functional level of search systems. We show how search functions
can be mapped to di erent search modes by giving
examples to illustrate how search systems can support each mode
and its associated search functions. Subsequently, we
describe the implications for designing and developing search
systems. Finally, we give an outlook on future work and a
conclusion.
2.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>SEARCH SYSTEM FUNCTIONS</title>
      <p>Higher Level Search Functions
Select/Organize/Project</p>
      <p>Session Support and</p>
      <p>Information Management</p>
      <p>
        We divide the functionality of an IR system into three
different groups depicted in Fig. 1: i) Select/Organize/Project
(SOP) ii) Session Support and Information Management
(SSIM) and iii) Higher Level Search Functions (HLSF). In
our notion a search function is a functionality of the
system with which the user can interact or that is xed by the
system designer. A more detailed explanation of the
latter two groups and an overall architectural view is given by
Beckers and Fuhr [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. We will concentrate on SOP and (to
a lesser degree) on HLSF in the following. In doing so, we
will focus on system functions and not discuss their concrete
visualizations in the user interface.
      </p>
      <sec id="sec-2-1">
        <title>Select functions</title>
        <p>Select (S) comprises functions for selecting (searching)
possibly relevant items.</p>
        <p>
          Ranking method Retrieval functions/ranking methods may
be more precision- or more recall-oriented, or they
may consider di erent sources of additional
information (like e.g. page-rank). Mutschke et al. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] showed
that search in scienti c literature can be improved by
considering information about the author, the
publication venue or related terms from a thesaurus.
        </p>
        <p>Ranking principle The nal ranking might regard each
document in isolation, or consider all items above the
current one in the output ranking (like e.g. in diversity
ranking).</p>
        <p>Query language The query structure can be very simple
(e.g. a list of terms) or more powerful and expressive,
e.g. by supporting simple (boolean) and more complex
(wildcards, word distances, etc.) query operators as
well as elds and data types.</p>
        <p>Formal lter conditions The result set can be ltered by
some formal criteria (e.g. by data type, source, date)
which is usually done without a ecting the RSV.
Query formulation Queries can be formulated a priori as
in most systems but also by referring to one or more
given items (e.g. query by example, similarity search).</p>
      </sec>
      <sec id="sec-2-2">
        <title>Organize functions</title>
        <p>Organize (O) functions deal with the way how the set of
result items is structured and organized logically.
Sorting The results can be sorted according to one or more
criteria. When searching the best o er for a new
smartphone the items may be sorted by price and the
trustworthiness or customer ratings. While sorting
usually is a one-dimensional organization, also two- or
three-dimensional organizations may be helpful,
provided that appropriate visualizations are available in
the user interface.</p>
        <p>
          Grouping The results can be grouped according to a
simple criterion (e.g. grouping by release date, author,
source) or according to several facets, as in faceted
search [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>
          Clustering While grouping is based on some formal
criteria, clustering focuses on content aspects based on a
some sort of similarity [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Although users might have
problems interpreting the cluster structure, they might
also gain new insights about the result set.
        </p>
        <p>Linking In case there are (explicit or implicit) links
between the answer items, the resulting tree or graph
structure might be of interest (e.g. Web links, co-author
relationships in scienti c literature, or friendship
connections in social networks).</p>
      </sec>
      <sec id="sec-2-3">
        <title>Project functions</title>
        <p>Project (P) comprises functions for the construction of the
surrogates to be presented in the results.</p>
        <p>Selection Surrogates consists of speci c elds of the
result items (like e.g. title, author and year in literature
search).</p>
        <p>Summarization Either unbiased or query-biased summaries
(extracts) of the answer documents (or speci c elds
thereof) can be generated.</p>
        <p>Aggregation This function generates a single entry
representing several items di ering in formal aspects (e.g.
mirrors of a web page, various editions of a book) or
content (e.g. di erent reviews of a book in an online
store).</p>
        <p>Faceting For displaying facets with their existing values
and corresponding frequencies, the system must
support projection on single facets along with counting
values. From the point of view of a relational database,
if F denotes a facet/attribute, then the system has to
process the SQL query "select F, count(*) from R
where ... group by F" for each facet. Query
conditions and restrictions wrt. to the values of a facet then
a ect the where part of the query.</p>
        <p>Enrichment By using external data sources the results can
be enriched with additional data (e.g. on a product
review site, linking to online stores for each product).
Extracting The items can be used to extract new data
characterizing the whole result set, e.g. common terms
in the documents or frequent authors.</p>
      </sec>
      <sec id="sec-2-4">
        <title>Higher level search functions</title>
        <p>
          According to Bates [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] a system should not only o er basic
functionality. It should also provide support for search
tactics, stratagems and strategies. In our model a HLSF is a
function that uses lower level SOP and/or SSIM functions
(called moves regarding Bates' terminology) for providing
tactical and strategic support. For example, when
searching for relevant literature about a certain topic a stratagem
consisting of two tactics would be to i) search for documents
that contain some terms describing the topic and then ii)
using a function for exploring references and citations of
documents to nd related documents. An ideal system should
also be able to support these kinds of search functions.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. SUPPORTING SEARCH MODES</title>
      <p>We think that the search mode taxonomy is exible and
general enough to be also well-suited for many other
domains. We regard a search mode or a sequence of search
modes (just called search mode in the following for the sake
of simpli cation) as a higher level search function (or task)
as de ned by Bates. In the following we will give
examples which functions are particularly required for supporting
certain search modes. Functions from all three groups are
required of course but we will focus on those that are the
most important and distinctive ones. These requirements
are listed in Table 1 and will be explained in more detail in
the following.</p>
      <p>Investigate: Analyze Analyzing items to identify patterns
and relationships is a very complex task. Thus, the
system should o er several versatile and powerful
organization functions.</p>
      <p>
        There are several functions that may be helpful for
the user here, such as i) (multi-dimensional) sorting,
ii) grouping, iii) clustering and iv) linking of the result
items. Sorting result items allows the user to inspect
the items by the priority of one or more sorting criteria.
The HyperScatter component of the visual information
seeking system MedioVis [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] would be a proper
visualization and interaction technique. Especially,
multidimensional sorting might help in understanding the
relationship between facets (e.g. when buying a digital
camera, the user might want to learn which features
have a strong in uence on the camera price).
Functions for grouping may help the user in gaining new
insights or getting an overview of the result items (see
preceding search modes). Clustering the result items
may be helpful for nding previously unknown
similarities by creating groups of items with an unknown
meaning. Additionally, a clustered result set may
support the user in getting an overview of the found items
easier. Functions for linking the items can produce tree
or graph structures of the result set. These functions
can be used for creating e.g. networks based on some
kind of relationship.
      </p>
      <p>Investigate: Evaluate For judging the value of an item
concerning a speci c goal or purpose the system should
be able to let the user organize the result items
according to the important criterion, e.g. by sorting.</p>
      <p>Investigate: Synthesize This search mode occurs when
the user is creating new objects from the found result
items. We envisage that a system may support this
by o ering a join function similar to joins in relational
databases.</p>
      <p>
        The system does not have to allow the user to perform
all functions that are theoretically possible. Instead, the
system should perform certain functions automatically and
should use suitable preadjustments and defaults (see levels
of system involvement by Bates [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]). Which functions the
user should interact with depends on the search modes and
the domain in which the system is actually used.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. IMPLICATIONS FOR THE DESIGN OF</title>
    </sec>
    <sec id="sec-5">
      <title>SEARCH SYSTEMS</title>
      <p>
        In the previous section we provided examples which
functions are required for di erent search modes. An ideal search
system should be exible enough to support a broad variety
of search modes. Which set of functions is exactly required
certainly depends on the context the system is used within
and the tasks a user typically performs. Adding as many
functions as possible may leads to a feature-bloated system.
Instead, only the appropriate functions should be o ered to
the user. Richer functionality requires increased user
expertise. Thus, the interaction and visualization techniques have
to be chosen carefully to provides an easy-to-use system.
Further open research issues concerning rich functionality
have been described by Beckers and Fuhr [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>The discussion in this paper has shown that the ideal
search system extends classical IR functionality with typical
database functions, as well as more advanced IR functions.
Thus, typical IR systems as well as relational database
systems are both far away from the ideal system. An XQuery
system with full-text search might come closest today, but
it lacks all the more advanced IR functions. Whatever the
resulting query language might look like, however, it should
be clear that it mainly targets at the application developer,
who speci es the functionality needed, which is then mapped
onto a user-friendly interface.</p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSION AND OUTLOOK</title>
      <p>We demonstrated how di erent search modes require
different search functions of the system. Thus, an ideal search
system suitable for various search modes should not only
support classic search functions for ad-hoc retrieval (e.g.
ordinary web search) but also more advanced functions
described in this paper. Our grouping of search functions
allows the identi cation of functions possibly required for a
certain search mode. Previous research in this area can be
categorized and integrated.</p>
      <p>Further empirical research is necessary to validate our
proposed mapping from search modes to search functions. A
rst step may be to show exemplarily that for a particular
search tasks the users can bene t from improved and
suitable functionality by controlling the other variables.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Bates</surname>
          </string-name>
          .
          <article-title>Information search tactics</article-title>
          .
          <source>Journal of the American Society for Information Science</source>
          ,
          <volume>30</volume>
          (
          <issue>4</issue>
          ):
          <volume>205</volume>
          {
          <fpage>214</fpage>
          ,
          <year>1979</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Bates</surname>
          </string-name>
          .
          <article-title>Where should the person stop and the information search interface start? Information Processing</article-title>
          and Management,
          <volume>26</volume>
          (
          <issue>5</issue>
          ):
          <volume>575</volume>
          {
          <fpage>591</fpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Beckers</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          .
          <article-title>Towards the systematic design of IR systems supporting complex search tasks</article-title>
          .
          <source>In Proceedings of the Task Based and Aggregated Search Workshop @ ECIR</source>
          <year>2012</year>
          ,
          <year>April 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Broder</surname>
          </string-name>
          .
          <article-title>A taxonomy of web search</article-title>
          .
          <source>SIGIR Forum</source>
          ,
          <volume>36</volume>
          :3{
          <fpage>10</fpage>
          ,
          <year>September 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lechtenfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Gollub</surname>
          </string-name>
          .
          <article-title>The optimum clustering framework: Implementing the cluster hypothesis</article-title>
          .
          <source>Information Retrieval</source>
          ,
          <volume>15</volume>
          :
          <fpage>93</fpage>
          {
          <fpage>115</fpage>
          ,
          <year>2012</year>
          . DOI:
          <volume>10</volume>
          .1007/s10791-011-9173-9.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Heilig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Demarmels</surname>
          </string-name>
          , W. A. Konig, J. Gerken,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rexhausen</surname>
          </string-name>
          , H.
          <string-name>
            <surname>-C. Jetter</surname>
            , and
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Reiterer</surname>
          </string-name>
          .
          <article-title>Mediovis: visual information seeking in digital libraries</article-title>
          .
          <source>In Proceedings of the working conference on Advanced visual interfaces</source>
          ,
          <source>AVI '08</source>
          , pages
          <fpage>490</fpage>
          {
          <fpage>491</fpage>
          , New York, NY, USA,
          <year>2008</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hughes-Morgan</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Wilson</surname>
          </string-name>
          .
          <article-title>Information vs interaction { examining di erent interaction models over consistent metadata</article-title>
          .
          <source>In Proceedings of the IIiX conference</source>
          ,
          <year>2012</year>
          . To be published.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Marchionini</surname>
          </string-name>
          .
          <article-title>Exploratory search: from nding to understanding</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>49</volume>
          (
          <issue>4</issue>
          ):
          <volume>41</volume>
          {
          <fpage>46</fpage>
          ,
          <string-name>
            <surname>Apr</surname>
          </string-name>
          .
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mutschke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Schaer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sure</surname>
          </string-name>
          .
          <article-title>Science models as value-added services for scholarly information systems</article-title>
          . Scientometrics,
          <volume>89</volume>
          (
          <issue>1</issue>
          ):
          <volume>349</volume>
          {
          <fpage>364</fpage>
          ,
          <string-name>
            <surname>Oct</surname>
          </string-name>
          .
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T.</given-names>
            <surname>Russell-Rose</surname>
          </string-name>
          .
          <article-title>A taxonomy of site search</article-title>
          . Talk at Enterprise Search Europe, UK, May
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Russell-Rose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lamantia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Burrell</surname>
          </string-name>
          .
          <article-title>A taxonomy of enterprise search</article-title>
          .
          <source>In Proceedings of euroHCIR</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Russell-Rose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lamantia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Burrell</surname>
          </string-name>
          .
          <article-title>A taxonomy of enterprise search and discovery</article-title>
          .
          <source>In Proceedings of HCIR</source>
          <year>2011</year>
          ,
          <year>October 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Tunkelang</surname>
          </string-name>
          .
          <source>Faceted Search. Number 5 in Synthesis Lectures on Information Concepts</source>
          , Retrieval, and Services. Morgan &amp; Claypool Publishers,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>