<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Cluster Manipulation Paradigm for Mobile Web Search Interaction</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>CNR-IDPA Politecnico di Milano Università di Bergamo Via Pasubio 5 DEI Facoltà di Ingegneria 24044</institution>
          <addr-line>Dalmine (BG) Piazza L. da Vinci 32 Viale Marconi 5 Italy 20133 Milano 24044 Dalmine, BG</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>27</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>This paper describes a new interaction paradigm well suited to perform web searches though a mobile device. The prototypal system that implements this novel interaction framework is named Matrioshka, that is a multi-modal system. In this paper we focus on the interaction framework and will introduce brie°y an overview of the mobile version of Matrioshka. This framework is based on cluster manipulation operations. The results of a user request, yielded by one or more search engines, are organized into labelled clusters. Then, some manipulation operators can be applied to rerank clusters or to combine them to generate new clusters. These facilities allow the user to capture the relevant documents hidden in the large set of retrieved ones in the ¯rst ranked clusters.</p>
      </abstract>
      <kwd-group>
        <kwd>Web searches</kwd>
        <kwd>mobile information retrieval</kwd>
        <kwd>results clustering</kwd>
        <kwd>ranking strategies</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>The large di®usion of Internet connections from anywhere
at anytime has arisen the problem of more e®ective ways
of searching the Web from mobile devices. In this paper,
a mobile interaction framework for web meta-searching is
proposed, whose de¯nition is motivated by the observation
that the visualization method based on the ranked list of
web pages is too long to ¯t small screens such as those of
mobile devices. Further, with the aid of a mobile keyboard,
the usual way of interacting with search engines based on
repeated cycles of query reformulation imposes too much
burden to the user. At the same time, it is too expensive in
terms of the high cost of mobile connections. In fact, if users
do not ¯nd what they are looking for in the ¯rst one or two
result pages, they are more keen to reformulate a new query
than to analyze successive pages, or to submit the current
query to another search engine.</p>
      <p>
        To overcome these drawbacks, some search services such
as vivisimo, clusty, Snaket, Ask.com (at [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]), MS AdCenter
Labs Search Result Clustering, etc., proposed to cluster the
results of Web searches. W.r.t. the ranked list, clustered
results are more compact and o®er an overview of the main
topics dealt with in much more documents than those
contained in the ¯rst few pages, that would be missed otherwise
[
        <xref ref-type="bibr" rid="ref11 ref16 ref8">8, 16, 11</xref>
        ]. As far as we know, in the literature we found
only one academic mobile search engine, named Credino [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
that exploits clustered results.
      </p>
      <p>On the other side, one problem users encounter with such
clustered results, is the inability of fully understanding the
contents of the clusters. This is mainly due to the short and
sometimes bad quality of the clusters' labels, which generally
consist of a few terms, or individual short phrases, which are
automatically extracted from the documents within clusters.
Often, several clusters have similar labels, which di®er just
for a single term. To e®ectively explore the cluster contents,
users have no other means than clicking on the cluster labels
and browsing the clusters themselves. On a mobile device,
this modality would again require too much scrolling.</p>
      <p>The idea of our proposal is to maintain the result
clustering paradigm, and to provide users with a language to
manipulate clusters. Both several ranking criteria to
differently order the clusters, and operators to combine the
clusters themselves are de¯ned whose ¯nal aim is to make
possible the exploration of the retrieved contents.</p>
      <p>
        The literature on mobile search engines mainly focuses on
modelling the user context, considering primarily the user
geographic location, in order to ¯lter the retrieved results
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]; other topics are the summarization of documents [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ],
and the de¯nition and use of data visualization schemes [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] the clustering of retrieved results is proposed as a
useful way of presenting the search results on small screens, but,
to the best of our knowledge, only the mobile search engine
Credino [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] performs clustering.
      </p>
      <p>
        The manipulation language as a basis for a °exible
interaction makes our proposal substantially di®erent from Credino
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where the focus is the clustering algorithm it adopts
w.r.t. other clustering methods, and does not o®er criteria
to explore the cluster contents.
      </p>
      <p>
        A motivation of utility of the manipulation language can
be found in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] which advocates the need of tools for
giving the user more immediate control over the clusters of
retrieved web documents. Our proposal can be particularly
useful when groups of clusters with same or almost same
labels are generated by distinct requests or by the same query
submitted to distinct search engines. In such situations it
becomes necessary to explore the contents of the clusters
and their relationships in terms of number of contained
documents, relevance of contents, homogeneity of contents, or
common and distinct contents with other clusters. This
task an exploratory task, that may last for a long time, and
may require to reuse the intermediate results several times.
For this reason, storing of the intermediate results into a
database is essential for successive manipulation.
Furthermore, the local manipulation of results avoids the useless
overloading of both the network and the search engines. In
fact, in current practices, several modi¯ed queries are
submitted to the search engines, trying to capture relevant
documents in the ¯rst positions of the ranked list; note that
most of these documents were already retrieved by the
previous queries, although hidden to the user since they did not
occur in the ¯rst positions.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], we proposed and de¯ned the operators for
combining the clusters for revealing their implicit
relationships. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] a prototypal mobile meta search system was
proposed that allows easily using the combination operators.
      </p>
      <p>In this paper we propose an extension of the manipulation
language by introducing a ranking operator that makes
possible the exploration of the cluster contents based on distinct
properties of the clusters.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>THE INTERACTION FRAMEWORK</title>
      <p>Data Model. Here we describe the data model on which
the proposed interaction framework is based. We start
considering a query q submitted to a search engine; its result is
a ranked list of documents, that we call items.</p>
      <p>De¯nition 1: Item An item i represents (an instance of)
a document retrieved by a web search. It is described by
the following attributes: uri, which is the Uniform Resource
Identi¯er of the ranked web document; title and snippet
which are, respectively, the document title and snippet1;
¯nally, irank is a score (in the range [0; 1]) that expresses
the estimated relevance of the retrieved document w.r.t. the
query. 2</p>
      <p>
        The same document (web page) may be represented by
distinct items in distinct result lists. In facts, we assume
that a document is uniquely identi¯ed by its uri [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], while it
may have distinct snippets, irank and title, when retrieved
by di®erent search services (or by di®erent queries). We
assume that irank is a function of the position of the item
in the query result list.
      </p>
      <p>In our system, the results of a user request (or exploration)
are not simply a ranked list of documents, but they are
gathered in ordered clusters.</p>
      <p>De¯nition 2: Cluster A cluster c is a set of items, having
a rank. It is de¯ned by two attributes: label is a set of
terms that semantically synthesize the main content of the
cluster; crank is a score (in the range [0; 1]) depending on
some property of the cluster. 2</p>
      <p>
        A cluster label is automatically generated by a speci¯c
labelling algorithm on the basis of frequent terms in cluster
items [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>At this point we de¯ne the main element of the data model.
De¯nition 3: Group A group g is a non empty, ordered
set of clusters. It is described by the following attributes:
1The snippet is an excerpt of the document, made by a set
of sentences that may contain the keywords of the query
label, a set of terms that semantically synthesizes the main
content of the group; s, the name of the search engines used
to retrieve the items in the clusters of the group. 2</p>
      <p>Finally we de¯ne the users' History repository.</p>
      <p>De¯nition 4: History A history H is a set of items. It
can be the empty set, at the beginning of a search session,
and it can be updated by explicit action of the user when
he/she decides to save a retrieved document. 2
Manipulating Clusters. The procedure that generates
a group is initially activated by a search operator, named
CQuery, that allows users to query a search engine (e.g.,
Google, Yahoo!, MSN Search) and to cluster the results. In
the implementation we considered a maximum of N
documents, with n ¸ 30, i.e., a number of documents greater
than that retrieved in the ¯rst three pages, those usually
analyzed by a common user.</p>
      <p>On this basis, for each retrieved document, the operator
builds an item i, whose irank value depends on the position
of the document in the result list: i:irank = (N ¡ P os(d) +
1)=N (where P os(d) is the position of the document in the
query result list). In this way, a document in the ¯rst
positions has a rank r:irank very close to 1. This is done in order
to achieve independence and comparability of the ranking
produced by distinct search engines.</p>
      <p>
        The ranked list obtained as a result by the search
operator, is then clustered by applying the Lingo algorithm [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
Lingo is used to perform a °at crisp clustering of the query
results on the basis of their snippets and titles. Once
clusters are obtained, they are labelled. Finally also the groups
are labelled (see [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for the labelling algorithm) to synthesize
the most central contents retrieved by all their clusters.
      </p>
      <p>Successively, one can decide either to explore the groups of
clusters retrieved by a single query by applying some ranking
operation described in 2.1 which evaluates a cluster property,
or one can generate other groups by combining the obtained
ones through the operators de¯ned in Section 2.2.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Cluster Ranking Methods</title>
      <p>Once the results of a query are obtained as a group of
ranked clusters, in which the default crank score is computed
as the average of the irank of its documents, the user has
the possibility to re-rank the clusters based on the evaluation
of some other clusters' property. This allows to obtain, in
the ¯rst positions of the ranked list of clusters, those clusters
that previously could appear in the last positions. This is the
novel contribution of the paper w.r.t. our previous work: the
user is this way provided with the possibility of evaluating
groups by di®erent perspectives.</p>
      <p>The cluster properties that can be considered for the
ranking are the following:
² Relevance: this is de¯ned as the average of the relevance
scores of documents belonging to the cluster and is the
default property for the ordering of clusters; the relevance
scores of clusters are the irank values computed as previously
de¯ned from the documents' positions in the ranked list
returned by the search engine. Ordering clusters by decreasing
values of their relevance means being interested primarily in
the relevance of documents contained in the clusters.
² Ponderosity : this is de¯ned as the cluster cardinality, and
it measures how many documents belong to the clusters; the
ranking of clusters in decreasing order of their ponderosity
can be useful for users interested in high recall.
² Heterogeneity : this is de¯ned as the variance of the
documents vectors, represented in the space of index terms
extracted from their titles and snippets, and weighted by their
relative frequency, w.r.t. the cluster centroid vector, de¯ned
as the average vectors of all the documents vectors belonging
to the cluster. The greater the variance the more
heterogeneous is the cluster: by choosing to rank clusters in
increasing order of their heterogeneity means being interested in
contents focalized on the speci¯c meaning expressed by the
label of the cluster, since the cluster label is generated from
its centroid vector. This can be useful in target searches.
Conversely, by choosing to rank clusters in decreasing
order of their heterogeneity means being more tolerant on the
meaning expressed by the cluster label; this can be useful
when one is unsure to have expressed by the query the actual
information needs and wants to soften the selection
conditions.
² Novelty: this is de¯ned as the proportion of novel
documents contained in the cluster w.r.t. previously already seen
documents, that the user has saved in the history
repository; choosing a novelty ranking means being interested in
new documents on the topics of a search and can be useful
in the context of bibliographic surveys.</p>
      <p>In order to rank clusters of a group based on one of the
above properties the operation ClusterRank is de¯ned:
g0 = ClusterRank(g; property; order)
in which g and g0 are the input and output groups of clusters,
property takes values in a set of strings fRelevance,
Ponderosity, Etherogeneity, Noveltyg denoting a cluster
property; order 2 fincreasing, decreasing g indicates the desired
ordering, i.e., increasing and decreasing w.r.t. the value of
the speci¯ed cluster property, respectively.
g0 has the same label of g and contains the same clusters of
g with the only di®erence that the clusters' crank scores are
computed based on the speci¯ed proverty of the clusters:
property(ci)
cranki = MAXk(property(ck))
2.2</p>
    </sec>
    <sec id="sec-4">
      <title>Combining Groups of Clusters</title>
      <p>The system provides users with the possibility to interact
with the results of search services organized in groups of
clusters, in order to get more satisfactory and re¯ned results
to their needs. To this aim, the user can choose to apply
di®erent sequences of operators on selected groups, in order
to recombine (modify, explore) their structure and content.</p>
      <p>
        The operators that we are going to illustrate are formally
de¯ned in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]; they are inspired by the operators provided by
the Relational Algebra (i.e. intersection, join, union etc.),
thought they are speci¯cally de¯ned for groups of clusters.
They generate, starting from two input groups g1 and g2,
one group g0 that may contain one ore more clusters; it can
also be empty, in the case no common items are detected.
      </p>
      <p>First of all, we describe two basic operations that combine
items belonging to two input clusters to get a new cluster.
We de¯ne two basic operations: Cluster Intersection and
Cluster Union. They work on the uri of the items of two
input clusters, assuming that uri is the document's unique
identi¯er. The rationale of this assumption is the fact that
the same document, retrieved by two di®erent search
services, may have di®erent title and snippet, but maintains
the same uri. Consider the intersection of two clusters c1
and c2, denoted as:
c0 = ClusterIntersection(c1; c2).</p>
      <p>The irank of i0 2 c0, the cluster resulting from the
intersection, is de¯ned as the minimum irank value of i1 and i2.2
In the case of cluster union, denoted as</p>
      <p>c0 = ClusterU nion(c1; c2),
the irank of i0 is the maximum irank value of i1 and i2.3 In
both cluster intersection and union, the title and the snippet
of the resulting items are obtained by selecting either i1:title
or i2:title, and either i1:snippet or i2:snippet, respectively.</p>
      <p>In particular, to obtain the title and the snippet of the
items belonging to the clusters of the resulting groups we
select as resulting title and snippet, those belonging to the
document having the smallest (in the case of Cluster
Intersection) or the greatest (in the case of Cluster Union) value
of irank, without making any changes. The rationale of this
choice is the fact that in the aggregation based on the
intersection (union), we want to represent the document by
its worst (best) representative, in accordance with the
modelling of the AND and the OR within fuzzy set theory.
2.2.1</p>
      <p>Group Operators</p>
      <p>The ¯rst group operators we describe are not properly
combination operators: they are the Group Selection and
the Group Deletion. The Group Selection operator allows
to select the clusters in a group. In the resulting group, the
selected clusters maintain the original order.</p>
      <p>Similarly, the Group Deletion operator allows the user to
delete clusters. Like for the Cluster Selection operator, the
original order is maintained in the resulting group.</p>
      <p>The following operators combine and generate groups.
Group Intersection. Group Intersection is de¯ned to
support the straightforward wish of users to intersect clusters in
two groups, to ¯nd more speci¯c clusters. The assumption is
that the more search services (or the more distinct queries)
retrieve the same document, the more the document content
is worth analyzing.</p>
      <p>
        De¯nition 5: The Group Intersection operator generates
a new group composed of all the combination of clusters in
the original groups having a not empty intersection.
In particular, given g1 and g2 the groups of cluster to
intersect, the resulting group g0 is composed of all the clusters c0
such that: c0= ClusterIntersect (c1, c2) with jc0j 6= 0. 2
Group Join A key operator of the language, closely related
to the previous one, is the Group Join. It lets the user
expand the original clusters in a group with clusters, possibly
belonging to another group, that share one or more
documents. The group Join operator can be used to explicit
indirect correlations between the topics represented by the
clusters in the two input groups. The basic idea underlying
its de¯nition is that if two clusters have a non empty
intersection (i.e. have some common items), this means that the
texts of their items are related with both topics represented
by the clusters. This may hint the existence of an implicit
relationship between the topics of the two clusters.
By merging the two overlapping clusters into a single one,
the more general topic representing the whole content of the
new cluster can be revealed, which subsumes, as more
speci¯c topics, those of the original clusters.
2This de¯nition is consistent with the de¯nition of the
intersection operation between fuzzy sets [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
3This is also consistent with the de¯nition of union of fuzzy
sets.
      </p>
      <p>De¯nition 6: The Group Join operator allows the user
to obtain, from two or more input groups, a resulting group
composed by the union of all those pairs of original clusters
that present a not empty intersection.</p>
      <p>In particular, given g1 and g2 the input groups, for each pair
of clusters c1 2 g1 and c2 2 g2, the cluster</p>
      <p>c' = ClusterU nion(c1; c2) 2 g0,
if and only if ClusterIntersection(c1; c2) 6= ;, with g0 the
resulting group. 2
Group Re¯nement The Group Re¯nement operator is
aimed at re¯ning clusters in a group, based on clusters in
another group. While the group join operator generates a
cluster representing a more general topic than the topics in
both the original clusters, the re¯nement operator can be
regarded as generating clusters specializing the topics of the
clusters in the ¯rst group on the basis of the topics of any
cluster in the second group. The idea underlying this
operator is that we want to collect, in a unique cluster, the items
(that are considered by the user as more interesting) which
belong to both a cluster c1 of the ¯rst group g1 and any of
the second group g2. This way, by eliminating some items
from c1, we generate a cluster representing a more speci¯c
topic w.r.t. c1, but not necessarily more speci¯c w.r.t. the
clusters of the second group.</p>
      <p>De¯nition 7: The Group Re¯nement operator allows the
user to keep, from the original group g1, only the clusters ci
containing documents presents in at least one of the clusters
cj of the most interesting group g2.</p>
      <p>In particular, given g1 (group of clusters to re¯ne) and g2
(interesting group), and being c1 a cluster such that c1 2 g1,
for each cluster cj 2 g2 we compute the cluster union of the
intersections cj, cj = ClusterIntersection(c1; cj).
If the union c0 of cj is not empty, then c0 2 g0. 2</p>
      <p>The operators so far introduced constitute the core of our
proposal; the others are sketched hereafter.</p>
      <p>Group Union. The Group Union operator unites together
two groups. It generates the resulting group g0 in such a
way it contains all clusters in the input groups g1 and g2.
Group Coalescing. Complex processing of retrieved
documents may need to be performed by fusing all clusters in
a group into one global cluster. The Group Coalescing
operator generates a resulting group g0 in such a way that
g0 contains only one cluster, obtained by uniting together all
clusters in the input group g.</p>
      <p>Reclustering. After complex transformations, it might be
necessary to reapply the clustering method to a group. In
fact, reclustering documents in a group may let new and
unexpected semantic information emerge.</p>
      <p>
        The Reclustering operator coalesces all clusters in the input
group g and generates a new group g0 in such a way that it
contains all the clusters obtained by clustering all items.
The Closure Property of Group Operators holds: operators
are de¯ned on groups and generate groups [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>THE MOBILE SYSTEM MATRIOSHKA</title>
      <p>The interaction framework introduced in the previous
section has been implemented in the mobile version of the
prototypal system Matrioshka.</p>
      <p>It is constituted by three main parts: the client side
components handle the user interaction; the server side
component interfaces the search engines and executes the
clustering and the manipulation operations speci¯ed by the user;
¯nally, the Communication Layer dispatches the messages
between client and server. Speci¯cally, the client provides
a query editor for the user, the server either executes the
queries and builds the groups of clusters or executes the
operations on previously generated groups of clusters. Let us
describe the functionality of each architectural component.</p>
      <p>On the client side the Matrioshka User Interface
collects users requests, displays the results of queries and/or
the application of manipulation operations. The
Clientside components are thin clients compliant, and
communicate with the server-side by exchanging XML messages.
Speci¯cally, the component for mobile devices (called
Mobile Matrioshka), is a Javascript application based on the
AJAX (Asynchronous JavaScript and XML) web
development technique.</p>
      <p>The Server Side exposes a web service interface, based
on XML messages: it receives requests to perform queries
on search services, or to apply the operators; it replies with
groups of clusters. All the data received from the search
engines, and those resulting from the operations, are stored
in an XML native database; this way, the entire process is
stored and can be accessed to carry on the exploratory task.
The server side is entirely implemented in the Java
Language. The interaction with search services usually exploits
web service APIs provided by the search engines, otherwise
the standard HTTP interaction model is exploited.</p>
      <p>Document clustering is performed on the indexes extracted
from the titles and snippets of retrieved documents
(generated by using Lucene functions): the Lingo multilingual
algorithm, provided by the Carrot2 libraries is used.</p>
      <p>The interpreter of the combination operators has been
implemented from scratch.</p>
      <p>The Communication Layer is a pool of JSP scripts,
executed on top of the Tomcat web server. It carries out
the client/server communication through XML format
messages, according with AJAX web development techniques,
and by the support of the Tomcat Java servlet container.</p>
      <p>When the user logs into the system, a speci¯c instance of
the database is created, in which the entire exploratory
process performed by the user will be stored. When logged-in,
the user has the possibility to submit queries to the chosen
search engine (as shown in the left-hand side of Figure 1).</p>
      <p>In order to organize a trip to visit London, let us submit
the query "visit London" to the search engines Google,
Yahoo! and MSN search. Groups g1, g2, g3 in Figure 2 are the
resulting groups clusters; the three groups being generated
by the same query "Visit London" have the same label.</p>
      <p>Terminated the inspection of clusters in the groups, we
can interactively ask for executing some operators, in an
attempt of obtaining clusters with labels that more closely
g1"Visit London"
cl.1: Visit London
cl.2: When to visit London
cl.3: Destination marketing
cl.4: London tourist information
cl.5: Visit London services
cl.6: The Royal Parks
cl.7: London Theater Guides
g2"Visit London"
cl.1: Visit London
cl.2: Visit London-o±cial web site
cl.3: Attractions in London
cl.4: London City Guide 2008
cl.5: Family-Visit London
cl.6: Visit London Organizers
cl.7: London Travel Maps
cl.8: Business-Visit London
g3"Visit London"
cl.1: Travel - Visit London
cl.2: Visit London Organizers
cl.3: Special O®ers - Visit London
cl.4: London Accommodation Guide
cl.5: Visit London Corporate
cl.6: London Maps - Visit London
cl.8: Places to go - Visit London
g4"Visit London"
cl.1: Visit London
cl.2: Visit London-o±cial website
cl.3: Visit London-o±cial website
g5"Mayor of London"
cl.1: Visit London
cl.2: London Accommodation Guide
cl.3: Mayor of London
meet our needs. At ¯rst, we ask to intersect the three groups
to retrieve the most reliable documents. By observing
clusters in the resulting group g4, we then decide to request a
join of the three original groups g1 g2 and g3, in order to
expand the contents obtained by the intersection (see the
screen shots in Figure 1). A new group g5 is generated with
more populous clusters: these clusters are the union of the
original clusters that share some common document. We
can see that the obtained clusters are identi¯ed by labels
which hints the presence of new correlated contents w.r.t.
the labels of the clusters obtained by the intersections of the
same groups (see groups g4 vs group g5 in Figure 2).</p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSIONS</title>
      <p>In this paper, we described a novel interaction framework
for web searches implemented by the prototypal mobile
version of the system Matrioshka.</p>
      <p>The features that make this framework particularly suitable
for mobile searches are several: ¯rst, it presents clustered
results of the searches so as to better render them on the
small screen of mobile devices; it makes available ranking
and combination operators de¯ned for clusters
manipulation which allow easily exploring the retrieved results, thus
alleviating network overloading caused by the submission of
repeated re¯ned queries to search engines. The large number
of documents retrieved by such engines constitute a serious
obstacle for users of mobile devices, who generally engages
long trial and error query reformulation phases to retrieve
relevant results in ¯rst few positions.</p>
      <p>The operator provided by the interaction framework are the
basis for complex exploratory tasks; users can issue
operations through the mobile interface, but certainly they must
be skilled users; certainly, generic users are in troubles.
Currently we are performing an evaluation study to understand
the e®ectiveness for end users, in order to de¯ne novel, more
user friendly interaction paradigms on the client side, more
suitable for generic users.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Ask</surname>
          </string-name>
          .com clustering. www.ask.com/reference/dictionary/49514/cluster.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] The GRASS system</article-title>
          . http://credino.dimi.uniud.it/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordogna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Campi</surname>
          </string-name>
          , G. Psaila, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ronchi</surname>
          </string-name>
          .
          <article-title>An interaction framework for mobile web searches results</article-title>
          . MOMM-
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordogna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Campi</surname>
          </string-name>
          , G. Psaila, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ronchi</surname>
          </string-name>
          .
          <article-title>A language for manipulating clustered web documents results</article-title>
          . CIKM-
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordogna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Campi</surname>
          </string-name>
          , G. Psaila, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ronchi</surname>
          </string-name>
          .
          <article-title>A °exibkle language for exploring clusterd search results</article-title>
          .
          <source>Scalable Fuzzy Algs. for Data Mgmt. and Anal.</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Buchnan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jones</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Marsden</surname>
          </string-name>
          .
          <article-title>Exploring small screen digital library access with the greenstone digital library</article-title>
          .
          <source>Eur. Conf. on Digital Libraries</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>O.</given-names>
            <surname>Buyukkokten</surname>
          </string-name>
          and et al.
          <article-title>E±cient web browsing on handheld devices using page and form summarization</article-title>
          .
          <source>ACM Trans. on Inf. Systems</source>
          ,
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <volume>82</volume>
          {
          <fpage>115</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Dumais</surname>
          </string-name>
          .
          <article-title>Bringing order to the web: Automatically categorizing search results</article-title>
          .
          <source>SIGCHI Conf. on Human factors in computing systems</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Coates</surname>
          </string-name>
          and al. Uris, urls, and urns:
          <source>Clari¯cations and recommendations 1.0. W3C Tech. report</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Crestani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dunlop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jones</surname>
          </string-name>
          , and S. Mizzaro (eds.).
          <article-title>Special issue on interactive mobile inf</article-title>
          .
          <source>access. J. of Personal and Ubiquitous Comp</source>
          .,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hearst</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. O.</given-names>
            <surname>Pederson</surname>
          </string-name>
          .
          <article-title>Reexamining the cluster hypothesis: Scatter/gather on retrieval results</article-title>
          .
          <source>Conf. on Research and Devel. in Inf. Retrieval</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Leouski</surname>
          </string-name>
          and
          <string-name>
            <given-names>W. B.</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <article-title>An evaluation of techniques for clustering search results</article-title>
          .
          <source>Tech. Rep. of the Dept. of Computer Science</source>
          f University of Massachusetts at Amherst, IR-
          <volume>76</volume>
          :
          <fpage>122</fpage>
          {
          <fpage>133</fpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Noirhomme-Fraiture and al. Data visualizations on small and very small screens</article-title>
          .
          <source>Conf. on Applied Stocastics Models and Data Analysis</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Osinski</surname>
          </string-name>
          .
          <article-title>An algorithm for clustering of web search results</article-title>
          .
          <source>Master's thesis</source>
          ,
          <year>2009</year>
          , Poznan' Univ. of Tech.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zadeh</surname>
          </string-name>
          .
          <article-title>Fuzzy sets</article-title>
          .
          <source>Information and control</source>
          ,
          <year>1965</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>O.</given-names>
            <surname>Zamir</surname>
          </string-name>
          and
          <string-name>
            <given-names>O.</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Grouper: a dynamic clustering interface to web search results</article-title>
          .
          <source>WWW</source>
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>