<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The University of Glasgow at ImageClefPhoto 2009</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Guido Zuccon</string-name>
          <email>Id@1</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Teerapong Leelanupab</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anuj Goyal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Halvey</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P. Punitha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joemon M. Jose</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>General Terms</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Interdependent Document Relevance, Diversity, Novelty</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Algorithms</institution>
          ,
          <addr-line>Measurement, Performance, Experimentation</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Glasgow</institution>
          ,
          <addr-line>Glasgow, G12 8RZ</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we describe the approaches adopted to generate the ve runs submitted to ImageClefPhoto 2009 by the University of Glasgow. The aim of our methods is to exploit document diversity in the rankings. All our runs used text statistics extracted from the captions associated to each image in the collection, except one run which combines the textual statistics with visual features extracted from the provided images. The results suggest that our methods based on text captions signi cantly improve the performance of the respective baselines, while the approach that combines visual features with text statistics shows lower levels of improvements.</p>
      </abstract>
      <kwd-group>
        <kwd>H</kwd>
        <kwd>3 [Information Storage and Retrieval]</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>1 Content Analysis and Indexing</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>3 Information Search and Retrieval</kwd>
        <kwd>H</kwd>
        <kwd>3</kwd>
        <kwd>4 Systems and Software</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In this paper we provide an overview of our participation in ImageClefPhoto 2009 and describe the
methods adopted to generate the submitted runs. The ultimate aim of this year's retrieval task
was to promote diversity against redundancy in document retrieval ranking. The need for diversity
in a result list is empirically motivated by several studies [
        <xref ref-type="bibr" rid="ref1 ref11 ref4">1, 4, 11</xref>
        ]. Addressing diversity issues
aims to allow retrieval systems coping with poorly speci ed or ambiguous queries, maximizing the
chance to retrieve relevant images to the user's information need.
      </p>
      <p>We recognize that diversity is a broad and under-speci ed concept: it is possible to refer to
topical diversity, source diversity, presentation diversity, etc. In the context of ImageClefPhoto
2009 we identi ed the following types of diversity: (i) topical diversity, (ii) semantic diversity, (iii)
visual diversity.</p>
      <p>In four out of ve submitted runs, we have tackled the problem of diversity considering
topicality and semantics. Under these approaches, the content of the documents have been assumed
to coincide with the text (the caption) associated with each image. All the four runs based on
text aim to exploit the statistical features associated to the captions, promoting topically and
semantically di erent documents. In the last of the ve runs, however, we considered also visual
features, combining these with the topical information provided by the matching between the
textual queries associated to the topics and the textual captions. Thus, in this approach both topical
and visual diversity were captured in our ranking function.</p>
      <p>
        The evaluation of the participants' results is a central step in a retrieval campaign. In order
to evaluate diversity, several measures have been proposed, such as s-recall (also called cluster
recall), s-precision, ws-precision [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], -NCGD [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], subMRR [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Of these, only s-recall has been
used in ImageClefPhoto 2009. Submitted runs have been discriminated by the performances they
obtained in s-recall at 10 and precision at 10. We believe, however, that a thorough evaluation
of the runs by means of the diversity measures suggested in the literature would provide more
insights for the understanding of how well systems are diversifying document rankings.
      </p>
      <p>The paper continues as follows. In section 2 we describe the approaches implemented to
generate the submitted runs. Section 3 reports the results obtained by evaluating the runs against
the ground truth and further discussions related to our approaches. Finally, the paper concludes
in section 4 where we state the achievements obtained by participating to ImageClefPhoto2009
and future lines of research.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Approaches</title>
      <p>In the following we illustrate the approaches adopted to tackle the challenging task of promoting
diversity in this year's ImageClefPhoto. All our approaches assume a common initial document
retrieving and ranking step. In this step, a query is matched against each document in the
collection and a ranked list is returned along with the scores associated to each reported document.
We refer to this step as the initial ranking process. Afterwards each method processes the common
input in order to generate a re-ranking of the initial results. Our approach based on the Quantum
Probability Ranking Principle (QPRP) is motivated employing quantum theory, while the others
start from empirical observation to derive a model. The approach called \VisualDiversity" is
our only method which uses both the visual features of the images and the text features of the
associated captions. The remaining methods instead employ exclusively statistics drawn from the
provided textual captions.</p>
      <p>
        The remaining of this section is structured as follows. In section 2.2 we illustrate the rst of
our approaches based exclusively on textual features, the Maximal Marginal Relevance (MMR),
derived from [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Afterwards we present the methods implemented in Glasgow run 3 and
Glasgow run 4 (section 2.3), which rely on a clustering procedure to individuate topically
and semantically diverse facets of the retrieved documents and re-ranks the initial ranking
combining those evidences and the MMR methodology. The main intuitions behind the QPRP and
details of the suggested ranking procedure are presented in section 2.4. Finally, in section 2.5 the
method called \VisualDiversity", which employs statistics derived both from text captions and
from images, is illustrated.
2.1
      </p>
      <sec id="sec-2-1">
        <title>Initial Ranking</title>
        <p>In this section, we brie y describe the data and method employed to generate an initial ranking
for the topics of ImageClefPhoto 2009. For each of the 50 topics, a set of subtopics have been
individuated by the organizers. For example, topic 16, which title is \queen" has as associated
subtopics \queen silvia", \queen rania", \queen so a", and an \other" cluster, which contains
facets not related to the previous clusters. Although the topics where provided with a description
and an image example, we decided to use merely the topic's titles as queries. Doing so, we aimed
to capture the situation of a user posing a very broad or ambiguous query. The topics have been
matched against a collection of nearly 500,000 pairs of images and textual annotations from the
Belga News Agency.</p>
        <p>
          In order to retrieve the initial document ranking for our runs Glasgow - run - 1,3,4,5 we
employed Terrier1 and we developed our method using Java. For the Glasgow - run - 2, instead,
we obtained the initial document ranking from Lemur2, and the re-ranking procedure has been
implemented in C++. In both cases, stemming and stop word removal has been applied using
the same dataset, and the internal implementation of the Okapi's retrieval function [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] has been
used to retrieve the initial set of documents and to estimate the probability of relevance of each
document. The initial ranking has been also used as baseline to evaluate the performances gains
our methods obtained.
2.2
        </p>
        <p>
          MMR
Maximal Marginal Relevance (MMR) has been one of the rst approaches that aimed to exploit
the diversity between documents by combining it with the relevance score of each document [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
We implemented the MMR approach in Glasgow - run - 1 with the intention of using this as a
benchmark against the approaches we propose in the following. More recent and sophisticated
methods, such as [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], might have been implemented as well. However, they require a thorough
and complex exploration of the parameters space. This is not possible in the context of
ImageClefPhoto2009 since there are not either a training set or the subtopical qrels to be employed in
the learning of the optimum parameter values. The MMR approach we adopted is characterized
by the following equation:
        </p>
        <p>M M RJ+1
argmax[ S(xi; q) + (1
xi2InJ
)D(xi; (x1; :::xJ ))]
(1)
where I is the set of initial results retrieved by the IR system; J is a set of re-ranked results at
iteration J ; q is a query; xi is a candidate document in I nJ , which is the set of documents that have
not been ranked yet; and xJ is a document in J , i.e. the set of documents that have been already
ranked. Function S(xi; q) is a normalised similarity metric used for document retrieval, such as
Okapi BM25, while D(xi; (x1; :::xJ )) is a diversity metric, which we implemented as the opposite
of the cosine similarity between document vectors which components are the BM25 weights of the
respective terms. Intuitively, the MMR method maximizes marginal relevance and diversity in
document retrieval at each iteration by linearly combining the relevance of the documents with
the dissimilarity to previous retrieved documents. The parameter &gt; 0:5 means that similarity
to the query is more important than novelty/diversity, while &lt; 0:5 represents situations where
novelty/diversity is more important than relevance to the query. In previous experiments, we found
that = 0:3 yields satisfying results since the obtained ranking still contains a great number of
relevant documents. We therefore selected this value as a constant for all our runs based on MMR.</p>
        <p>Following the MMR method, the rst image to be inserted in the nal ranking is the one that is
most similar to the query, since no previous results are in the ranking and the diversity component
of eq. 1 does not contribute to the document's nal score. Afterwards, the image xi 2 InJ which
obtains the highest MMR score in each iteration is added to the results list until all images are
selected. In our Glasgow - run - 1, we re-ranked the top 500 images.
2.3
2.3.1</p>
      </sec>
      <sec id="sec-2-2">
        <title>Semantic Clustering</title>
        <sec id="sec-2-2-1">
          <title>Clustering as pre-processing for MMR improvement</title>
          <p>Although MMR can adequately combine relevance and diversity in the ranking results, the selection
of the rst document to retrieve might represent a problem. In fact, the score for a new document
to be added depends upon the documents ranked at the previous steps. As a result, an unfavorable
choice at an early position in the result list may adversely deteriorate the entire ranking. The
reranking procedure might get trapped in a local maximum of distance (diversity). MMR suggests
that the rst document retrieved should be the one with highest similarity score. However, there
have been several methods proposed to leverage the problem of non-optimal ranking due to initial
1http://ir.dcs.gla.ac.uk/terrier/.</p>
          <p>
            2http://lemurproject.org/.
bad selection such as dynamic programming [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ], requiring high computation cost. In our Glasgow
- run - 3, we strive to deal with this problem using a light-weight approach by incorporating within
a clustering technique.
          </p>
          <p>In our experiment, we assumed that top 100 ranked documents from an initial set of results are
highly relevant to the user's information need. Note that this is in some extent similar to pseudo
relevance feedback, but in our approach re-ranking is performed without an extended interaction.
We then cluster the top 100 documents using Hierarchical Agglomerative Clustering due to its
low computational cost. Subsequently, we consider only the biggest cluster in order to extract the
rst document to rank. This choice is motivated by the assumption that the biggest cluster in
the top list could be used to characterize the most popular subtopic users identify. The medoid
of the biggest cluster is selected as a representative and as an initial choice for re-ranking. Later
re-ranking iterations are processed following the MMR method on the top 500 images from the
initial ranking obtained by Okapi.
2.3.2</p>
        </sec>
        <sec id="sec-2-2-2">
          <title>Revisiting the integration of Clustering and MMR techniques</title>
          <p>
            A popular method to obtain diverse but relevant results in image and information retrieval is
re-ranking based on clustering techniques. Most methods deal with the diversity problem using
a two-step approach. In the rst step the set of potentially relevant documents is acquired using
standard retrieval methods. In the second step these documents are clustered, assuming that
individual clusters have the potential to represent di erent sub-topics of results amongst the rst
positions. To build the nal ranked list based on this approach, a representative document is then
taken from each cluster in a round-robin fashion and added to the result list [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ].
          </p>
          <p>
            There are three common approaches used to select the representative document of each cluster.
The rst is to select the cluster's document, with the highest similarity score to the given query [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ].
In this case, relevance is considered solely after clustering. A second approach is to select the
medoid3, which is assumed being the best representative of the cluster [
            <xref ref-type="bibr" rid="ref14 ref9">9, 14</xref>
            ]. The third method
selects the most similar document to the members of the selected cluster [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ]. Therefore, all the
presented methods merely rely on clustering and consequently ignore the diversity of documents
in the nal list. As a result, the top ranked results are still similar or semantically close to each
other. To the best of our knowledge, no existing cluster-based methods consider the diversity and
relevance criterion aiming to nd the optimal balance between similarity of the results to queries
and diversity between candidate documents within the documents ranking.
          </p>
          <p>In this paper, we propose an approach that takes into consideration the diversity criterion on
the selection of documents within clusters. This approach has been implemented in Glasgow - run
- 4 on the top 500 images from the initial ranking employing the Expectation Maximisation (EM)
clustering algorithm, which has been observed to perform best in clustering similar images in our
previous experiments. We set the number of clusters to be 20, aiming at producing a ranking that,
in the top 20, holds as many relevant images that are representative of the di erent sub-topics
within the results. The standard deviation is set at 6 10 6. The seed number is set at 100 with
100 maximum iterations. After clustering the dataset of the initial ranking, clusters are ranked
according to average relevance score of instances within clusters as de ned by
S(Ck; q) =
1 I</p>
          <p>X s(xki; q)
Ik i=1
(2)
where S(Ck; q) is the average similarity of the cluster Ck to query q; Ik is the number of
documents in cluster Ck; and X = fx1; :::; xng is the initial set of potential relevant documents.
Subsequently, the retrieval consists of two steps: in the rst step, we select clusters according to the
means of their similarity scores. We used a round robin approach to simplify the selection at cluster
level in our preliminary experiment. In the second step, MMR is employed to select a document
to be added to the results list from a particular cluster in each iteration. We assume that applying
diversity based re-ranking on clusters can enhance the overall diversity of the document ranking,
3The document closest to the centroid of the cluster.
yet maintaining relevancy to the query. In each iteration, the number of candidate documents to
be compared is reduced from the entire set of initial rank to a set of a speci c cluster. MMR can
be used to fuse similarity and diversity:</p>
          <p>R(xk; q; (xr1; :::xrJ )) = S(xk; q) + (1
)D(xk; (xr1; :::xrJ ))
where R is the retrieval score for a candidate document xk in a speci c cluster k given query
q and S is the similarity of the candidate document given the query. is a weight used to
balance similarity and diversity, and it has been set to 0.3 in this experiment. For partial results
(xr1; :::xrJ ) in the nal ranked list at iteration J , we de ne the dissimilarity score for a candidate
image xk to be inserted in the ranking at iteration J+1 as</p>
          <p>D(xk; (xr1; :::xrJ )) = 1 XJ d(xk; xrj)</p>
          <p>J
j=1
(3)
(4)
2.4</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>QPRP</title>
        <p>
          For the run Glasgow run 2 we employed an approach based on the theoretical framework
of the Quantum Probability Ranking Principle (QPRP), introduced in [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The framework has
justi cation from quantum probability theory and posits that documents have to be ranked not
only based on the probability of relevance of each document to the user's information need, but
also according to the interdependent relationships between documents at relevance level. This
approach then aims to capture and model interdependent document relevance and diversity in
document ranking from rst principles. Although the QPRP presents still several open questions,
in particular related to the estimation of the probability of relevance and the interference e ects,
we identi ed in ImageClefPhoto 2009 a suitable testing scenario for the current developments
accomplished towards a full understanding of the capabilities of the QPRP. For a complete
presentation of the QPRP framework and of the open questions related to it the reader is referred
to [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. In this paper, instead, we provide a brief introduction to the principle and the main
intuitions underlying it.
        </p>
        <p>Consider the following scenario. A set of documents has been retrieved in response to an
information need and a probability of relevance to the information need is attached to each document.
The system is asked to provide a documents ranking to present to the user. Assume the document
with highest probability of relevance to the information need is ranked at the rst position4.</p>
        <sec id="sec-2-3-1">
          <title>Which document the system should rank next? The widely used Probability Ranking</title>
          <p>
            Principle (PRP) [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ] posits that documents should be ranked in descendent probability of relevance
to the information need. Thus, according to the PRP the system should present at the second
position in the rank the document with the second highest probability of relevance. Consider this
example. The set of retrieved documents is R = fd1; d2; d3g and the probability of relevance of
each document to the information need is respectively P (d1) = 0:9, P (d2) = 0:8, P (d3) = 0:7.
Following our assumption, d1 is presented at the rst rank; a system implementing the PRP ranks
at the second position in the rank document d2, since P (d2) &gt; P (d3).
          </p>
          <p>Conversely, the QPRP suggests that interdependent document relationships at relevance level
should be accounted for in the document ranking. In particular, the QPRP ranks d2 at the
second position in the rank if and only if P (d2) + Id1;d2 &gt; P (d3) + Id1;d3 , where Id1;di is the
quantum interference between the documents that have been already ranked, d1 in our example,
and document di. The QPRP posits that the interference between documents occurs at relevance
level and that it models interdependent document relevance.</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>Regarded to what do PRP's and QPRP's rankings di er? Intuitively, the QPRP is</title>
          <p>a generalization of the PRP: in particular when the interference term equals zero for each pair
of documents, then the two principles provide the same ranking list. However, when Id1;d2 (or
4This is a quite intuitive assumption that is justi ed both from the PRP and the QPRP.
equally Id1;d3 ) is not null, the ranking suggested by the PRP is subverted by the interference term
if</p>
          <p>Id1;d3 &gt; P (d2)</p>
          <p>
            P (d3) + Id1;d2
Although there is no guarantee that this happens in document ranking, a follow-up paper to [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ]
will empirically illustrate the presence of such phenomena and thus that PRP and QPRP provide
signi cant di erent rankings.
          </p>
          <p>Ranking documents with the QPRP. In the following we describe the complete ranking
strategy suggested by the QPRP. The rst document that is ranked is the one that has higher
probability of relevance given the information need. This is the same as the PRP suggests, and
is also the best we can do from a utility theory point of view: given the evidence that we have
before starting to rank, the best document to be returned to the user at rank one is the one
that is expected to be most relevant to his information need. We indicate this document as d@1,
meaning the document that is ranked at position 1. The document that has to be returned at
second position in the ranking is the one which maximizes
with di 2 RE n fd@1g, being RE the set of documents retrieved in response to the query5. Let RA
contain the documents that have been already ranked; then the document that has to be returned
at rank n is the one that maximizes
(5)
(6)
(7)
(8)
with di 2 RE n RA.</p>
          <p>
            The interference term. At this stage it appears clear the central role of the interference
term in the formulation of the QPPR. However, what is the interference term governed by? The
interference term is expressed (refer to [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ] for the mathematical justi cation) as:
q
Idx;dy = 2
          </p>
          <p>P (dx)P (dy)cos dx;dy
P (di) +</p>
          <p>X
dx2RA</p>
          <p>Idx;di
where dx;dy is the di erence of the phases6 associated to the probabilities of dx and dy. Thus
the interference's behaviour depends from the phase di erence . A correct estimation of either
or of the probability (of relevance) amplitudes associated to documents is then essential for an
e ective modeling of interdependent document relevance using the QPRP framework: this is still
argument of research. Thus, we employed a rather nave estimation of based on the opposite of
the Pearson's correlation between vectors associated to documents. Each component of the vectors
is associated to a term of the collection's vocabulary and is characterized by the correspondent
Okapi weight.</p>
          <p>
            In Glasgow - run - 2 we implemented the QPRP paradigm, by re-ranking an initial document
ranking containing the 100 most relevant documents to a query retrieved using Lemur. The
parameters of the Okapi's score function were set to the values suggested in [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ].
2.5
          </p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Visual Diversity</title>
        <p>In Glasgow run 5 we combined text statistic with visual features. Before generating a ranking
encoding diversity exploiting the visual content of the images, we processes for each query the
top 100 documents retrieved employing the Okapi scoring function. Afterwards we apply factor
analysis and bi-clustering to the visual features of the gathered documents. These two techniques
are presented in the next sections.</p>
        <p>
          5Thus RE n fd@1; : : : ; d@xg represents the set of documents retrieved but not yet ranked after x positions.
6Recall that the QPRP assumes the presence of a complex probability (of relevance) amplitude i associated to
each document di, where probability amplitudes and probabilities are related by P (di) = j ij2.
In the following, we illustrate factor analysis in details. Factor analysis (FA) is a linear method
for dimensionality reduction based on the second-order data summaries (covariances) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. The
underlying assumption of Fa is that measured variables depend on some unknown, and often
unmeasurable, common factors. The aim of FA is to uncover such relations, and thus it can be
employed to reduce dataset's dimensions.
        </p>
        <p>The main intuition under FA is that a set of p standardized variables (in our case images in
the result set) x = fx1; ::::; xpg can be represented as a linear combination of latent orthogonal
uncorrelated variables f = ff1; ::::; fmg called common factors, summed to unique (speci c) factors
e = fe; : : : ; epg. The unique factors e1; : : : ; ep express the part of x that cannot be explained by
the common factors. Formally,</p>
        <p>xi = i1f1 + i2f2 + :::: + imfm + ei
where coe cients ij ; i 2 f1; :::; pgj 2 f1; :::; mg; are called factor loadings and represent the
correlation between variable xi and factor fj . Each common factor represents some common
characteristics of the data under consideration.</p>
        <p>Eq. (9) can be rewritten in matrix form, with obvious notation as
where x = [x1 x2 x3 :::: xp]T ; e = [e1 e2 e3 :::: ep]T , and
x =</p>
        <p>f + e
0
is also called the loading matrix.</p>
        <p>The communality in FA represents the extent of overlap between the variables and the factors.
In detail, communality is the proportion of variance of a particular variable (i.e. images in the
initial ranking) that is due to common factors, that is, the proportion of variance that each variable
has in common with other variables. The communality for each variable is equal to the sums of
squares of the loadings for the variable. Let the communality of xi is ci then:
ci =
q i21 + i22 + ::: + 2
im
2.5.2</p>
        <sec id="sec-2-4-1">
          <title>Bi-clustering</title>
          <p>Bi-clustering is a technique for two-way data analysis. A two-way dataset is a matrix with rows ri,
column cj and entries aij . The purpose of bi-clustering is to nd subgroups of rows and columns,
which are as similar as possible to each other and as di erent as possible to the rest.</p>
          <p>Data analysis by using simple clustering techniques such as K-Means clustering makes several
a-priori assumptions that may not be adequate in all circumstances. In fact, clustering can be
applied to either images or factors, implicitly directing the analysis to a particular aspect of the
system (e.g. groups of factors or groups of images). Also, clustering algorithms usually try to get
disjoint sets of elements, constraining an image or factor to belong exclusively to one cluster.</p>
          <p>The concept of a bi-cluster allows for a more exible framework for data analysis. A bi-cluster
is de ned as a submatrix containing a set/subset of images and a set/subset of factors. Given a
loading matrix, we can distinguish the data patterns it represents by a collection of bi-clusters,
each representing a di erent type of joint characteristics of a subset of images in a corresponding
subset of factors. In brief, there are no a-priori constraints on the organization of bi-clusters and
in particular, images or factors can be part of more than one bi-cluster or of no bi-cluster.</p>
          <p>
            Many algorithms like: BiMax, Plaid, Spectral, XMotifs have been proposed for bi-clustering;
a comparison of these algorithms is provided in [
            <xref ref-type="bibr" rid="ref10 ref7">7, 10</xref>
            ]. In our diversity based retrieval technique,
the BiMax algorithm is implemented because of its lower computational complexity. In a binary
matrix, the BiMax algorithm nds sub-matrix where all entries are one. The algorithm iterates
through the following two steps:
1. Rearrange the rows and columns to concentrate ones in the upper right of the matrix;
2. Divide the matrix into two submatrices.
          </p>
          <p>Whenever only ones are found in one of the submatrices, this submatrix is returned. In order to
obtain satisfying results the method is restarted several times with di erent starting points.</p>
          <p>In the next section we detail our proposed approach for re-ranking of text based results to
promote visual diversity in document ranking.
2.5.3</p>
        </sec>
        <sec id="sec-2-4-2">
          <title>Visual Features based Diversity</title>
          <p>Our main goal is to re-rank text based results to make the ranked list more diverse based on
visual features. For this purpose, we use two low-level features de ned in MPEG-7 standards i.e.
Color Structure and Edge Histogram. While color structure gives information about the color
distribution, edge histogram provides information about edge distribution. We combined these
two features into one feature by a concatenation to process an image further. To apply factor
analysis, a matrix F is generated by using the top 100 results from text based retrieval, in which
each row contains a particular feature component of all top 100 images. Let the component number
(dimensionality) in a low-level feature be d, the size of F will be (d 100).</p>
          <p>Factor analysis is applied on the covariance matrix of F which generates the loading matrix .
In the loading matrix, each row represents an image by its factor loadings for all common factors
from 100 top results. Further, we calculate communality for each image by using eq. 11. The
Communality tells us about the characteristics of an image which are common with other images
in the result set.</p>
          <p>Since each common factor represents some speci c visual characteristics of the result set,
combinations of di erent common factors are also an e cient representation of result set characteristics.
For diversity based retrieval, our main goal is to create subgroups of the loading matrix such that
a xed number of factors behave in a similar pattern for all images in that subgroup. A similar
pattern means that a factor is either highly correlated or less correlated with all components in a
subgroup. This is a two-fold requirement: (1) images within a subgroup will represent some
common characteristic; and (2) images from two di erent subgroups will contain somewhat di erent
characteristics.</p>
          <p>We can employ three di erent methods to cluster the loading matrix. The image clustering
only considers the overall distance between two images by using all factors loadings, which is
questionable as we have to think about factor combinations as well. In factor clusters, we have a
group of di erent factors which behave almost the same for all images. However, it will miss some
factor combinations that behave similarly only for some images, due to the constraint that each
cluster should contain all images.</p>
          <p>To overcome the problems in these one-way clustering, we turn to bi-clustering over the loading
matrix. Bi-clustering results in subgroups in which some particular factors are highly correlated
with all images in that subgroup. Each bi-cluster hence contains factors or factor components as
wished. Moreover, each bi-cluster represents distinct characteristics of the result set.</p>
          <p>
            We used the BiMax algorithm [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ] for bi-clustering the loading matrix . BiMax works on a
binary matrix. Let aij be an original element in before binarization and bij the binary value,
the binarizarion function is de ned as follows:
bij =
0 if ij Median of all loadings for factor j
1 if ij &gt; Median of all loadings for factor j
(12)
Factors will be encoded as a 1 if they have loading greater than the median, 0 otherwise. The
median is used for partitioning the loading columns in two equal parts.
          </p>
          <p>After bi-clustering, each bi-cluster contains some distinct characteristics of the result set which
is used to generate diversity in the nal ranking. To create a nal document ranking, we rst
calculate the ranking within a cluster based on the ranking in text based results and the communality
of each image. The text ranking (Rtext) characterizes relevance while the communality based rank
(Rcomm) returns the most common images from that cluster. Finally, we re-rank images within a
cluster in increasing order of their nal rank Rf :</p>
          <p>Rf = ( )Rtext + (1
)Rcomm
(13)</p>
          <p>Lesser the value of Rf , higher the image will be in the nal document ranking. Note that a
high means that similarity to the query is more important than diversity, while low represents
situations where diversity is more important than similarity. After some experiments we xed
value of to 0:4.</p>
          <p>To generate a nal document ranking, we rst rank the clusters based on the best Rtext among
all images within a cluster and then we select one image from each cluster in descending order
of cluster ranks repeatedly till all clusters are completely exhausted. To select an image from a
cluster, we just choose the image having minimum Rf in that cluster. In this way we generate a
nal document ranking containing more visual diversity than the initial ranking.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussions</title>
      <p>
        The performances of our runs against the values obtained by the initial rankings given as input to
each approach are reported in Tables 1, 2, and 3. Since the ultimate aim of this year's campaign
was to promote diversity in document ranking, we report and discuss only the values relative to
cluster recall [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. This measure is de ned as follow:
      </p>
      <p>CR@k = [ik=1subtopics(di)
nA
(14)
where the function subtopics(di) returns those subtopics or facets that are covered by document
di, nA is the total number of subtopics for the given topic and k is a rank position. The evaluation
of our runs using the usual IR measures such as precision, recall, MAP, etc, can be found in the
campaign summary released by the organizers. Note that the results reported for Glasgow run 2
di er from the one reported by the ImageClefPhoto's organizers. The submitted run, in fact,
contained a mistake being thus wrongly evaluated. We corrected this problem after the submission
and the organizers kindly agreed to evaluate the correct rankings against the ground truth for the
subtopics. This evaluation is reported in Table 2, together with the evaluation of the initial ranking
obtained with Okapi (which has been given as input to our approach). The values reported in
the table, however, cannot be fairly compared to the one obtained in the other runs or by other
participants, since those rankings provided by the QPRP were not included in the pool that have
been manually judged by the ImageClefPhoto assessors.</p>
      <sec id="sec-3-1">
        <title>Glasgow 2 baseline CR@5</title>
        <p>
          The performances obtained by Glasgow run 1 (the MMR approach) are superior to the one
obtained by the relative initial ranking. However, the empirical evaluation shows that our runs
combining MMR and two di erent clustering approaches (Glasgow run 3 and Glasgow run 4)
out-perform both the baseline and the original MMR approach. Also the approach based on the
quantum probability framework (QPRP, Glasgow run 2), obtains signi cantly better values
of s-recall with respect to the initial ranking obtained employing the Okapi retrieval schema.
Conversely, the results obtained in Glasgow run 5 are not consistently superior to the relative
baseline, although improvements are obtained for cluster recall at ranks less than 30. We still have
to examine if this happens because of a ow in our methodology or because the way we combine
visual features and text statistics is not e ective for diversify document rankings. However, we
hypothesize that the poor results obtained by our visual diversity run suggest that the diversity
in the ImageClefPhoto 2009 dataset is preeminently topical rather than visual.
In this paper we have presented a summary of our participation in ImageClefPhoto 2009 evaluation
campaign. The methods we proposed aimed to obtain diversi ed ranking in function of topical,
semantical and visual diversity. We have proposed four new approaches (one of them combined
both visual and textual feature) and we have tested a well known method for combining relevance
and document dissimilarity (MMR). The obtained results are promising, in particular for the
runs exploiting only textual statistics. However, we recognize the need to perform a through
evaluation of our approaches employing a series of evaluation measures tailored to capture diversity
and novelty citeClarke:2008,Zhai:2003. This will be subject of future work. Moreover, for each
approach proposed in this paper, we have identi ed the following future directions of research:
Semantic clustering: In our clustering and MMR approaches, the documents were
represented by the Okapi weights associated to the terms occurring in each document. We
claim that in doing so the semantic of each documents has been taken in consideration when
diversity of the document ranking was required. Although we obtained a good level of
diversity, e ectiveness might be improved by employing a di erent document representation. In
particular, we propose to employ a document representation based on co-occurrence
statistics and adopt the methodology proposed in [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] to capture semantic relationships between
documents so as to increase document diversity at semantic level.
        </p>
        <p>
          QPRP: although the run based on the QPRP framework provides satisfying results in terms
of diversity, the research on the framework itself is still at its early stage of development.
Several research questions have been stated during this paper and in [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The most stringent
ones relate to the estimation of the initial probability (and in particular of the complex
amplitudes) or alternatively to the approximation of the interference term.
        </p>
        <p>VisualDiversity: In Glasgow run 5 we combined visual and textual features to
rerank the results retrieved using the Okapi schema, aiming at diversifying the document
rankings. In particular, during the re-ranking process, we used factor analysis, bi-clustering
and successively from each bi-cluster we selected an image based both on its relevance and
communality. Although the results from the proposed approach are not completely satisfying
(especially for cluster recall at ranks higher than 30), there might be some chances to improve
performances. A possible solution would be to employ di erent criteria for selecting an image
from a bi-cluster. In particular, the information from re-ranked images can be exploited to
select the next best image from a bi-cluster.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>The authors are grateful to the ImageClefPhoto 2009 organizers, and in particular to Monica L.
Paramita, for the additional evaluation of the initial rankings (baselines) and for the results from
the corrected Glasgow run 2.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Carbonell</surname>
          </string-name>
          and J.
          <string-name>
            <surname>Goldstein</surname>
          </string-name>
          .
          <article-title>The use of MMR, diversity-based reranking for reordering documents and producing summaries</article-title>
          .
          <source>In SIGIR '98: Proc. 21st Int. ACM SIGIR Conf. on Research and Development in IR</source>
          , pages
          <volume>335</volume>
          {
          <fpage>336</fpage>
          , New York, NY, USA,
          <year>1998</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C. L.A.</given-names>
            <surname>Clarke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kolla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Cormack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Vechtomova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ashkan</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <article-title>Buttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation</article-title>
          .
          <source>In SIGIR '08: Proc. 31st Int. ACM SIGIR Conf. on Research and Development in IR</source>
          , pages
          <volume>659</volume>
          {
          <fpage>666</fpage>
          , New York, NY, USA,
          <year>2008</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Deselaers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Gass</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Dreuw</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Ney</surname>
          </string-name>
          .
          <article-title>Jointly optimising relevance and diversity in image retrieval</article-title>
          .
          <source>In CIVR '09: Proc. 8th ACM Int. Conf. on Image and Video Ret</source>
          .,
          <string-name>
            <surname>Santorini</surname>
          </string-name>
          , Greece,
          <year>July 2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Eisenberg</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Barry</surname>
          </string-name>
          .
          <article-title>Order e ects: A study of the possible in uence of presentation order on user judgments of document relevance</article-title>
          .
          <source>JASIS</source>
          ,
          <volume>39</volume>
          (
          <issue>5</issue>
          ):
          <volume>293</volume>
          {
          <fpage>300</fpage>
          ,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferecatu</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Sahbi</surname>
          </string-name>
          . Telecom paristech at imageclefphoto 2008:
          <article-title>Bi-modal text and image retrieval with diversity enhancement</article-title>
          .
          <source>In Working Notes for the CLEF 2008 workshop</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Halvey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Punitha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hannah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Villa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hopfgartner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Goyal</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Jose</surname>
          </string-name>
          . Diversity, assortment, dissimilarity, variety
          <article-title>: A study of diversity measures using low level features for video retrieval</article-title>
          .
          <source>In ECIR '09: Proc. 31st Eu. Conf. on Research on Advances in IR</source>
          , pages
          <volume>126</volume>
          {
          <fpage>137</fpage>
          , Berlin, Heidelberg, Germany,
          <year>2009</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kluger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Basri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Chang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gerstein</surname>
          </string-name>
          .
          <article-title>Spectral biclustering of microarray data: coclustering genes and conditions</article-title>
          .
          <source>Genome Res</source>
          ,
          <volume>13</volume>
          (
          <issue>4</issue>
          ):
          <volume>703</volume>
          {
          <fpage>716</fpage>
          ,
          <string-name>
            <surname>April</surname>
          </string-name>
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Lebart</surname>
          </string-name>
          .
          <source>Multivariate Descriptive Statistical Analysis (Probability &amp; Mathematical Statistics)</source>
          . John Wiley &amp; Sons Inc, May
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Leelanupab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hopfgartner</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. M. Jose.</surname>
          </string-name>
          <article-title>User centred evaluation of a recommendation based image browsing system</article-title>
          .
          <source>In IICAI '09: Proc. 4th Indian Int. Conf. on AI</source>
          ,
          <year>December 2009</year>
          . to appear.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.C.</given-names>
            <surname>Madeira</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.L.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          .
          <article-title>Biclustering algorithms for biological data analysis: a survey</article-title>
          .
          <source>Computational Biology and Bioinformatics</source>
          , IEEE/ACM Transactions on,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <volume>24</volume>
          {
          <fpage>45</fpage>
          ,
          <string-name>
            <surname>Jan</surname>
          </string-name>
          .- March
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>F.</given-names>
            <surname>Radlinski</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Dumais</surname>
          </string-name>
          .
          <article-title>Improving personalized web search using result diversi cation</article-title>
          .
          <source>In SIGIR '06: Proc. 29th Int. ACM SIGIR Conf. on Research and Development in IR</source>
          , pages
          <volume>691</volume>
          {
          <fpage>692</fpage>
          , New York, NY, USA,
          <year>2006</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Robertson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Walker</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Beaulieu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Gatford</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Payne</surname>
          </string-name>
          .
          <article-title>Okapi at trec-4</article-title>
          . In NIST Special Publ.
          <fpage>500</fpage>
          -
          <lpage>236</lpage>
          :
          <article-title>The 4th Text REtrieval Conf</article-title>
          .
          <source>(TREC-4)</source>
          , pages
          <fpage>73</fpage>
          {
          <fpage>96</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>S. E. Robertson.</surname>
          </string-name>
          <article-title>The probability ranking principle in IR</article-title>
          , pages
          <volume>281</volume>
          {
          <fpage>286</fpage>
          . Morgan Kaufmann Publishers Inc., San Francisco, CA, USA,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T.</given-names>
            <surname>Urruty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hopfgartner</surname>
          </string-name>
          , Hannah D.,
          <string-name>
            <given-names>D.</given-names>
            <surname>Elliott</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Jose</surname>
          </string-name>
          .
          <article-title>Supporting aspect-based video browsing - analysis of a user study</article-title>
          .
          <source>In CIVR '09: Proc. 8th ACM Int. Conf. on Image and Video Ret</source>
          .,
          <string-name>
            <surname>Santorini</surname>
          </string-name>
          , Greece,
          <year>July 2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          .
          <article-title>Portfolio theory of information retrieval</article-title>
          .
          <source>In SIGIR '09: Proc. 32nd Int. ACM SIGIR Conf. on Research and Development in IR</source>
          , pages
          <volume>115</volume>
          {
          <fpage>122</fpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.X.</given-names>
            <surname>Zhai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. W.</given-names>
            <surname>Cohen</surname>
          </string-name>
          , and
          <string-name>
            <surname>J.</surname>
          </string-name>
          <article-title>La erty. Beyond independent relevance: methods and evaluation metrics for subtopic retrieval</article-title>
          .
          <source>In SIGIR '03: Proc. 26th Int. ACM SIGIR Conf. on Research and Development in IR</source>
          , pages
          <volume>10</volume>
          {
          <fpage>17</fpage>
          , New York, NY, USA,
          <year>2003</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Azzopardi</surname>
          </string-name>
          , and
          <string-name>
            <surname>C.J. van Rijsbergen.</surname>
          </string-name>
          <article-title>The quantum probability ranking principle</article-title>
          .
          <source>In ICTIR '09: Proc. 2nd Int. Conf. on the Theory of IR</source>
          ,
          <year>2009</year>
          . to appear.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Azzopardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Rijsbergen</surname>
          </string-name>
          .
          <article-title>Semantic spaces: Measuring the distance between di erent subspaces</article-title>
          .
          <source>In QI '09: Proc. 3rd Int. Symp. on Quantum Interaction</source>
          , pages
          <volume>225</volume>
          {
          <fpage>236</fpage>
          , Berlin, Heidelberg,
          <year>2009</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>