<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Rank Fusion and Multimodal Per-topic Adaptiveness for Diverse Image Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rodrigo T. Calumby</string-name>
          <email>rtcalumby@ecomp.uefs.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Iago B. A. C. Araujo</string-name>
          <email>ibacaraujo@ecomp.uefs.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Felipe S. Cordeiro</string-name>
          <email>fscordeiro@ecomp.uefs.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabiana C. Bertoni</string-name>
          <email>fabianabertoni@ecomp.uefs.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sérgio Canuto</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabiano Belém</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marcos A. Gonçalves</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Icaro Dourado</string-name>
          <email>icaro.dourado@ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Javier A. V. Munoz</string-name>
          <email>javier.munoz@ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lin Tzy Li</string-name>
          <email>lintzyli@ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ricardo da S. Torres</string-name>
          <email>rtorres@ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Federal University of Minas Gerais</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Campinas</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Feira de Santana</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>13</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>This paper presents the MultiBrasil team experience in the Retrieving Diverse Social Images Task at MediaEval 2017. The teams were required to develop a diversification approach for social image retrieval, enhanced with visual summarization. Our proposal for relevance improvement relies on text and credibility-based reranking and rank aggregation. For diversification, we use diversity-oriented reranking and also propose a clustering-based query-adaptive diversity promotion approach. We applied Genetic Programming and Genetic Algorithm-based approaches for combining textual, visual, and user credibility information.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Beyond relevance, for many search tasks the coverage of diferent
query aspects/intents in the retrieved set has great impact on
fuliflling user needs [
        <xref ref-type="bibr" rid="ref11 ref5">5, 11</xref>
        ]. Promoting diversity has been shown to
positively impact the user search experience specially for
ambiguous, underspecified, and visual summarization queries [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1–4</xref>
        ].
      </p>
      <p>
        However, tackling the balance between relevance and diversity
is still a great challenge. The Retrieving Diverse Social Images Task
2017 [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] models it into a general ad-hoc image retrieval challenge
in which systems are supposed to handle complex and
generalpurpose multi-concept queries. This paper describes the MultiBrasil
team proposals based on reranking and query-adaptive diversity
promotion boosted by multimodal rank fusion.
      </p>
    </sec>
    <sec id="sec-2">
      <title>PROPOSED APPROACH</title>
      <p>The proposed approach consists in improving the original Flickr
ranking for relevance-based filtering followed by a diversification
step with diversity-oriented reranking or query-adaptive
clusteringbased summarization. By improving the original ranking and
keeping only the top-ranked items, we intend to construct a more
relevant subset, which may reduce data noise for the subsequent
diversity promotion step. In turn, for diversification, we have evaluated
two approaches: (i) relevance-diversity balancing via reranking; and
(ii) representative image selection via query-adaptive clustering
metric learning.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Relevance Enhancement and Filtering</title>
      <p>For improving the original ranking of the images from each topic,
we explored textual and credibility-based ranking. For text ranking,
we used the original topic terms as query and for each flickr image
obtained for that topic, the title, description, and tag data were
concatenated before preprocessing (see Section 3.2). For
credibilitybased ranking, the user credibility scores were used as relevance of
her uploaded images (see Section 3.3).</p>
      <p>
        Additionally, for the aggregation of multiple rankings, we applied
the Genetic Programming approach from GPAgg [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], which
combines several well-known rank aggregation methods. This method
was trained using the development data and integrated order-based
(MRA [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], RRF [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and BordaCount [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]) and score-based
(CombMIN, CombMAX, CombSUM, ComMED, CombANZ [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], and
RLSim [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]) rank fusion methods.
      </p>
      <p>As a relevance-based filtering, from the final aggregated list, the
top-ranked images were selected as input for the diversification step.
We evaluated multiple cutof points with best results achieved by
using only the 200-top images (run 1) and the 50-top images (runs 2
to 5). Considering the diversification approaches, keeping more than
50 images degraded the final ranking by pushing more non-relevant
images to be reranked and consequently also degrading diversity.
Since we did not visually reranked the images for run 1, a deeper
reranking had to be considered to allow better diversification, which
in turn may negatively impact final relevance.
2.2</p>
    </sec>
    <sec id="sec-4">
      <title>Diversification</title>
      <p>
        For diversification, we tested two methods. First, a reranking method
following the traditional Maximal Marginal Relevance [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] approach
considering multiple features. For this method, the feature
combination is performed by averaging the individual similarity scores.
The relevance-diversity trade-of adjustment was selected based on
the best results on the development set.
      </p>
      <p>Alternatively, our query-adaptive clustering method seeks to
construct a more suitable clustering structure based on an
evolutionary weight adjustment metric learning method guided by
intrinsic clustering fitness evaluation. Here, we used a Genetic
Algorithm (GA) for per-query feature weight learning. For
combination compatibility, min-max normalization was applied for all
distance matrices.</p>
      <p>As an unsupervised optimization criteria, we evaluated the
clustering quality of the discovered functions by clustering the
50top images using agglomerative hierarchical clustering
(averagelinkage). For metric learning and final diversification, we used 25
clusters. The clusters were ranked according to their sizes in
descending order and intra-cluster sorting was applied using the
images original ranking positions. The final ranking was constructed
in a round-robin fashion from the final clusters.
3
3.1</p>
    </sec>
    <sec id="sec-5">
      <title>FEATURES</title>
    </sec>
    <sec id="sec-6">
      <title>Visual features</title>
      <p>We evaluated only the provided visual descriptors for run 1.
Additionally, the combination of each visual feature with the best text
similarity measures was evaluated for the remaining runs. For the
diversification step, the features were combined by averaging the
respective similarity scores.
3.2</p>
    </sec>
    <sec id="sec-7">
      <title>Text similarity</title>
      <p>
        For text-only and multimodal reranking (runs 2 to 5), the text-based
scores were computed as the similarity between the text vectors
associated with the images and the original query terms. As text
preprocessing, we applied stopwords removal1 and stemming [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
We evaluated several similarity scores: BM25, Cosine, Dice, Jaccard,
and TF-IDF. These scores were also evaluated for the diversification
procedure.
3.3
      </p>
    </sec>
    <sec id="sec-8">
      <title>Credibility</title>
      <p>The user credibility scores were individually used for ranking. In
this step, we first evaluated the ranking quality of each score
individually and finally aggregated the ranking for: bulkProportion,
meanImageTagClarity, meanTagRank, meanTagsPerPhoto,
meanTitleWordCounts, photoCount, uniqueTags and uploadFrequency.
Additionally, we have also created a ranking considering a linear
combination of such scores with the weights empirically adjusted,
which here we name linearCred.
4</p>
    </sec>
    <sec id="sec-9">
      <title>RUN CONFIGURATIONS</title>
      <p>We submitted 5 runs. In Table 1, GPAgg (in run 4) is the GP-based
rank aggregation of BM25, Jaccard, DICE, and linearCred rankings.
For all runs, we have used only the data provided by the task, and
the parameters and features used were chosen according to the best
results yielded from development set.
5</p>
    </sec>
    <sec id="sec-10">
      <title>RESULTS AND DISCUSSION</title>
      <p>Regarding the test set, the text-only run achieved superior
efectiveness than the visual-only run, which is a direct consequence of
the textual reranking and filtering of the input list. The visual-only
run handled more non-relevant images, which impacted the final
relevance and diversity.</p>
      <p>In general, all multimodal runs (3, 4, and 5) achieved similar
efectiveness, with run 3 being slightly superior. Nevertheless,
considering F1@20 for the test set, although run 3 outperformed run 5,
in a per-query analysis, we noticed that run 5 outperforms for ∼40%
of the topics. Moreover, the absolute diference between runs 3 and 5
was 0.0714 in terms of F1@20. Furthermore, even though runs 3
and 4 rely on the same diversification method, the average F1@20
diference was 0.0571, with run 4 outperforming for roughly 43% of
the topics. Hence, we highlight the opportunity for further
improvement with per-query adaptiveness, for instance, by selecting the
most suitable diversification model or even dynamically combining
them.
6</p>
    </sec>
    <sec id="sec-11">
      <title>CONCLUSIONS</title>
      <p>In our experiments, we have combined traditional reranking and
clustering-based diversification methods along with ranking fusion
and per-query adaptive feature fusion for clustering. Even though
traditional methods slightly outperformed our more complex
proposals, we found the results to be satisfactory. In this case, the small
training corpus is considered a challenging factor for the learning
strategies. Moreover, our results have shown the importance of
improving the original ranking for allowing better results, both in
terms of relevance and diversity. We have also shown that properly
selecting the most suitable diversification approach or integrating
alternative methods may lead to further improvements.</p>
    </sec>
    <sec id="sec-12">
      <title>ACKNOWLEDGMENTS</title>
      <p>The authors thank FAPESP, FAPEMIG, CAPES, CNPq, and FAPESB
(grant #4098/2016) for partially funding this work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Fabiano</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Belém</surname>
          </string-name>
          , Carolina S. Batista, Rodrygo L. T. Santos,
          <string-name>
            <surname>Jussara M. Almeida</surname>
          </string-name>
          , and
          <string-name>
            <surname>Marcos</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gonçalves</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Beyond Relevance: Explicitly Promoting Novelty and Diversity in Tag Recommendation</article-title>
          .
          <source>ACM Transactions on Intelligent Systems and Technology 7</source>
          ,
          <issue>3</issue>
          (
          <year>2016</year>
          ),
          <volume>26</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          :
          <fpage>34</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Jingwen</surname>
            <given-names>Bian</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            <given-names>Yang</given-names>
          </string-name>
          , Hanwang Zhang, and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Multimedia Summarization for Social Events in Microblog Stream</article-title>
          .
          <source>IEEE Transactions on Multimedia 17</source>
          ,
          <issue>2</issue>
          (
          <year>2015</year>
          ),
          <fpage>216</fpage>
          -
          <lpage>228</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Calumby</surname>
          </string-name>
          , R. da
          <string-name>
            <given-names>S.</given-names>
            <surname>Torres</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gonçalves</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Diversitydriven Learning for Multimodal Image Retrieval with Relevance Feedback</article-title>
          .
          <source>In Proceedings of the 21st IEEE International Conference on Image Processing</source>
          .
          <fpage>2197</fpage>
          -
          <lpage>2201</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Calumby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gonçalves</surname>
          </string-name>
          , and R. da
          <string-name>
            <given-names>S.</given-names>
            <surname>Torres</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>On Interactive Learning-to-rank for IR: Overview, recent advances, challenges, and directions</article-title>
          .
          <source>Neurocomputing</source>
          <volume>208</volume>
          (
          <year>2016</year>
          ),
          <fpage>3</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Calumby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gonçalves</surname>
          </string-name>
          , and R. da
          <string-name>
            <given-names>S.</given-names>
            <surname>Torres</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Diversitybased interactive learning meets multimodality</article-title>
          .
          <source>Neurocomputing</source>
          <volume>259</volume>
          (
          <year>2017</year>
          ),
          <fpage>159</fpage>
          -
          <lpage>175</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Carbonell</surname>
          </string-name>
          and J.
          <string-name>
            <surname>Goldstein</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>The use of MMR, diversity-based reranking for reordering documents and producing summaries</article-title>
          .
          <source>In Proceedings of the 21st Anual International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          .
          <fpage>335</fpage>
          -
          <lpage>336</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Gordon</surname>
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Cormack</surname>
          </string-name>
          ,
          <string-name>
            <surname>Charles L A Clarke</surname>
            , and
            <given-names>Stefan</given-names>
          </string-name>
          <string-name>
            <surname>Buettcher</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Reciprocal Rank Fusion Outperforms Condorcet and Individual Rank Learning Methods</article-title>
          .
          <source>In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          .
          <fpage>758</fpage>
          -
          <lpage>759</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Ronald</given-names>
            <surname>Fagin</surname>
          </string-name>
          , Ravi Kumar, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Sivakumar</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Eficient Similarity Search and Classification via Rank Aggregation</article-title>
          .
          <source>In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data</source>
          .
          <volume>301</volume>
          -
          <fpage>312</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Carlos Guimarães Pedronette</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. da S.</given-names>
            <surname>Torres</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Image Re-ranking and Rank Aggregation Based on Similarity of Ranked Lists</article-title>
          .
          <source>In Proceedings of the 14th International Conference on Computer Analysis of Images and Patterns - Volume Part I</source>
          .
          <fpage>369</fpage>
          -
          <lpage>376</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Porter</surname>
          </string-name>
          .
          <year>1980</year>
          .
          <article-title>An algorithm for sufix stripping</article-title>
          .
          <source>Program 14</source>
          ,
          <issue>3</issue>
          (
          <year>1980</year>
          ),
          <fpage>130</fpage>
          -
          <lpage>137</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Rodrygo</surname>
            <given-names>L. T.</given-names>
          </string-name>
          <string-name>
            <surname>Santos</surname>
            , Craig Macdonald, and
            <given-names>Iadh</given-names>
          </string-name>
          <string-name>
            <surname>Ounis</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Search Result Diversification</article-title>
          .
          <source>Foundations and Trends in Information Retrieval 9</source>
          ,
          <issue>1</issue>
          (
          <year>2015</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Joseph</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Shaw</surname>
          </string-name>
          , Edward A.
          <string-name>
            <surname>Fox</surname>
          </string-name>
          , Joseph A.
          <string-name>
            <surname>Shaw</surname>
          </string-name>
          , and
          <string-name>
            <surname>Edward</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Fox</surname>
          </string-name>
          .
          <year>1994</year>
          .
          <article-title>Combination of Multiple Searches</article-title>
          .
          <source>In The Second Text REtrieval Conference (TREC-2)</source>
          .
          <fpage>243</fpage>
          -
          <lpage>252</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Javier</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Vargas</surname>
            , R. da
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Torres</surname>
          </string-name>
          , and
          <string-name>
            <surname>Marcos</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gonçalves</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>A Soft Computing Approach for Learning to Aggregate Rankings</article-title>
          .
          <source>In Proceedings of the 24th ACM International Conference on Conference on Information and Knowledge Management.</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>H. P.</given-names>
            <surname>Young</surname>
          </string-name>
          .
          <year>1974</year>
          .
          <article-title>An axiomatization of Borda's rule</article-title>
          .
          <source>Journal of Economic Theory</source>
          <volume>9</volume>
          ,
          <issue>1</issue>
          (
          <year>1974</year>
          ),
          <fpage>43</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Maia</surname>
            <given-names>Zaharieva</given-names>
          </string-name>
          , Bogdan Ionescu, Alexandru Lucian Gînscă, Rodrygo L.T. Santos, and
          <string-name>
            <given-names>Henning</given-names>
            <surname>Müller</surname>
          </string-name>
          .
          <year>2017</year>
          . Retrieving Diverse Social Images at MediaEval 2017:
          <article-title>Challenges, Dataset and Evaluation</article-title>
          .
          <source>In Proceedings of the MediaEval 2017 Workshop</source>
          . Dublin, Ireland, Sept.
          <fpage>13</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>