<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>RECOD @ Placing Task of MediaEval 2016: A Ranking Fusion Approach for Geographic-Location Prediction of Multimedia Objects</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Javier A. V. Muñoz</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lin Tzy Li</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ícaro C. Dourado</string-name>
          <email>icaro.dourado@students.ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Keiller Nogueira</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Samuel G. Fadel</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Otávio A. B. Penatti</string-name>
          <email>o.penatti@samsung.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jurandy Almeida</string-name>
          <email>jurandy.almeida@unifesp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luís A. M. Pereira</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rodrigo T. Calumby</string-name>
          <email>rtcalumby@ecomp.uefs.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jefersson A. dos Santos</string-name>
          <email>jeferssong@dcc.ufmg.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ricardo da S. Torres</string-name>
          <email>rtorresg@ic.unicamp.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Universidade Federal de Minas Gerais</institution>
          ,
          <addr-line>UFMG</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>GIBIS Lab, Institute of Science and Technology, Federal University of Sa~o Paulo</institution>
          ,
          <addr-line>UNIFESP</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>RECOD Lab, Institute of Computing, University of Campinas</institution>
          ,
          <addr-line>UNICAMP</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>SAMSUNG Research Institute</institution>
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Feira de Santana</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>20</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>We describe the approach proposed by the RECOD team for the estimation-based sub-task of Placing Task at MediaEval 2016. Our approach uses genetic programming (GP) to combine ranked lists de ned in terms of textual and visual descriptors to automatically assign geographic locations to images and videos.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        By having multimedia content annotated with geographic
information, we can provide richer services for users such
as placing information on maps and providing geographic
searches. Since 2011, the Placing Task [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] at MediaEval
has been challenging participants to assign the geographical
locations to images and videos automatically.
      </p>
      <p>
        Here we present our approach for the estimation-based
subtask of the Placing Task 2016. It combines textual,
audio, and/or visual descriptors by applying rank
aggregation and ranked list density analysis to combine
multimodal information encoded in ranked lists. We evaluated
new features and a genetic programming (GP) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] approach
for multimodal geocoding. GP provides a good framework
for modeling optimization problems even when the variables
are functions. We applied combinations of rank
aggregation methods de ned by a GP framework. The idea is to
automatically select a set of suitable features and rank
aggregation functions that yield the best result according to
a given tness function. Previous works [
        <xref ref-type="bibr" rid="ref16 ref8">8, 16</xref>
        ] have shown
that combining rank aggregated lists and rank aggregation
functions [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] yields very e ective results.
      </p>
    </sec>
    <sec id="sec-2">
      <title>PROPOSED APPROACH</title>
      <p>
        Our approach estimates location based on rank
aggregation of a multitude of ranked lists and their top-K density
analysis [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. We extracted a large set of features from the
data, derived their ranked lists, and combined them using
rank aggregation methods which in turn are selected and
fused by the GP-based framework proposed in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]
(GPAgg).
      </p>
      <p>
        For evaluation purposes in the training phase (as in
2015 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) we split the whole training set into two parts: (i)
a validation set; and (ii) a sub-training set. The validation
set has 4,674 images and 903 videos, while the sub-training
set has 12,935 videos and 4,188,484 images.
2.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Features</title>
      <p>
        Textual . The title, description, and tags of photos/videos
were concatenated as a single eld. The text was stemmed
and stopwords were removed. We used BM25, TF-IDF
(cosine), information-based similarity (IBSimilarity - IBS)
and language modelling similarity (LMDirichletSimilarity
LMD), which are similarity measures implemented in the
Lucene package [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        Audio/Visual . For visual place recognition of images, we
used the provided features: edgehistogram (EHD),
scalablecolor (SCD), GIST (static feature), cedd, col, jhist, and
tamura. We also extracted BIC [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and deep-learning based
features (GoogleNet) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. For video data, due to time and
infrastructure constraints for extracting features for new
videos in test set, we were only able to use features of
histograms of motion patterns (HMP) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>GP-based Rank aggregation &amp; Geocoding</title>
      <p>
        We used the full training set as geo-pro les and each test
item was compared to the whole training set for each feature
independently. For a given test item, a ranked list for each
feature was generated. Then, these ranked lists were
aggregated through the GP-Agg framework [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Given the
improvements obtained in the last year by applying the ranked
list density analysis (RLDA) over the nal combined ranked
list [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], we explored the idea of including this RLDA
function into the GP-Agg framework: both in the tness function
evaluation and in the tree structure of GP's individuals (as
an unary and binary operator). In this way, the GP-Agg
framework was able to apply the RLDA density function in
previous steps of the combination, which improved the
results. Including the RLDA density function in the set of
rank aggregation functions turns it in the unique function
that uses geo-localization in the combination, whereas the
other classic approaches only use similarity or rank position.
      </p>
      <p>
        The GP-Agg method uses genetic programming to
combine a set of methods for rank aggregation in an
agglomerative way, in order to improve the results of the isolated
methods [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. We used this method to combine the
textual and visual ranked lists generated for various descriptors.
This method was chosen because in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] the authors showed
that GP-Agg produced better or equal results than the best
supervised technique in a wide range of rank aggregation
techniques (supervised and unsupervised). Moreover, it
required a reasonable time for training (a couple of hours), and
it was relatively fast to apply the best individual (discovered
function) on the test set.
      </p>
      <p>
        The GP-Agg method was trained using 400 queries from
the validation set (randomly chosen) and their ranked lists.
We stopped the evolution process at the 20th generation.
We used the tness function, genetic operators, and rank
aggregation techniques that yielded the best results in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
The GP-Agg parameters are shown in Table 1.
      </p>
      <p>For the training phase of GP-Agg, an element of a ranked
list was considered relevant if it is located no farther than
1 km from the ground truth location of the query element.
The best individuals discovered in the training phase were
applied to combine the ranked lists of test set. The
predicted lat/long for an test-set element is obtained by picking
the lat/long of the rst element of its respective combined
ranked list (which could be the single result of RLDA).</p>
      <p>
        Among the di erent tness functions tested, the best
results (more precise) were achieved with the WAS [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and
FFP1 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>3. OUR SUBMISSIONS &amp; RESULTS</title>
      <p>Based on parameters of our best results in the evaluation
phase, our submissions were con gured as shown in Table 2.
For each Run, it shows the combination function applied on
the test set, some of them discovered by the GP-Agg
framework and others we choose based on experimental results, as
it will be explained in next paragraphs. Runs 1 and 4 were
based on textual-only descriptors, Run 2 was visual-only,
and Run 3 was our multi-modal submission. For textual
and multimodal runs, we set the K-top parameter of RLDA
at 5, and for the visual ones at 100. No extra crawled
material or gazetteers were used in our submissions.</p>
      <p>In the case of photos, for Runs 1-3, we used the GP-Agg
framework to discover a semi-optimal combination of rank
aggregation functions and ranked lists. For the Run 4, we
used the con guration with which we got the best results
in the past year. Results in Table 3 show slight
improvements at including RLDA in GP-Agg framework (Run 1 vs.
Run 4).</p>
      <p>As shown in Table 3, most of our best results were from
Run 1, where GP-Agg applied rank aggregation for textual
descriptors. For visual run (Run 2), combining rank
aggregation functions and di erent visual features, including
GoogleNet, improved our results over last year's.</p>
      <p>The results for videos are presented in Table 4. As in
the case of images, the best video results were obtained by
applying GP-Agg over textual ranked lists. For Run 1 and
Run 3, we combined the ranked lists using individual found
by GP-Agg. We were unable to use the GP-Agg for Run
2 (visual) because we had only the HMP descriptor, thus
we applied RLDA over it. In Run 4 we used only the best
textual descriptor, since the best con guration of past year
decreased the precision of video results. We can observe
in Table 4 signi cant improvements in the combination of
textual ranked lists through GP-Agg framework over the
best textual descriptor (Run 1 vs. Run 4).</p>
      <p>In both cases, for photos and videos, results obtained show
no gain in the combination of textual and visual information
(Run 3) through GP-Agg. It is explained due to the fact that
the visual ranked list has signi cantly lower precision than
textual ranked lists, and it is hard to nd complementary
between these types of lists by just applying classical rank
aggregation methods.</p>
    </sec>
    <sec id="sec-6">
      <title>4. FUTURE WORK</title>
      <p>
        We plan to evaluate more textual and visual descriptors
and give them as input to GP-Agg to select descriptors
and rank aggregation methods. For example: (a) a
textual descriptor that combines graph representation [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] with
a framework for graph-to-vector synthesis [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]; (b)
applying results from works that tackle the problem of visual
place recognition [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and of geolocation with Convolutional
Neural Networks [
        <xref ref-type="bibr" rid="ref17 ref2">2, 17</xref>
        ]; (c) extracting visual features using
GoogleNet and BIC for video frames.
We thank FAPESP, CNPq, CAPES, and Samsung.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Almeida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. J.</given-names>
            <surname>Leite</surname>
          </string-name>
          , and R. da Silva Torres.
          <article-title>Comparison of video sequences with histograms of motion patterns</article-title>
          .
          <source>In ICIP</source>
          , pages
          <volume>3673</volume>
          {
          <fpage>3676</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Arandjelovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gronat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Torii</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Pajdla</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Sivic.</surname>
          </string-name>
          <article-title>NetVLAD: CNN architecture for weakly supervised place recognition</article-title>
          .
          <source>In Computer Vision and Pattern Recognition (CVPR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Choi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Hau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. V.</given-names>
            <surname>Laere</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Thomee</surname>
          </string-name>
          .
          <article-title>The Placing Task at MediaEval 2016</article-title>
          .
          <source>In Working Notes Proc. MediaEval Workshop</source>
          , Hilversum, Netherlands, Oct.
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W.</given-names>
            <surname>Fan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pathak</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          .
          <article-title>The e ects of tness functions on genetic programming-based ranking discovery for web search</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>55</volume>
          (
          <issue>7</issue>
          ):
          <volume>628</volume>
          {
          <fpage>636</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Koza</surname>
          </string-name>
          .
          <article-title>Genetic Programming: On the Programming of Computers by Means of Natural Selection</article-title>
          . MIT Press, Cambridge, MA, USA,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L. T.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz</article-title>
          , J. Almeida,
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Calumby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A.</given-names>
            <surname>Penatti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. C.</given-names>
            <surname>Dourado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Nogueira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. R. M.</given-names>
            <surname>Junior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Pedronette</surname>
          </string-name>
          , et al.
          <article-title>Recod@ placing task of mediaeval 2015</article-title>
          .
          <source>In Working Notes Proc. MediaEval Workshop</source>
          , volume
          <volume>15</volume>
          , page 2.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L. T.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C. G.</given-names>
            <surname>Pedronette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Almeida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A. B.</given-names>
            <surname>Penatti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Calumby</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R. da Silva</given-names>
            <surname>Torres</surname>
          </string-name>
          .
          <article-title>A rank aggregation framework for video multimodal geocoding</article-title>
          .
          <source>Mult. Tools and App.</source>
          , pages
          <volume>1</volume>
          {
          <fpage>37</fpage>
          ,
          <year>2013</year>
          . http://dx.doi.org/10.1007/s11042-013-1588-4.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L. T.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A. B.</given-names>
            <surname>Penatti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Almeida</surname>
          </string-name>
          , G. Chiachia, R. T. Calumby,
          <string-name>
            <given-names>P. R. M.</given-names>
            <surname>Junio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C. G.</given-names>
            <surname>Pedronette</surname>
          </string-name>
          , and R. da
          <string-name>
            <given-names>S.</given-names>
            <surname>Torres</surname>
          </string-name>
          .
          <article-title>Multimedia geocoding: The RECOD 2014 approach</article-title>
          .
          <source>In Working Notes Proc. MediaEval Workshop</source>
          , volume
          <volume>1263</volume>
          , page 2,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lucene</surname>
          </string-name>
          . Apache Lucene Core. Web Site. http://lucene.apache.org/core/. As of Sept.
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schenker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Bunke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Last</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kandel</surname>
          </string-name>
          .
          <article-title>Graph-Theoretic Techniques for Web Content Mining</article-title>
          . World Scienti c Publishing Co., Inc., NJ, USA,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>F. B.</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tabbone</surname>
          </string-name>
          , and R. d. S. Torres.
          <article-title>BoG: A New Approach for Graph Matching</article-title>
          .
          <source>In ICPR</source>
          , pages
          <volume>82</volume>
          {
          <fpage>87</fpage>
          . IEEE, Aug.
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R. d. O.</given-names>
            <surname>Stehling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nascimento</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Falca</surname>
          </string-name>
          <article-title>~o. A compact and e cient image retrieval approach based on border/interior pixel classi cation</article-title>
          .
          <source>In Proceedings of the 11th International Conference on Information and Knowledge Management</source>
          ,
          <source>CIKM '02</source>
          , pages
          <fpage>102</fpage>
          {
          <fpage>109</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Szegedy</surname>
          </string-name>
          , W. Liu,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sermanet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Anguelov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Erhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vanhoucke</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Rabinovich</surname>
          </string-name>
          .
          <article-title>Going deeper with convolutions</article-title>
          .
          <source>In Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Torii</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Arandjelovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sivic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Okutomi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Pajdla</surname>
          </string-name>
          .
          <volume>24</volume>
          /
          <article-title>7 place recognition by view synthesis</article-title>
          .
          <source>In Proceedings of the IEEE Conference on Computer Vision</source>
          and Pattern Recognition,
          <source>CVPR '15</source>
          , pages
          <year>1808</year>
          {
          <year>1817</year>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Vargas</surname>
          </string-name>
          <article-title>Mun~oz</article-title>
          , R. da Silva Torres, and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Goncalves</surname>
          </string-name>
          .
          <article-title>A soft computing approach for learning to aggregate rankings</article-title>
          .
          <source>In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management</source>
          ,
          <source>CIKM '15</source>
          , pages
          <fpage>83</fpage>
          {
          <fpage>92</fpage>
          , New York, NY, USA,
          <year>2015</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Volkovs</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. S.</given-names>
            <surname>Zemel</surname>
          </string-name>
          .
          <article-title>CRF framework for supervised preference aggregation</article-title>
          .
          <source>In Proceedings of the 22Nd ACM International Conference on Conference on Information; Knowledge Management, CIKM '13</source>
          , pages
          <fpage>89</fpage>
          {
          <fpage>98</fpage>
          , New York, NY, USA,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>T.</given-names>
            <surname>Weyand</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kostrikov</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Philbin</surname>
          </string-name>
          .
          <article-title>Planet - photo geolocation with convolutional neural networks</article-title>
          .
          <source>In European Conference on Computer Vision (ECCV)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>