<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Xtrieval Framework at CLEF 2008: ImageCLEF Wikipedia MM task</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Thomas Wilhelm, Jens Kürsten, and Maximilian Eibl Chemnitz University of Technology Faculty of Computer Science, Dept.</institution>
          <addr-line>Computer Science and Media 09107 Chemnitz, Germany [ thomas.wilhelm</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes our participation at the ImageCLEF Wikipedia MM task. We used our Xtrieval framework for the preparation and execution of the experiments. We submitted 4 experiments in total. The results of these experiments were mixed. The text-only experiment scored second best with a mean average precision (MAP) of 0.2166. In combination with image based features the MAP dropped to 0.2138. With the addition of our thesaurus based query expansion it scored best with a MAP of 0.2195. Without query expansion and with the inclusion of the provided concepts the lowest MAP of 0.2048 was achieved, but there were 23 more relevant documents retrieved than in all 3 other experiments. Furthermore, the retrieval speed and comparison operations for vectors could be speeded up by implementing an interface to the PostgreSQL database.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Evaluation</kwd>
        <kwd>Cross-Language Information Retrieval</kwd>
        <kwd>Content-based Image Retrieval</kwd>
        <kwd>Query Expansion</kwd>
        <kwd>Experimentation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>For the ImageCLEF 2008: Wikipedia MM task a large set of data is supplied. Additional to the images and texts
a 120 dimensional feature vector and concepts were available. This encouraged us to find a solution for vector
storage, retrieval and comparison. Therefore we implemented an interface to index and search data into a
PostgreSQL database. Additional to indexing the supplied feature vector we also indexed the MPEG-7
descriptors computed by Caliph &amp; Emir.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Experiment Setup</title>
      <p>
        We used our base system from the last year (see [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) with the following setup: Apache Lucene,
customized analyzer with positional stopword1 removal and Snowball stemmer2. For the content-based image
retrieval we used Caliph &amp; Emir as described below.
      </p>
      <p>For query expansion a thesaurus was used. The parameters of the last year were further tuned to reduce unfitting
synonyms. As source for the thesauri we still use OpenOffice.org3.</p>
      <sec id="sec-2-1">
        <title>1 http://members.unine.ch/jacques.savoy/clef/index.html 2 http://snowball.tartarus.org/ 3 http://wiki.services.openoffice.org/wiki/Dictionaries</title>
        <p>
          The MPEG-7 features were calculated by Caliph &amp; Emir (see [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]). Contrary to our experiments of the last year
the MPEG-7 descriptors were not stored as text representations in Lucene4, but as vectors in a PostgreSQL5
database. PostgreSQL was chosen because it supports arrays as data types. In fact it is not necessary to know the
actual size of the arrays at design time. This approach is expected to achieve a much higher retrieval speed and it
is possible to use descriptors of Caliph &amp; Emir which has no string representation implemented (e.g. the
dominant color descriptor). The supplied 120 dimensional vectors are stored in the PostgreSQL database as well.
The computations of the distance measures were externalized into the PostgreSQL database by implementing the
algorithms as stored procedures in PL/pgSQL6. PL/pgSQL is an internal programming language of PostgreSQL
which adds support for additional logic to SQL such as control structures. The following algorithms are
implemented so far: cosine similarity, Dice coefficient, Euclidean metric, intersection, Jaccard similarity
coefficient. The main advantage is the reduction of extra round trips between our application and the database
server. On the other hand the speed could be reduced by the fact that PL/pgSQL is an interpreted language.
All topics were preprocessed ad-hoc to retrieve all needed resources to perform the experiments. Especially the
example images were retrieved and analyzed in advance.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Results</title>
      <p>The results in table 1 show that the text-only results can only be marginally improved by additional data. Just
experiment “cut-mix-concepts” retrieved more relevant documents in total, but with a lower MAP than all other
experiments.
Our baseline is “cut-txt-a”, which retrieved a total of 3111 relevant documents and reached a mean average
precision (MAP) of 0.2166. By adding the content-based image features, which consist of the 120 dimensional
feature vector and four MPEG-7 descriptors (scalable color, edge histogram, color layout and dominant color
descriptor), the MAP decreased to 0.2138. This is a hint towards the low visual similarity between relevant
pictures, which attract our attention during the relevance assessment process. After the preprocessing of the
topics with our query expansion our highest MAP of 0.2195 was achieved. The inclusion of the concepts scored
the worst MAP of 0.2048, but retrieved 23 more relevant documents than any other of our experiments.</p>
    </sec>
    <sec id="sec-4">
      <title>4 Future Work</title>
      <p>The PostgreSQL database support many other programming languages to implement stored procedures, i.e.
PL/Tcl, PL/Perl, PL/Python and PL/Java7. Because the retrieval system itself is written in Java it would be
suitable to use PL/Java as programming language.</p>
      <sec id="sec-4-1">
        <title>4 http://lucene.apache.org 5 http://www.postgresql.org 6 http://www.postgresql.org/docs/current/static/plpgsql.html 7 http://pgfoundry.org/projects/pljava/</title>
        <p>We also intend to implement our own content-based image retrieval algorithms on the basis of vectors stored in a
PostgreSQL database.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wilhelm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kürsten</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Eibl</surname>
          </string-name>
          , “
          <article-title>Experiments for the ImageCLEF 2007 Photographic Retrieval Task”</article-title>
          ; http://clef-campaign.org/2007/working_notes/wilhelmCLEF2007.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kürsten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wilhelm</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Eibl</surname>
          </string-name>
          , “
          <article-title>The xtrieval framework at clef 2007: Domain-specific track,” LNCS - Advances in Multilingual</article-title>
          and Multimodal Information Retrieval,
          <string-name>
            <given-names>C.</given-names>
            <surname>Peters</surname>
          </string-name>
          et al., ed., Berlin: Springer Verlag,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wilhelm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kürsten</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Eibl</surname>
          </string-name>
          , “
          <article-title>Extensible retrieval and evaluation framework: Xtrieval,” LWA 2008: Lernen - Wissen -</article-title>
          <string-name>
            <surname>Adaption</surname>
          </string-name>
          , Würzburg:
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Lux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Klieber</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Granitzer</surname>
          </string-name>
          , “Caliph &amp;
          <article-title>Emir: Semantics in Multimedia Retrieval</article-title>
          and Annotation,” 19th
          <source>International CODATA Conference</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>