<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Sabanci-Okan System at ImageClef 2013 Plant Identi cation Competition</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Berrin Yanikoglu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Erchan Aptoula</string-name>
          <email>erchan.aptoula@okan.edu.tr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>S. Tolga Yildiran</string-name>
          <email>stolgayg@sabanciuniv.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Okan University</institution>
          ,
          <addr-line>Istanbul, Turkey, 34959</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Sabanci University</institution>
          ,
          <addr-line>Istanbul, Turkey 34956</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2013</year>
      </pub-date>
      <fpage>2</fpage>
      <lpage>7</lpage>
      <abstract>
        <p>We describe our participation in the plant identi cation task of ImageClef 2013. We submitted one fully automatic run that uses di erent features for the uniform background (isolated leaves) and natural background (unconstrained photos) categories. Besides the category information, meta-data was only used in the natural background category. Our approach employs a variety of shape, texture and color descriptors. As in the previous years, we used shape and texture only for isolated leaves and observed them to be very e ective. Our system obtained the best results in this category with a score of 0.607 which is the inverse rank of the retrieved class, averaged over all queried photos and users. As for the natural background category, we used a limited approach using a restricted set of features that were extracted globally due to lack of time, and obtained a score of 0.181.</p>
      </abstract>
      <kwd-group>
        <kwd>Plant identi cation</kwd>
        <kwd>mathematical morphology</kwd>
        <kwd>support vector machines</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The ImageCLEF plant identi cation competition is organized every year since
2011 and aims to benchmark progress in the area of plant identi cation from
photographs [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">3, 4, 2</xref>
        ]. Similar to the previous years, the competition in 2013
consisted of identifying images of plants that were captured by di erent means:
isolated leaves that were scanned or photographed on a uniform background
comprised the SheetAsBackground category. Parts or full images of a plant taken
on a natural background formed the NaturalBackground category. This category
was further sub-divided as ower, fruit, entire, leaf and stem categories.
      </p>
      <p>The organizers collected a large set of data from 250 di erent plant species
over the course of several years. Part of this data formed the training set that
was distributed to the participants along with the corresponding groundtruth.
The remaining data was shared with the participants in order to collect their
systems' responses, while the corresponding groundtruth was kept sequestered.</p>
      <p>
        Submitted systems were scored in terms of the inverse average rank of
the correct class for each submitted query. The details of this competition are
described in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>Overview of the System</title>
      <p>As a collaboration from two universities in Istanbul, we submitted a single fully
automatic run (Sabanci-Okan-Run1) that uses di erent features for the uniform
background (isolated leaves) category and natural background (unconstrained
photos) sub-categories. The category information was obtained from the
metadata of the query image. This handling of queries in di erent categories was done
to select the appropriate feature set for each group, but it also helped with the
handling of this large task.</p>
      <p>
        As in the previous years, we used shape and texture only for isolated
leaves and observed them to be very e ective. We had the best average score
overall last year in both the automatic and manual categories [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and this year
we obtained the best score on the isolated leaf (uniform background) category.
      </p>
      <p>For the natural background category, we used texture and color features
for the ower, fruit and entire sub-categories; shape and texture for the leaf
category; and only texture features for the stem category. The feature group
selection was done based on our previous experiences in this problem and in
order to increase generalization performance; it also helped reduce the time
spent in feature extraction.</p>
      <p>Meta-data was used only in the natural background category; speci cally
the month information was used to narrow down successfully the alternatives
for fruit and ower categories.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Segmentation</title>
      <p>
        Although segmentation is of crucial signi cance for content description, it
has been used in our system only for isolated leaves and stems. In contrast,
segmentation of photographs with a natural background is either not meaningful
(i.e. the whole picture contains some part of the plant) or not an easy problem
even though the background is well-de ned (e.g. a plant photographed with the
forest ground). In ImageCLEF 2012, we had used an approach where photos
were aggressively segmented to leave only a single leaf in the image, in order to
channel photographs to our successful isolated leaf recognition system [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. While
we believe that this is an interesting and complementary approach to one based
on local invariants, it is limited in its potential as much information is discarded.
This approach was skipped altogether this year due to lack of time.
      </p>
      <p>
        Isolated leaves usually possess an uniform background, often with uneven
illumination and sometimes shadow. Their segmentation has been conducted as
in the past, using edge preserving morphological simpli cation by means of area
attribute lters, followed by an adaptive threshold [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Moreover, contrary to
owers and fruit, it has been observed that the stem category contains mostly
vertical or horizontal tree trunks that often occupy the majority of the image
surface's center. Hence, in order to reliably obtain a background-free sub-image,
we rst determined the stem's orientation by controlling the horizontal and
vertical derivatives' maxima, followed by cropping the corresponding central two
third's of the image surface.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Preprocessing</title>
      <p>Preprocessing stages were present only for the isolated leaves, in the form of size
and orientation normalization. Speci cally, we align the leaves' major axis with
the vertical and normalize their height to 600 pixels, preserving the aspect ratio.
Orientation normalization is realized through principal component analysis, with
additional correction coming from the leaf petiole's location.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Features</title>
      <p>
        Given the high visual variability of this year's dataset categories as well as
the number of classes, feature extraction has become more challenging than
ever before. Consequently, a large spectrum of descriptors has been evaluated,
including shape, texture, color and local invariants. Moreover, considering the
strong relation between seasons and image categories such as fruit and owers,
meta-data have also been exploited with great success. Here we summarize only
the new descriptors, while the others have been explained in detail at the previous
working notes [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ].
      </p>
      <p>In particular, following the success of our past systems with scan and
scan-like data (isolated leaves), it has been chosen not to greatly modify their
descriptor set; instead we mainly optimized their parameters in order to cope
with the higher class count. In addition, only one new descriptor was included
in the feature extraction set: the edge background/foreground histogram. It is
computed on the binary mask of its input and it consists in calculating the ratio
of background to foreground pixels in a subwindow centered on each edge pixel.
The normalized histogram of the said ratios constitutes the end feature vector.</p>
      <p>
        As far as photographs are concerned, given the extreme variation of
viewpoint and scale (especially w.r.t. the category \entire"), we resorted to using
rather traditional, yet still reliable color descriptors. In particular, we employed
the color autocorrelogram [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], computed in the LSH color space after a
nonuniform subquantization to 63 colors (7 levels for hue, 3 for saturation and 3
for luminance). The color autocorrelogram describes the spatial correlation of
colors. It consists of a table where the entry (i; j) denotes the probability of
encountering two pixels of color i at a distance of j pixels.
      </p>
      <p>
        We further employed the saturation-weighted hue histogram [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], where the
total value of each bin W ; 2 [0; 360] is calculated as:
      </p>
      <p>
        W = X Sx Hx
x
(1)
where Hx and Sx are the hue and saturation values at position x and ij the
Kronecker delta function. As far as the color space is concerned, we have used
LSH [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] since it provides a saturation representation independent of luminance.
      </p>
      <p>And last, in order to exploit the e ect of seasons on fruit and owers, it
has been decided to use the meta-data accompanying the visual samples, and
speci cally the month of acquisition.</p>
      <p>Data</p>
    </sec>
    <sec id="sec-6">
      <title>Classi er Training and Evaluation</title>
      <p>The competition data consisted of a training set which was made available to all
the participants, along with the corresponding groundtruth les, and a test set
whose groundtruth was kept sequestered. The distribution of the data in each
category and in each of these sets is shown in Table 1.</p>
      <p>
        We split the available training data shown in Table 1 into train and
validation subsets. The training set was used in training the corresponding
classi er and the validation set was used as our internal test data for evaluating
di erent features and algorithms. In order to help with the generalization
capability, we tried to avoid having very similar images in the train and validation
splits. Speci cally, pictures from an individual plant were put in either the train
or validation subset. The selection of the samples was done as described in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
As a result of this split, we obtained the train/validation subsets as shown in
Table 2.
We used shape and texture only for isolated leaves in the SheetAsBackground
category and observed them to be very e ective. The length of the feature
vector was 156 for this case, consisting of Fourier descriptors (50 of them),
in addition to various area and contour-based shape descriptors, and texture
descriptors (106 altogether), many of them used in our previous system [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In
the NaturalBackground category, we only used color features for the ower, fruit
and entire sub-categories (autocorrelogram, saturation-weighted hue histogram
and the month the picture was taken, for a total of 265 dimensions); shape and
texture for the leaf category (same classi er as for isolated leaves); and only
texture features for the stem category.
      </p>
      <p>Feature extraction was done from the whole picture, except for the
case of leaf images in the SheetAsBackground and the NaturalBackground
categories, where segmentation step preceded feature extraction. The approach
of using global features or using only color features is clearly not su cient for
unconstrained photos (e.g. ower, fruit, entire categories), however we did not
have time to incorporate other methods based on local features.</p>
      <p>The classi ers used for di erent categories were all trained with the
training portion of the available data shown in Table 2, except for the leaf
sub-category of NaturalBackground photographs. For this group, we used the
same system developed for recognizing the SheetAsBackground category, after
a simple segmentation of the image.</p>
      <p>As classi er, we used a Support Vector Machine (SVM) classi er based
on their good performance in many object recognition problem and used the
SMO classi er inside the WEKA toolbox. The parameters for the SVM was set
asC = 10 and a polynomial kernel of degree 2 after some limited tests with the
validation set.</p>
      <p>
        In Table 3, we give the cross-validation accuracy obtained while training
a classi er using 10-fold cross-validation, as well as the accuracy of the same
classi er on the validation subset. In the last column of this table, we also include
the average inverse rank results published by the competition organizers for each
category [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Here, a score of 1 indicates that all queries return the correct class
as the top guess, while a score near 0 means the correct class is returned much
later in rank.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Summary and Discussion</title>
      <p>Participation into the ImageCLEF Plant Identi cation competition is an arduous
task, especially when done in collaboration, with di erent people working in
di erent parts of the problem. Last year we had to transfer partial results
back and forth, since alternating steps of segmentation, preprocessing, feature
extraction, and classi cation were done by di erent people in our small group.
This year we streamlined this process a little better and concentrated on what
we could accomplish the best. For that reason, we worked on isolated leaves the
most, while some categories received minimal attention (e.g. leaves under the
NaturalBackground category).</p>
      <p>
        As the o cial results indicate, we obtained the best results in recognizing
isolated leaves (SheetAsBackground category), with an average inverse rank of
0.607. This score roughly indicates that that the correct class was returned as
top-1 or top-2 alternative for the majority of queries, which is a promising result
for the plant retrieval problem. In recognizing the unconstrained photographs in
the NaturalBackground category, we started working on a system based on SIFT
features [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]; although the initial results have been encouraging, the allocated time
has not been su cient for nalizing this module before the submission.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>E.</given-names>
            <surname>Aptoula</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Lefevre</surname>
          </string-name>
          .
          <source>On the morphological processing of hue. Image and Vision Computing</source>
          ,
          <volume>27</volume>
          (
          <issue>9</issue>
          ):
          <volume>1394</volume>
          {
          <fpage>1401</fpage>
          ,
          <year>August 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>B.</given-names>
            <surname>Caputo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Muller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thomee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Villegas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Paredes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zellhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Goeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Martinez</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. Garcia</given-names>
            <surname>Varea</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Cazorla</surname>
          </string-name>
          .
          <source>Imageclef</source>
          <year>2013</year>
          :
          <article-title>the vision, the data and the open challenges</article-title>
          .
          <source>In Proc. CLEF</source>
          <year>2013</year>
          , LNCS,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. H. Goeau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Molino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Birnbaum</surname>
          </string-name>
          , E. Mouysset, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Picard</surname>
          </string-name>
          .
          <article-title>The clef 2011 plant image classi cation task</article-title>
          .
          <source>In CLEF 2011 working notes</source>
          , Amsterdam, The Netherlands,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. H. Goeau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Yahiaoui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          , , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Molino</surname>
          </string-name>
          .
          <article-title>The ImageClef 2012 plant identi cation task</article-title>
          .
          <source>In CLEF 2011 working notes</source>
          , Rome, Italy,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          . Circular statistics applied to colour images.
          <source>In Computer Vision Winter Workshop</source>
          , pages
          <volume>55</volume>
          {
          <fpage>60</fpage>
          ,
          <string-name>
            <surname>Valtice</surname>
          </string-name>
          , Czech Republic,
          <year>February 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>J.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mitra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Zabih</surname>
          </string-name>
          .
          <article-title>Image indexing using color correlogram</article-title>
          .
          <source>In CVPR</source>
          , pages
          <volume>762</volume>
          {
          <fpage>768</fpage>
          ,
          <string-name>
            <surname>San</surname>
            <given-names>Juan</given-names>
          </string-name>
          , Puerto Rico,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>David</surname>
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Lowe.</surname>
          </string-name>
          <article-title>Object recognition from local scale-invariant features</article-title>
          .
          <source>In ICCV</source>
          , pages
          <volume>1150</volume>
          {
          <fpage>1157</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>B.</given-names>
            <surname>Yanikoglu</surname>
          </string-name>
          , E. Aptoula, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tirkaz</surname>
          </string-name>
          .
          <article-title>Sabanci-Okan system at ImageClef 2011: Plant identi cation task</article-title>
          .
          <source>In CLEF (Notebook Papers/Labs/Workshop)</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>B.</given-names>
            <surname>Yanikoglu</surname>
          </string-name>
          , E. Aptoula, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tirkaz</surname>
          </string-name>
          .
          <article-title>Sabanci-Okan system at ImageClef 2012: Combining features and classi ers for plant identi cation</article-title>
          .
          <source>In CLEF (Notebook Papers/Labs/Workshop)</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>