<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Impact of Histogram Subset Selection on Classi cation using Multi-scale LBP-Operators</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sebastian Hegenbart</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Uhl</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas V´ecsei</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Sciences, University of Salzburg</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>St. Anna Children's Hospital</institution>
          ,
          <addr-line>Vienna</addr-line>
        </aff>
      </contrib-group>
      <fpage>359</fpage>
      <lpage>363</lpage>
      <abstract>
        <p>Multi-scale Local Binary Pattern based operators are used to extract features from duodenal texture patches with histological ground truth in case of pediatric celiac disease. The multi-scale LBP combined with color channels and possibly other filters lead to a high number of computed histograms. The impact of histogram subset selection on the overall classification rates using two feature subset selection algorithms (SFS and SBS) with three LBP-based operators is analyzed and the applicability of these techniques validated.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Celiac disease is a complex autoimmune disorder in genetically predisposed
individuals of all age groups after introduction of gluten containing food. During
the course of the disease, hyperplasia of the enteric crypts occurs and the
mucosa eventually looses its absorptive villi thus leading to a diminished ability to
absorb nutrients. People with untreated celiac disease are at risk for developing
various complications like osteoporosis, infertility and other autoimmune
diseases including type 1 diabetes. Endoscopy with biopsy is currently considered
the gold standard for the diagnosis of celiac disease. During endoscopy at least
four duodenal biopsies are taken. Microscopic changes within these specimen
are classified by a histological analysis according to a classification scheme by
Oberhuber et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The benefits of an automated support tool for diagnosis are
many. Among them are an improved reliability of diagnosis, supported targeting
of biopsies and more efficient use of time and manpower.
      </p>
      <p>
        The Local Binary Pattern (LBP) operator is invariant to monotonic
intensity variations which is beneficial to texture classification in environments with
varying lighting conditions. This property makes the method interesting for
classifying endoscopic images. In the context of LBP many modifications and
related operators have been suggested over the years. A prominent modification
that is often neglected across the literature is the multi-scale approach
suggested by Ma¨enpa¨a¨ [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This approach is based on low-pass filtering combined
with appropriate filter sizes and operator radii to improve the operators’ spatial
support area. Using this extension, combined with color channels and possibly
other filters, the number of computed histograms is considerably higher than the
Pars Descendens
common approach of using a combination of two or three different
parameterizations of the operator. It is unclear how a high number of histograms affects the
classification. To study the effects we use two feature subset selection schemes
to find optimal suitable combinations of histograms. We analyze the impact of
the subset selection and validate the applicability of these techniques using two
distinct image sets.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Materials and Methods</title>
      <p>
        The image test set used, contains images taken during duodenoscopies at the St.
Anna Children’s Hospital using pediatric gastroscopes without magnification.
Images were recorded by using the modified immersion technique, which has been
shown to be beneficial to automated classification by Hegenbart et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. There
are two duodenal regions with completely different geometric properties, i.e. the
duodenal Bulb and the Pars Descendens. Accordingly, we chose to separate the
images into two distinct sets. Texture patches with a fixed size of 128 128 pixels
were extracted from the full sized frames, a size which turned out to be optimally
suited in previous experiments [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The ground truth for the texture patches used
in experimentation was determined by histological examination of biopsies from
corresponding regions. In the following, we aim at a two class problem with the
classes Class0 as the class representing healthy tissue and Class1 representing
texture patches showing villous atrophy. Table 1 shows the number of images
available per considered class. For evaluation two distinct set of images for both
duodenal regions denoted as Set-1 and Set-2 were assembled. This happened at
two different points in time, the specific sets reflect the time intervals where the
images were captured.
2.1
      </p>
      <sec id="sec-2-1">
        <title>Feature Extraction</title>
        <p>
          The basic LBP operator was introduced to the community by Ojala et al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
We use three operators that are based on LBP to conduct our experiments. The
operators are LTP (Local Ternary Patterns, [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]), ELBP (extended Local Binary
Patterns, [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]), and the LBP operator combined with a contrast measure (LBPC,
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]). The entire family of operators is used to model a pixel neighborhood in
terms of pixel intensity differences. The operators assign a binary label to each
possible pixel neighborhood. The distributions of these labels are then used
as features. The distributions are represented by histograms. We compute the
pattern distributions for each color channel (RGB), each LBP-Scale (1-3) as well
as filter orientation (in case of the extended LBP based operators: horizontal,
vertical and diagonal). In total this is 9-histograms for LTP and LBPC, and
27histograms for ELBP. For each histogram, only a subset of dominant patterns
known as the uniform patterns [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] which make up the majority of discriminative
patterns is used. This subset consists of 58-patterns for 8 considered neighbors.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Histogram Subset Selection</title>
        <p>Depending on the specific operator, at least 9 and at maximum 27 histograms
are computed for a single image. A single LBP histogram can be interpreted
as a macro feature. Therefore the terms histogram subset selection and feature
subset selection share the same meaning. Feature subset selection techniques are
usually applied for two reasons.</p>
        <p>{ Result Optimization: Probably not all parameters combinations are equally
well suited for describing the specific textural properties. Even more, when
computing a large number of histograms, this set could contain a few “bad”
histograms which reduce the discriminative power.
{ Reduction of Dimensionality: Depending on the chosen classification method
large feature vectors might be suboptimal in terms of computational
complexity and classification performance. Feature subset selection can be used
to reduce the number of considered histograms and therefore the final feature
vector dimensionality.</p>
        <p>
          The applied algorithms were the Sequential Forward Selection algorithm
(SFS, [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]) and the Sequential Backward Selection algorithm (SBS, [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]). Please
note, that due to the imbalance of image number in the specific classes among
the two image sets we chose the average classification rate of both classes as
optimization criterion.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Classi cation</title>
        <p>The k-nearest neighbors (kNN) classifier was used for classification. A rather
weak classifier was chosen to give more emphasis on the selected histogram
combinations. After the histogram subset selection the candidate histograms were
combined and treated as a single histogram. The classification is based on the
histogram intersection distance between two histograms. The optimal k-value
was found in a range from 1 to 25.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>Tables 2 and 3 demonstrate the effect of using subset selection on the set of
histograms. For each experiment the entire set of histograms was computed using
the specific operator and both image sets (Set-1 and Set-2). The algorithms
mentioned in section 2.2 were then used to select subsets for each image set. The sets
were optimized until no new local maximum considering the classification rate
could be found. The found subsets of Set-1 were then used to classify the images
from Set-2 and vice versa. We compare the overall classification rates of these
experiments with the rates gained by using the entire set of histograms without
performing histogram subset selection (column ∆All) and the rates gained by
optimizing the feature subset for the specific image set the classification is
actually performed on (we expect this to be over fitted, column ∆Set1 or ∆Set2).
We denote an increase in overall classification rate with a elqq +” and a decrease
with a “–”.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>We can see that using feature subset selection algorithms to find a reliable subset
of histograms in case of multi-scale LBP is reasonable in case of the duodenal
Bulb. The final feature vector dimensionality could be reduced and most
classification rates be improved. The SBS method provides slightly more reliable
results in terms of classification rates but SFS is more efficient in terms of
feature vector dimensionality reduction. Comparing the results with the optimized
results for the specific datasets, we see that the average loss in classification rate
is approximately 1.86 %. This indicates that the optimized subsets are slightly
over fitted. In contrast to the result of the Bulbus-experiments the results of the
Pars-experiments show a general decrease in overall classification rate. Again
the SBS method provided more reliable results as compared to SFS, however
in general no reliable subsets of histograms could be found to guarantee stable
classification rates. Compared to the Bulbus-experiments the histogram subsets
are even more over fitted. The average loss in classification rate is over 3.86 % in
this case. The Pars-set contains two different types of images (and perspectives),
namely the classical perspective perpendicular to the mucosa and the
perspective into the direction of the center of the lumen (Fig. 1). Images exhibiting
the latter perspective cannot be well described by LBP based operators.This
leads to unreliable histograms and affects the subset selection. To overcome this
limitation, a pre-classification of the images contained in the Pars-set, combined
with a separate classification of each type of perspective could be introduced.</p>
      <p>We see that histogram subset optimization can be a feasible option for both,
reducing feature vector dimensionality and improving classification performance.
By using distinct test- and training-sets over fitting can be avoided.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Oberhuber</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Granditsch</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vogelsang</surname>
            <given-names>H.</given-names>
          </string-name>
          <article-title>The histopathology of coeliac disease: time for a standardized report scheme for pathologists</article-title>
          .
          <source>Eur J Gastroenterol Hepatol</source>
          .
          <year>1999</year>
          ;
          <volume>11</volume>
          :
          <fpage>1185</fpage>
          -
          <lpage>1194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Ma¨
          <article-title>enpa¨a¨ T. The Local Binary Pattern Approach to Texture Analysis: Extensions and Applications</article-title>
          [PhD Thesis]. University of Oulu;
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Hegenbart</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kwitt</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liedlgruber</surname>
            <given-names>M</given-names>
          </string-name>
          , et al.
          <article-title>Impact of duodenal image capturing techniques and duodenal regions on the performance of automated diagnosis of celiac disease</article-title>
          .
          <source>Proc ISPA</source>
          .
          <year>2009</year>
          ; p.
          <fpage>718</fpage>
          -
          <lpage>723</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ojala</surname>
            <given-names>T</given-names>
          </string-name>
          , Pietika¨inen M,
          <string-name>
            <surname>Harwood</surname>
            <given-names>D.</given-names>
          </string-name>
          <article-title>A comparative study of texture measures with classification based on feature distributions</article-title>
          .
          <source>Pattern Recognit</source>
          .
          <year>1996</year>
          ;
          <volume>29</volume>
          (
          <issue>1</issue>
          ):
          <fpage>51</fpage>
          -
          <lpage>59</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Tan</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Triggs</surname>
            <given-names>B</given-names>
          </string-name>
          .
          <article-title>Enhanced local texture feature sets for face recognition under difficult lighting conditions</article-title>
          . Lect Notes Computer Sci.
          <year>2007</year>
          ;
          <volume>4778</volume>
          :
          <fpage>168</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Huang</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            <given-names>Y</given-names>
          </string-name>
          .
          <article-title>Shape localization based on statistical method using extended local binary pattern</article-title>
          .
          <source>Proc ICIG</source>
          .
          <year>2004</year>
          ; p.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Ma¨enpa¨a¨ T,
          <string-name>
            <surname>Ojala</surname>
            <given-names>T</given-names>
          </string-name>
          , Pietika¨inen M, et al.
          <article-title>Robust texture classification by subsets of local binary patterns</article-title>
          .
          <source>Proc ICPR</source>
          .
          <year>2000</year>
          ;
          <volume>3</volume>
          :
          <fpage>3947</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jain</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zongker</surname>
            <given-names>D</given-names>
          </string-name>
          .
          <article-title>Feature selection: evaluation, application, and small sample performance</article-title>
          .
          <source>IEEE Trans Pattern Anal Mach Intell</source>
          .
          <year>1997</year>
          ;
          <volume>19</volume>
          :
          <fpage>153</fpage>
          -
          <lpage>158</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>