<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Testing Pear Disease Diagnosis Assuming Gaussian Distribution of Feature Values</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nikos Petrellis</string-name>
          <email>npetrellis@uop.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Electrical and Computer Engineering, University of Peloponnese</institution>
          ,
          <addr-line>Patra</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
      </contrib-group>
      <fpage>122</fpage>
      <lpage>129</lpage>
      <abstract>
        <p>A plant disease diagnosis method based on processing images that display a sick plant part, is tested here on four pear diseases. In the original classification method employed, an extensible feature ranking method had been adopted. Specifically, a discrete grade was assigned to each invariant feature depending on whether its value was found within predefined strict or loose limits. The potential classes that may correspond to the test sample, were sorted according to the sum of the grades of the features. In the current implementation, a Gaussian behavior is assumed and an analog feature grade is assigned depending on the distance of the extracted feature value from the mean. Modeling feature values with Gaussian distribution would eliminate the need of heuristic selection of the strict/loose feature limits but would make more difficult the extension of the supported set of diseases by the end user. However, the experimental results show that the feature values do not follow a Gaussian distribution since comparable classification precision results are obtained only when a normalization scheme is applied to the input image.</p>
      </abstract>
      <kwd-group>
        <kwd>plant disease diagnosis</kwd>
        <kwd>Gaussian distribution</kwd>
        <kwd>normalization</kwd>
        <kwd>image processing</kwd>
        <kwd>smart phone application</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Plant monitoring is important in precision agriculture and similar Internet of Things
(IoT) applications [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ][
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Plant pathogen diagnostics are reviewed in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. A review of
machine vision techniques employed for the inspection of citrus fruits with accuracy
ranging between 60% and 100% is presented in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Several monitoring techniques for
specific plants have been recently proposed for garden strawberry diseases [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], rice
blast [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Fusarium infections on wheat [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], etc. Image processing in color scale
different than Red-Green-Blue (RGB) has also been employed for plant disease
diagnosis. For example, CIE L*a*b color scale is employed in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and a neural network
achieves 91% classification accuracy. The use of hyperspectral data and neural
networks in plant disease diagnosis is reviewed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Mobile phone applications for
the recognition of plant diseases have also been presented in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] (Purdue Plant Doctor
implemented for iOS and Android) and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] for Android.
      </p>
      <p>
        In this paper, an alternative implementation of the smart phone application
originally presented in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and extended in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] to support several color normalization
techniques, is used. At the input of this smart phone application, an image is used of
the plant part displaying the symptoms of a disease as a number of lesion spots. The
image is segmented to the background (ignored), the normal leaf or fruit, the spots and
halo around the spots. A number of invariant features concerning the 3 Regions of
Interest (ROI) are used for the classification. In the previous versions, a small number
of representative training images were used for each disease to arbitrarily extract strict
and loose limits for each feature. During the test phase, the extracted new feature value
was compared to the extracted limits defined for a specific disease and different grade
was assigned according to whether the new feature value resided within the strict or
loose limits. The weighted sum of all feature grades was used to sort the potential
diseases that may have infected the plant. This method will be called henceforth Limit
Test (LT).
      </p>
      <p>The alternative classification method tested here, assumes that the values of each
feature found in images displaying the same disease, follow a Gaussian Distribution
(GD). During the training phase, the various values of a specific feature are used to
estimate its mean and variance that are stored in the signature (recognition rules) of a
specific disease. The grade assigned to each feature depends, in this case, on the
distance of the current feature value from the mean. Two cases are examined: a) use
of the original images without normalization and b) use of Linear Dynamic Range
Expansion (LDRE normalization) for each one of the RGB colors to moderate the
differences in the light exposure of each photograph. Sensitivity, specificity, precision
and accuracy metrics are used to evaluate the LT and GD classification methods. The
pear diseases tested with the methods described above include: Fire Blight, Pear Scab,
Mycosphaerella and Powdery Mildew. Fifty photographs displaying the upper surface
of a pear leaf have been tested from each disease.</p>
      <p>The LT, GD classification methods are described in Section 2. The experimental
results are presented and discussed in Section 3.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Plant Disease Recognition Methods</title>
      <p>
        The photographs of plant parts captured by the user must display indicative lesions
(spots) of a specific disease. The plant parts can be leaves or fruits or even roots,
branches, flowers, etc. The user initially selects the user interface language, the image
normalization type and the halo zone around the lesion spots with its thickness
expressed as a number of pixels (see Fig. 1a). Then, the photograph is analyzed by
potentially configuring the thresholds for the separation of the background, the normal
plant part and the lesion. In the present application version, separate thresholds are used
for each color component of the RGB color scale (red, green, blue) as shown in Fig. 1b.
Additional information can be given by the user to assist the disease classification (e.g.,
weather data of the specific rural region where the plant exists, located by Global
Positioning System-GPS). The smart phone application was initially developed for
Windows Phone and then ported to Android platforms. The classification methods are
described in detail and the reader can refer to [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] for more implementation details
concerning the application.
(a)
(b)
(c)
(d)
      </p>
      <p>The following invariant features are extracted by the segmented photograph:
number of lesion spots, their area, the average gray level of each region (normal plant
part, spots, halo). A separate histogram is generated for each color component (red,
green, blue) and each ROI (normal, spots, halo). It is assumed that photographs
displaying the same disease would have similar features like the ones listed above.
Instead of attempting to match full histograms, their similarity is simply checked by
comparing where they start, where they end and the where their peak is (the histograms
usually consist of a single lobe).</p>
      <p>
        When the input photograph is analyzed two triplets of thresholds are used to
separate the background and the spots. If all the red, green, blue values of a pixel (pr,
pg, pb, respectively) are higher than the corresponding background thresholds (BGr,
BGg, BGb) then, this pixel is mapped to the background since the background in this
implementation is simply assumed to be much brighter than the plant part. The rest of
the pixels are initially mapped as normal plant part (e.g., the leaf in the cases examined
here). The lesion spots can either be darker or brighter (the user indicates this fact to
the application through the Invert checkbox shown in Fig. 1b), than the normal leaf. If
all the pr, pg, pb values of a leaf pixel are lower than the corresponding thresholds THr,
THg, THb and the lesion is darker than the normal leaf color, this pixel is mapped to
the lesion region. If the pr, pg, pb values of a leaf pixel are higher than the corresponding
thresholds THr, THg, THb and the lesion is brighter than the normal leaf, again this
pixel is mapped to the lesion region. Finally, the pixels existing in a zone around the
spots with thickness defined by the corresponding field shown in Fig. 1a, are mapped
to the halo region. The user can also select an image normalization method from the
page shown in Fig. 1a. The supported normalization methods are described in detail in
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In this paper, the normalization Type 1 (no normalization) and Type 2 (LDRE in
all RGB colors) are tested since Type 2 seems to achieve a better classification
accuracy [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] compared with the other supported normalization schemes.
      </p>
      <p>The matrix BGW1 is defined with the same size as the original image and each one
of its cells can have four distinct values corresponding to normal leaf, spot, hallo and
background. Matrix BGW2 can be derived from BGW1 by marking pixels belonging
to the same spot with the same identity. BGW2 can be used to estimate all of the
features used, in a simple way. The maximum spot identity is the number of spots
while the area covered by the spots is estimated by the number of spot pixels divided
by the total number of pixels belonging to the plant part. The average gray level of
each region can be estimated by averaging the gray level of the pixels mapped to this
region in BGW2. Spots consisting of a very small number of pixels (less than the
threshold “Min Area” of Fig. 1a) are considered as noise and are not taken into
consideration.</p>
      <p>During the training phase of the application for a new disease, a small number of
representative photos with similar sick plant parts can be analyzed even by an end user
that is not aware of the application architecture and implementation details. The range
of each feature fi as determined by the training photographs is used to define the strict
limits [fi_s1..fi_s2]. This narrow range can be heuristically extended in order to define the
loose limits [fi_l1..fi_l2] of this feature for a specific disease. The application lists all these
limits to the end user as shown in Fig. 1d, to assist the definition of new diseases or for
the customization of the existing diseases. All the feature strict and loose limits are
stored in a disease signature exploited during the test phase. When a new photograph is
examined in the test phase, all the features fi are extracted and are compared to the
predefined limits stored in each disease signature. A rank Gr is estimated for each
potential disease using the following equation:
1,  !_#% ≤ ! ≤ !_#&amp;.</p>
      <p>0, ℎ
!_$ = (
1,  !_$% ≤ ! ≤ !_$&amp;.</p>
      <p>0, ℎ</p>
      <p>The parameters wi_l and wi_s are the individual grades (weights) assigned to feature
fi if it is found within the loose or strict limits, respectively of a specific disease. The
diseases are sorted according to the rank Gr that they have received for a specific
photograph. The three diseases with the highest rank are listed. The simple classification
method described above is the LT.</p>
      <p>The new classification method (GD) tested in this paper assumes that the feature
values that are extracted from the photographs of a disease training set, follow a
Gaussian distribution. The mean feature value fm,i, is estimated by the training samples.
The rank Gi given for this feature is inversely proportional to the distance of the value
of this feature from fm,i. If σ2 is the variance for this feature (also estimated by the
training samples), then the disease rank Gr can be estimated by:</p>
      <p>'! 
 = ∑!+-,%% (&amp;)*"
,($!%$&amp;,!)"
") " .</p>
      <p>(1)
(2)
(3)
(4)</p>
      <p>The LDRE normalization employed in the experiments performed in the context of
this paper, stretches the values of each color component in the RGB scale according to
the following equation:
 =
 − 
 − 
 + 
(5)</p>
      <p>PrevCol is the original color value, MinGray and MaxGray are the minimum and
maximum pixel values of the gray version of the image (excluding the background),
Range is the desired final range and StartCol is the desired offset where Range starts.</p>
    </sec>
    <sec id="sec-3">
      <title>3 Experimental Results</title>
      <p>Indicative photographs from the pear diseases that have been tested in the framework
of this paper are shown in Fig. 2. The metrics used to compare the LT and GD
classification methods are the sensitivity, specificity, precision and accuracy. Let True
Positives (TP) be the number of photographs that are correctly recognized as positive
to a disease. True Negatives (TN) are the number of photographs that are correctly
recognized as negative to a disease. False Positives (FP) are the photographs that are
falsely recognized as positives and False Negatives (FN), the ones that are falsely
recognized as negatives to a disease. The aforementioned four metrics are defined as:
 =
 =
 + 


 +</p>
      <p>+ 
 + 
(6)
(7)
(8)
(9)
 =</p>
      <p>+  +  +</p>
      <p>As already mentioned, 50 photographs of each pear disease have been tested. They
have been retrieved from pear trees in the Achaia prefecture of Greece. About half of
these photographs have been captured under sunlight and the rest, under a canopy in
order to test different light exposure. Eight representative (training) photographs have
been used to extract the disease recognition signatures in either LT (containing the
loose and strict limits of each feature), or GD (containing the mean and variance of
each feature). In the first test case, the photographs have not been normalized. The
experimental results for this case are shown in Tables 1 and 2.</p>
      <p>As can be seen from the results listed in Tables 1 and 2, GD is much worse than
LT since 2 of the 4 diseases (fire blight and powdery mildew) cannot be recognized at
all: their sensitivity is 0 although the specificity achieved is more acceptable. Using
LT as the classification method, only pear scab has a low sensitivity but all the average
metrics are better than GD.</p>
      <p>If image normalization with LDRE is used, the results obtained are listed in Tables
3 and 4. Although the average values of the 4 metrics obtained for GD are still worse
than the ones achieved by LT, they are much closer with LDRE normalization. In any
case, these results show that although Gaussian seems a more natural way to describe
the distribution of the various feature values, the classification accuracy achieved by
GD is worse than the one achieved by LT. The positive aspect of this conclusion is
that the rules in the disease signatures used by LT can be defined in a simpler way by
an end-user that is not familiar with the implementation details of the developed
application: it is easier for an end-user to determine the limits in a list of feature values
than estimate the mean values and variances required by the GD. For this reason, the
employed LT classification method allows the extension of the supported set of
diseases by an end user that is not particularly qualified in computer science or
statistics.</p>
      <p>Two classification methods are tested in the framework of a mobile phone
application capable of recognizing pear diseases. The experimental results showed that
the classification method based on the assumption that the invariant image feature
values follow a Gaussian distribution in images displaying the same disease, does not
achieve a better accuracy than the heuristic classification method that compares if the
feature values reside within predefined strict and loose limits. The benefit from the
classification method based on Gaussian distribution is that it favors the extensibility
of the supported set of diseases by a non-expert. Image normalization significantly
improves the accuracy of this classification method.</p>
      <p>Based on the experimental results presented in this paper, future work may focus
on supporting multiple ranges of feature values for comparison e.g., too strict, strict,
loose, too loose, etc, in order to improve further the accuracy.</p>
      <p>Acknowledgments. This work is protected by the provisional patents
1009346/13-82018 and 1008484/12-5-2015 (Greek Patent Office). The author wishes to thank the
student Olympia Sakorafa for her contribution in the experiments carried out for this
paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Lara</surname>
            ,
            <given-names>J.C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gutierrez</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Low Cost Greenhouse Monitoring System Based on Internet of Things</article-title>
          .
          <source>Proc. Of the 2019 IEEE International Conference on Engineering Veracruz (ICEV)</source>
          ,
          <year>Mexico</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Nayak</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kavitha</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rao</surname>
            .
            <given-names>C.M.</given-names>
          </string-name>
          (
          <year>2020</year>
          )
          <article-title>IoT-Enabled Agricultural System Applications, Challenges and Security Issues. IoT and Analytics for Agriculture</article-title>
          , DOI: 10.1007/
          <fpage>978</fpage>
          -981-13-9177-
          <issue>4</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Donovan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Mantri</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Review: The future of plant pathogen diagnostics in a nursery production system</article-title>
          .
          <source>Elsevier Biosensors and Bioelectronics</source>
          . Available online.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cubero</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>W.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aleixos</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Blasco</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>Automated Systems Based on Machine Vision for Inspecting Citrus Fruits from the Field to Postharvest-a Review</article-title>
          .
          <source>Food and Bioprocess Technology</source>
          ,
          <volume>9</volume>
          (
          <issue>10</issue>
          ), p.
          <fpage>1623</fpage>
          -
          <lpage>1639</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Aleinikov</surname>
            ,
            <given-names>A.F.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Method of Non-Invasive Determination of Fungal Diseases of Common Garden Strawberry</article-title>
          .
          <source>Siberian Herald of Agricultural Science</source>
          ,
          <volume>48</volume>
          (
          <issue>3</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>W.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>F.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>C.Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.W.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>RiceTalk: Rice Blast Detection using Internet of Things and Artificial Intelligence Technologies</article-title>
          . IEEE Internet of Things Journal
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Bauriegel</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Herppich</surname>
            ,
            <given-names>W.B.</given-names>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>Hyperspectral and Chlorophyll Fluorescence Imaging for Early Detection of Plant Diseases, with Special Reference to Fusarium spec</article-title>
          .
          <source>Infections on Wheat. Agriculture</source>
          ,
          <volume>4</volume>
          , p.
          <fpage>32</fpage>
          -
          <lpage>57</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kulkarni</surname>
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Patil</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2012</year>
          )
          <article-title>Applying Image Processing Technique to Detect Plant Diseases</article-title>
          .
          <source>International Journal of Modern Engineering Research</source>
          ,
          <volume>2</volume>
          (
          <issue>5</issue>
          ), p.
          <fpage>3361</fpage>
          -
          <lpage>3364</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Siva</surname>
            ,
            <given-names>K.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ganesan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Pradhan</surname>
            ,
            <given-names>V.B.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>A Review of Neural Networks in Plant Disease Detection using Hyperspectral Data</article-title>
          .
          <source>Elsevier Information Processing in Agriculture</source>
          ,
          <volume>5</volume>
          (
          <issue>3</issue>
          ), p.
          <fpage>354</fpage>
          -
          <lpage>371</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Luke</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beckerman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sadof</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richmond</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McClure</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2017</year>
          )
          <article-title>Purdue Plant Doctor App Suite</article-title>
          . Purdue University, https://www.purdueplantdoctor.com/.
          <source>Accessed 22 August</source>
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Lakshmi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sundareshwaran</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>An Android based Image Processing Application to Detect Plant Disease</article-title>
          .
          <source>International Research Journal of Engineering and Technology (IRJET)</source>
          ,
          <volume>6</volume>
          (
          <issue>5</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Petrellis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2017</year>
          )
          <article-title>A Windows Phone Application for Plant Disease Diagnosis</article-title>
          .
          <source>Proc. of the 8th HAICTA</source>
          , Chania, Greece.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Petrellis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Plant Disease Diagnosis with Color Normalization</article-title>
          .
          <source>Proc. of the IEEE MOCAST</source>
          <year>2019</year>
          , Thessaloniki, Greece
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>