<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Comparative analysis of segmentation algorithms for the allocation of microcalcifications on mammograms</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yu A Podgornova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>S S Sadykov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Murom Institute (branch) Federal state budgetary Educational Institution of Higher Education" Vladimir State University named after Alexader Grigoryevich and Nickolay Grigoryevich Stoletovs"</institution>
          ,
          <addr-line>Orlovskaya street, 23, Murom, Russia, 602264</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>121</fpage>
      <lpage>127</lpage>
      <abstract>
        <p>Breast cancer is the most common disease of the current century in the female population of the world. The main task of the research of most scientists is the detection of this pathology at an early stage (the tumor size is less than 7 mm) when a woman can still be helped. An indicator of this disease is the presence of small-point microcalcifications, located in groups within or in the immediate circle of the tumor. Microcalcification is a small-point character at cancer, reminding grains of sand of irregular shape which sizes are from 100 to 600 microns. The probability of breast cancer increases with the increase in the number of microcalcifications per unit area. So, the probability of cancer is 80% if more than 15 microcalcifications on 1 sq. cm. The microcalcifications are often the only sign of breast cancer, therefore, their detection even in the absence of a tumor node could be a harbinger to cancer. Image segmentation is one way to identify microcalcifications. The conducted research allowed us to choose the optimal segmentation algorithms of mammograms to highlight areas of microcalcifications for further analysis of their groups, sizes, and so on.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The mammary gland is a complex, sensitive organ that requires constant monitoring due to the annual
increase in the incidence of breast cancer and its "rejuvenation" [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Often a symptom of serious
diseases is small-point calcification, called microcalcifications (deposits of calcium) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Usually,
isolated microcalcifications or clusters are small in size, so they do not self-identify. For detection of
this pathology is required to carry out hardware diagnostics such as ultrasound and mammography.
      </p>
      <p>
        Mammography is a noninvasive method for the detection of pathologies of mammary glands [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Microcalcinates differ in localization, size, shape, concentration, quantity. Examples of
microcalcifications are presented in figure 1. To assess all of these parameters will need to find and
select the picture mammography. To evaluate all the specified parameters, it is necessary to find and
highlight them in the mammography image. The process of finding homogeneous areas in an image is
called segmentation. It is the first step in image analysis. Thus, segmentation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] plays an important
role in the processing of medical images. The main idea of the segmentation process is as follows:
each pixel of the image can be associated with some visual properties, such as brightness, color, and
texture. Within one object or one part of an object, these attributes change relatively little, whereas
when crossing the border from one object to another, there is usually a significant change in the above
attributes.
      </p>
      <p>
        At the moment many image segmentation algorithms are developed [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], therefore the main task of
this paper is the analysis of existing methods of segmentation and selection of an optimal algorithm for
detection of microcalcifications in mammographic images.
      </p>
      <p>
        There is the following classification of image segmentation algorithms [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]: threshold methods,
region-based methods, edge detection methods, and clustering-based algorithms. Edge detection
methods are not used in this work, as most are used to highlight the contours of the image. This article
explores the following segmentation algorithms: algorithm FloodFill, the watersheds, MeanShift, and
k-means.
      </p>
      <p>Figure 1 shows examples of mammograms with different microcalcifications, in Figure 1(c), in
addition to microcalcifications, there is also a malignant neoplasm that has fuzzy spiciform contours,
i.e. a star-shaped knot with thin strands extending from it, which are called spicules.
c)</p>
      <p>Figure 1. Examples of forms, localization, and the number of microcalcifications on mammograms.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Overview of segmentation algorithms</title>
      <sec id="sec-2-1">
        <title>2.1. Watershed method</title>
        <p>
          The concept of watershed [
          <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
          ] is based on the representation of the image as a
threedimensional surface defined by two spatial coordinates and the level of brightness as the height of the
surface (relief). In this "topographical" interpretation, three types of points are considered: (a) points
of the local minimum; (b) points located on the slope, i.e. from which water rolls down to the same
local minimum; and (C) points located on the crest or peak, i.e. from which water is equally likely to
roll down more than one such minimum. When applied to a specific local minimum, a set of points
that satisfy condition (b) is called a basin (or catchment area) of this minimum. The sets of points that
satisfy condition (c) form the ridge lines on the surface of the relief and are called the watershed lines.
        </p>
        <p>One of the most important applications of segmentation by watersheds is the selection of objects of
uniform brightness in the background (in the form of spots). Areas characterized by small changes in
brightness have small gradient values. Therefore, in practice, there is often a situation when the
method of segmentation by watershed is applied not to the image itself, but to the gradient of this
image. Under such conditions, the local minima of the basins agree well with the small gradient
values, which usually corresponds to the objects of interest.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. MeanShift segmentation</title>
        <p>
          The main idea of this method [
          <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
          ] is that the input image can be used to construct a nuclear
estimate for the probability density of data distribution in the RGBXY feature space. Next, a natural
assumption is made that the local maxima of the probability density corresponding to the cluster
centers. From the necessary condition of a local extremum, an expression is determined for the shift
vector m(p) of the feature space p ∈ RGBXY, applying which iteratively to the point p we get a
sequence of points converging to the local maximum of the probability density estimate (i.e. to the
center of the nearest cluster):
m( p) = i=1
        </p>
        <p>− p,
n
∑ pi ⋅ g i
n
∑ g i
i=1
where g i = g ( p −h pi ) 2  , g (v) = −k ′(v) , h is smoothing parameter, K(v)=ck(v) is kernel
estimates of a density.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. FloodFill algorithm</title>
        <p>
          Using the FloodFill method [
          <xref ref-type="bibr" rid="ref11 ref13 ref14">11, 13, 14</xref>
          ] it is possible to select areas of uniform color. To do this,
select the starting pixel and set the interval for changing the color of neighboring pixels relative to the
original. The interval can be asymmetric. The algorithm will combine pixels into one segment (filling
them with one color) if they fall within the specified range. The output is a segment filled with a
certain color and its area in pixels.
        </p>
        <p>
          Such an algorithm can be useful for filling an area with weak color swings with a homogeneous
background. One of the ways to use FloodFill is to detect damaged edges of the object. For example, if
the algorithm fills neighboring regions by filling homogeneous areas with a certain color, then the
integrity of the border between these areas is violated.
2.4. k-means algorithm
k-means segmentation [
          <xref ref-type="bibr" rid="ref15 ref16 ref17 ref18">15, 16, 17, 18</xref>
          ] is the most popular clustering method. The algorithm is aimed
at minimizing the total quadratic deviation of cluster points from the centers of these clusters. Thus,
this is an iterative algorithm that divides a given set of pixels into k clusters of points, which are as
close as possible to their centers, and the clustering itself occurs due to the displacement of these same
centers. It is necessary to take into account the fact that the k-means method is very sensitive to noise,
which can significantly distort the results of clustering.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Experimental results</title>
      <p>As criteria for evaluation of work of algorithms of segmentation, it is possible to use quality of
background suppression and selection of objects in the form of connected areas.</p>
      <p>Because microcalcifications are a complex object, it is impossible to demand accurate
determination of the object consisting of several parts of different brightness as a single connected
region.</p>
      <p>For the analysis methods were taken real pictures of microcalcifications on mammograms. Images
differ in the number and type of microcalcifications, brightness of the background and objects, and the
presence of repetitive textures.</p>
      <p>Figures 2-4 show the results of the algorithms on the original images. Only the FloodFill algorithm
coped with the allocation of microcalcifications, all other algorithms have identified too many
connected regions. Such experimental results suggest the need for using pre-processing methods
before using segmentation.</p>
      <p>
        Next to the images were applied contrasting methods, described in detail in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Figures 5-7 show examples of studies of segmentation algorithms on contrasted mammograms.
The results of the experiments are as follows:
1) the watershed algorithm is not suitable for the solution of a task at all;
2) the MeanShift algorithm is able to allocate the required objects only in images without tumors
on the background of fatty involution;
3) the k-means algorithm showed results similar to the previous method.</p>
      <p>a) b) c) d)
Figure 2. Examples of image segmentation shown in Figure 1(a): (a) watershed method, (b)
MeanShift algorithm, (c) FloodFill algorithm, (d) k-means segmentation.</p>
      <p>a) b) c) d)
Figure 3. Examples of image segmentation shown in Figure 1(b): (a) watershed method, (b)
MeanShift algorithm, (c) FloodFill algorithm, (d) k-means segmentation.</p>
      <p>a) b) c) d)
Figure 4. Examples of image segmentation shown in Figure 1(c): (a) watershed method, (b)</p>
      <p>MeanShift algorithm, (c) FloodFill algorithm, (d) k-means segmentation.</p>
      <p>a) b) c) d)
Figure 5. Examples of image segmentation shown in Figure 1(a): (a) changing the brightness/contrast
of the image, (b) the watershed algorithm, (c) – the MeanShift algorithm, (d) the k-means
segmentation.</p>
      <p>a) b) c) d)
Figure 6. Examples of image segmentation shown in Figure 1(b): (a) changing the brightness/contrast
of the image, (b) the watershed algorithm, (c) the MeanShift algorithm, (d) the k-means segmentation.</p>
      <p>a) b) c) d)
Figure 7. Examples of image segmentation shown in Figure 1(c): (a) changing the brightness/contrast
of the image, (b) the watershed algorithm, (c) the MeanShift algorithm, (d) the k-means segmentation.</p>
      <p>
        The study of the work of the algorithms was carried out on 250 mammograms from the MIAS [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
database. Empirically managed to achieve the results presented in Table 1 and 2.
      </p>
      <sec id="sec-3-1">
        <title>Watershed algorithm MeanShift algorithm FloodFill algorithm k-means segmentation</title>
      </sec>
      <sec id="sec-3-2">
        <title>Watershed algorithm</title>
        <p>MeanShift algorithm
k-means segmentation</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>Comparative analysis of different methods of image segmentation applied to the problem of allocation
of microcalcifications in mammographic images. To compare segmentation methods, criteria were
used based on the expert's assessment (based on visual analysis) of the quality of background
suppression and the selection of objects as connected areas. Through experimental studies, it was
found that the method of watersheds incorrectly finds the boundaries of objects and is not acceptable
in solving the problem.</p>
      <p>The best segmentation result was obtained using the FloodFill algorithm, which consists of the
selection of areas of uniform color. During the experiments, it was found that to improve the quality of
mammogram segmentation, it is advisable to pre-process the images. It provides a reduction in the
number of analyzed areas by combining segments and removing irrelevant fragments from the point of
view of the problem. Using the same segmentation algorithms after processing the images showed that
the MeanShift algorithms and k-means are able to highlight microcalcifications only on the images
without tumors on the background of fatty involution.</p>
      <p>It should be noted that further research is needed to improve the methods of thematic segmentation,
taking into account the spatial properties of areas and providing the best compromise between
insufficient and excessive segmentation.</p>
      <p>The obtained results allow us to outline the prospects of using segmentation algorithms in the
construction of automatic cancer detection systems on mammograms at an early stage.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>Podgornova Yu A expresses her sincere gratitude to her academic advisor, Dr. of Tech. Sci., Professor
Sadykov S S for his help at the research conduct, valuable recommendations in relation to their
planning and article preparation as well as for his moral support.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Korzhenkova G P 2004 Comprehensive</surname>
            <given-names>X</given-names>
          </string-name>
          <article-title>-ray Sonographic Diagnosis of Breast Diseases (Moscow: Firma STROM</article-title>
          ) p
          <fpage>128</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Sadykov</surname>
            <given-names>S S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bulanova</surname>
            <given-names>Yu A</given-names>
          </string-name>
          and
          <string-name>
            <surname>Zaharova</surname>
            <given-names>E A</given-names>
          </string-name>
          <year>2014</year>
          <article-title>Computer diagnosis of tumors in mammograms</article-title>
          <source>Computer Optics</source>
          <volume>38</volume>
          (
          <issue>1</issue>
          )
          <fpage>131</fpage>
          -
          <lpage>138</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Eddaoudi</surname>
            <given-names>F</given-names>
          </string-name>
          and
          <article-title>Regragui F 2011 Microcalcifications detection in mammographic images using texture coding</article-title>
          <source>Applied Mathematical Sciences</source>
          <volume>5</volume>
          <fpage>381</fpage>
          -
          <lpage>393</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Panchenko D S and Putyatin E P 1999</surname>
          </string-name>
          <article-title>Comparative analysis of image segmentation methods Radio electronics</article-title>
          and informatics
          <volume>4</volume>
          <fpage>109</fpage>
          -
          <lpage>114</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Whitey D J and Koles Z J 2008</surname>
          </string-name>
          <article-title>A review of Medical Image segmentation: Methods and</article-title>
          available software
          <source>International Journal of Bioelectromagnetism</source>
          <volume>10</volume>
          <fpage>125</fpage>
          -
          <lpage>148</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Doskolovich</surname>
            <given-names>L L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharitonov</surname>
            <given-names>S I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petrova</surname>
            <given-names>O I</given-names>
          </string-name>
          and
          <string-name>
            <surname>Soifer</surname>
            <given-names>V A</given-names>
          </string-name>
          <year>1998</year>
          <article-title>A gradient method for design of multiorder varied-depth binary diffraction gratings - a comparison Opt</article-title>
          . And Lasers in Eng.
          <volume>29</volume>
          <fpage>249</fpage>
          -
          <lpage>259</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Sadykov</surname>
            <given-names>S S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bulanova Yu</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaharova</surname>
            <given-names>E A</given-names>
          </string-name>
          and
          <string-name>
            <surname>Yashkov</surname>
            <given-names>V S</given-names>
          </string-name>
          <year>2013</year>
          <article-title>Marker watershed study to isolate the breast cancer area Algorithms</article-title>
          ,
          <source>methods and data processing systems</source>
          <volume>1</volume>
          <fpage>56</fpage>
          -
          <lpage>64</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Hagyard</surname>
            <given-names>D</given-names>
          </string-name>
          and
          <string-name>
            <surname>Razaz</surname>
            <given-names>M 1996</given-names>
          </string-name>
          <article-title>Analysis of watershed algorithms for gray scale images</article-title>
          <source>IEEE conf on image processing</source>
          <volume>3</volume>
          <fpage>41</fpage>
          -
          <lpage>44</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Gauch J M 1999</surname>
          </string-name>
          <article-title>Image segmentation and analysis via multiscale gradient watersheds</article-title>
          <source>IEEE trans on image processing</source>
          <volume>8</volume>
          <fpage>69</fpage>
          -
          <lpage>79</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Myasnikov</surname>
            <given-names>E V</given-names>
          </string-name>
          <year>2017</year>
          <article-title>Hyperspectral image segmentation using dimensionality reduction and classical segmentation approaches</article-title>
          <source>Computer Optics</source>
          <volume>41</volume>
          (
          <issue>4</issue>
          )
          <fpage>564</fpage>
          -
          <lpage>572</lpage>
          DOI: 10.18287/
          <fpage>2412</fpage>
          -6179- 2017-41-4-
          <fpage>564</fpage>
          -572
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Comaniciu</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramesh</surname>
            <given-names>V</given-names>
          </string-name>
          and
          <string-name>
            <surname>Meer P 2000 Real-Time Tracking</surname>
          </string-name>
          of Non-Rigid
          <source>Objects Using Mean Shift Conference on CVPR</source>
          <volume>2</volume>
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Dingding</surname>
            <given-names>Liu</given-names>
          </string-name>
          , Bilge Soran, Gregg Petrie, and
          <article-title>Linda Shapiro 2019 A Review of Computer Vision Segmentation Algorithms</article-title>
          (Washington: University of Washington)
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Comaniciu</surname>
            <given-names>D</given-names>
          </string-name>
          and
          <string-name>
            <surname>Meer P 2002 Mean Shift</surname>
          </string-name>
          :
          <article-title>A Robust Approach toward Feature Space Analysis IEEE Trans</article-title>
          .
          <source>Pattern Analysis and Machine Intelligence</source>
          <volume>24</volume>
          <fpage>603</fpage>
          -
          <lpage>619</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Charles</surname>
            <given-names>J J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuncheva</surname>
            <given-names>L I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wells</surname>
            <given-names>B</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lim</surname>
            <given-names>I S 2006</given-names>
          </string-name>
          <article-title>An Evaluation Measure of Image Segmentation Based on Object Centres International Conference Image Analysis and Recognition ICIAR 2006: Image Analysis</article-title>
          and
          <source>Recognition</source>
          (Springer: Verlag Berlin Heidelberg)
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Inaba</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katoh</surname>
            <given-names>N</given-names>
          </string-name>
          and
          <string-name>
            <surname>Imai</surname>
            <given-names>H 1994</given-names>
          </string-name>
          <article-title>Applications of weighted Voronoi diagrams and randomization to variance-based k-clustering</article-title>
          <source>Proceedings of 10th ACM Symposium on Computational Geometry 332-339 DOI:10.1145/177424</source>
          .178042
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Steven</surname>
            <given-names>J 2002</given-names>
          </string-name>
          <article-title>Acceleration of K-Means</article-title>
          and
          <source>Related Clustering Algorithms Lecture Notes in Computer Science</source>
          (Springer Berlin Heidelberg)
          <fpage>166</fpage>
          -
          <lpage>177</lpage>
          DOI:10.1007/3-540-45643-0_
          <fpage>13</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Nameirakpam</surname>
            <given-names>D 2015</given-names>
          </string-name>
          <string-name>
            <surname>Image Segmentation Using K-means Clustering</surname>
          </string-name>
          Algorithm and
          <source>Subtractive Clustering Algorithm Procedia Computer Science</source>
          <volume>54</volume>
          <fpage>764</fpage>
          -
          <lpage>771</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Tatarnikov</surname>
            <given-names>V V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pestunov</surname>
            <given-names>I A</given-names>
          </string-name>
          and
          <string-name>
            <surname>Berikov</surname>
            <given-names>V B</given-names>
          </string-name>
          <year>2017</year>
          <article-title>Centroid averaging algorithm for a clustering ensemble</article-title>
          <source>Computer Optics</source>
          <volume>41</volume>
          (
          <issue>5</issue>
          )
          <fpage>712</fpage>
          -
          <lpage>718</lpage>
          DOI: 10.18287/
          <fpage>2412</fpage>
          -6179-2017-41-5-
          <fpage>712</fpage>
          - 718
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>