<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>NEURAL NETWORK APPROACH TO THE PROBLEM OF IMAGE SEGMENTATION FOR MORPHOLOGICAL STUDIES</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A.V. Stadnik</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>O.I. Streltsova</string-name>
          <email>strel@jinr.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D.V. Podgainy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>I.A. Kolesnikоva</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yu.A. Butenko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A.V. Nechaevskiy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A.I. Anikina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>T.V. Gudiev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A.I. Streltsov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Alexey Stadnik</institution>
          ,
          <addr-line>Oxana Streltsova, Dmitry Podgainy, Inna Kolesnikоva, Yuri Butenko, Andrey Nechaevskiy, Anastasia Anikina, Timur Gudiev, Alexey Streltsov</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratory of Radiation Biology, JINR</institution>
          ,
          <addr-line>6 Joliot-Curie St., Dubna, 141980</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Meshcheryakov Laboratory of Information Technologies, JINR</institution>
          ,
          <addr-line>6 Joliot-Curie St., Dubna, 141980</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Regional Scientific and Educational Mathematical Center, North Ossetian State University after K. L. Khetagurov</institution>
          ,
          <addr-line>44-46 Vatutina, Vladikavkaz, 362025, North Ossetia - Alania</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>SAP SE</institution>
          ,
          <addr-line>Walldorf, 69190</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>2743</volume>
      <fpage>5</fpage>
      <lpage>9</lpage>
      <abstract>
        <p>The report presents the results on the development of the algorithmic block of the Information System (IS) for radiobiological studies, created within a joint project of MLIT and LRB JINR, in terms of solving the segmentation problem for morphological research to study the effect of ionizing radiation on biological objects. The problem of automating the morphological analysis of histological preparations is solved in the frames of the project by implementing algorithms based on a neural network approach and computer vision methods. The results of the investigations will be used in the development of the algorithmic block of the BIOHLIT information system.</p>
      </abstract>
      <kwd-group>
        <kwd>morphological analysis</kwd>
        <kwd>image segmentation</kwd>
        <kwd>neural networks</kwd>
        <kwd>automated image processing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Morphological data are images representing photographs of histological preparations of serial
sections of the brain of laboratory animals. The task of the morphological analysis of nervous tissue
embraces image segmentation, the classification of cells by the type of disorders, the assessment of
structural changes in tissue. Different types of cells in this context are different degrees of damage to a
neuron in the brain. As part of an experiment, scientists usually receive data on the behavioral
reactions of laboratory animals in test installations, as well as on the results of a histological
examination of the taken biological material. The global task of such a complex analysis is to study
morphofunctional changes in the central nervous system.</p>
      <p>The automation of the process of analyzing photographic images of histological preparations
is important, since it allows one to speed up the acquisition of data, to increase their volume and
reduce the likelihood of classification errors arising from the human factor. For this purpose, an
information system for radiobiological studies, BIOHLIT [1], is being developed.</p>
      <p>The main goals of the work are to extract information, to automate the process of analyzing
and labeling images, extracting the maximum amount of data for research groups. Thus, a web-based
service with a client-server architecture on the HybriLIT platform [2] has been implemented [3].</p>
    </sec>
    <sec id="sec-2">
      <title>2. Training sample</title>
      <p>Data for the task of training and building a working neural network model are the original
image and the corresponding image markup, which can be implemented in various ways. Figure 1
illustrates a variant of the markup by an expert with markers applied to the image and a variant of the
markup where objects are specified by a mask of different colors. This is equivalent to the set of
binary masks required in a multiclass segmentation task.</p>
      <p>The formulation of the segmentation problem for a neural network is to form a set of images
with masks. Separate masks are formed for each type of violation.</p>
      <sec id="sec-2-1">
        <title>Options for creating a training sample are:</title>
        <p> expert markup,
 (semi-) automated marking, using the methods of adaptive threshold filtering,
segmentation, search for contours.</p>
        <p>Expert markup is the most correct, but the most time-consuming way of obtaining a training
sample. However, using classical image processing techniques, such as edge search, we can extract
additional information about the data and perform preliminary analysis. In some cases, this can be
used to create markup. Figure 2 shows the results of an analysis of some of the images, where it can be
seen that statistics on cell size can be extracted in a similar way.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Segmentation using a neural network approach</title>
      <p>
        As a solution, the U-net neural network architecture [
        <xref ref-type="bibr" rid="ref1">4</xref>
        ] was chosen, it is well suited for image
segmentation problems.
      </p>
      <p>The key features of the architecture are:
 Path of information passage, similar to an autoencoder (compressing and expanding path),
which leads to the creation of a "bottleneck" for information passage, giving a high-quality
generalization of data;
 Transfer of information through the layers of the network (similar to Resnet [5]), which gives
an increase in the spatial resolution, less attenuation of the gradient in the process of training
the model.</p>
      <p>For the loss function, the well-proven Dice function (dice loss) was chosen for the segmentation
problem, it has the meaning of the fraction of overlap between the received and target areas.
dice_loss = 2* (A ᑎ В) / (А + В)
(1)</p>
      <p>The training of neural network models and work with data were carried out on the HybriLIT
heterogeneous cluster, which is part of the Multifunctional Information and Computing Complex
(MICC) of the Meshcheryakov Laboratory of Information Technologies of JINR, Dubna. The platform
consists of the “Govorun” supercomputer and the HybriLIT training and testing polygon. We studied
models of neural networks with different capacities, but they all achieved overfitting at 80-90 epochs.
At the same time, less-capacious networks settled on the best result compared to large-volume
networks. A graph illustrating retraining at best is shown in [6].</p>
      <p>In general, the result was unsatisfactory. The reasons for retraining can be both separately and
several of the following factors:
 Insufficient amount of data,
 Systematic error in data,
 Unsuccessful network architecture,
 Insufficient data completeness,
 Unbalanced data (by class).</p>
      <p>For one example from the test set, the result of classification by the trained model is
demonstrated in Figure 4, it is obvious that the main direction of optimization of the neural network
architecture in this task turned out to be the direction of semantic segmentation, i.e. trying to separate
objects from the background of the image.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Semantic segmentation</title>
      <p>The analysis showed that U-net tried to select all segments in the image. Therefore, it seems
reasonable to divide the original problem into two corresponding stages:
 segmentation;
 type classification.</p>
      <p>At the same time, the stage of classification of the type of neuron can be carried out both using
the U-net architecture with the involvement of additional information, taking into account the already
obtained at the first stage of segmentation, and making decisions by segments for each of the objects.</p>
      <p>To form a training sample for the semantic segmentation problem, the opportunity to
automatically perform markup using the classical approach with the search for contours in the image
with the previously applied adaptive threshold transformation was used. As part of solving this
problem, a study on the stability of the solution to the segmentation problem to the presence of
systematic errors in data markup was also carried out.</p>
      <p>For this, two samples were formed:
 mask of contours, which gives the search for contours in the OpenCV library;
 convex contour mask.</p>
      <p>Small-capacity networks were chosen as neural network architectures for segmentation:
 simple convolutional network, without pooling layers, i.e. without reducing the spatial
resolution,
 autoencoder, i.e. compressive mapping, which is essentially U-net without concatenations
(forwarding information through levels).</p>
      <p>An autoencoder enables to achieve a greater generality of the model, to get rid of background
remnants, to better select segments, but lose in the spatial resolution of the image, i.e. small details
(such as the nuclei of neurons, for example).</p>
      <p>The result of training both models on different sets was an almost identical classi fication
result, which allows us to conclude that errors in the markup with this scheme will have an
insignificant effect. In general, the autoencoder showed the best result (Fig. 6).</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>A number of interesting results have been acquired, a model of an autoencoder that allows for
the semantic segmentation of an image has been obtained, and the presence of the stability of the
obtained solution in relation to systematic errors in marking has been clarified.</p>
      <p>Further work is aimed at solving the problem of classifying brain neurons by type and
increasing the sample, including using data from open sources [7].
[2] HybriLIT heterogeneous cluster, http://hlit.jinr.ru/
[3] Yu. Butenko, D. Marov, A. Nechaevskiy, D. Podgainy Development of a Service for Conducting
Radiobiological Studies on the HybriLIT Platform // Proceedings of the Workshop on Information
Systems for the Radiation Biology Tasks, Dubna, Russia, June 18, 2020, pp. 26-33.
http://ceurws.org/Vol-2743/26-33-paper-4.pdf
[6] A. Stadnik, A. Streltsov, O. Streltsova Algorithms Based on Neural Network Approach for Image
Segmentation in Research Morphofunctional Changes in the Central Nervous System // Proceedings of
the Workshop on Information Systems for the Radiation Biology Tasks, Dubna, Russia, June 18,
2020, pp. 39-47 http://ceur-ws.org/Vol-2743/39-47-paper-6.pdf
2018.</p>
      <sec id="sec-5-1">
        <title>Available</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Olaf</given-names>
            <surname>Ronneberger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Philipp</given-names>
            <surname>Fischer</surname>
          </string-name>
          , and Thomas Brox, U-Net:
          <article-title>Convolutional Networks for BiomedicalImage Segmentation, Computer Science Department and BIOSS Centre for Biological Signalling Studies</article-title>
          , University of Freiburg, Germany,
          <source>arXiv:1505.04597v1 [cs.CV] 18 May</source>
          <year>2015</year>
          [5]
          <string-name>
            <surname>He</surname>
          </string-name>
          , Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun,
          <string-name>
            <surname>Jian</surname>
          </string-name>
          (
          <year>2015</year>
          -12-10).
          <article-title>"Deep Residual Learning for Image Recognition"</article-title>
          .
          <source>arXiv:1512.0338</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>