<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ORCID:</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>MLOps Approach for Automatic Segmentation of Biomedical Images</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Berezkyi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oleh Berezsky</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oleh Pitsun</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Grygoriy Melnyk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuriy Batko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Petro Liashchynskyi</string-name>
          <email>p.liashchynskyi@st.wunu.edu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mykola</string-name>
          <email>mykolaberezkyy@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>West Ukrainian National University</institution>
          ,
          <addr-line>11 Lvivska st., Ternopil, 46001</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>When using artificial intelligence systems for processing medical images, a large amount of software libraries, data and cloud computing is required. Implementing deep learning elements in CAD is a complex process and applying DevOps can help speed up this process. The implementation of DevOps approaches in the field of machine learning differs from the operations with standard programs; therefore the development of MLOps approaches to the implementation of deep learning elements for the analysis of biomedical images is an actual task. The developed pipeline allows scientists and specialists to use the findings in this article to launch projects based on machine learning and focus on model development rather than the process of setting up the environment. This paper provides examples of improved MLOps pipelines that can be used for solving problems of automatic image segmentation and evaluating the quantitative characteristics of microobjects.</p>
      </abstract>
      <kwd-group>
        <kwd>Machine learning</kwd>
        <kwd>MLOps</kwd>
        <kwd>biomedical images</kwd>
        <kwd>programming</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Every year, software systems increasingly use machine learning elements. Despite the great demand
for neural networks, there is still a need for programmers with specialized knowledge including the
knowledge of development and system administrators. Special MLOps approaches are applied to speed
up the software development process and increase its reliability and ease of software support. The
purpose of this work is to develop MLOps approaches for automatic segmentation of histological and
immunohistochemical images and evaluate quantitative characteristics of cell nuclei.</p>
      <p>MLOps approaches are developed to efficiently and reliably deploy infrastructure for running
machine learning elements and provide convenient and continuous delivery and deployment of program
code on cloud systems. This is a relatively new industry that requires the development of solutions for
specific subject area. Thus, in this paper, we consider the processing of biomedical images.</p>
      <p>Usually, machine learning models are developed at the local level, which does not allow one to
quickly run the code on any other computer system for data processing on the basis of machine learning.
In most cases, such developments are used at the level of specialized laboratories and do not become
widely used. However, modern hardware and cloud computing
make it possible to use local
developments on an industrial scale. The development of specific pipelines allows automating the
process of deploying software code and increasing the efficiency of this process. Applying MLOps
approaches for software development can help to get the following advantages:
less time for preparing and launching machine learning models;
scalability;
reduction of the number of errors and elimination of contradictory situations;
EMAIL:</p>
      <p>1);
o.pitsun@wunu.edu.ua
(A.</p>
      <p>2);</p>
      <p>2023 Copyright for this paper by its authors.
CEUR</p>
      <p>ceur-ws.org
program automation;
reduction of possible risks.</p>
      <p>Currently, there are already a large number of tools that allow you to deploy infrastructure, such as
terraform. Mechanisms for continuous code delivery and deployment are also available. However, most
of these mechanisms are used in DevOps tasks.</p>
      <p>The scientific novelty of this work lies in the development of MLOps workflow for automatic
segmentation of biomedical images using the elements of deep machine learning.</p>
      <p>The purpose of our research is to improve the existing mechanisms for machine learning tasks.</p>
      <p>The object of our research is the processes of automatic creation of infrastructure for microscopic
image processing.</p>
      <p>The subject of the research is DevOps practices for the creation of CI/CD pipelines.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature review</title>
      <p>
        In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the authors emphasized on the lack of regulatory documents for MLOps and offered their
own analysis and classification of the existing documents. Based on the conducted analysis, they
proposed a 10-step pipeline.
      </p>
      <p>
        Sajid Nazir et al. conducted a detailed analysis of artificial intelligence tools for biomedical image
processing, using deep machine learning in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In particular, the authors carried out an in-depth analysis
of artificial intelligence tools when investigating breast cancer.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], unsolved problems in machine learning relating to the analysis of health preserving means
were highlighted. The authors paid considerable attention to the problem of generating datasets for
machine learning process.
      </p>
      <p>
        In work [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the authors analyzed the problem of organizing interaction between specialists in IT
field to solve problems based on machine learning. Therefore, the development of unified pipelines for
software deployment is currently one of the most relevant problems in the field of machine learning.
      </p>
      <p>
        Adrien Bennetot et al. in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] considered artificial intelligence tools applied on biomedical use case
applications. The authors analyzed both standard models of neural networks and the latest ones such as
transformers. Due to the analysis of trends in machine learning, modern diagnostic tools are defined.
However, there is a need to reduce the complexity of the software configuration process and configure
the interaction between different technologies.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref6 ref7 ref8 ref9">6-9</xref>
        ], approaches for the implementation of DevOps as tools for processing biomedical images
were highlighted, which made it possible to create the main elements of the pipeline.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], the authors presented a pipeline for the classification of biomedical images and the structure
of a convolutional neural network for the classification of immunohistochemical images. In work [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
an approach for evaluating the quantitative characteristics of microobjects for diagnosis was proposed.
The structure of the u-net neural network for automatic segmentation of biomedical images was
presented in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>The analysis of the above publications has shown that the development of a pipeline, which can be
used in deep machine learning tools and algorithms for processing biomedical images, is an urgent task.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Problem statement</title>
      <p>To develop a unified approach for automatic segmentation and evaluation of quantitative
characteristics of microobjects, it is necessary to:
- analyze the existing tools for implementing MLOps pipeline;
- select the main components of the pipeline;
- develop a pipeline for processing images with elements of artificial intelligence.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Analysis of MLOps tools and platforms</title>
    </sec>
    <sec id="sec-5">
      <title>5. MLOps workflow for biomedical image segmentation</title>
      <p>This section presents a pipeline for automatic segmentation of images using Unet. The key stage in
this process is generation and preparation of data (images). This is one of the key differences compared
to analogues.</p>
      <p>MLOps workflow consists of 3 main components:
1. Build.
2. Deploy.
3. Monitor.</p>
      <p>The stage of data preparation includes the following steps:
- creating the directory structure for training and test samples (for example, "original", "masks");
- creating the internal directory structure for storing image masks;
- Data Labelling. The stage includes the rules for creating file names (for example, the suffix
"_mask" is added to the mask);
- changes in file size and other parameters.</p>
      <p>MLOps-workflow for biomedical images segmentation is shown in Figure 1.</p>
      <p>Image processing is an important stage because we are developing an image processing pipeline.
This stage also includes the process of histogram alignment and changes in image parameters depending
on the settings.</p>
      <p>U-net is used for image segmentation. This is a modern approach that makes it possible to use deep
learning. At the same time, it is necessary to create the architecture and select hyperparameters for
unet.</p>
      <p>After creating the architecture, it is necessary to conduct training and validate the model.</p>
      <p>The monitoring stage is one of the key stages in DevOps approaches and is aimed at analyzing the
system performance.</p>
      <p>Deployment is necessary for software release and for the use in real conditions.
6. CI/CD pipeline
microobjects
for
evaluating
the
quantitative
characteristics
of</p>
      <p>The module for evaluating the quantitative characteristics of microobjects is an important
component of the software system. Unlike other modules, this module can frequently change the code.
This is due to the need to set parameters for different types of images. To automate the process of
transferring parameter settings, it is proposed to use a separate deploy.yml file (Figure 2).</p>
      <p>In addition to the necessary entries for connection to the cloud server, this example shows the path
to the repository with the software code for launching the project of evaluating the quantitative
characteristics of microobjects.</p>
    </sec>
    <sec id="sec-6">
      <title>7. Peculiarities of using the Infrastructure as Code approach</title>
      <p>Infrastructure as Code is a modern approach for the development and implementation of software,
which makes it possible to write all the necessary elements of the server environment as software code.
This is especially convenient for solving problems with elements of deep learning. As a tool, terraform
is chosen, which allows one to use a large number of providers to deploy the project on various cloud
services, such as AWS, digitalocean, Azure, etc.</p>
      <p>An example of biomedical images is shown in Figure 3.
Configuration file structure for infrastructure deployment used in a project is shown in Figure 4.</p>
      <p>Digitalocean is chosen as the provider for conducting experiments. To connect to the cloud storage,
a token and ssh-keys are used as standard. Ubuntu server is chosen as the operating system. After
deploying the main environment, you need to install the necessary software and download the dataset
for further processing.</p>
      <p>Software code is updated using the CI/CD mechanism and taking github actions.</p>
      <p>The minimum requirements for the developed system are as follows:
RAM – 4GB
Disk Space – 25GB
1000 GB transfer</p>
      <p>OS – Ubnntu 18.X
8. Conclusions</p>
      <p>1. According to the comparative analysis, the advantages and disadvantages of the existing
systems with pipelines for automatic segmentation are highlighted. It is found that not all the systems
have the necessary functionality.</p>
      <p>2. The MLOps workflow is developed for the segmentation of biomedical images based on deep
learning with Unet elements.</p>
      <p>3. CI/CD pipeline is developed for software code delivery and deployment for evaluating the
quantitative characteristics of microobjects.</p>
      <p>4. The developed workflow can be a prototype not only for image segmentation programs, but
also for solving other problems in another subject area.</p>
    </sec>
    <sec id="sec-7">
      <title>9. References</title>
      <p>International Conference on Robotics and Automation (ICRA), pp. 5537-5544. IEEE,
2015. https://doi.org/10.1109/ICRA.2015.7139973
[18] Ciaglia, Floriana, Francesco Saverio Zuppichini, Paul Guerrie, Mark McQuade, and
Jacob Solawetz. "Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark."
arXiv preprint arXiv:2211.13523 (2022). https://doi.org/10.48550/arXiv.2211.13523
[19] Moreschini, S., Lomio, F., Hästbacka, D., &amp; Taibi, D. (2022, March). MLOps for
evolvable AI intensive software systems. In 2022 IEEE International Conference on
Software Analysis, Evolution and Reengineering (SANER) (pp. 1293-1294). IEEE.
https://doi.org/10.1109/SANER53432.2022.00155
[20] Kreuzberger, Dominik, Niklas Kühl, and Sebastian Hirschl. "Machine learning
operations (mlops): Overview, definition, and architecture." IEEE Access (2023).
https://doi.org/10.1109/ACCESS.2023.3262138
[21] Kumara, Indika, Rowan Arts, Dario Di Nucci, Willem Jan Van Den Heuvel, and
Damian Andrew Tamburri. "Requirements and Reference Architecture for MLOps:
Insights from Industry." (2022).
[22] Recupito, Gilberto, Fabiano Pecorelli, Gemma Catolino, Sergio Moreschini, Dario Di
Nucci, Fabio Palomba, and Damian A. Tamburri. "A multivocal literature review of
mlops tools and features." In 2022 48th Euromicro Conference on Software
Engineering and Advanced Applications (SEAA), pp. 84-91. IEEE, 2022.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Testi</surname>
          </string-name>
          et al.,
          <article-title>"MLOps: A Taxonomy and a Methodology,"</article-title>
          <source>in IEEE Access</source>
          , vol.
          <volume>10</volume>
          , pp.
          <fpage>63606</fpage>
          -
          <lpage>63618</lpage>
          ,
          <year>2022</year>
          , doi: 10.1109/ACCESS.
          <year>2022</year>
          .
          <volume>3181730</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Nazir</surname>
            , Sajid,
            <given-names>Diane M.</given-names>
          </string-name>
          <string-name>
            <surname>Dickson</surname>
          </string-name>
          , and Muhammad Usman Akram.
          <article-title>"Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks." Computers in Biology and Medicine (</article-title>
          <year>2023</year>
          ):
          <fpage>106668</fpage>
          . https://doi.org/10.1016/j.compbiomed.
          <year>2023</year>
          .106668
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Dhar</surname>
            , Tribikram, Nilanjan Dey, Surekha Borra, and
            <given-names>R. Simon</given-names>
          </string-name>
          <string-name>
            <surname>Sherratt</surname>
          </string-name>
          .
          <article-title>"Challenges of Deep Learning in Medical Image Analysis-Improving Explainability and Trust."</article-title>
          <source>IEEE Transactions on Technology and Society</source>
          <volume>4</volume>
          , no.
          <issue>1</issue>
          (
          <year>2023</year>
          ):
          <fpage>68</fpage>
          -
          <lpage>75</lpage>
          . https://doi.org/10.1109/TTS.
          <year>2023</year>
          .3234203
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Vega</surname>
            , Carlos, Miroslav Kratochvil, Venkata Satagopam, and
            <given-names>Reinhard</given-names>
          </string-name>
          <string-name>
            <surname>Schneider</surname>
          </string-name>
          .
          <article-title>"Translational challenges of biomedical machine learning solutions in clinical and laboratory settings."</article-title>
          <source>In International Work-Conference on Bioinformatics and Biomedical Engineering</source>
          , pp.
          <fpage>353</fpage>
          -
          <lpage>358</lpage>
          . Cham: Springer International Publishing,
          <year>2022</year>
          . https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -07802-6_
          <fpage>30</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Bennetot</surname>
          </string-name>
          , Adrien, Ivan Donadello, Ayoub El Qadi, Mauro Dragoni, Thomas Frossard, Benedikt Wagner, Anna Saranti et al.
          <article-title>"A Practical guide on</article-title>
          <source>Explainable AI Techniques applied on Biomedical use case applications." arXiv preprint arXiv:2111.14260</source>
          (
          <year>2021</year>
          ). http://dx.doi.org/10.2139/ssrn.4229624
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Granlund</surname>
            , Tuomas,
            <given-names>Vlad</given-names>
          </string-name>
          <string-name>
            <surname>Stirbu</surname>
            , and
            <given-names>Tommi</given-names>
          </string-name>
          <string-name>
            <surname>Mikkonen</surname>
          </string-name>
          .
          <article-title>"Towards regulatory-compliant MLOps: Oravizio's journey from a machine learning experiment to a deployed certified medical product." SN computer Science 2</article-title>
          , no.
          <issue>5</issue>
          (
          <year>2021</year>
          ):
          <fpage>342</fpage>
          . https://doi.org/10.1007/s42979- 021-00726-1
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Reddy</surname>
            , Manjunatha, Brahmanand Dattaprakash, Sandesh Kammath, Subramanya Kn, Sumathra Manokaran, and
            <given-names>Rangaswamy</given-names>
          </string-name>
          <string-name>
            <surname>Be</surname>
          </string-name>
          .
          <article-title>"Application of mlops in prediction of lifestyle diseases." ECS Transactions 107</article-title>
          , no.
          <issue>1</issue>
          (
          <year>2022</year>
          ):
          <fpage>1191</fpage>
          . DOI 10.1149/10701.1191ecst
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Jain</surname>
            , Archit, Adarsh Malviya, Disha Bajaj, Revati Bhavsar, and
            <given-names>Amit</given-names>
          </string-name>
          <string-name>
            <surname>Savyanavar</surname>
          </string-name>
          .
          <article-title>"Brain Tumor Detection using MLops and Hybrid Multi-Cloud."</article-title>
          <source>In 2022 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS)</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . IEEE,
          <year>2022</year>
          . https://doi.org/10.1109/ICBDS53701.
          <year>2022</year>
          .9936020
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Stirbu</surname>
            , Vlad,
            <given-names>Tuomas</given-names>
          </string-name>
          <string-name>
            <surname>Granlund</surname>
            , and
            <given-names>Tommi</given-names>
          </string-name>
          <string-name>
            <surname>Mikkonen</surname>
          </string-name>
          .
          <article-title>"Continuous design control for machine learning in certified medical systems</article-title>
          .
          <source>" Software Quality Journal</source>
          <volume>31</volume>
          , no.
          <issue>2</issue>
          (
          <year>2023</year>
          ):
          <fpage>307</fpage>
          -
          <lpage>333</lpage>
          . https://doi.org/10.1007/s11219-022-09601-5
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Berezsky</surname>
            , Oleh, Oleh Pitsun, Grygory Melnyk, Yuriy Batko, Bohdan Derysh, and
            <given-names>Petro</given-names>
          </string-name>
          <string-name>
            <surname>Liashchynskyi</surname>
          </string-name>
          .
          <article-title>"Application Of MLOps Practices For Biomedical Image Classification."</article-title>
          <source>In IDDM</source>
          , pp.
          <fpage>69</fpage>
          -
          <lpage>77</lpage>
          .
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Berezsky</surname>
            <given-names>Oleh</given-names>
          </string-name>
          , Pitsun Oleh, Grygoriy Melnyk, Tamara Datsko, Ivan Izonin, and
          <string-name>
            <given-names>Bohdan</given-names>
            <surname>Derysh</surname>
          </string-name>
          .
          <article-title>"An Approach toward Automatic Specifics Diagnosis of Breast Cancer Based on an Immunohistochemical Image."</article-title>
          <source>Journal of Imaging</source>
          <volume>9</volume>
          , no.
          <issue>1</issue>
          (
          <year>2023</year>
          ):
          <fpage>12</fpage>
          . https://doi.org/10.3390%2Fjimaging9010012
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Berezsky</surname>
            , Oleh, Pitsun Oleh, Bohdan Derysh, Ihor Pazdriy, Grygory Melnyk, and
            <given-names>Yuriy</given-names>
          </string-name>
          <string-name>
            <surname>Batko</surname>
          </string-name>
          .
          <article-title>"Automatic segmentation of immunohistochemical images based on U-net architecture."</article-title>
          <source>In 2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT)</source>
          ,
          <source>vol. 1</source>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>32</lpage>
          . IEEE,
          <year>2021</year>
          . https://doi.org/10.1109/CSIT52700.
          <year>2021</year>
          .9648669
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Nyckel</surname>
            <given-names>URL</given-names>
          </string-name>
          : https://www.nyckel.com/
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>ximilar</surname>
            <given-names>URL</given-names>
          </string-name>
          : https://www.ximilar.com/
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>hasty</surname>
            <given-names>URL</given-names>
          </string-name>
          : https://hasty.cloudfactory.com/
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>levity</surname>
            <given-names>URL</given-names>
          </string-name>
          : https://levity.ai/
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Alexandrova</surname>
            , Sonya,
            <given-names>Zachary</given-names>
          </string-name>
          <string-name>
            <surname>Tatlock</surname>
            , and
            <given-names>Maya</given-names>
          </string-name>
          <string-name>
            <surname>Cakmak</surname>
          </string-name>
          .
          <article-title>"RoboFlow: A flow-based visual programming language for mobile manipulation tasks." In 2015 IEEE</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>