<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Analysis of Business Structures Regarding the Level of Digital Maturity Using Data Mining Methods</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Iryna Strutynska</string-name>
          <email>strutynskairy@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Halyna Kozbur</string-name>
          <email>kozbur.galina@gmail.com</email>
          <email>kozbur.igor@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olena Sorokivska</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lesia Dmytrotsa</string-name>
          <email>dmytrotsa.lesya@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ternopil Ivan Puluj National Technical University</institution>
          ,
          <addr-line>Ruska 56 46001 Ternopil</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>The Netherlands Loughborough University London</institution>
          ,
          <addr-line>3 Lesney Avenue, Queen Elizabeth Olympic Park, London, E20 3BS</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Cluster analysis is proposed as an unsupervised machine learning method to divide small and medium-sized businesses in Ukraine into groups based on their level and types of digital maturity. The input data used is a dataset formed by expert assessments of the state of digital technology usage in regional small and medium-sized businesses. The Digital Transformation Index "HIT" is used to numerically measure the level of digital maturity of domestic enterprises. Various approaches to building clustering models are implemented using built-in methods in the scikit-learn library for Data Mining problems. The quality of the constructed models is evaluated using three indicators. Groups of companies are identified based on similarity in understanding digital development, and a comparative analysis is performed. Performing clustering for a representative sample of domestic small and medium-sized businesses will allow understanding the current state of their use of digital technologies and developing a well-reasoned system of actions to effectively digitize entrepreneurship in Ukraine.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Data Mining Algorithms</kwd>
        <kwd>Digital Transformation</kwd>
        <kwd>ICT for Data Analysis</kwd>
        <kwd>Scikit-learn</kwd>
        <kwd>Clustering1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>the problems is the lack of necessary knowledge among entrepreneurs regarding the
application of innovative digital technologies, as well as the insufficient number of tools
(platforms, services, or applications) that would allow them to assess the current level of
digital maturity of individual enterprises and at the same time provide a roadmap of digital
opportunities for business transformation. Clustering SMEs by the level and types of digital
maturity will allow to understand the current state of digitalization, identify problem groups
of enterprises and bottlenecks in the process of digital transformation, as well as recommend a
reasoned systemic program of actions for effective digital growth.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related works</title>
      <p>
        The process of digitalization of business and the use of digital technologies in activities is the
subject of many scientific studies. Thus, in the work of J. Cenamor, V. Parida, and J. Vincent,
the relationship between the use of digital platforms and small business performance
indicators is analyzed [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Features of the use of digital business models are highlighted in the
works of N. Ivanchenko, Zh. Kudrytska, K. Rekachynska [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], N. Kraus, O. Holoborodka,
K. Kraus [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Digital transformation is proposed to be considered as "processes that aim to
improve an economic entity by triggering significant changes in its properties through a
combination of information, computing, communications and connectivity" [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Digital
transformation affects business processes, operational procedures, and organizational
capabilities [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], requiring enterprises to update workforce skills, achieve a certain level of
digital maturity, and improve productivity and efficiency.
      </p>
      <p>
        R. Ochoa in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] summarizes and forms the semantic core of literature reviews of various
scientists regarding the definition of the digital maturity models. Domestic scientists pay
attention to factors specific to Ukraine (in particular, the low level of digital literacy of society
and cyber security, insufficient regulatory and legal regulation of digitalization), which reduce
the interest of small businesses in the digitalization of business processes [8, p. 231; 9, p. 58].
In connection with this, an important direction of scientific research in the field of digitization
is the study of the peculiarities of the formation of the digital space in Ukraine, as well as the
participation of the state in the institutional and legal regulation of this process (O.
Pishchulina [8], H. Zhekalo [9], H. Karcheva, D. Ohorodnia, and V. Open'ko [10]).
      </p>
      <p>Investigating the use of digital tools by business organizations [11, 12], the authors
developed methodologies for applying mathematical and computer modeling methods to
measure the level of digital transformations [13, 14]. The main methodological tool of this
study is cluster analysis. General problems of clustering are fully covered in the sources [15,
16]. Authors of scientific studies use diversified methods of cluster analysis, depending on the
problem to be solved. Thus, in the scientific works of C. Iyigun, M. Türkeş, I. Batmaz, C.
Yozgatligil, V. Purutçuoğlu, E. Kartal, M. Öztürk [17] and K. Sablin, E. Kagan, E. Chernova [18]
use hierarchical clustering methods, K.  Gorbatiuk, O. Mantalyuk, O. Proskurovych, O. Valkov
in [19] study fuzzy clustering methods. Cluster analysis is often used in scientific works by
both domestic and foreign authors to perform macro analysis, namely the differentiation of
socio-economic development of regions. Works [20, 23-25] are devoted to various directions
of building clusters among the regions of Ukraine. As for tasks at the micro level, many
scientific works are focused on the study of financial transactions in banking institutions and
trade organizations. The work of foreign authors, M. R. Pinto, P. K. Salume, M. W. Barbosa, P.
R. de Sousa [26], is quite interesting and informative, in which the clustering of retail trade
enterprises in relation to the levels of digital maturity according to five dimensions – strategy,
market, operations, culture and technology. It is proposed to consider culture as a driver of
digital transformation.</p>
      <p>The importance of digital education, awareness, and skills for entrepreneurship, as well as
the use of data analysis techniques in digital business transformation processes, has been
discussed in the works of domestic and foreign scientists [27-31]. However, the question of
clustering business structures by the level of digital maturity in order to develop practical
recommendations for digital transformation currently requires further study.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology for assessing the level of digital maturity of</title>
    </sec>
    <sec id="sec-4">
      <title>Ukrainian enterprises</title>
      <p>Many countries have their own methodologies, frameworks, and tools for measuring digital
maturity and digital transformation of business structures. For example, the UK uses diverse
tools (Digital Acceleration Index (DAI) (Boston Consulting Group (BCG) and Google), The
Digital Scorecard (Lloyds Bank), Digital Maturity Assessment (Department for Digital,
Culture, Media &amp; Sport (DCMS)), Digital Capability Assessment Tool (Department for
Business, Energy and Industrial Strategy (BEIS)), Digital Business Academy Assessment (Tech
Nation, a UK-based network for entrepreneurs)) based on different methodologies to
understand the situation of digital business development. Collecting and processing relevant
data provides an understanding of the development and implementation of various digital
technologies and enables the formation of digital transformation "roadmaps".</p>
      <p>The current state of digital technologies in domestic businesses sharply differs from the
world. The use of international methodologies to determine the level of digital maturity in
business using relevant indicators is not acceptable for domestic realities due to the low
overall level of the use of digital technologies in the economic space. The low level of
awareness of small and medium-sized enterprises about the opportunities for integrating
technologies into their business processes hinders the development of companies and creates
difficulties in the entry of domestic businesses into the international arena. Therefore,
research on the development of digital transformation indicators for businesses, regular
assessments of digital development, and the implementation of regular, systematic statistical
observations [11, 12] deserve special attention.</p>
      <p>It is necessary to develop our own methodology for determining the digital transformation
index of businesses with corresponding indicators that reflect the current state of affairs,
provide a deep analysis of the digital maturity indicators of business structures and take into
account their dynamics, while remaining flexible to quickly respond to new economic
processes and phenomena and ensure further alignment with international methodologies for
comparing Ukraine with the most developed countries in the world.</p>
      <p>A methodology for determining the Digital Transformation Index “HIT” of domestic SMEs
was proposed in [14]. It allows not only to evaluate the level of digital maturity of a business
structure but also obtain a vector of digital development strategy. The main indicators of the
HIT index are:


</p>
      <p>Humans (H): digital literacy (competence) of human capital, which is defined as the
ability of an employee to perform complex tasks and requirements that involve both
professional and personal digital skills.</p>
      <p>Instruments (I): use of digital tools, which includes components such as social media
management, website functioning and search engine optimization, work with
specialized business process automation systems, etc.</p>
      <p>Technologies (T): use of digital technologies, that is, the level of enterprise
infrastructure provision with necessary equipment (personal computers, laptops,
smartphones) and broadband Internet.</p>
      <p>The value of the Digital Transformation Index is calculated as a weighted sum of the
values of the three corresponding indicators:</p>
      <p>HIT =ωH ∙ ∑ ¿H + ωI ∙ ∑ ¿I + ωT ∙ ∑ ¿T , ¿ ¿ ¿
HIT ∈ [ 0 ; 1] ;
where ∑ ¿H ¿ – the aggregated indicator of the digital literacy level of the organization's
human capital; ∑ ¿I ¿ – the aggregated indicator of the functioning of digital tools integrated
into the organization's business processes; ∑ ¿T ¿ – the aggregated indicator of the
functioning of the organization's digital infrastructure; ωH, ωI, ωT – the respective weight
factors of the indicators, where ωH + ωI + ωT =1.</p>
      <p>The weight factors were obtained by expert evaluation: ωH =0.3, ωI =0.5 , ωT =0.2.</p>
      <p>The aggregated indicators ∑ ¿X ¿ for each of the indicators H, I, T are calculated using
formula:</p>
      <p>mX
∑ ¿X =∑ n(i X ) ∙ k(i X ) , ¿</p>
      <p>i=1
where ∑ ¿X ¿ – the aggregated value of indicator X (H, I, or T);
mX – the number of components of indicator X;
n(i X ) – the functioning level of the ith component of indicator X;
k(i X ) – the weight factor of the ith component of indicator X.</p>
      <p>Depending on the obtained value of the HIT index, such gradations for the levels of digital
maturity of domestic SMEs were determined: [0; 0.2) is considered very low; [0.2; 0.4) – low;
[0.4; 0.6) – medium; [0.6; 0.8) – high; and [0.8; 1] – very high.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Dataset description</title>
      <p>The dataset represents the results of a survey conducted through Google Forms among
Ukrainian entrepreneurs. Thirty four representatives of various small and medium-sized
businesses registered in the Ternopil region participated in the survey. Participants were
asked to answer 29 questions related to the level of digitization of business activity based on
(1)
(2)
the components of the HIT index. The set of responses was defined as an experimental
dataset.</p>
      <p>The answers of N respondents to M questions formed a matrix of dimension ( N × M ). It is
assumed that each participant u⃗i answered each of the questions qk. Thus, each surveyed
participant is represented in the form of the vector: u⃗i={ui1 , ui2 , … , uik , … , uℑ}, where uik is
the answer of the ith participant to the kth question. Each specific vector below in the work is
considered a point.</p>
      <p>Encoding was used to transform categorical data into numeric data (Figure 2).</p>
      <p>All procedures related to data processing were performed in a specially developed software
application using Python. Python libraries used at various stages of the research:




scikit-learn – for using clustering algorithms and computing quality metrics;
scipy – for computing distance matrices based on a dataset;
matplotlib – for visualizing obtained data in the form of graphs;
pandas – for storing and manipulating a dataset in a special structure, a dataframe.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Choice of Clustering Specifications</title>
      <p>After obtaining the values of the three components of the HIT index for each SME, the data
set consisted of 34 items with 3 numerical attributes. Clustering of preprocessed data using
the defined method and distance measure was performed sequentially using the number of
clusters from 2 to 8. For each obtained clustering model, quality metrics (Silhouette,
CalinskiHarabasz, and Davies-Bouldin indices) were calculated. Based on visual analysis of the
dependencies, the optimal number of clusters was selected. The Figure 3 shows the quality
index dependence plots on the number of clusters obtained for agglomerative clustering using
cosine distance and Ward linkage.</p>
      <p>As it is shown in the Figure 3, local maxima of the Silhouette index and Calinski-Harabasz
index are achieved at 3 and 8 clusters. At the same points, local minima are observed for the
Davies-Bouldin index. Considering the features of the given problem, the value of 8 clusters
seemed too large for the dataset with 34 points, so 3 clusters were chosen.</p>
      <p>Since the concept of distance metric is used only for two clustering methods:
agglomerative and OPTICS, the selection of criteria set: distance, number of clusters,
neighbors was carried out only for them. For each distance metric, the optimal number of
clusters was determined. Then, among all the used distance metrics, the one that showed the
best results for the current method was selected. The tabular result of such comparison for the
agglomerative method is shown in Table 2.</p>
      <p>A similar evaluation was conducted for each used method and distance measure. For each
of the methods used, a summary analytical table was compiled with the main characteristics
of the formed clusters (Tables 3-7). The figures also show a scatter plot of the dependence of
the HIT index on the level of use of digital instruments (on the left) and a bar chart of clusters
by HIT index value (on the right). The elements that belong to one cluster are highlighted in
the same color.</p>
      <p>1. The dataset was divided into 3 clusters using the K-means clustering algorithm. As seen
in the scatter plot in the Figure 4, the clusters almost do not intersect with each other and
contain sufficiently similar elements inside. Cluster #2 (blue dots) is clearly highlighted and is
located at the bottom of the graph in terms of the value of the HIT index to the use of digital
tools. Cluster #1 contains most of the points that are located within the intervals of both the
HIT index value and the use of digital tools. Cluster #3 is characterized by the highest index
values.</p>
      <p>Members of Cluster #1 are partially effective in using social networks but do not use their
own websites, advertising or analytics tools, while having sufficient technical equipment. The
literacy of the human capital is at an elementary level (Table 3).</p>
      <p>Cluster #2 shows similar indicators to Cluster #1, except that they do not use social
networks or use them inefficiently, and the companies lack sufficient technical equipment. In
contrast, Cluster #3 includes respondents who more effectively use the necessary digital tools:
websites, social networks, advertising, and have sufficient human capital literacy.</p>
      <p>2. Using the agglomerative method, the Euclidean distance measure and Ward linkage
allowed for a fairly good result in dividing into 3 clusters (Figure 5). It can be noted that there
is a fairly good separation of Cluster #2 (blue dots), which contains respondents with the
lowest HIT index values. Additionally, Clusters #1 and #3 are fairly spread out in space,
although they do overlap in a few points. Comparison of the main characteristics of the
formed clusters is presented in the Table 4.</p>
      <p>HIT index value by participants</p>
      <p>Cluster #1 members, who belong to the area with the highest indicator values, effectively
use the website and social media, and also have a level of digital literacy that is at or above the
average for most respondents. In contrast, Cluster #2 is characterized by ineffective use of
digital tools for most members, as well as low digital literacy and unsatisfactory technical
equipment for more than half of the surveyed. Cluster #3 has a certain intensity of social
media use, but low indicators in other areas, such as elementary level of digital literacy among
employees.
100.0%</p>
      <p>Cluster # 3 (17)
Ranges of Indicator values:</p>
      <p>H є [0; 0,364]
I є [0,097; 0,614]</p>
      <p>T є [0,75; 1]
Weighted Sum (HIT) є [0,28; 0,56]</p>
      <p>Status</p>
      <p>Percentage
of cases
68.9%
70.3%
Use of specialized
management
systems
3. Using OPTICS with Chebyshev distance metric and a minimum of 7 points for cluster
formation. Despite obtaining an optimal value for quality metrics, the clustering itself was not
successful from a practical standpoint. As can be seen in the visualization in the Figure 6, the
clusters contain almost the same number of members. Additionally, the clusters were
distributed as internal and external, making it impossible to establish fundamental differences
between them, as seen in the analytical Table 5.</p>
      <p>HIT index value by indicator “I”</p>
      <p>HIT index value by participants</p>
      <p>The reason for this result is that OPTICS belongs to density-based algorithms, and the
basic data set does not contain dense areas. Therefore, the internal cluster (green) turned out
to be an artificial area with dense values, while the external one was marked as outliers,
meaning values that do not carry any value.</p>
      <p>4. The Affinity Propagation method doesn’t depend on the number of clusters and distance
measures, so its results represent the inherent data structure without any user influence. As
seen in the Figure 7 and Table 6, the data was divided into 6 clusters. Some of the clusters
(such as #1, #5 and #6) are quite distinct from the others. At the same time, clusters #2, #3 and
#4 overlap somewhat with other clusters. The distribution of respondents based on the value
of the HIT index clearly highlights the cluster leader (#5), as well as the clusters with the
lowest values (#2 and #4). Clusters #1, #3 and #6 consist of respondents with average and
above-average values of the index.</p>
      <p>Clusters #1, #3 and #5 are quite similar to each other, as can be seen from the table.
However, it is interesting that about 2/3 of the participants in cluster #1 are successfully using
the website and social media, although they rate the level of human capital literacy as
elementary.</p>
      <p>OPTICS
Website availability, optimization and
effectiveness
Social media
effectiveness
availability</p>
      <p>and
Use of online advertising and analytics
Use of specialized management systems
Use of specialized technical systems
Level of technical support</p>
      <p>Cluster # 1 (18)
Ranges of Indicator values:</p>
      <p>H є [0; 0,364]
I є [0,128; 0,614]</p>
      <p>T є [0,7; 1]
Weighted Sum (HIT) є [0,23; 0,56]</p>
      <p>Cluster # 2 (16)</p>
      <p>Percentage
of cases
61.1%
50.0%
74.1%</p>
      <p>79.2%
0.327
HIT index value by indicator “I”</p>
      <p>In contrast, cluster #4 has a high value of digital literacy, but only slightly more than half
of the participants are successfully using digital technologies (given the size of the cluster, this
may be within the margin of error). Cluster #5 is the smallest, but consists of respondents with
the highest level of digital tool usage and transformation index value. Clusters #2 and #4 are
characterized by inefficient use of digital resources. The difference between them lies in the
value of the digital literacy indicator. Cluster #6 is also interesting, as it showed the
effectiveness of social media use at low levels of other indicators.</p>
      <p>5. The Gaussian Mixture Expectation-Maximization soft clustering algorithm divided the
dataset into 3 clusters; visualization is shown in the Figure 8. Cluster #2 (blue dots) is dense,
with its HIT index values falling in the interval with the mean values, indicating the use of
digital tools. Slightly higher values can be observed in cluster #3, which is also well grouped.</p>
      <p>In contrast, the largest cluster #1 is very dispersed and contains points with both the
lowest and highest values of HIT index components. The points in this cluster, shown in
green, are located around the perimeter of the scatter plot. Such dividing is likely due to the
initial dataset being far from a normal distribution.</p>
      <p>In Cluster #1, half of the respondents do not use digital tools, although almost 70% of those
surveyed claim to have an average or high level of digital literacy. In Cluster #2, the majority
do not use modern capabilities, despite that all respondents have a basic level of technical
means.
100.0%
100.0%
60.0%
86.7%</p>
      <p>The Cluster #3 shows moderate success in using simple tools, such as a website and social
networks, provided that 80% of respondents consider the digital competencies of their
employees to be basic. Another observation is that half of the respondents use, for example,
analytics and half do not, making it impossible to identify precise distinguishing features
between the clusters.</p>
      <p>HIT index value by indicator “I”</p>
      <p>HIT index value by participants</p>
      <p>Analytical data with the main characteristics of the formed clusters are presented in the
Table 7.
Main characteristics of the formed clusters by the Gaussian Mixture (EM-method)
Cluster # 1 (17)</p>
      <p>Cluster # 2 (9)</p>
      <p>It is worth noting that the level of digital literacy of employees has a significant impact on
the overall state of digitalization of the enterprise. If the level of digital literacy of employees
is defined as elementary, then such an enterprise lacks websites, social networks and other
used tools. As the digital literacy of employees increases, the percentage of use of tools and
technologies increases, so investing in people is seen as an important contribution to the
success of digitalization. It is interesting that the level of technical equipment does not have a
significant impact on the overall digital level of the enterprises.</p>
    </sec>
    <sec id="sec-7">
      <title>6. Conclusions</title>
      <p>The paper presents 5 data clustering models for understanding the current state of
digitalization of business processes among small and medium-sized enterprises in the Ternopil
region of Ukraine. The Digital Transformation Index "HIT" was used for numerical
measurement of the current level of digital maturity of domestic enterprises. Clustering of
enterprises was based on numerical values of three indicators – components of the Digital
Transformation Index. A special software application was developed in Python programming
language for solving the task. Various approaches to clustering model construction were
implemented using built-in methods of the scikit-learn library for Data Mining problems. Four
hard clustering methods (K-Means, Affinity Propagation, Hierarchical clustering, OPTICS) and
one soft clustering method using the EM algorithm (Gaussian Mixture) were used. The
Silhouette Index was used as the main quality metric. From the perspective of similarity
between elements within groups and differences between different clusters, the best results on
the dataset were demonstrated by Affinity Propagation, Ward's hierarchical clustering with 3
clusters, and K-Means with a division into 3 clusters. Analysis of the constructed models
showed that high values of quality metrics do not always indicate an optimal and effective
division into groups that can be successfully interpreted. New valuable ideas were obtained
regarding the importance of individual components of the Digital Transformation Index.
Common features of the obtained groups of enterprises, their strengths and weaknesses in the
use of digital tools and digital literacy of human capital were identified. In the future, stable
formed clusters can be used for classifying new surveyed enterprises and identifying
significant attributes with the greatest impact on the value of digital maturity of the subject or
for developing a methodology for providing recommendations to improve the level of digital
maturity of the enterprise.
7. References
https://www.researchgate.net/publication/351975241_Digital_Maturity_Models_A_Syste
matic_Literature_Review.
[8] O. Pischulina, Digital economy: trends, risks and social determinants: report, Tsentr
Razumkova, 2020, 271 p. URL:
https://razumkov.org.ua/uploads/article/2020_digitalization.pdf.
[9] H. Zhekalo, Digital economy of Ukraine: problems and prospects of development,
Naukovyj visnyk Uzhhorods'koho natsional'noho universytetu, Seriia: Mizhnarodni
ekonomichni vidnosyny ta svitove hospodarstvo, vol. 26, no. 1, (2019): 56–60.
[10] H. Karcheva, D. Ohorodnia, and V. Open'ko, Digital economy and its impact on the
development of national and international economy, Finansovyj prostir, vol. 3, no. 1,
(2017): 13–21.
[11] I. Strutynska, L. Dmytrotsa, H. Kozbur, O. Hlado, P. Dudkin and O. Dudkina,
Development of Digital Platform to Identify and Monitor the Digital Business
Transformation Index, in: Proceedings of the 15th International Conference on Computer
Sciences and Information Technologies (CSIT), Zbarazh, Ukraine, September 23, 2020, pp.
171-175, doi: 10.1109/CSIT49958.2020.9322016.
[12] I. Strutynska, L. Dmytrotsa, H. Kozbur, L. Melnyk, O. Hlado. Developing Practical
Recommendations for Increasing the Level of Digital Business Transformation Index, in:
Proceedings of the 16th International Conference on ICT in Education, Research and
Industrial Applications. Integration, Harmonization and Knowledge Transfer, volume II:
Workshops of ICTERI, Part III: 8th International Workshop Information Technology in
Economic Research (ITER 2020), Kharkiv, Ukraine, October 06-10, 2020, pp. 351-362. URL:
https://ceur-ws.org/Vol-2732/20200351.pdf.
[13] I. Strutynska, L. Dmytrotsa, H. Kozbur, L. Melnyk, System-Integrated Methodological
Approach Development to Calculating the Digital Transformation Index of Businesses,
in: Proceedings of the 16th International Conference on ICT in Education, Research and
Industrial Applications. Integration, Harmonization and Knowledge Transfer, volume I:
Main Conference (ICTERI 2020), Kharkiv, Ukraine, October 06-10, 2020, pp. 373-379. URL:
http://ceur-ws.org/Vol-2740/20200373.pdf.
[14] I. Strutynska, L. Dmytrotsa, H. Kozbur, L. Melnyk, The Digital Business Transformation
Index Determining and Monitoring: Development of a National Online Platform, in:
Proceedings of the 1st International Workshop on Information Technologies: Theoretical
and Applied Problems, ITTAP 2021, Ternopil, Ukraine, 2021, pp. 327-334.
[15] H. Cuesta, S. Kumar, Practical Data Analysis. Birmingham, Packt Publishing Ltd, 2016.
[16] Data Science &amp; Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting</p>
      <p>Data, EMC Education Services. Indianapolis, John Wiley &amp; Sons, Inc, 2015.
[17] C. Iyigun, M. Türkeş, I. Batmaz, C. Yozgatligil, V. Purutçuoğlu, E. Kartal, M. Öztürk,
Clustering current climate regions of Turkey by using a multivariate statistical method.</p>
      <p>Theoretical and Applied Climatology, 114 (2013): 95-106.
[18] K. Sablyn, E. Kahan, E. Chernova, Clustering of coal mining regions of Russia: investment
and innovation activity. Journal of New Economy, 21 (1) (2020): 89-106.
[19] K. Gorbatiuk, O. Mantalyuk, O. Proskurovych, O.Valkov, Application of Fuzzy Clustering
to Shaping Regional Development Strategies in Ukraine, Proceedings of the 6th
International Conference on Strategies, Models and Technologies of Economic Systems
Management (SMTESM 2019), 2019, pp. 271-276.
[20] T. Paianok, Y. Vazhaliuk, Cluster analysis of labor potential of Ukraine. Economy and</p>
      <p>State, 12 (2019): 109-114.
[21] S. Behun, Application of cluster analysis to study the demographic situation in the
region. Economic Journal of Lesya Ukrainka East European National University, 2 (2016):
122-128.
[22] S. Synytsia, O. Vakun, Clustering of regions by level of economic potential. Economy and
society Mukachevo State University, 12 (2017): 776-784.
[23] L. Zomchak, Y. Dobrotii, Clustering of regions of Ukraine by competitiveness.</p>
      <p>Proceedings of the International scientific-practical conference Administrative-territorial
vs economic spatial borders of regions, KNEU, 2020, pp. 328-332.
[24] V. Aulin, O. Lyashuk, O. Pavlenko, D. Velykodnyi, A. Hrynkiv, S. Lysenko, et al.,
"Realization of the Logistic Approach in the International Cargo Delivery System",
COMMUNICATIONS, vol. 21, no. 2, pp. 3-12, 2019.
[25] Petraška, A.; Čižiuniene, K.; Jarašuniene, A.; Maruschak, P.; Prentkovskis, O. Algorithm
for the assessment of heavyweight and oversize cargo transportation routes. J. Bus. Econ.</p>
      <p>Manag. 2017, 18, 1098–1114
[26] The path to digital maturity: A cluster analysis of the retail industry in an emerging
economy Marcelo Rezende Pinto, Paula Karina Salume, Marcelo Werneck Barbosa, Paulo
Renato de Sousa https://doi.org/10.1016/j.techsoc.2022.102191.
[27] M. Halkidi, Y. Batistakis, M. Vazirgiannis, Clustering algorithms and validity measures,
in: Proceedings of the Thirteenth International Conference on Scientific and Statistical
Database Management. SSDBM, Fairfax, VA, USA SourceIEEE Xplore, July 18, 2001.
doi:10.1109/SSDM.2001.938534.
[28] Clustering, 2022. URL: https://scikit-learn.org/stable/modules/clustering.
[29] E. Zuccarelli, Performance Metrics in Machine Learning — Part 3: Clustering, 2021. URL:
https://towardsdatascience.com/performance-metrics-in-machine-learning-part-3clustering-d69550662dc6.
[30] N. Bolshakova, F. Azuaje, Cluster validation techniques for genome expression data,
volume 83 of Signal Processing, Issue 4, April 2003, pp. 825-833. doi:
10.1016/S01651684(02)00475-9.
[31] Yavorskyi, A.V.; Karpash, M.O.; Zhovtulia, L.Y.; Poberezhny, L.Y.; Maruschak, P.O. Safe
operation of engineering structures in the oil and gas industry. J. Nat. Gas Sci. Eng. 2017,
46, 289–295.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>OECD</given-names>
            <surname>Policy</surname>
          </string-name>
          <article-title>Responses on the Impacts of the War in Ukraine “Digitalisation for recovery in Ukraine“</article-title>
          ,
          <year>2022</year>
          . URL: https://www.oecd.org/ukraine-hub/
          <article-title>policy-responses/digitalisation-for-recovery-inukraine-c5477864/.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cenamor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Parida</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wincent</surname>
          </string-name>
          ,
          <article-title>How entrepreneurial SMEs compete through digital platforms: The roles of digital platform capability, network capability and ambidexterity</article-title>
          ,
          <source>Journal of Business Research</source>
          , Julay, vol.
          <volume>100</volume>
          , (
          <year>2019</year>
          ), pp.
          <fpage>196</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N.</given-names>
            <surname>Ivanchenko</surname>
          </string-name>
          , Zh.
          <article-title>Kudryts'ka. and K. Rekachyns'ka, Business models in the conditions of digital transformations, Vcheni zapysky TNU imeni V. I. Vernads'koho, Seriia: Ekonomika i upravlinnia</article-title>
          , vol.
          <volume>3</volume>
          , no.
          <volume>31</volume>
          (
          <year>2020</year>
          ), pp.
          <fpage>185</fpage>
          -
          <lpage>190</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N. M.</given-names>
            <surname>Kraus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. P.</given-names>
            <surname>Holoborod</surname>
          </string-name>
          <article-title>'ko, and</article-title>
          <string-name>
            <surname>K. M. Kraus</surname>
          </string-name>
          ,
          <article-title>Digital economy: trends and perspectives of the abangard change of development, Efektyvna ekonomika</article-title>
          , vol.
          <volume>1</volume>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Annarelli</surname>
          </string-name>
          et al.
          <article-title>Literature review on digitalization capabilities: Co-citation analysis of antecedents, conceptualization and consequences Technol</article-title>
          .
          <source>Forecast. Soc. Change</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mero</surname>
          </string-name>
          et al.
          <article-title>An effectual approach to executing dynamic capabilities under unexpected uncertainty Ind</article-title>
          .
          <source>Market. Manag</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Digital</given-names>
            <surname>Maturity Models: A Systematic Literature Review</surname>
          </string-name>
          May
          <year>2021</year>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -69380-
          <issue>0</issue>
          _5 URL:
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>