<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Experimental and Exploratory Analysis of Languages Popularity According to the PYPL Index Programming</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleksandr Mediakov</string-name>
          <email>Oleksandr.Mediakov.sa.2019@lpnu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bohdan Korostynskyi</string-name>
          <email>bohdan.korostynskyi.sa.2019@lpnu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Victoria Vysotska</string-name>
          <email>victoria.a.vysotska@lpnu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oksana Markiv</string-name>
          <email>oksana.o.markiv@lpnu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sofia Chyrun</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Lviv Polytechnic National University</institution>
          ,
          <addr-line>S. Bandera Street, 12, Lviv, 79013</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Osnabrück University</institution>
          ,
          <addr-line>Friedrich-Janssen-Str. 1, Osnabrück, 49076</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This article dwells upon procedure for primary and exploratory analysis of data concerning programming languages popularity according to the PYPL index. Classical, but flexible methods of cluster and correlation analysis, and mathematical statistics have been used in data analysis. Examples of various researches in the field of popularity and development of programming languages have been provided, and based on open questions in this area, set of graphic results has been constructed by data analysis, which logically has resulted in conclusions regarding the selection of models and methods for further data forecasting. Integral part of the report information content consists of executable code examples in the R programming language, which can be used to conduct similar studies.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Cluster analysis</kwd>
        <kwd>correlation analysis</kwd>
        <kwd>programming languages</kwd>
        <kwd>PYPL index</kwd>
        <kwd>data forecasting</kwd>
        <kwd>data analysis</kwd>
        <kwd>mathematical statistics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Achievements and development of modern technologies are the result of long-term people
intellectual activity with aim of solving any problems that cause discomfort or impose restrictions on
the desires, interests or the process of society and/or an individual flourishing. Computer systems
capable of performing a huge number of arithmetic, logical and derivative operations in very short
periods are vivid examples of this development achievements.</p>
      <p>Logically, in order to use the capabilities of these systems, it is necessary to have a tool for managing
them, which is programming language. The spread, diversity, and popularity of information
technologies have led to the existence of a wide class of people who use programming languages.
Because of this, the question of specific programming languages popularity in different periods is
widely discussed in the media and among the community of IT representatives.</p>
      <p>However, the question of popularity can be considered from different angles, the most logical of
which is the statistical one. Primary statistical analysis of data on the popularity of various "living"
languages will allow to see patterns and trends among developers in general. Also, mathematical
processing of data will allow to build relative indicators of growth or decline in the popularity of
languages.</p>
      <p>
        To achieve correct results, it is necessary to define clearly the tasks that will be considered in the
work. Using the usual mathematical approach, it can be argued that it is necessary to conduct data
analysis with: construction of statistics of various programming languages and in general for all,
research of regularities in the distribution of time data, detection of the presence or absence of linear
relationships between the popularity of various programming languages, and also to distinguish groups
programming languages depending on the dynamics of changes in their popularity [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
        ]. For a
highquality presentation of the results or the correct choice of the method, it is necessary to visualize the
intermediate and final results.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related works</title>
      <p>Since the topic of programming and programming languages is broad, popular and interesting for
society, analysis of the chosen topic is carried out in various forms - scientific articles and studies,
journalistic articles, own investigations of corporations, communities, etc. Question of language
popularity is raised to trace trends in language paradigms, settings, even syntax.</p>
      <p>
        First of all, it is necessary to mention the availability of ratings that publicize current state of the
"market" of programming languages. For example, one of the most famous indices as ratings, TIOBE
programming community index [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This rating is made exclusively for programming languages
(Turing complete), based on the number of mentions in various search engines. It is obvious that this
approach has significant drawbacks, because it takes into account unpopular articles and literature.
Thus, this rating counts only the "noise" of the programming language in the network [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5">1-5</xref>
        ].
      </p>
      <p>
        In contrast to this rating, more qualitative one has emerged - PYPL Popularity of Programming
Language. This rating is formed by analyzing the number of searches for programming language
courses in Google [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Data from this rating has more respect and authority among users. It is from this
rating that the data that will be analyzed in the following parts of the work was formed.
      </p>
      <p>It is logical that such platforms as GitHub or StackOverflow can keep certain statistics on
programming languages.</p>
      <p>Based on data from GitHub, the company creates an annual Github report, where it displays data on
the number of repositories, pull requests, etc., by programming language. With the help of this
information, it is possible to see a real picture of live projects and the technologies they use.</p>
      <p>
        In turn, several important ratings are formed from the data of StackOverflow, among them:
popularity of programming languages, most favorite programming languages by developers, most
willing languages. Based on these ratings, media representatives, specialists and users of programming
languages create articles and blogs. For example, a rather popular magazine in IT – Medium published
a general collection of analysis of ratings of programming languages [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This work reveals a lot of
interesting information about the use and development of trends among developers and IT specialists,
their professional preparation, formation of requirements for job vacancies or participation in project
teams, recruiting and formation of requirements for IT projects [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref17 ref18 ref19 ref20 ref21 ref22 ref23 ref24 ref25 ref26 ref27 ref6 ref7 ref8 ref9">6-27</xref>
        ]. However, many popular articles
and works describe general aspects, so it is possible to highlight the place of this work among others –
the work will offer a qualitative statistical analysis of the temporal evolution the popularity of
programming languages for data forecasting based on machine learning [
        <xref ref-type="bibr" rid="ref28 ref29 ref30 ref31 ref32 ref33 ref34 ref35">28-72</xref>
        ], which will allow
assessing the growth and decline of each language separately in a relative and absolute projection.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methods</title>
      <p>For the correct solution of the given problems, it is necessary correctly to select or create algorithms
of methods. For primary statistical analysis, it is possible to build several different approaches to each
problem. It is necessary to consider the algorithms of methods that will be used in the work in the future.</p>
      <p>The simplest methods used in the work are the calculation of time series statistics. For their
calculation, the usual formulas and methods of mathematical statistics are used, which do not require
special improvements for specific data.</p>
      <p>However, there is a cluster of algorithms and methods that need to be modernized or refined for
specific data, or the methods have several options, and it is necessary to choose one of them.</p>
      <p>Several such methods belong to correlation analysis. For example, there is a need to present the
algorithm for forming a correlation matrix with its visualization in the form of a heat map.</p>
      <p>S- algorithm for filling the correlation matrix and its visualization
S1. Initialize unit matrix of the order of the number of attributes
S2. Cycle through the cells of the matrix with incremental parameters i, j.</p>
      <p>
        S2.1. If i≠j belongs to the matrix - in cell i, j assigns the value of the Pearson correlation coefficient
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] between attributes i and j.
      </p>
      <p>S2.2. If i=j skip step.</p>
      <p>S3. Find the maximum and minimum of the matrix.</p>
      <p>S4. Create linear color change function from minimum to maximum.</p>
      <p>S5. Display table with colors according to the matrix cell values.</p>
      <p>However, work with matrices is not only found in correlation analysis, the use of matrix theory is
also necessary in cluster analysis. One of the main tasks when performing clustering is to determine the
next union node, which mathematically means finding the minimum among the elements of the lower
(or upper) part of the matrix without taking into account the main diagonal. The algorithm that performs
this task is shown in Fig. 1.</p>
      <p>start
mat, n
index =
індекс_мін(m</p>
      <p>_l)
i = n-1
index &gt; 0
index = index - i
i = i-1</p>
      <p>Output
n+index,
n-i1
end</p>
      <p>The method of determining the
position of the minimal element of a
symmetric matrix without double
cycles
m_l =
elements_under_the_ma
in_diagonal(mat)</p>
      <p>In this method, the use of double loops with conditions is enabled, and the overall estimate of the
asymptotics of the algorithm is O(n), where n is the number of diagonal elements. However, for the
general case, let m be the order of the matrix, then the complexity of the algorithm is estimated at
 (⌊ 2− ⌋). More general picture of the clustering method with the formation of a hierarchical structure
2
involves the creation of a dendogram, but there is a problem of remembering the results of the clustering
steps in the form of a certain data structure. For this purpose, we present a diagram of activities of the
clustering algorithm with memorization of intermediate results (Fig. 2).</p>
      <p>Define a set of
statistics functions</p>
      <p>Create a rectangular
matrix of properties</p>
      <p>Create a square
matrix of distances</p>
      <p>Choose a metric
Fill in the properties</p>
      <p>matrix
Calculate the distance
from the merged</p>
      <p>cluster
Delete two rows and</p>
      <p>columns at the
intersection of the
mini element</p>
      <p>Fill in the distance</p>
      <p>matrix
Find the minimum
element of the matrix
among the elements
below the main</p>
      <p>diagonal
Write the name of the
new cluster and the
name of the variable
column in the file,
leave a space
between them
Read a dataframe</p>
      <p>from a file
Normalize the
property matrix
Create a new column
and row of the
distance matrix
Write the name of the
new cluster and the
name of the variable
line in the file, space</p>
      <p>between them
Repeat the process
until the dimension
of the matrix is not</p>
      <p>equal to 1
Read data from the
file into the graph</p>
      <p>data structure
Graphically display
the graph in the form</p>
      <p>of a dendogram</p>
    </sec>
    <sec id="sec-4">
      <title>4. Experiments</title>
      <p>
        Experimental data for researching the popularity of programming languages is the time sequence of
the PYPL index value for 28 programming languages. PYPL [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5">1-5</xref>
        ] index was mentioned in previous
parts of the work, but the data itself was collected by Kaggle users, the corresponding access mode is
https://www.kaggle.com/muhammadkhalid/most-popular-programming-languages-since-2004.
      </p>
      <p>The initial view of data in csv format is shown in Fig. 3.</p>
      <p>Unfortunately, the data readability is minimal in this form, but it can be improved by changing the
view to the appropriate Fig. 4. It can be seen that the variables, that is, the columns, are each
programming language and the date column, respectively, each line is a record of the date and the value
of the PYPL index for that date. Accordingly, the size of the data is 207 by 29. In general, the dataset
includes data from July 2004 to September 2021. And the programming languages considered in the
data are: Abap, Ada, C/C++, C#, Cobol, Dart, Delphi.Pascal, Go , Groovy , Haskell, Java, JavaScript,
Julia, Kotlin, Lua, Matlab, Objective.C, Perl, PHP, Python, R, Ruby, Rust, Scala, Swift, TypeScript,
VBA, Visual.Basic. Such a data structure also corresponds to certain properties, for example, the sum
of all values in one row equals 100, this is because the PYPL index itself displays data in 100% language
breakdown.</p>
      <p>R language was used for data experiments and graphing, as well as such packages as dplyr, ggplot2
(in fact, most of the libraries from tidyverse), reshape2, zoo, etc. (Fig. 6-7). To begin with, the data
must be read correctly, and the problem arises that the format of the date column does not correspond
to the one available in R, so it must be converted to a new one, for example, October 2020 will be
converted into 10 2020, which in turn will be converted into a Date data type with a value of 01.10.
2020. The general function of reading data from a file and primary transformation:
}
progs.df &lt;- get_data_frame()</p>
      <p>The first task set before the work involves the construction of statistics for each of the programming
languages. It is necessary to go through the data columns and create a new dataframe with statistical
parameters.
library(moments)
source("libs_config.r")
# returns a set of statistical characteristics for the vector x
statistical_characteristics &lt;- function(x) {
c("mean" = mean(x), "median" = median(x),
"std" = sd(x), "var" = var(x), "min" = min(x),
"max" = max(x), "kurt" = kurtosis(x), "skew" = skewness(x)) %&gt;% return
}
# use of function for all columns except date
apply(progs.df[, 2:29], MARGIN = 2, FUN = statistical_characteristics) %&gt;%
t() %&gt;% data.frame() %&gt;% mutate(range = (max - min), variation = std/mean*100) %&gt;%
arrange(desc(mean)) -&gt; stats.df</p>
      <p>Although the R language has many built-in functions, it is possible to build your own more flexible
functions, for example a simple histogram construction would look like this:</p>
      <p>ggplot(progs.df$R, aes(x = R)) + geom_hist()</p>
      <p>However, in order to achieve flexibility, it is necessary to correctly determine the number of columns
(orthe step of the column), as well as experiment with the visual display, therefore, the following
functions were created to build the histogram and the probability distribution function:
}
pdf[length(counts)+1] &lt;- 1
(ggplot(data.frame(x = breaks, y = pdf), aes(x = x, y = y)) +
geom_line(size = 1.5, alpha = 0.7, col = "#72007c") +
geom_point(size = 2.5, alpha = 0.8, col = "#470068") +
theme_light() + labs(x = "R popularity", y = "Probability")) %&gt;% return
}
progs.df$R %&gt;% create_stat -&gt; stat
stat %&gt;% create_histogram -&gt; hist.plot
stat %&gt;% create_distr_plot -&gt; pdf.plot</p>
      <p>With this approach, it is possible to adjust easy the number and colors of the columns, for example
as in Fig. 7.</p>
      <p>In addition, it is possible to adjust the graphs of the time series themselves, their combination and
presentation, a script has been developed for this:
progs.df[, c("Date", row.names(stats.df))] -&gt; progs.df
# returns time series chart for languages indexed a through b
gen_plot &lt;- function(a,b, polar = F) {
progs.df[, c(1, a:b)] %&gt;% melt(id = "Date") -&gt; data.df
if(!polar) {
(ggplot(data = data.df, aes(x = Date, y = value, fill = variable, col = variable))+
geom_line()) %&gt;% return
}
else {
(ggplot(data = data.df, aes(x = Date, y = value, fill = variable, col = variable))+
geom_line(size= 1.5) + coord_polar()) %&gt;% return</p>
      <p>
        When studying the dynamics of time series, the smoothing mechanism can be used, for example, for
exponential and median filtering, the functions will look simple:
# exponential smoothing
expsmooth &lt;- function(data, alpha){
# smoothing factor 0.1 &lt;= a &lt;= 0.3
l = length(data)
y0 = data[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
res &lt;- c(alpha*data[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]+(1-alpha)*y0)
for(n in 2:l){
      </p>
      <p>
        res &lt;- append(res, alpha*data[n]+(1-alpha)*res[n-1])
}
return(res)
}
# median filtering
medianfiltering &lt;- function(data){
l = length(data)
res &lt;- c(data[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ])
for(n in 2:(l-1)){
      </p>
      <p>res &lt;- append(res, max(c(min(c(data[n-1],
data[n]))),c(min(c(data[n],data[n+1]))),c(min(c(data[n-1],data[n+1])))))
}
res &lt;- append(res, data[l])
return(res)
}</p>
      <p>
        But, for linear Kendel smoothing, there is a problem of redundancy in the construction of the
smoothing function, for example, using all the parameters, the function will look like this:
test_w7 &lt;- function(data){
if(length(data)&lt;7){
print("The array is too small")
return(NA)
}
l = length(data)
#y(1,2,3)
y1 = (13*data[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]+10*data[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]+7*data[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]+4*data[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]+data[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]-2*data[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]-5*data[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ])/28
      </p>
      <p>However, as can be seen, the matrix of coefficients is a mirror image. It can be simply “reflected”
and a function can be constructed for different w, with the first half of the matrix and the division
coefficients known:
# w = 7
wequal7 &lt;- function(data){
p = 7
l = length(data)
if(l&lt;p){
print("The array is too small")
return(NA)
}
res &lt;- c()
# values
values &lt;- c(
13,10,7,4,1,-2,-5,
5,4,3,2,1,0,-1,
7,6,5,4,3,2,1
)
addvalues &lt;- c(28,14,28,7,28,14,28)
# values
mvalues &lt;- matrix(values, nrow = p, byrow = F)
mvalues &lt;- valuemirror(mvalues)
for(n in 1:((p-1)/2)){
vres = 0
for(v in 1:p){</p>
      <p>vres &lt;- vres + mvalues[v,n] * data[v]
}
res &lt;- append(res, vres/addvalues[n])
}
for(n in ((p+1)/2):(l-(p-1)/2)){
vres = 0
for(v in 1:p){vres &lt;- vres + mvalues[v,(p+1)/2] * data[n-1-(p-1)/2+v]}
res &lt;- append(res, vres/addvalues[(p+1)/2])
}
for(n in (l+1-(p-1)/2):l){
vres = 0
for(v in 1:p){vres &lt;- vres + mvalues[v,p-l+n] * data[l-p+v]}
res &lt;- append(res, vres/addvalues[(p-l+n)])
}
}
return(res)</p>
      <p>From Fig. 8, it can be seen that the results are exactly the same, but the second option allows
parameterizing the coefficients of Kendel's formula for any known w.</p>
      <p>When conducting correlation analysis of time series, it is often necessary to build correlation
matrices and display them in the form of heat maps, etc. However, the primary task is to construct a
correlation field or dot plot, for example with ggplot2 it can be constructed as follows:
source("libs_config.r")
# кор. поле з лінією регресії для мов R та Julia
gplot_cor_1 &lt;- ggplot(progs.df, aes(x = R, y = Julia)) +
geom_point(aes(x = R, y = Julia), color="black", fill="#49ddd6",
shape=21, alpha=0.5, size=4, stroke = 1) +
geom_smooth(method=lm , color="#686868", fill="#0011f81f", se=F)</p>
      <p>This option allows to add an automatically calculated regression line to the graph using the lm()
function. To build heat maps, it is necessary first to determine the correlation matrix, and then set the
graphic display. It is important to choose the right color scheme in order quickly to determine high
correlation coefficients. It is also possible to adjust the display of the text. Examples of the correlation
matrix are shown in Fig. 9.
progs.df[, c("Python", "Java", "C", "Dart", "Julia", "R")] %&gt;% cor() -&gt; cor.matrix_mini
cor.matrix %&gt;% melt() -&gt; heat.df
cor.matrix_mini %&gt;% melt() -&gt; heat.df_mini
heat.map_mini &lt;- ggplot(data = heat.df_mini, aes(x = Var1, y = Var2, fill = value)) +
geom_tile(color = "white") #+
scale_fill_gradient2(low = "#ca0033", high = "#0043fa", mid = "#ececec",
midpoint = 0, limit = c(-1,1), space = "Lab",
name="Cor coef") + labs(x = "", y = "") + geom_text(aes(label = round(value, 3)))</p>
      <p>The last element that must be completed according to the tasks set in the work is hierarchical
clustering. To build a tree-like clustering, it is possible to use the built-in capabilities of the language:
source("task_1_stat\\create_statistics.r")
stats.df %&gt;% apply(MARGIN = 2, FUN = function(x){return((x-min(x))/(max(x)-min(x)))}) -&gt;
stats.df
stats.df %&gt;% dist(method = "minkowski", p = 2) -&gt; dist.m
hclust(dist.m) -&gt; hc
plot(hc)</p>
      <p>As a result, it is possible to get a dendogram from Fig. 10.</p>
      <p>However, there is a certain number of different methods of combining two clusters, and to achieve
flexibility in the construction of clusters, it is necessary to define new clustering functions. So, the
following functions have been developed for this work:
# conversion of the minimum number into matrix indices
convert_index &lt;- function(index, n) {
i &lt;- n-1
while(index &gt; 0) {
index &lt;- index - i
i &lt;- i-1
}
c(n+index, n-i-1) %&gt;% return
}
# property matrix normalization
normalize &lt;- function(data.df) {</p>
      <p>data.df %&gt;% apply(MARGIN = 2, FUN = function(x){return((x-min(x))/(max(x)-min(x)))})
%&gt;% return
}
# returns the distance matrix according to the Minkowski metricgenerate_dist
&lt;function(data.df, p = 2) {</p>
      <p>
        data.df %&gt;% dist(method = "minkowski", p = p) %&gt;% as.matrix() %&gt;% return
}
# the function of combining a new cluster with the deletion of two previous ones
syntez_cluster &lt;- function(dist.matrix, ij, alpha_i, alpha_j, beta, gamma, path) {
dist.matrix &lt;- as.matrix(dist.matrix)
n &lt;- dim(dist.matrix)[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
i &lt;- ij[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
j &lt;- ij[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
cluster.name &lt;- paste(rownames(dist.matrix)[i], colnames(dist.matrix)[j], sep = "|")
#paste(paste(rownames(dist.matrix)[i], colnames(dist.matrix)[j], sep=" + "),
cluster.name, sep = " = ", dist.matrix[i,j]) %&gt;% print
new.names &lt;- c(colnames(dist.matrix), cluster.name)
dist.matrix %&gt;% cbind(0) %&gt;% rbind(0) -&gt; dist.matrix
colnames(dist.matrix)[n+1] &lt;- cluster.name
rownames(dist.matrix)[n+1] &lt;- cluster.name
first &lt;- strsplit(cluster.name, "[|]")[[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]][
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
write.table(data.frame(from = cluster.name, to = rownames(dist.matrix)[i], value =
dist.matrix[i,j], origin = first),
file = path, col.names = F, row.names = F, append = T, sep = ",")
write.table(data.frame(from = cluster.name, to = rownames(dist.matrix)[j], value =
dist.matrix[i,j], origin = first),
file = path, col.names = F, row.names = F, append = T, sep = ",")
dist.matrix %&gt;%
calc_cluster_dist(i = i, j = j, n = n+1, alpha_i = alpha_i, alpha_j = alpha_j, beta =
beta, gamma = gamma) %&gt;%
      </p>
      <p>return
}
# distance calculation function for a new cluster
calc_cluster_dist &lt;- function(dist.matrix, i, j, n, alpha_i, alpha_j, beta, gamma) {
# in general case, it is necessary that i &gt; j, however, we work with a lower triangular
matrix, where the condition is always fulfilled
#if (i &lt; j) i &lt;- (j - i) + (j &lt;- i);
for (h in seq(n)[-i][-j]) {
dist.matrix[n, h]
&lt;dist.matrix[h, n]
&lt;sum(c(alpha_i, alpha_j, beta, gamma) * c(dist.matrix[h,i], dist.matrix[h,j],
dist.matrix[i,j], abs(dist.matrix[h,i] - dist.matrix[h,j])))
}
dist.matrix[-i,-i][-j,-j] %&gt;% return
}</p>
      <p>
        With this approach, it is possible to combine capabilities of libraries for working with graphs and
get more informative dendograms. In addition, circular dendograms can be constructed. For example,
the results of one clustering can be presented in two graphs from Fig. 11, while the construction code
looks like this:
library(igraph)
library(ggraph)
library(dplyr)
# construction of dendogram with clustering
create_dendogram &lt;- function(dendo_df_file, dendo_name) {
dendo.df &lt;- read.csv(dendo_df_file)
label.df &lt;- data.frame(name = unique(c(dendo.df$from, dendo.df$to)))
label.df %&gt;% mutate(origin = NA, dist = NA, programming = NA) -&gt; label.df
for(i in seq_len(dim(label.df)[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ])) {
if (label.df$name[i] %in% dendo.df$from) {
      </p>
      <p>
        label.df$dist[i] = dendo.df$value[dendo.df$from == label.df$name[i]][
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
}
if (label.df$name[i] %in% dendo.df$to) {
label.df$origin[i] = dendo.df$origin[dendo.df$to == label.df$name[i]][
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
if(length(strsplit(label.df$name[i], "[|]")[[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]]) == 1) {
      </p>
      <p>label.df$programming[i] = label.df$name[i]
}</p>
      <p>}
}
dendo.g &lt;- graph_from_data_frame(dendo.df, vertices = label.df)
dendogram.plot &lt;- ggraph(dendo.g, layout = 'dendrogram', circular = F) +
geom_edge_elbow(aes(colour = origin), alpha=0.8, edge_width = 2) +
theme_graph(background = "white", plot_margin = margin(5, 5, 5, 5), base_size = 16) +
geom_node_point(aes(size = dist)) +
geom_node_point(aes(filter = leaf), alpha = 0.7, col = "black", size = 2) +
ylim(-1.6, NA) +
geom_node_text(aes(label = programming, filter = leaf, color = origin), angle=90 ,
hjust=1, nudge_y = -0.04)
dendogram.circ &lt;- ggraph(dendo.g, layout = 'dendrogram', circular = T) +
geom_edge_elbow(aes(colour = origin), alpha=0.8, edge_width = 2) +
theme_graph(background = "white", plot_margin = margin(5, 5, 5, 5), base_size = 15) +
geom_node_point(aes(size = 1/dist), color = "#050022") +
geom_node_point(aes(filter = leaf, ), alpha = 0.7, col = "black", size = 3) +
ylim(-1, NA) +
geom_node_text(aes(label = programming, filter = leaf, filter = leaf, angle = angle),
angle = 0, hjust = 1.1, vjust = 0.5)</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>For the correct use of data, its filtering, sorting or clustering, it is necessary to conduct a classical
intelligence analysis. If to build a graph with the data of all available programming languages, then the
obtained result will not have a meaningful load, but in order to obtain a certain evolution of intelligence
analysis, such a graph is shown in Fig. 12.</p>
      <p>To improve the appearance and meaning of graphical display of data, it is necessary to divide
languages into certain groups. It is best to do this according to statistical parameters. The following
features are selected as the main statistical characteristics for these series: mean value, dispersion,
standard deviation, median, minimum, maximum, range, coefficient of variation, skewness, and
kurtosis. After sorting the data window by the average value, there is a possibility already to plot a set
of graphs (Fig. 13-15) and get certain conclusions:
 The most popular language today is Python, the value of the index for September 2021 is 29.48,
that is, almost a third of all queries refer to this language, in addition, the average popularity level is
12.26, and the maximum of 32.11 was reached in July 2020.
 On average, the most popular language during the observation period is Java - the average value
is 25.76, and the value of the index for today is 17.18.</p>
      <p>The average value of the top 10 programming languages, as well as the top ten in terms of the
maximum, are shown as slices of the data frame in Fig. 13. In addition, the dynamics of changes in the
popularity of the language can be considered in the polar coordinate system, as in Fig. 14.</p>
      <p>For example, this graph shows the uniformity of the distribution of the popularity of the JavaScript
language. However, for correct presentation of the hypothesis about the distribution, it is necessary to
use the mechanism of construction of histograms and the distribution function. To conduct the
experiment, consider the programming language R. From Fig. 4 shows that the popularity of the
language grew up to a certain time, and then stabilized, and it is impossible to determine whether the
time series is subject to a certain distribution. As a result of using the constructed functions, a histogram
in Fig. 16 is represented. This histogram shows that the popularity of R language does not lend itself to
standard density functions, however, it is possible to construct an empirical probability distribution
function to get a higherlevel picture of the distribution of the data. For this, function has been created
that uses the same parameters as the histogram. It can be seen that the distribution function does not
have a clear picture to correspond to the known distributions, perhaps this is explained by the instability
of the parameters that determine the popularity of a particular programming language. When examining
the behavior of some time series, there is a problem of a large range of data in local intervals, for
example, for the Dart programming language, the popularity in the period after 2011 (ie, from the first
non-zero values) does not have a clear or convenient trend picture.</p>
      <p>Smoothing mechanism was used to highlight the trend line close to the initial data. The following
smoothing methods were used in the work (Fig. 17-19):
 Moving average: by 3, 7, 11, 13, 15 points and 7 points of the neo-oil formula;
 Exponential: at alpha 0.2 and 0.3;
 Median filtering.</p>
      <p>The results of applying smoothing methods are shown in Fig. 17 – linear moving average, and Fig.
19 - non-linear and median. As was shown in the Experiments section, smoothing functions can be of
different levels of complexity. From the graph in Fig. 17, it can be seen that when smoothing at points
11, 13, 15, a strong distortion is obtained, and when three points are used, the difference is minimal. To
investigate the effectiveness of the moving average method, we calculate the correlation coefficient for
real data and smoothing at w=5 and w=13. The corresponding calculations are shown in Fig. 18.</p>
      <p>Nonlinear smoothing is shown in Fig. and has the best-fitting form, and it rejects most outliers. The
worst result can be considered when using the exponential method with alpha equal to 0.2, and as can
be seen from Fig. 19, the correlation coefficient is 0.83. If considering the obtained results, there is
general cyclical one might say sinusoidal dynamics of development of language popularity. Two main
peaks may have occurred during the release of the popular framework in the Dart language - Flatter, in
general, the cyclical dynamics has a tendency to grow.</p>
      <p>In order to consider the development of the popularity of all languages at the same time, it is first
necessary to simplify the task and consider the relationship between the development of some languages
(Fig. 20). To do this, first consider certain logical assumptions:</p>
      <p>1. New programming languages (Go, Dart, Julia, Nim, etc.) usually do not affect the popularity of
old languages, because new languages solve modern problems, and established languages have their
permanent niche. From this it can be assumed that there is no definite linear relationship between Dart
and C#.</p>
      <p>2. Programming languages Java and Python are considered the biggest "enemies", completely
different style in syntax, speed, community, etc. However, it is quite logical to assume that the growth
of the popularity of the Python language has a strong influence on the popularity of Java, in particular,
it should decrease.</p>
      <p>3. Since data analysis, data science, artificial intelligence and neural networks are currently the most
popular for discussion and development in IT, languages such as R or Julia are constantly increasing in
popularity, and therefore there should be a direct linear relationship between the popularity of these
languages, and both should grow.</p>
      <p>If we interpret the logical assumptions in the language of correlation analysis, then the first
assumption speaks about the absence of a linear relationship between the data and the correlation
coefficient approaching zero, the second - there is a negative linear relationship with a correlation
coefficient equal to -1, and the last one about a positive linear relationship connection with a coefficient
of +1. In order to confirm these hypotheses, correlation fields were first constructed. To finally confirm
the hypothesis, it is necessary to calculate the correlation coefficient.</p>
      <p>From the graphical representation of the dependence, it can be already seen that the initial
assumptions are correct, however, to achieve accuracy, a heat map was constructed based on the
correlation matrix for the languages described above. From the map in Fig. 20 it is visible that the
correlation coefficient between Dart and C languages is close to zero, namely 0.082, and between Julia
and R the coefficient is equal to 0.906. The most interesting thing is the behavior of the correlation
coefficient for Java, it is less than zero in almost all languages, and therefore there is a tendency to
decrease the popularity of the language in comparison with all other languages. However, Java and
Python have the highest coefficient by module, which confirms the second hypothesis.</p>
      <p>However, there is a certain problem in the presence of correlation relations between each language,
and not all of them have a certain logic. To study the general behavior of languages with each other, it
is possible to build a general heat map, which is shown in Fig. 21-22.</p>
      <p>After analyzing the correlation coefficients of all languages with each other, there is an assumption
that, in fact, the dependencies between languages should be considered at a higher level, for example,
taking into account the connections between the purposes of languages, trends in the modern IT market,
etc. But it is possible individually to study the popularity dynamics of certain programming language.
The growing popularity of the Python language has been mentione, and graphically it is also visible.
However, there is another statistical method for checking the presence of trend in a time series
autocorrelation. Using the acf function of the R language, autocorrelation was calculated at a lag of
023. The result is presented in the form of a graph in Fig. 21-22. It can be seen from the correlogram that
even at the last lag the coefficient is above 0.5, and it is quite obvious that the time series has a linear
trend.</p>
      <p>The logical conclusion of the intelligence analysis of data on the popularity of programming
languages is to conduct a simple cluster analysis. In the course of the task, clustering was carried out
with different parameters, namely with different metrics and unification strategies. In all cases, the
Minkowski metric was used, with a variable parameter p. Two graphical approaches were used to
display the results - conventional dendograms and circular ones. The vertices of the union are marked
on ordinary dendograms proportional to the distance to the union, that is, the greater the distance, the
larger the vertex, and in circular dendograms it is inversely proportional. In addition, the dendograms
have a colored grouping, which is tied to the programming languages that are the initial nodes for a
particular cluster.</p>
      <p>1. Clustering at p = 2, the nearest neighbor joining strategy is a classic approach to clustering. As
can be seen from the dendogram in Fig. 23, a different number of clusters can be distinguished at
different levels, but those programming languages that were added at the very end - Ada, PHP, Haskell,
Java and Python - are distinguished. These languages are unique enough to be clustered by the residual
principle, that is, with this metric and clustering strategy, these languages have no relatives. On the
other hand, some languages had a sufficiently small distance between them, for example, Go and
TypeScript have a metric of about 0.09, which is why they form the first cluster. According to the
colors, Go is the starting vertex for the final cluster.</p>
      <p>2. Clustering at p = 2, the unification strategy is flexible at αi = αj = 0.3535, β= 0.293, γ = 0. When
using another unification strategy, we have a slightly different clustering, which in general does not
have a fundamental difference from the previous one. The main difference is the appearance of PHP
and Java language integration node before entering the common terminal cluster. Another difference is
the new initial vertices, if in the previous configuration it was the Go language, then in this case Groovy
(for less popular languages) and PHP (for the most popular) dominate.</p>
      <p>3. Clustering at p = 3, pooling strategy of the far-neighbor. With such parameters, different picture
was obtained as a result of clustering. First, cluster of PHP, Java and Python languages was formed,
which is the top 3 in terms of average value. Kotlin was added to the cluster with Dart, Rust, TypeScript,
Go, which was not presented in this cluster with other parameters. In addition, Lua is now the main root
for clusters. In addition, with such a configuration, the tendency to form clusters at the middle level
remains, for example, languages C/C++, C# and JavaScript stick to one cluster almost always.</p>
      <p>4. Clustering at p = 0.5, pooling strategy of the nearest neighbor. This clustering is interesting
because of the parameter p, because it is less than 1.</p>
      <p>From the diagram it can be seen that separate cluster of C, C++ and JavaScript languages, a large
cluster of Scala, Julia, Cobol and Lua languages can be distinguished, and again a cluster of languages
that are far from all others - Python, Java and PHP, which will be added at the end</p>
    </sec>
    <sec id="sec-6">
      <title>6. Discussions</title>
      <p>The first important note that characterizes all data in general is the complete diversity of data for
each language. The dynamics of the development of the popularity of each language during one period
are maximally diversified, but at the same time interconnected in a certain way. When examining the
results, it was shown that it is possible to obtain the appearance of linear relationships between the
popularity of different languages that have logical connections. However, some languages are not
related to each other, their development and use are quite different, but there is a correlation between
them. If we consider the heat map from Fig. 18, it can be seen that the popularity of Swift and R and
Scala languages have a very high correlation, but in fact the languages are not related to each other,
they operate in different niches. Correlation of languages speaks of simultaneous growth in popularity,
and therefore can indicate general trends in the IT market, but not the specifics of each language. The
question arises of building such a model that can take into account the dynamics of all languages at the
same time, but at the same time the input data is only one variable - the date. Such a problem with the
one-dimensionality of the input data gives rise to the need for a large amount of training data.</p>
      <p>To begin with, conducting exploratory analysis, it was proven that the data is complete, qualitative,
suitable for some statistical tests. When constructing statistics, it can be seen that the data have various
asymmetries and excesses, and do not lend themselves to normal distributions, in addition, most graphs
of time series have obvious non-constancy. Therefore, it is impossible to select statistical parameters
for the data that will fully describe them. In addition, it is necessary to maximally correspond to the
limitations of the data, for example, the sum of all values by rows is equal to the constant 100. This
means that approaches such as networks with marginalization or Bayesian neural networks are not
suitable. In addition to the quality, but at the same time chaotic data, there is another problem - a small
number of records. This problem rejects the possibility of using multilayer neural networks.</p>
      <p>However, it is possible to distinguish certain approaches to describe the data, but for this it is
necessary to find a mathematical counterpart of the function that will describe the current state of the
data. The first and main option is a vector function. For example, a parametric vector function with
three coordinates v(t) = [sin t , t2, cos t] For our data, such a function will be 28-dimensional. The date
will act as the parameter t in the data.</p>
      <p>Such a function will be able to accurately describe the current state of the data if we choose a function
depending on t for each language, such as an approximation or interpolation polynomial. Smoothing
functions can be used to reduce the number of input points.</p>
      <p>When constructing such a function, it is possible to conduct experiments with the search for
dependencies, derivatives of different orders to study the function. However, if there is a problem of
extrapolation, it is immediately possible to assert a large absolute error.</p>
      <p>The first way to reduce the number of calculations and the error is to use the clustering results. Some
clustering languages have such small distances that they can be treated as a single variable. Then it is
possible to get a function with fewer dimensions. A reduction in error will occur when clustering is
used to generate interpolation nodes.</p>
      <p>However, it is still not possible to achieve a high predicted accuracy. Therefore, the next option is
neuron-by-neuron of the network. Such neural networks differ from most in that they do not have a
defined architecture at the beginning, but form it during the learning process, by adding a neuron or a
layer. Such networks require many times less input data, but are not suitable for multidimensional
results. To build a network that can produce vector of index values for the future with a small amount
of input data, it is possible to use the approach of graph neural networks together with clustering.
Dendogram is good initial network graph, and the distances between clusters can be used as coefficients
when passing messages. When combining methods of graph and neuron-by-neuron networks, it is
possible to get the smallest error when predicted. So, as a result of exploratory and cluster analysis, it
was concluded that the data on the popularity of programming languages are suitable for research, but
the methods of their analysis should be the most modern.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions</title>
      <p>As a result of experimental and exploratory analysis of data concerning the programming languages
popularity according to the PYPL index since 2004, the presence of various trends in the dynamics of
the popularity of programming languages was revealed and the relationship between two different
languages was established. In the case of a complex consideration of time series, a hypothesis was
created regarding the complexity and absolute nonlinearity of the general trend using programming
languages, which affects the choice of models for forecasting. Some programming languages have a
sufficiently unique popularity that distinguishes them when constructing hierarchical cluster structures.
Such languages are highly popular, have the most diverse fields of application and their own niches.
An equally important result is the fact of conducting data research as such, because it indicates the
correctness, integrity and informativeness of the data, due to which such data can be used for
educational purposes.</p>
    </sec>
    <sec id="sec-8">
      <title>8. References</title>
      <p>[36] M. Bublyk, V. Feshchyn, L. Bekirova, O. Khomuliak, Sustainable Development by a Statistical
Analysis of Country Rankings by the Population Happiness Level, CEUR Workshop Proceedings,
Vol-3171(2022) 817-837.
[37] I. Khomytska, V. Teslyuk, The Method of Statistical Analysis of the Scientific, Colloquial,
BellesLettres and Newspaper Styles on the Phonological Level, Advances in Intelligent Systems and
Computing 512 (2016) 149–163.
[38] I. Khomytska, V. Teslyuk, Authorship and Style Attribution by Statistical Methods of Style
Differentiation on the Phonological Level, Advances in Intelligent Systems and Computing 871
(2019) 105–118. doi: 10.1007/978-3-030-01069-0_8.
[39] A. Vasyliuk, Y. Matseliukh, T. Batiuk, M. Luchkevych, I. Shakleina, H. Harbuzynska, S.</p>
      <p>Kondratiuk, K. Zelenska, Intelligent Analysis of Best-Selling Books Statistics on Amazon, CEUR
Workshop Proceedings Vol-3171 (2022) 1432-1462.
[40] T. Shestakevych, Modeling the Process of Analysis of Statistical Characteristics of Student Digital</p>
      <p>Text, CEUR Workshop Proceedings Vol-2870 (2021) 657-669.
[41] M. Bublyk, Y. Matseliukh, Small-Batteries Utilization Analysis Based on Mathematical Statistics
Methods in Challenges of Circular Economy, CEUR Workshop Proceedings Vol-2870 (2021)
1594-1603.
[42] V. Vysotska, O. Markiv, S. Teslia, Y. Romanova, I. Pihulechko, Correlation Analysis of Text
Author Identification Results Based on N-Grams Frequency Distribution in Ukrainian Scientific
and Technical Articles, CEUR Workshop Proceedings Vol-3171 (2022) 277-314.
[43] B.O. Kapustiy, B.P. Rusyn, V.A. Tayanov, A new approach to determination of correct recognition
probability of set objects, Upravlyaushchie Sistemy i Mashiny 2 (2005) 8-12.
[44] A.B. Lozynskyy, I.M. Romanyshyn, B.P. Rusyn, Intensity Estimation of Noise-Like Signal in
Presence of Uncorrelated Pulse Interferences, Radioelectronics and Communications Systems
62(5) (2019) 214-222.
[45] N. Romanyshyn, Algorithm for Disclosing Artistic Concepts in the Correlation of Explicitness and</p>
      <p>Implicitness of Their Textual Manifestation, CEUR Workshop Proceedings 2870 (2021) 719-730.
[46] V. Pasichnyk, T. Shestakevych, N. Kunanets, V. Andrunyk, Analysis of completeness, diversity
and ergonomics of information online resources of diagnostic and correction facilities in Ukraine,
CEUR Workshop Proceedings Vol-2105 (2019) 193-208.
[47] H. Zhengbing, V. Yatskiv, A. Sachenko, Increasing the Data Transmission Robustness in WSN
Using the Modified Error Correction Codes on Residue Number System, Elektronika ir
Elektrotechnika 21(1) (2015) 76-81.
[48] V. Yatskiv, N. Yatskiv, S. Jun, A. Sachenko, H. Zhengbing, The use of modified correction code
based on residue number system in WSN, in: Proceedings of the International Conference on
Intelligent Data Acquisition and Advanced Computing Systems, IDAACS. 2013, pp. 513-516.
[49] O. Veres, Y. Matseliukh, T. Batiuk, S. Teslia, A. Shakhno, T. Kopach, Y. Romanova, I.</p>
      <p>Pihulechko, Cluster Analysis of Exclamations and Comments on E-Commerce Products, CEUR
Workshop Proceedings Vol-3171 (2022) 1403-1431.
[50] I. Lurie, V. Lytvynenko, S. Olszewski, M. Voronenko, A. Kornelyuk, U. Zhunissova, О. Boskin,
The Use of Inductive Methods to Identify Subtypes of Glioblastomas in Gene Clustering, CEUR
Workshop Proceedings Vol-2631 (2020) 406-418.
[51] Y. Bodyanskiy, A. Shafronenko, I. Klymova, Adaptive Recovery of Distorted Data Based on</p>
      <p>Credibilistic Fuzzy Clustering Approach, CEUR Workshop Proceedings Vol-2870 (2021) 6-15.
[52] S. Babichev, B. Durnyak, I. Pikh, V. Senkivskyy, An Evaluation of the Objective Clustering
Inductive Technology Effectiveness Implemented Using Density-Based and Agglomerative
Hierarchical Clustering Algorithms, Advances in Intelligent Systems and Computing 1020 (2020)
532–553. Cham. https://doi.org/10.1007/978-3-030-26474-1_37.
[53] P. Bidyuk, A. Gozhyj, I. Kalinina, V. Vysotska, Methods for Forecasting Nonlinear
NonStationary Processes in Machine Learning, Communications in Computer and Information Science
1158 (2020) 470-485. DOI: 10.1007/978-3-030-61656-4_32.
[54] P. Bidyuk, A. Gozhyj, I. Kalinina, V. Vysotska, M. Vasilev, R. Malets, Forecasting Nonlinear
Nonstationary Processes in Machine Learning Task, in: Proceedings of the IEEE 3rd International
Conference on Data Stream Mining and Processing, DSMP, 2020, pp. 28-32. DOI:
10.1109/DSMP47368.2020.9204077.
[55] O. Soprun, M. Bublyk, Y. Matseliukh, V. Andrunyk, L. Chyrun, I. Dyyak, A. Yakovlev, M.</p>
      <p>Emmerich, O. Osolinsky, A. Sachenko, Forecasting Temperatures of a Synchronous Motor with
Permanent Magnets Using Machine Learning, CEUR workshop proceedings 2631 (2020) 95–120.
[56] N. Koval, А. Tryhuba, I. Kondysiuk, I. Тryhuba, O. Boiarchuk, M. Rudynets, V. Grabovets, V.</p>
      <p>Onyshchuk, Forecasting the Fund of Time for Performance of Works in Hybrid Projects Using
Machine Training Technologies, CEUR Workshop Proceedings Vol-2917 (2021) 196-206.
[57] R. Yurynets, Z. Yurynets, O. Budіakova, L. Gnylianska, M. Kokhan, Innovation and Investment
Factors in the State Strategic Management of Social and Economic Development of the Country:
Modeling and Forecasting, CEUR Workshop Proceedings Vol-2917 (2021) 357-372.
[58] Y. Tverdokhlib, V. Andrunyk, L. Chyrun, L. Chyrun, N. Antonyuk, I. Dyyak, O. Naum, D. Uhryn,
V. Basto-Fernandes, Analysis and Estimation of Popular Places in Online Tourism Based on
Machine Learning Technology, CEUR Workshop Proceedings Vol-2631 (2020) 457-470.
[59] I. Bodnar, M. Bublyk, O. Veres, O. Lozynska, I. Karpov, Y. Burov, P. Kravets, I. Peleshchak, O.</p>
      <p>Vovk, O. Maslak, Forecasting the Risk of Cervical Cancer in Women in the Human Capital
Development Context Using Machine Learning, CEUR Workshop Proceedings Vol-2631 (2020)
491-501.
[60] L. Podlesna, M. Bublyk, I. Grybyk, Y. Matseliukh, Y. Burov, P. Kravets, O. Lozynska, I. Karpov,
I. Peleshchak, R. Peleshchak, Optimization Model of the Buses Number on the Route Based on
Queueing Theory in a Smart City, CEUR Workshop Proceedings Vol-2631 (2020) 502-515.
[61] V. Lytvynenko, et. al., Hybrid Methods of GMDH-Neural Networks Synthesis and Training for
Solving Problems of Time Series Forecasting, Advances in Intelligent Systems and Computing
1020 (2020) 513–531.
[62] P. Bidyuk, A. Gozhyj, I. Kalinina, V. Gozhyj, Analysis of uncertainty types for model building
and forecasting dynamic processes, Advances in Intelligent Systems and Computing 689 (2018)
66-78.
[63] P. Bidyuk, A. Gozhyj, Y. Matsuki, N. Kuznetsova, I. Kalinina, Modeling and forecasting economic
and financial processes using combined adaptive models, Advances in Intelligent Systems and
Computing 1246 (2021) 395–408.
[64] V. Hnatushenko, V. Hnatushenko, N. Dorosh, N. Solodka, O. Liashenko, Non-relational approach
to developing knowledge bases of expert system prototype, Naukovyi Visnyk Natsionalnoho
Hirnychoho Universytetu 2 (2022) 112-117. https://doi.org/10.33271/nvngu/2022-2/112.
[65] R. Sytnyk, V. Hnatushenko, V. Hnatushenko, Decentralized Information System for Supply Chain
Management Using Blockchain, in: International Workshop on Intelligent Information
Technologies and Systems of Information Security, CEUR Workshop Proceedings Vol-3156
(2022) 587-598.
[66] T. Basyuk, A. Vasyliuk, Approach to a subject area ontology visualization system creating, CEUR</p>
      <p>Workshop Proceedings Vol-2870 (2021) 528–540.
[67] T. Basyuk, A. Vasyliuk, Design and Construction of a Popularizing Internet Resources System</p>
      <p>Using External Ranking Factors, CEUR Workshop Proceedings Vol-3171 (2022) 437–450.
[68] A. Vasyliuk, T. Basyuk, Construction features of the industrial environment control system, CEUR</p>
      <p>Workshop Proceedings Vol-2870 (2021) 1011–1025.
[69] O. Mediakov, T. Basyuk, Specifics of Designing and Construction of the System for Deep Neural</p>
      <p>Networks Generation, CEUR Workshop Proceedings Vol-3171 (2022) 1282–1296.
[70] V. Davydov, D. Hrebeniuk, Development the resources load variation forecasting method within
cloud computing systems, Advanced Information Systems 4(4) (2020) 128–135.
https://doi.org/10.20998/2522-9052.2020.4.18.
[71] S. Krepych, I. Spivak, Forecasting system of utilities service costs based on neural network,
Advanced Information Systems 4(4), (2020) 102–108.
https://doi.org/10.20998/25229052.2020.4.14.
[72] S. Semenov, C. Weilin, L. Zhang, S. Bulba, Automated penetration testing method using deep
machine learning technology, Advanced Information Systems 5(3) (2021) 119–127.
https://doi.org/10.20998/2522-9052.2021.3.16.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Devathon</surname>
          </string-name>
          .
          <article-title>Top 10 Best Programming Language Rankings</article-title>
          . URL: https://medium.com/@devathon_/top-10
          <string-name>
            <surname>-</surname>
          </string-name>
          best
          <article-title>-programming-language-rankings-e49a0def796c</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>PYPL</surname>
          </string-name>
          <article-title>PopularitY of Programming Language</article-title>
          . URL: https://pypl.github.io/PYPL.html.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>[3] TIOBE Index for October 2021</article-title>
          . URL: https://www.tiobe.com/tiobe-index/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Victor</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Bloomfield</surname>
          </string-name>
          .
          <article-title>Using R for Numerical Analysis in Science and Engineering</article-title>
          . Minneapolis, USA: University of Minnesota,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Most</given-names>
            <surname>Popular Programming Languages</surname>
          </string-name>
          Since
          <year>2004</year>
          . URL: https://www.kaggle.com/muhammadkhalid/most
          <article-title>-popular-programming-languages-since-2004.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>I.</given-names>
            <surname>Rishnyak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Matseliukh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Batiuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Strembitska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Mlynko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Liashenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lema</surname>
          </string-name>
          ,
          <article-title>Statistical Analysis of the Popularity of Programming Language Libraries Based on StackOverflow Queries</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>3171</volume>
          (
          <year>2022</year>
          )
          <fpage>1351</fpage>
          -
          <lpage>1379</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fedorov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Berko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Matseliukh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Schuchmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Budz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Garbich-Moshora</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mamchyn</surname>
          </string-name>
          ,
          <article-title>Decision Support System for Formation and Implementing Orders Based on Cross Programming and Cloud Computing</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2917</volume>
          (
          <year>2021</year>
          )
          <fpage>714</fpage>
          -
          <lpage>748</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Koshtura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Andrunyk</surname>
          </string-name>
          , T. Shestakevych,
          <article-title>Development of a Speech-to-Text Program for People with Haring Impairments</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2917</volume>
          (
          <year>2021</year>
          )
          <fpage>565</fpage>
          -
          <lpage>583</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Tetiana</surname>
          </string-name>
          ,
          <article-title>The method of education format ascertaining in program system of inclusive education support</article-title>
          ,
          <source>in: International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          ,
          <year>2017</year>
          , pp.
          <fpage>279</fpage>
          -
          <lpage>283</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.O.</given-names>
            <surname>Medykovskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.M.</given-names>
            <surname>Teslyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.B.</given-names>
            <surname>Shunevych</surname>
          </string-name>
          ,
          <article-title>The use of dynamic programming for the problem of the uniform use of wind plants</article-title>
          ,
          <source>Technical Electrodynamics 4</source>
          (
          <year>2014</year>
          )
          <fpage>135</fpage>
          -
          <lpage>137</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Medykovskyy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Teslyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Shunevych</surname>
          </string-name>
          ,
          <article-title>Optimization of wind power stations structure by the dynamic programming method</article-title>
          ,
          <source>Actual Problems of Economics</source>
          ,
          <volume>152</volume>
          (
          <issue>2</issue>
          ) (
          <year>2014</year>
          )
          <fpage>508</fpage>
          -
          <lpage>515</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>V.</given-names>
            <surname>Litvinenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bidyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bardachov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sherstjuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fefelov</surname>
          </string-name>
          ,
          <article-title>Combining clonal selection algorithm and gene expression programming for time series prediction</article-title>
          ,
          <source>in: Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications</source>
          ,
          <year>2005</year>
          , pp.
          <fpage>133</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>V.</given-names>
            <surname>Lytvyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vysotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rzheuskyi</surname>
          </string-name>
          ,
          <article-title>Technology for the Psychological Portraits Formation of Social Networks Users for the IT Specialists Recruitment Based on Big Five, NLP and Big Data Analysis</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2392</volume>
          (
          <year>2019</year>
          )
          <fpage>147</fpage>
          -
          <lpage>171</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>I.</given-names>
            <surname>Rishnyak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Veres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Lytvyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bublyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Karpov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vysotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Panasyuk</surname>
          </string-name>
          ,
          <article-title>Implementation models application for IT project risk management</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2805</volume>
          (
          <year>2020</year>
          )
          <fpage>102</fpage>
          -
          <lpage>117</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rzheuskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vysotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Burov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Lytvyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <article-title>The Architecture of Distant Competencies Analyzing System for IT Recruitment</article-title>
          , in
          <source>: Proceedings of the International Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          ,
          <year>2019</year>
          , pp.
          <fpage>254</fpage>
          -
          <lpage>261</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bublyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vysotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Panasyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Brodyak</surname>
          </string-name>
          ,
          <article-title>Assessing Security Risks Method in E-Commerce System for IT Portfolio Management</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2853</volume>
          (
          <year>2021</year>
          )
          <fpage>462</fpage>
          -
          <lpage>479</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Karpyak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. M.</given-names>
            <surname>Rybytska</surname>
          </string-name>
          ,
          <article-title>Cluster Analysis of Motivational Management of Personnel Support of IT Companies</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>3171</volume>
          (
          <year>2022</year>
          )
          <fpage>1684</fpage>
          -
          <lpage>1693</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>V.</given-names>
            <surname>Yunchyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fedonuyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Khomyak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Yatsyuk</surname>
          </string-name>
          ,
          <article-title>Cognitive Modeling of the Learning Process of Training IT Specialists</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2917</volume>
          (
          <year>2021</year>
          )
          <fpage>141</fpage>
          -
          <lpage>150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rzheuskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Voloshyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kowalska-Styczen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Voloshyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Peleshko</surname>
          </string-name>
          , T. Rak,
          <article-title>The intellectual system development of distant competencies analyzing for IT recruitment</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          <volume>1080</volume>
          (
          <year>2020</year>
          )
          <fpage>696</fpage>
          -
          <lpage>720</lpage>
          . DOI:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -33695-0_
          <fpage>47</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>L.</given-names>
            <surname>Chyrun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kravets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Garasym</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gozhyj</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kalinina</surname>
          </string-name>
          ,
          <article-title>Cryptographic information protection algorithm selection optimization for electronic governance IT project management by the analytic hierarchy process based on nonlinear conclusion criteria</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2565</volume>
          (
          <year>2020</year>
          )
          <fpage>205</fpage>
          -
          <lpage>220</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>V.</given-names>
            <surname>Sokol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bilova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Druzhynin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Cherkun</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Perepelytsia</surname>
          </string-name>
          ,
          <article-title>An Adaptive Algorithm for Effective Selection of Training Content for IT Professionals</article-title>
          , CEUR Workshop Proceedings, Vol2870 (
          <year>2021</year>
          )
          <fpage>1173</fpage>
          -
          <lpage>1183</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>L.</given-names>
            <surname>Halkiv</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Karyy</surname>
          </string-name>
          , I. Kulyniak,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tsapulych</surname>
          </string-name>
          ,
          <article-title>The National System of Higher Education and Government Procurement for Its Services as Activators of the Development of IT Entrepreneurship</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2870</volume>
          (
          <year>2021</year>
          )
          <fpage>1338</fpage>
          -
          <lpage>1349</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>V.</given-names>
            <surname>Andrunyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Pasichnyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Antonyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shestakevych</surname>
          </string-name>
          ,
          <article-title>A Complex System for Teaching Students with Autism: The Concept of Analysis. Formation of IT Teaching Complex</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          <volume>1080</volume>
          (
          <year>2020</year>
          )
          <fpage>721</fpage>
          -
          <lpage>733</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>O.</given-names>
            <surname>Duda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kunanets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Matsiuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Pasichnyk</surname>
          </string-name>
          ,
          <article-title>Cloud-based IT infrastructure for "Smart City" projects, in: Dependable IoT for Human</article-title>
          and Industry: Modeling, Architecting, Implementation,
          <year>2018</year>
          , pp.
          <fpage>389</fpage>
          -
          <lpage>410</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bublyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Rybytska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Karpiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Matseliukh</surname>
          </string-name>
          ,
          <article-title>Structuring the fuzzy knowledge base of the IT industry impact factors</article-title>
          ,
          <source>in: Computer sciences and information technologies</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>O.</given-names>
            <surname>Veres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Oborska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Vasyliuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Brezmen</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Rishnyak</surname>
          </string-name>
          ,
          <article-title>Problems and peculiarities of the IT project managment of ontological engineering for person psychological state diagnosing</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2565</volume>
          (
          <year>2020</year>
          )
          <fpage>162</fpage>
          -
          <lpage>177</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>O.</given-names>
            <surname>Veres</surname>
          </string-name>
          , et. al.,
          <article-title>Development and Operations - The Modern Paradigm of the Work of IT Project Teams</article-title>
          , in:
          <source>International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          ,
          <year>2019</year>
          , 3, pp.
          <fpage>103</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>B.E.</given-names>
            <surname>Kapustiy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.P.</given-names>
            <surname>Rusyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.A.</given-names>
            <surname>Tayanov</surname>
          </string-name>
          ,
          <source>Peculiarities of Application of Statistical Detection Criteria for Problem of Pattern Recognition/Journal of Automation and Information Science</source>
          <volume>37</volume>
          (
          <issue>2</issue>
          ) (
          <year>2005</year>
          )
          <fpage>30</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>S.</given-names>
            <surname>Babichev</surname>
          </string-name>
          , et. al.,
          <article-title>A fuzzy model for gene expression profiles reducing based on the complex use of statistical criteria and Shannon entropy</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          <volume>754</volume>
          (
          <year>2018</year>
          )
          <fpage>545</fpage>
          -
          <lpage>554</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>M.</given-names>
            <surname>Voronenko</surname>
          </string-name>
          , et. al.,
          <article-title>Analysis of the Effectiveness of an Investment Project Using Statistical Bayesian Networks</article-title>
          , in: International Conference on Advanced Computer Information Technologies,
          <string-name>
            <surname>ACIT</surname>
          </string-name>
          ,
          <year>2020</year>
          , pp.
          <fpage>408</fpage>
          -
          <lpage>411</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>S.</given-names>
            <surname>Babichev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Khamula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Durnyak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Škvor</surname>
          </string-name>
          ,
          <article-title>Technique of gene expression profiles selection based on sota clustering algorithm using statistical criteria and Shannon entropy</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          <volume>1246</volume>
          (
          <year>2021</year>
          )
          <fpage>23</fpage>
          -
          <lpage>38</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bidyuk</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kalinina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gozhyj</surname>
          </string-name>
          ,
          <article-title>Methodology of constructing statistical models for nonlinear non-stationary processes in medical diagnostic systems</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2753</volume>
          (
          <year>2020</year>
          )
          <fpage>36</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>I.</given-names>
            <surname>Gorbenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gorbenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vdovenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tymchenko</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Lutsenko, Studies on Statistical Analysis and Performance Evaluation for Some Stream Ciphers</article-title>
          ,
          <source>International Journal of Computing</source>
          <volume>18</volume>
          (
          <issue>1</issue>
          ) (
          <year>2019</year>
          )
          <fpage>82</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kaminskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kunanets</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Rzheuskyi,
          <article-title>Mathematical support for statistical research based on informational technologies</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2105</volume>
          (
          <year>2018</year>
          )
          <fpage>449</fpage>
          -
          <lpage>452</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>I.</given-names>
            <surname>Sokolovskyy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shakhovska</surname>
          </string-name>
          ,
          <article-title>Statistical modeling of diffusion processes with a fractal structure</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol-
          <volume>2488</volume>
          (
          <year>2019</year>
          )
          <fpage>145</fpage>
          -
          <lpage>154</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>