=Paper=
{{Paper
|id=Vol-3241/paper2
|storemode=property
|title=Relaxation Time as Unique Characteristic for Networks Clustering
|pdfUrl=https://ceur-ws.org/Vol-3241/paper2.pdf
|volume=Vol-3241
|authors=Andrii Snarskii,Dmytro Lande,Oleh Dmytrenko
|dblpUrl=https://dblp.org/rec/conf/its2/SnarskiiLD21
}}
==Relaxation Time as Unique Characteristic for Networks Clustering==
<pdf width="1500px">https://ceur-ws.org/Vol-3241/paper2.pdf</pdf>
<pre>
Relaxation Time as Unique Characteristic for Networks
Clustering
Andrii Snarskii 1,2, Dmytro Lande 1,2 and Oleh Dmytrenko 1,2
1
  National Technical University «Igor Sikorsky Kyiv Polytechnic Institute», 37, Prosp. Peremohy, Kyiv, 03056,
Ukraine
2
  Institute for Information Recording of National Academy of Sciences of Ukraine, 2, Mykoly Shpaka Street, Kyiv,
03113, Ukraine

                 Abstract
                 This paper researches new unique characteristics of networks – a network relaxation time and
                 an individual node relaxation time, which characterize the stability of a complex network and,
                 accordingly, each node separately to external perturbations. While researching of the complex
                 networks, it is assumed that relaxation time is the number of iterative steps of the corresponding
                 algorithm required to achieve the initial equilibrium numerical values of a certain characteristic
                 after some external perturbation. In other words, the network relaxation time for each node
                 characterize the resistance of a complex network and the individual node relaxation time
                 characterize the resistance of each node to external perturbations, accordingly. In this work, to
                 compute the relaxation time, the decelerated iterative HITS algorithm is used. It is shown, that
                 these characteristics are unique numerical characteristic of network nodes, and they can be
                 used to find the centroids of clusters and combine nodes into groups according to these
                 characteristics – for complex networks clustering. The approbation of the presented
                 characteristics of the relaxation time and the individual relaxation time was carried out on the
                 example of clustering of random networks with clearly expressed clusters. In particular, a
                 randomly generated matrix with dimension 30×30 and 3 clusters and a matrix with dimension
                 100×100 and 4 clusters were researched.

                 Keywords 1
                 Complex Network, Network Relaxation Time, Individual Node Relaxation Time, HITS,
                 PageRank, Clustering, Centroids of Clusters

1. Introduction

Complex networks are widespread in nature and technology. For example, networks such as the World
Wide Web, peer-to-peer networks [1] and others are complex [2, 3]. They have non-trivial topological
properties and, therefore, are of great interest to research.
    Despite the fact that various networks, such as electrical, transport, and information networks, fall
within the scope of the consideration of the theory of complex networks, the greatest contribution to
the development of this theory is made by research of social networks [4], where binary relationships
between people in a group can be represented in the form of a network, so these networks also play an
important role. In these networks, each object is a node, and connections between nodes are an edge or
link [5, 6].
    An analysis of complex networks that includes the study of statistical and dynamical structural
properties changes that characterise their behaviour, as well as the prediction of the evolution of such
networks, are important directions [7].
    Division of a set of network nodes (objects) into subsets (clusters), within which these objects are
strongly connected, and the connection between individual clusters is relatively small, i.e. partitioning
into clusters and, for example, detecting communities in social networks [8, 9] is an urgent task.

XXI International Scientific and Practical Conference "Information Technologies and Security" (ITS-2021), December 9, 2021, Kyiv, Ukraine
EMAIL: asnarskii@gmail.com (A. Snarskii); dwlande@gmail.com (D. Lande); dmytrenko.o@gmail.com (O. Dmytrenko)
ORCID: 0000-0002-4468-4542 (A. Snarskii); 0000-0003-3945-1178 (D. Lande); 0000-0001-8501-5313 (O. Dmytrenko)
              © 2021 Copyright for this paper by its authors.
              Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
              CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                                                                     13
    There are many methods of clustering graphs of complex networks [10, 11], which differ in the idea
of combining similar nodes. Since different models use different algorithms, different clustering models
are distinguished, in particular, such as connectivity models, centroid, statistical, group, neural models
and others. For example, there are models based on connectivity. In these models, objects that are closer
in space are more similar (related). And also, there are models based on finding the centroid. In these
models, clusters are represented in the form of a central vector.
    In this work, we propose to use a new characteristic for clustering complex networks – the relaxation
time [12, 13].

2. Cluster network analysis
Cluster network analysis, as a rule, solves the problem of two-criteria optimization, namely:
  1. Within each cluster 𝐾, elements (nodes) should be connected as much as possible, that is

                                                  𝑎 → 𝑚𝑎𝑥.
                                                                                                      (1)
                                          ∈
                                          ∈
   Hereinafter, 𝑎 is some estimate of the relationship between the elements with indices 𝑖 and 𝑗, which
are included in the cluster 𝐾. 𝑎 can take non-negative values. This estimate can be calculated in various
ways, for example, as the shortest path between nodes 𝑖 and 𝑗 [14].
   2. The connectivity between any separate different clusters, for example, 𝐾 and 𝐾 should be
minimal, that is

                                                  𝑎 → 𝑚𝑎𝑥 .
                                          ∈
                                          ∈
                                                                                                      (2)

   Often, in the general case, the sums for all indices are estimated (𝑁 is a number of clusters):

                                  𝑎 → 𝑚𝑎𝑥 ,                   𝑎 → 𝑚𝑖𝑛 .
                              ∈                           ∈                                           (3)
                              ∈                           ∈

    By association, in our case, the first factor in the formula for determining the weight 𝑤 corresponds
to the first requirement, if it is of great importance, then it is connected by strong ties with a certain
number of nodes (including from its own group). If the parameter 𝑥 is of great importance, then the
bonds are concentrated within their own group (cluster) – this corresponds to requirement 2.
    We can assume that the individual nodes with the highest discriminant weight (the number chosen
in advance) will constitute the centroids for clustering, or the basis for formation of clusters that will
form the basis of the clusters.
    There are numerous clustering algorithms such as K-means [15], LSA/LSI [4], most of which are
used for networks. Recently, the algorithm based on the modularity property has been widely used due
to the fact that it is built into the Gephi system [16].
    By definition, modularity is equal to the proportion of the total number of edges that fall into a given
cluster minus the predicted numbers of edges that would fall into the same groups if they were randomly
distributed. The value of modularity lies in the interval [0,1]. For a given partitioning of network nodes
into clusters, modularity shows how many links in clusters in comparison to the links that are randomly
distributed between all nodes without paying attention to clusters. There are various methods for
calculating modularity. In the most commonly accepted version of the concept, edges are randomized
in such a way that the degree of each node is preserved. The difference between the real number of
edges between nodes 𝑖 and 𝑗 and the predicted number of edges between them is
                                             𝑘𝑘
                                              𝑎  ,                                             (4)
                                             2𝑚
where 𝑚 is a number of edges in network, 𝑘 and 𝑘 are degrees of the nodes 𝑖 and 𝑗, respectively.


                                                                                                         14
   The modularity is defined as the sum of all pairs
                                       1            𝑘𝑘
                                  𝑄            𝑎       𝛿 𝑐 ,𝑐 ,                                   (5)
                                      2𝑚            2𝑚
                                           ,

where 𝛿 is Kronecker delta (shows whether nodes 𝑖 and 𝑗 are in the same cluster).
   The matrix formulation of modularity is as follows. By definition, if the node belongs to the cluster
then 𝑆 is equal to 1and else it is equal to 0. Then

                                      𝛿 𝑐 ,𝑐           𝑆 𝑆 ,                                      (6)

                         1                     𝑘𝑘               1
                   𝑄                  𝑎           𝛿 𝑐 ,𝑐          𝑇𝑟 𝑆 𝐵𝑆 ,                       (7)
                        2𝑚                     2𝑚              2𝑚
                              ,

where 𝑆 is a matrix, having elements 𝑆 , and 𝐵 is the so-called modularity matrix, which has elements
(4).
    The modularity of a network without clusters is equal 0 cause the sum of all columns and rows of
the matrix that corresponds modularity is equal to 0.
    Figure 1 shows a fragment of the interface of the Gephi system, which implements the calculation
of the modularity function.


Figure 1: Fragment of the interface for calculating the modularity in the Gephi system

3. Physical meaning of relaxation time

In physics, relaxation is understood as the process of establishing thermodynamic (including stochastic)
equilibrium in a physical system. It is assumed that the system consists of a large number of particles.
    The system is characterized by a large number of parameters, and different sets of parameters move
towards equilibrium at different speeds. The parameters characterizing the microstate of the system are
the first to relax. And ultimately, we can talk about local equilibrium when the thermodynamic
parameters relax, such as example, temperature, the concentration of particles or charges, etc.
    For such phenomenological parameters responsible for transfer processes in physical systems, the
Le Chatelier-Brown principle was formed [17]. If the physical quantity 𝑓 characterizing the system

                                                                                                     15
(temperature, charge density, etc.) has the value 𝑓 і |𝑓                     𝑓 |/𝑓 ≪ 1, then in the first approximation
𝑓 that go to equilibrium is given by the kinetic equation
                                         𝑑𝑓                                                                       (8)
                                                          𝜆 𝑓          𝑓 ,
                                         𝑑𝑡
   The solution of this equation is
                                                                                                                  (9)
                                   𝑓 𝑡        𝑓           𝑓 0           𝑓 𝑒
where 𝑓 0      𝑓 is an initial perturbation, and
                                                            1                                                    (10)
                                                      𝜏
                                                            𝜆
is a relaxation time.
    So, for example, for an environment with conductivity 𝜎 and the relative permittivity 𝜀, Maxwell's
equation and Ohm's law𝚥⃗ 𝜎𝐸⃗ are can be written as
                                                               𝜕𝜌                                                (11)
                                𝑑𝑖𝑣𝜀𝐸⃗      4𝜋𝜌,                         𝑑𝑖𝑣𝚥⃗   0
                                                               𝜕𝑡
where 𝐸⃗ is an electric field strength, 𝚥⃗ is a charge density.
  In the simplest case of a homogeneous environment
                                            𝜕𝜌          𝜎                                                        (12)
                                                      4𝜋 𝜌              0
                                            𝜕𝑡          𝜀
the next equation immediately follows
                                                                                                                 (13)
                                             𝜌 𝑡           𝜌 𝑒
where the relaxation time is
                                                     𝜀                                              (14)
                                                  𝜏
                                                   4𝜋𝜎
i.e., the higher conductivity 𝜎 and the lower relative permittivity 𝜀, the faster the heterogeneity of the
charge distribution is resolved, i.e., the equilibrium is reached faster.
The concept of relaxation is used in the methods for approximate solution of systems of linear equations,
in iterative methods, for example, in methods of coordinate relaxation, block and group relaxation, etc.
    These examples can be described in the following terms. Let us consider a system of equations 𝐿⃗𝑓
0, the solution of which is the function 𝑓 . To find solutions, some initial approximation 𝑓 of the
function 𝑓 is chosen and such an iterative procedure is found. In explicit form it is represented as
                                              𝑓                𝑇𝑓                                                (15)
or in implicit form as
                                            𝐹 𝑓           ,𝑓           0                                         (16)
that is, there is such an operator 𝑇 that
                                                 lim 𝑓             𝑓                                             (17)
                                                 →
    Here we assume an approach and terminology that allows us to consider the described methods in
the only way.
    Let us consider a system described by a set of parameters (numbers, functions) 𝑓. The equilibrium
state of the system is given by equations (11-12)
                                                  𝐿𝑓           0                                                 (18)
is a solution for which 𝑓 defines the equilibrium state of the system. Let's perturbing the system out of
equilibrium by defining some set 𝑓     𝑓 . By choosing some iterative scheme (15), we get the sequence
                                                 𝑓 ,𝑓 ,…,𝑓                                                       (19)


                                                                                                                    16
where 𝑛 is considered as the discrete time.
   In the usual case, for a descending sequence|𝑓          𝑓 |, |𝑓   𝑓 |, … let’s define a value △, such that
if
                                   𝑓          𝑓       △ 𝑓 ,𝑓 ,…,𝑓                                      (20)
that value 𝑛 , which determines (20) will be considered (called) as the relaxation time 𝜏 𝑛 .
     So, in the case of researching complex networks, the relaxation time is the number of iterative steps
𝜏 of the corresponding algorithm, which are necessary to reach the initial equilibrium numerical values
of a certain characteristic after its perturbation with some predetermined accuracy △ – the condition of
convergence (usually △=10-4).
   When the recovery of the whole network is researched, the number of iteration steps required for
the value of each node to recover (the whole network is recovered) is called the relaxation time of the
network 𝜏       after the perturbation of a certain m-th node or 𝑚𝑎𝑥 𝜏    (k = 1,..,N, where N is a number
of nodes in network). And the number of iterations which are required to recover the particular node
whose numerical value was perturbated, will be called the individual relaxation time indicator of the
node 𝜏    (or simply 𝜏 ). That is, the first characteristic characterizes the node in terms of the stability
of the whole network after an external perturbation, the last one characterizes the individual stability of
the node after its perturbation.

4. Algorithm

For example, let's consider some complex network (a directed graph), for which PageRank [18] (𝑃𝑅 of
each node) can be found. PageRank is one of the algorithms for evaluating the importance and ranking
of web pages by hyperlinks, it was created by Sergey Brin and Larry Page in 1996 at Stanford
University. The principle of calculating the PageRank of nodes is based on the model of "random
wandering" of the user according to the following algorithm: he opens a random node (web page) from
which he goes by a randomly selected link. It then moves to another web page and activates a random
link again, and so on, constantly going from page to page, never coming back. Sometimes, when, with
some probability 1 𝛿, he gets bored with this wandering, or the page does not have links to other
pages, he goes to a random web page again – not by a link, but by manually typing in some URL. It is
assumed that the probability that a user wandering the network will go to a certain web page is its rank.
Obviously, a node's PageRank is higher the more other nodes link to it, and the more popular those
nodes are.
    Let's assume, there are n nodes 𝑑 , 𝑑 , . . , 𝑑 that refer to some node (web page 𝐴), and 𝐶 𝐴 is
the total number of links from a node to other nodes. Some fixed value 𝛿 is defined as the probability
that the user, viewing any web page from the set 𝐷, will go to the node 𝐴 by a link, and not by typing
its URL explicitly.
    Within the framework of the model, the probability of this user continuing to surf the web from 𝑁
web pages without using links, by manually entering an address (URL) from a random page will be
1 𝛿 (alternative to following links). The PageRank index for a node is considered as the probability
that the user will be at this node at some random moment in time:
                                          1       𝛿        𝑃𝑅 𝑑                                        (21)
                               𝑃𝑅 𝐴                    𝛿        .
                                              𝑁             𝐶 𝑑
   According to this formula, the node rank is calculated by a simple iterative algorithm.
   In a complex network of 𝑁 nodes, the 𝑃𝑅 value of each node 𝑃𝑅 , 𝑖 1,2, … , 𝑁 determines the
equilibrium state of this network. After selecting the value of 𝑃𝑅 for each node 𝑖 that deviates from the
value 𝑃𝑅     𝑃𝑅       0, the iterative procedure for finding 𝑃𝑅 is used. The relaxation time of node 𝑗 is
the number of steps 𝑚 for which
                                         𝑃𝑅           𝑃𝑅   △                                           (22)


                                                                                                          17
    Note that there are different variants of setting initial values (deviations) for nodes. They can be the
same, they can alternately set the deviations of individual nodes, leaving others at equilibrium values,
etc.
    In this work, as a characteristic whose numerical value is perturbated, a characteristic corresponding
to the iterative HITS (Hyperlink Induced Topic Search) rank algorithm was used. This algorithm was
proposed and developed in 1998 by J. Kleinberg [19] to select the best "authors" (primary sources that
refer to other documents) and "hub" (documents that refer to these primary sources) from an array of
documents. For each document j, its authority 𝑎 𝑗 (Authority) and hub ℎ 𝑗 are calculated according
to the formulas:
                                   𝑎 𝑗         ∑→ ℎ 𝑖 , ℎ 𝑗                   ∑→ 𝑎 𝑖 ,                  (23)

    Let introduce the relaxation time of the k-th node in the complex system – 𝜏 . At the initial stage, a
set of values of nodes 𝑠 ( 𝑠 ⃑ is a vector form) define as the equilibrium state. This state is determined
in accordance with the certain rule, for example, – the HITS or PageRank iterative algorithm, or any
other.
    Calculation of 𝑠 ⃑ in accordance with the selected rule (the iterative algorithm) can always be
rewritten in iterative form:
                                  𝑠⃑ 𝑛     1            𝑠⃑ 𝑛          𝐿𝑠⃑ 𝑛 , 𝑛       0,1, …            (24)
where the numbers of nodes correspond to the vector elements numbers of 𝑠⃑; 𝐿 is the operator of one
the iterative HITS or PageRank algorithms that are considered in this work; 𝑠⃑ 0 is the initial values of
nodes.
                                     𝑠⃑ 0           𝑙𝑖𝑚 𝑠⃑ 𝑛 , 𝑛            0,1, …                      (25)
                                                 →
and, respectively
                                                 𝐿𝑠⃑ 0                0.                                (26)
   The initial values of nodes 𝑠⃑ 0 is considered as the solution – 𝑠 ⃑ (these values are equilibrium).
Next, we propose to make some deviation of the score (called perturbation) of the m-th node, as it is
shown in the next example:
                                           𝑠⃑ 0            𝑠⃑         𝛼𝛿 𝑠⃑ ,                           (27)
where 𝛼 defines the deviation (perturbation) score of the m-th vector element; 𝛿 is the Kronecker
symbol.
   The described above means vector forms of the deviation from the equilibrium state one of the
elements of the vector 𝑠 ⃑. The perturbation of vector 𝑠 ⃑ due to the perturbation of one of its elements
leads a non-equilibrium state of the system.
   According to the equation (24), for n=0 it can be rewritten:
                                     𝑠         𝑠 0             ∑ 𝐿 𝑠 0 ,                                (28)
taking into account (27) it looks as follows:
               𝑠 1          𝑠 0      ∑ 𝐿 𝑠                     𝛼∑ 𝐿 𝛿             𝑠      𝑠     𝛼𝑞   ,   (29)

where the vector 𝑞           ∑ 𝐿 𝛿             𝑠 can be defined as
                       𝐿      𝐿          ...    𝐿               ...    𝐿       0               𝐿
                       𝐿      ...        ...    𝐿               ...    𝐿                       𝐿
                     ⎛                                                     ⎞⎛ 0 ⎞            ⎛   ⎞
           𝑞         ⎜ :                            :                      ⎟⎜ : ⎟            ⎜
                                                                                               :
                                                                                                 ⎟𝑠 .   (30)
                     ⎜𝐿                             :                      ⎟ ⎜𝑠 ⎟            ⎜ : ⎟
                        :                           :                          :               :
                     ⎝𝐿                         𝐿                      𝐿   ⎠⎝ 0 ⎠            ⎝𝐿 ⎠

    Taking into account the initial condition 𝑠 1 , 𝑠 𝑛 come back to the initial equilibrium state 𝒔 ⃑
after increasing a number of iterations n in accordance to (25).

                                                                                                           18
   The initial scores 𝑠 0     𝑠 , 𝑘 𝑚. After 𝑛           10 (more than 10 step) the values 𝑠 𝑛           10
decrease and convergence to the value μ.


Figure 2: Shema of values convergence the of the k‐th node

The value 𝜏      is the k-th node relaxation time after the perturbation of the m-th node, when the
following condition is satisfied
                                         𝑠 𝑛    𝜏         𝜇.                                          (31)

   In the other words, the maximum value of 𝜏        for ∀ 𝑚𝑎𝑥 𝜏           after the perturbation of the m-th
node is the network relaxation time [12] for the m-th node.
   And respectively, the relaxation time of perturbed node called the individual relaxation time of this
node.

5. Deceleration of algorithm

When calculating the relaxation time for many small networks, a problem arises, which is that the
relaxation time (a discrete number of algorithm steps) can be very small, that is, the effect of "small
distributive force" occurs. To solve this problem, it is proposed to apply an artificial deceleration of the
iterative algorithm. That is, the so-called decelerated iterative algorithms for HITS and PageRank can
be used to calculate the relaxation time. Cause the number of connections between the nodes is large,
the process of iterative recalculation of the values of the nodes after perturbation of some node and
achieving their initial values is fast. It means that we need to make only a few iterative steps to achieve
the initial equilibrium state of nodes after perturbation. In other words, a network relaxation is fast. In
order to decelerate the relaxation time, we propose to decelerate the HITS or PageRank algorithm,
respectively. After the deceleration, the process of convergence to the equilibrium initial values of nodes
after their perturbation will be slowed. So, the corresponding decelerated HITS or PageRank algorithm
is applied
                     𝑃𝑅 𝑗 → 𝑃𝑅 𝑗 → 𝑃𝑅 𝑗 →. . . . → 𝑃𝑅                           𝑖 ,                   (32)
                         𝐴𝑃𝑅 𝑖       𝑃𝑅 𝑖 ,         𝐴𝑃𝑅 𝑖            𝑃𝑅   𝑖 ,                         (33)

                                    𝑃𝑅      𝑖   ⎯ 𝛽𝑃𝑅          𝑖 ,                                    (34)
where 0    𝛽     1 is a deceleration factor, 𝐴 is the HITS or PageRank algorithm operator.


                                                                                                          19
6. Examples of network analysis

Research of complex networks shows that the values of the network characteristics of nodes after
external perturbation and the next recalculation of these characteristics, recover their initial equilibrium
values during some individual time for each node. Comparing the relaxation time with the network
characteristic, the value of which was perturbated, a vague dependence exists, which shows that the
relaxation time is a unique and incomparable numerical characteristic of network nodes [12].
    Within this work, the individual relaxation time 𝜏 is researched, i.e., the relaxation time of the
node, which was deviated from the equilibrium state. In this case, we will further use the notation 𝜏 ,
omitting the upper symbol in 𝜏 .
    Also in the work, the general relaxation time of the network for the 𝑚-th node is researched –
𝑚𝑎𝑥 𝜏        the highest value of 𝜏    among ∀ when the 𝑚-th node was disturbed.
   The approbation of the presented characteristics of the relaxation time and the individual relaxation
time was carried out on the example of clustering of random networks with clearly expressed clusters.
In particular, a randomly generated matrix with dimension 30×30 and 3 clusters and a matrix with
dimension 100×100 and 4 clusters were researched.
   For each network described by the corresponding matrix, after perturbing the value of the network
characteristic ℎ 𝑗 using decelerated HITS algorithm, the network relaxation time and the individual
node relaxation time were calculated. For each network, the deceleration factor 𝛽 of the HITS algorithm
and the convergence condition 𝜇 of the numerical values ℎ 𝑗 and 𝑎 𝑗 to the preperturbed values were
chosen individually.
   Also, in order to avoid large numerical values that sometimes arise after the using of decelerated
algorithms, all the resulting values of the relaxation time indicators were normalized in the interval
[0,1].
   Figures 3 and 4 show a visual representation of networks. In each of the figures, the nodes are colored
according to the modularity class (figures 3(a) and 4(a)) and by the network relaxation time and the
individual node relaxation time (for the presented examples, the normalized numerical values of these
characteristics coincide and are presented in the form of node labels – Figures 3(b) and 4(b)).
   The relaxation time of the network, which presented in Figure 1, was obtained as a result of
deceleration the HITS algorithm with a deceleration factor 𝛽 0.9 and convergence condition 𝜇
10 . For the network presented in figure 2 – 𝛽 0.9 та 𝜇 10 .


                        (a)                                                     (b)

Figure 3: An example of a network corresponding to a randomly generated 30x30 matrix with 3
clusters where groups of nodes are united (colored) by a) modularity class and b) relaxation time
(numeric values are represented as node labels)


                                                                                                         20
                        (a)                                                     (b)

Figure 4: An example of a network corresponding to a randomly generated 100x100 matrix with 4
clusters where groups of nodes are united (colored) by a) modularity class and b) relaxation time
(numeric values are represented as node labels)

It can be noticed that the groups of nodes united by the modularity class differ from the groups obtained
by the union of nodes by the relaxation time. This means that the network relaxation time and the
individual node relaxation time are unique network node characteristics and can be used to find cluster
centroids and cluster nodes based on these characteristics.

7. Conclusion

This paper researches the proposed network node characteristics – the network relaxation time and the
individual node relaxation time. The approbation of the presented characteristics of the relaxation time
and the individual relaxation time was carried out on the example of clustering of random networks
with clearly expressed clusters. It is shown, that these characteristics are unique numerical characteristic
of network nodes, and they can be used to find the centroids of clusters and combine nodes into groups
according to these characteristics, in other words, it can be used for complex networks clustering.
    The proposed and researched in this work numerical network and node characteristics can be used
during research and analysis of the network structure, making it possible to identify the most important
structural elements. Also, results of the research can be used when building personal search interfaces
for users of information and search systems. In turn, it will simplify the process of finding the necessary
information.
    The given algorithm, like the well-known LSA algorithm, is a cluster analysis algorithm that uses a
matrix representation of data. The complexity of the algorithm is determined by the complexity of the
PageRank, HITS or any other iterative algorithm, which is used to recalculate node weights.
    The novelty of the algorithm is defined by the approach for calculating the weight of nodes (the
values of relaxation time of network or relaxation time of nodes), the most significant of which can be
used as centres for determining clusters (centroids). If it is necessary to improve the quality of the
proposed approach, certain centroids can be passed as input to other well-known algorithms such as K-
means.


                                                                                                         21
8. References

[1] A. L. Barabâsi, H. Jeong, Z. Néda, E. Ravasz, A. Schubert, T. Vicsek, "Evolution of the social
     network of scientific collaborations." Physica A: Statistical mechanics and its applications 311.3-
     4 (2002): 590-614. doi: 10.1016/S0378-4371(02)00736-7
[2] M. E. J. Newman, "The structure and function of complex networks." SIAM review 45.2 (2003):
     167-256. doi:10.1137/S003614450342480
[3] S. H. Strogatz: "Exploring complex networks." nature 410.6825 (2001): 268-276. doi:
     10.1038/35065725
[4] A. A. Snarskii, D. V. Lande, Modeling of complex networks: tutorial, K.: Engineering, 2015.
     ISBN 978-966-2344-44-8
[5] R. Cohen, S. Havlin, Complex networks: structure, robustness and function. Cambridge university
     press, 2010. doi: 10.1017/CBO9780511780356
[6] S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, D. U. Hwang, Complex networks: Structure and
     dynamics, Physics reports 424.4-5 (2006): 175–308. doi: 10.1016/j.physrep.2005.10.009
[7] R. Albert, A. L. Barabási, "Statistical mechanics of complex networks." Reviews of modern
     physics 74.1 (2002): 47. doi: 10.1103/RevModPhys.74.47
[8] T. Murata (Ed.), Handbook of social network technologies and applications. Springer Science &
     Business Media, 2010. doi: 10.1007/978-1-4419-7142-5_12
[9] A. Lancichinetti, S. Fortunato, "Community detection algorithms: a comparative analysis."
     Physical review E 80.5 (2009): 056117. doi: 0.1103/PhysRevE.80.056117
[10] V. Estivill-Castro, "Why so many clustering algorithms: a position paper." ACM SIGKDD
     explorations newsletter 4.1 (2002): 65-75.
[11] S. White, P. Smyth, A spectral clustering approach to finding communities in graphs, in:
     Proceedings of the 2005 SIAM international conference on data mining, 2005, pp. 274-285. doi:
     10.1137/1.9781611972757.25
[12] D. Lande, A. Snarskii, O. Dmytrenko, I. Subach, Relaxation time in complex network, in:
     Proceedings of the 15th International Conference on Availability, Reliability and Security, ARES
     '20, 99, 2020, pp. 1–6. doi: 10.1145/3407023.3409231
[13] D. V. Lande, O. O. Dmytrenko, A. A. Snarskii, "Research of network`s relaxation coefficient as
     characteristics of network`s nodes." Data Recording, Storage & Processing 21.1 (2019): 83-94.
     doi: 10.35681/1560-9189.2019.1.1.179714
[14] D. Lande, I. Subach, O. Puchkov, A. Soboliev, "A Clustering Method for Information
     Summarization and Modelling a Subject Domain." Information & Security 50.1 (2021): 79-86.
     doi: 10.11610/isij.5013
[15] A. Likas, N. Vlassis, J. J. Verbeek, "The global k-means clustering algorithm." Pattern recognition
     36.2 (2003): 451-461. doi: 10.1016/S0031-3203(02)00060-2
[16] Gephi: makes graph handly, 2022. URL: http://gephi.org
[17] K. Knox, "Le Châtelier's Principle." Journal of Chemical Education 62.10 (1985): 863.
[18] A.N. Langville, C.D. Meyer, "Google's PageRank and beyond." Google's PageRank and Beyond.
     Princeton university press, 2011. doi: 10.1515/9781400830329
[19] J. M. Kleinberg, "Authoritative sources in a hyperlinked environment." Journal of the ACM
     (JACM) 46.5 (1999): 604-632.


                                                                                                     22

</pre>