<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Optimization of the Database Structure of a Distributed Corporate Information System Node Using the Analytic Hierarchy Process</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Mykhailo</forename><surname>Dvoretskyi</surname></persName>
							<email>m.dvoretskiy@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Petro Mohyla Black Sea National University</orgName>
								<address>
									<addrLine>68-Desantnykiv St 10</addrLine>
									<postCode>54003</postCode>
									<settlement>Mykolaiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Svitlana</forename><surname>Dvoretska</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Petro Mohyla Black Sea National University</orgName>
								<address>
									<addrLine>68-Desantnykiv St 10</addrLine>
									<postCode>54003</postCode>
									<settlement>Mykolaiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Hlib</forename><surname>Horban</surname></persName>
							<email>gleb.gorban@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Petro Mohyla Black Sea National University</orgName>
								<address>
									<addrLine>68-Desantnykiv St 10</addrLine>
									<postCode>54003</postCode>
									<settlement>Mykolaiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yuriy</forename><surname>Nezdoliy</surname></persName>
							<email>nezdoliy.yura@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Petro Mohyla Black Sea National University</orgName>
								<address>
									<addrLine>68-Desantnykiv St 10</addrLine>
									<postCode>54003</postCode>
									<settlement>Mykolaiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">IT&amp;I-2020 Information Technology and Interactions</orgName>
								<orgName type="institution">KNU Taras Shevchenko</orgName>
								<address>
									<addrLine>December 02-03</addrLine>
									<postCode>2020</postCode>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Optimization of the Database Structure of a Distributed Corporate Information System Node Using the Analytic Hierarchy Process</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">46DFB97E50386826A5526DC11F4B5EDD</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T09:06+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Corporate information system, database management system, distributed database, SQLquery, data replication, multidimensional analysis, multicriteria problem, analytic hierarchy process 0000-0001-5913-6859 (M. Dvoretskyi)</term>
					<term>0000-0001-5199-9430 (S. Dvoretska)</term>
					<term>0000-0002-6512-3576 (H. Horban)</term>
					<term>0000-0002-6003-5585 (Y. Nezdoliy)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The relevance of the problem of optimizing the database structure of a node in corporate information systems (CIS) is due to the widespread use of information technologies of multilevel, geographically dispersed computer systems, including those with distributed databases. One of the research aims is to determine and build a mathematical model of the optimality criteria for the structure of a remote node of the distributed corporate information system database. The statistics of user SQL-queries activity is taken into account and presented in the form of a multidimensional database. Criteria of the model effectiveness are formulated, which are independence from the central node of the database, the size of the local database, and an indicator of the level of need for data synchronization. The problem of multicriteria optimization is solved by using of hierarchy analysis method. Among the using method's features can be mentioned: different sets of optimality criteria for the evolving individuals; quantifying of the data representation marker value into 5 alternatives and automatically presetting the matrices of pairwise comparisons on the last level of the hierarchy. Solving the problem of multicriteria analysis and choosing the best alternative makes possible to determine the optimal level of the data representation marker. It makes possible to classify the attributes and tuples of DB relations according to their representation on the node of distributed CIS.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In information systems development, there is a trend of transition from local to distributed databases (DDB). There are many database management systems (DBMS) that allow you to host, maintain and process data on various nodes of computer information systems (CIS). The main task of distributed database management systems is to provide access control to the data of many users and ensure the integrity and consistency of data <ref type="bibr" target="#b0">[1]</ref>. Within one company there is a need to automate different types of accounting <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>. The attempt to automate all types of accounting leads to so-called "universal" corporate information systems <ref type="bibr" target="#b2">[3]</ref>, which create a single accounting environment and provide access to all necessary data for analysis and decision support. This approach has many disadvantages <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b3">4]</ref>, which can be eliminated by using separate specialized solutions <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b4">5]</ref>. But this path leads to use of several databases (and perhaps DBMS) that require their synchronization <ref type="bibr" target="#b5">[6]</ref>. So, in addition to the main functions of the distributed DBMS: input, storage, processing and sharing data a specific important function is to ensure the collaboration of many users with distributed information <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Topicality</head><p>The database structure optimizing is considered in <ref type="bibr" target="#b8">[9]</ref><ref type="bibr" target="#b9">[10]</ref><ref type="bibr" target="#b10">[11]</ref><ref type="bibr" target="#b11">[12]</ref><ref type="bibr" target="#b12">[13]</ref><ref type="bibr" target="#b13">[14]</ref><ref type="bibr" target="#b14">[15]</ref>, but insufficient attention is paid to improving the automated systems performance by optimizing the structure of the CIS distributed database on the basis of statistics of SQL-queries. Also, in <ref type="bibr" target="#b8">[9]</ref><ref type="bibr" target="#b9">[10]</ref><ref type="bibr" target="#b10">[11]</ref> while considering the design of automated control and data processing systems, building of data warehouses and multidimensional models, the use of a combined strategy of distributed data representation in CIS is not considered. In <ref type="bibr" target="#b11">[12]</ref><ref type="bibr" target="#b12">[13]</ref><ref type="bibr" target="#b13">[14]</ref><ref type="bibr" target="#b14">[15]</ref> the authors consider the issue of increasing the productivity of automated systems through the use of materialized views, database restructuring and relations denormalization. However, the optimality of the structure of a single distributed CIS node is ignored. A key factor influencing the reliability and accessibility of the database is the so-called localization of links <ref type="bibr" target="#b4">[5]</ref>. If the database is distributed so that the data hosted in a node is called exclusively by its user, it indicates a high level of link localization. If such data distribution is not possible and to execute the user's requests you need to access the information of other nodes, it indicates a low level of links localization.</p><p>A combined data distribution strategy is the best in terms of combining the benefits of strategies with and without duplication. But when using it, in addition to the task of synchronizing duplicate information, the task of designing the structure of the database is actual, depending which node data belonging to. In addition, the performance of the system will directly depend on the decision on the need for partial or complete duplication of data. Some tables of a relational database can be duplicated completely, and someafter projection and selection. That is, for optimized data representation on a remote node, it is necessary to use vertical and horizontal data fragmentation procedures.</p><p>Therefore, the issue of data distribution between nodes of distributed and territorially dispersed CIS is quite important. Therefore, the task of optimizing the structure of the database of a geographically remote node in corporate information systems is relevant.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Purpose of publication</head><p>The purpose of the research is to create a mathematical optimization model and subsequent choosing the best alternative to the marker of data representation of the remote node of distributed CIS. The research is related to only to relational databases. The relational data model is based on a simple and at the same time powerful mathematical apparatus, based mainly on theory of sets and mathematical logic <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b15">16]</ref>. So, when building a mathematical model, it is considered appropriate to use the basic concepts of set theory.</p><p>The developed model should take into account the statistics of user requests to local and remote data. Using filtering by selected dimensions, the appropriate subsets of data can be obtained <ref type="bibr" target="#b16">[17]</ref>. For dimension elements, the term "data representation marker" was proposed, which determines the level of their need at the node of the distributed corporate information system (DCIS). From the value of this marker, aggregated on the database subset, corresponding to the remote node, will depend on the values of the criteria of model efficiency. It is independence from the central node of the database, the size of the local database and the level of data synchronization <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b18">19]</ref>. Therefore, one of the tasks is the mathematical representation of the optimality criteria dependence on the value of the data representation marker.</p><p>The obtained multicriteria problem must be solved to determine the optimal level of data representation marker. It should be noted that the optimality criteria, the models of which were defined, are independent, monotonic and are represented on the set of real numbers in the interval [0; 1]. The classical methods of Pareto and Slater <ref type="bibr" target="#b19">[20,</ref><ref type="bibr" target="#b20">21]</ref> can give results only at the first stage of modeling. But when calculating the optimal level of the data representation marker they are ineffective due to the decrease in the level of one criteria while increasing others. The solution of the problem is also complicated by the fact that the solution space is defined on a set of real numbers, and therefore the set of solutions contains a large number of alternatives.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">The main part</head><p>Among the well-known relational algebra operations <ref type="bibr" target="#b9">[10]</ref>, due to the horizontal and vertical fragmentation of data on the distributed CIS node, the operations "projection" (hereinafter P) and "selecting" (hereinafter S) are considered here. Let tupbe a tuple of the relation R, tup[P] be a part of this tuple containing only the values of the attributes that are included in the subset P of the relation scheme Rshema (P ⊂ Rshema). Then the projection of R on P will be the relation, consisting of tuples of all values from the set P, which exists in the relation R, i.e. R[P] = {tup[P] | tup ∈ Rdata}. The scheme of the resulting set can be defined by the following set of attributes: R[P]shema = {A1 …, Am}, where Ai ∈ Rshema. The selection displays tuples, and the result is a relation containing a subset of all unique tuples of the relation R, for which a certain logical condition is true R[S] = {tup | tup ∈ Rdata ˄ F(tup, S) = true}, where S is a logical condition of SQL-query, and F(tup, S) is a function that reflects its fulfillment for the corresponding tuple. The scheme of the resulting set will equal to the scheme of the basic relation, i.e. R[S]shema = R shema. Within the SQL-query for data selecting, a number of relations can be involved, all of which are the result of sequential execution of select and projection operations to the base relation (database table). R'' = R' <ref type="bibr">[P]</ref>, where R' = R[S], i.e. To avoid the need for further replication some data that required on the DDB node can only be presented on the central node of the database and participate the query through the use of distributed queries. So the resulting relation R remote will only be a subset of R''union. Due to the fact that to represent the data on the remote node it is necessary to use elements of both vertical and horizontal data fragmentation (both projection and selecting), a subset of the base relation R that will describe the relation of the remote node can be represented as follows:</p><formula xml:id="formula_0">𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 = {A | A ∈ Rshema , Rprimary ⊂ 𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 , A ∈ Rprimary ˅ Fa(Node, A) = true} (2)</formula><p>To make a decision on the attribute representation on a node, the function Fa(Node, A) will be used. Besides, the set of attributes of the relation primary key in any case must be represented on the remote node. The set of tuples, in turn, will be determined by the formula:</p><formula xml:id="formula_1">𝑅 𝑑𝑎𝑡𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 = { tup | tup ∈ Rdata , tupprimary ∈ R remote-dep data ˅ Ftup(Node, tup) = true} (3)</formula><p>As we can see, the tuple must be represented in the case of entering its primary key to the set of these relations, depending on the current. Otherwise, the need for data is solved using the evaluation function Ftup(Node, tup).</p><p>The model of presenting user queries should support the possibility of their further classification according to belonging to a particular workplace, location, user role and other criteria that can be added to the model. That is, the user query is defined as</p><formula xml:id="formula_2">Q = &lt;Workplace, User, Application, Rset'', Qset inner &gt;,<label>(4)</label></formula><p>where workstation = &lt;Type, Location&gt;; User = &lt;Role, Name&gt;; Rset'' = { R'' | {tup[P] | tup[P] ∈ R[P]data ˄ F(tup, S) = true} }the set of resulting relations obtained from the basic relations (tables) of the database by the corresponding queries; 𝑄 𝑠𝑒𝑡 𝑖𝑛𝑛𝑒𝑟is a set of nested queries of the main query Q. When planning the structure of the database of the remote node of distributed CIS, several factors will be involved -availability and speed of data obtaining, independence from the central DB node, the DB size, the level of data reliability, the need for further synchronization.</p><p>In the first step, the simulation begins with the presentation in the remote node the complete copy of the central node DB. In this case, the data availability and independence from the central node of the database has a maximum level. The speed of data obtaining compared to the central node is usually lower due to less powerful computing resources, but can be increased by performing selecting and projection operations and decreasing the number of data locks. The local database is large, therefore this criteria is not optimal. Also, all data requires synchronization with the central node, which is quite a resource-intensive operation.</p><p>The second step is to exclude all unnecessary data from the remote node. To solve this problem, on the basis of a relational model of user SQL-queries (4) was created a multidimensional database <ref type="bibr" target="#b21">[22]</ref> with following set of dimensions: &lt;DateTime, WorkplaceType, Location, UserRole, Application, R, A, tup&gt;. For the dimensions elements the term of data representation marker is proposed. It reflects the level of data representation necessity at the node of distributed CIS. For each element value of marker is taken from the following set: {"necessary", neutral", "not required"}. To dimension the "Location", the marking is performed automatically with the value "necessary" for the corresponding remote node and "not required" for all others.</p><p>When determining the value of the representation marker for the row of the fact table <ref type="bibr" target="#b21">[22]</ref>, the max function is used, which reflects the principle of absorption. Determining the value of the marker when performing the consolidation of rows of the fact table on the values of &lt;R, A, tup&gt; (for the table cell) can be performed by moving average method. But the question of the specific influence of each dimension remains unresolved. In addition, it should be taken into attention, that for some subsets of dimensions pessimistic scenario should work (data is needed, no matter what), and for someoptimistic (data should not be duplicated in any case).</p><p>So, we have a model where each dimension attribute has a value, a marker and a weight Adim ={Val, Mrk, vol}, where Mrk = {"obligatorily", "necessary", " neutral", "not required" , "forbidden"}, and volweight (ignored for the values of the marker "obligatorily" and "forbidden"). By converting a non-numeric linguistic variable of markers into a numeric value ("obligatorily" -"2", "necessary" -"1", "neutral" -"0", "not required" -"-1", "forbidden" -"-2"), the aggregation function was defined:</p><formula xml:id="formula_3">𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒 𝑖=1 𝑛 𝑀𝑟𝑘 𝑖 = { 2, 𝑖𝑓 ∃ 𝑀𝑟𝑘 𝑖 = 2 −2, 𝑖𝑓 ∃ 𝑀𝑟𝑘 𝑖 = − 2 ˄ ∄ 𝑀𝑟𝑘 𝑖 = 2 ∑ (𝑀𝑟𝑘 𝑖 * 𝑛 𝑖=1 𝑉𝑜𝑙 𝑖 ∑ 𝑉𝑜𝑙 𝑖 𝑛 𝑖=1 )<label>(5)</label></formula><p>When deciding on the data representation on a remote node, we consolidate the rows of the fact table by the tuple &lt;R, A, tup&gt; and calculate the value of the marker for each of its elements by formula <ref type="bibr" target="#b4">(5)</ref>. And based on following the decision about data representation is made:</p><formula xml:id="formula_4">Repr (Node,R,A,tup)=(𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝑅, 𝐴, 𝑡𝑢𝑝) 𝑖=1 𝑛 𝑀𝑟𝑘 𝑖 &gt; 𝑘𝑜𝑒𝑓 𝑟𝑒𝑝𝑟 𝑛𝑜𝑑𝑒 ),<label>(6)</label></formula><p>where 𝑘𝑜𝑒𝑓 𝑟𝑒𝑝𝑟 𝑛𝑜𝑑𝑒the threshold coefficient of data representation in a certain node Node, that is defined at the range of [-1, 1]. The third step is to completely abandon the local database and place all the data on the central node (or, in some cases, in other nodes) of distributed CIS. In this case, we have the maximization of optimality for criteria of the need for data synchronization. That is because there is no duplication of data. The level of reliability is also maximum, and the size of the local database has a minimum value (no local database). But, at the same time, the availability of data and the access speed are minimized, and the work of CIS is highly dependent on the central node availability.</p><p>The value of some criteria improved compared to the second step, but at the same time the value of the others got worse. It is logical to assume that the optimal values of all DCIS DB structure criteria acquire between the 2nd and 3rd steps. To be able to perform the analysis and find the optimal distribution of data between the remote and central nodes, it is necessary to formalize the database structure quality criteria.</p><p>Criterion of independence from the central database node, and, accordingly, the availability and access speed directly depend on the representation of user SQL-query data on the node of distributed CIS. Using the model of the user SQL-query (4) and the resulting relation of the remote node (1, 2), we can determine the function of the request data availability:</p><formula xml:id="formula_5">Favailab(Node, Q) = { 1, 𝑖𝑓 ∀ 𝑅'' ∃ 𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 , 𝑅'' ∈ 𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 ˄ ∀ 𝑄 𝑖𝑛𝑛𝑒𝑟 𝐹 𝑎𝑣𝑎𝑖𝑙𝑎𝑏 (𝑄 𝑖𝑛𝑛𝑒𝑟 ) = 1 0, 𝑖𝑓 ∃ 𝑅'' ∄ 𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 , 𝑅'' ∈ 𝑅 𝑠ℎ𝑒𝑚𝑎 𝑟𝑒𝑚𝑜𝑡𝑒 ˅ ∃ 𝑄 𝑖𝑛𝑛𝑒𝑟 𝐹 𝑎𝑣𝑎𝑖𝑙𝑎𝑏 (𝑄 𝑖𝑛𝑛𝑒𝑟 ) = 0 . (<label>7</label></formula><formula xml:id="formula_6">)</formula><p>The aggregate value of the data availability level and independence from the central DB is defined as the average value </p><p>The set of user SQL-queries Qnode is a subset of all user queries Qall (Qnode ⊂ Qall), where for each element the function of belonging to a remote node is equal to one. Next, we consider the criterion of the local database size. This criterion affects both the performance of queries to the local database and the power of computing resources required to perform database and CIS administration operations. The database under the relational DBMS control (including distributed) is presented on disk space as a file or group of files <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>. At the same time, any modern relational DBMS has mechanisms for obtaining information about how much disk space is used by each relation. In the vast majority of cases, the total value of the relations size equals the total value of the database files sizes.</p><p>But the information about size of R does not make it possible to determine the size of R'', which is the result of a sequence of selecting and projection operations, and is part of the set R remote . On the other hand, each DBMS provides information about the amount of disk space required to store the value of the attribute defined on a particular domain <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>. The size of the tuple can be determined as</p><formula xml:id="formula_8">SizeR = 𝑆𝑖𝑧𝑒𝑅0 𝑖 𝐷𝐵𝑀𝑆 + p х ∑ 𝑆𝑖𝑧𝑒(𝑇𝑦𝑝𝑒 𝑖 ) 𝑛 𝑖=1 , (<label>9</label></formula><formula xml:id="formula_9">)</formula><p>where Ai ∈Di ∈Typei , pis the relation power, and SizeR0 i DBMSis the size of the i-th relation if it is empty.</p><p>However, the values obtained by ( <ref type="formula" target="#formula_8">9</ref>) cannot be used in calculations, because SizeR almost never equals to SizeR dbms . This may be due to the presence of additional data structures (indexes) related to the table, as well as other properties of data representation on the disk. Therefore, for each relation we determine the correction factor Next, when determining the size of R '' (subset of R) we use the following formula SizeR'' = KoefsizeR × p' × ∑ 𝑆𝑖𝑧𝑒(𝑇𝑦𝑝𝑒 𝑖 ) 𝑛′ 𝑖=1 <ref type="bibr" target="#b10">(11)</ref> where p is the power of R'', n'is the number of elements of the set R remote shema (number of attributes), and each attribute Ai ∈ Di ∈Typei.</p><p>But for each individual case of the subject area, the size (11) will take different values, and therefore its absolute value has no sense. Therefore, it was decided to present the final value of the criterion of the local database size in proportion to the size of the database in the CIS central node. The last of the above criteria is the need for data synchronization. First, we define a subset of the remote node data for which the data change operations are performed. To do this, define the model of the SQL-query that modify data Q modif = &lt;Виміри, R'' modif , type&gt;, where R'' modifis a subset on the relation R, which changes due to data modification operations, type = {insert, update, delete}operation type. R'' modif is defined as</p><formula xml:id="formula_10">R'' modif = {tup[P modif ] | tup[P modif ] ∈ R[P modif ]data ˄ F(tup, S) = true} (13)</formula><p>where Sis a logical condition, defined in SQL query, F(tup, S)is a function that reflects its fulfillment for the corresponding tuple, and P modifis a set of attributes that are modified.</p><p>Considering the set of queries to the database, the resulting subset 𝑅′′ 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 of the base relation R can be defined as the union of subsets R'' modif of all queries (13) received by the database from the remote node 𝑅′′ 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 = ⋃ 𝑅 𝑖 ′′ 𝑚𝑜𝑑𝑖𝑓 𝑛 𝑖=1</p><p>. Similarly, we define the set 𝑅′′ 𝑚𝑎𝑖𝑛 𝑚𝑜𝑑𝑖𝑓 , which will be modified on the central node or other nodes with future synchronization with the central node. The intersection of the sets 𝑅′′ 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 та 𝑅′′ 𝑚𝑎𝑖𝑛 𝑚𝑜𝑑𝑖𝑓 will determine the subset of the basic relation on which data conflitct can take place. This data require the use of more resource-intensive synchronization algorithms <ref type="bibr" target="#b5">[6]</ref>.</p><formula xml:id="formula_11">𝑅′′′ 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 = 𝑅′′ 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 ∩ 𝑅′′ 𝑚𝑎𝑖𝑛 𝑚𝑜𝑑𝑖𝑓 (<label>14</label></formula><formula xml:id="formula_12">)</formula><p>Based on <ref type="bibr" target="#b13">(14)</ref>, we add to the multidimensional DB (5) the dimension SyncroFlg = {true, false}, which will be determined on the tuple &lt;R, A, tup&gt;. Next, based on the aggregate value of the representation marker 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒 𝑖=1 𝑛 𝑀𝑟𝑘 𝑖 and the representation coefficient 𝑘𝑜𝑒𝑓 𝑟𝑒𝑝𝑟 𝑛𝑜𝑑𝑒 perform filtering of the multidimensional DB according to the decision on representation <ref type="bibr" target="#b5">(6)</ref> and SyncroFlg = true. Aggregate the results by &lt;R, A, tup&gt; and count the number of queries. The ratio of the obtained value to the total number of queries according to (6) will be an indicator of the level of data synchronization need</p><formula xml:id="formula_13">Fsynchro = 𝑝 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓 𝑝 𝑛𝑜𝑑𝑒 , (<label>15</label></formula><formula xml:id="formula_14">)</formula><p>where 𝑝 𝑛𝑜𝑑𝑒 𝑚𝑜𝑑𝑖𝑓relation power, including queries of the remote node (according to the decision on representation), which includes the values of the tuples attributes (cells), which are also included in the set R'' modif node, and 𝑝 𝑛𝑜𝑑𝑒the cardinality of all queries, attributes and tuples of which are represented in the remote node.</p><p>A multicriteria problem, that was obtained, must be solved to determine the optimal level of data representation marker. Classical Pareto and Slater methods <ref type="bibr" target="#b19">[20,</ref><ref type="bibr" target="#b20">21]</ref> can give results only at the first stage. But when calculating the optimal level of data representation marker are ineffective due to the decrease in the level of some criteria of optimality while increasing others. The solution of the problem is also complicated by the fact that the solution space is determined on a set of real numbers, and therefore the set of solutions contains many alternatives. The analytic hierarchy process (AHP), which is a general methodology for solving a wide class of decision-making problems, allows to combine a relatively simple mathematical apparatus with knowledge and experience of the decision maker. The basis of this method is the representation of the decision process in the form of a multilevel hierarchy. This hierarchy should reflect all the components of the problem to be solved. The method is based on the principles of decomposition, pairwise comparisons and hierarchical composition. The main stages of the method are building a hierarchy, estimating the importance and priorities, checking the consistency of priorities and synthesis of the solution.</p><p>When compiling the hierarchy, following relationship between the levels elements was used: goal -stakeholders -criteria -alternatives. The value of the data representation marker (alternative) is a real number in the interval [-1, 1]. It leads to potential large number of alternatives at the 4th level of the hierarchy and therefore the matrices of pairwise comparisons by criteria can become very big. This complicates estimation process for the decision makers. It is proposed to simplify the task by reducing the number of alternatives to 5: "low" (L) -"-1", "lower them medium" (LM) -"-0.5", "medium" (M) -"0", "higher then medium" (HM) -"0.5", and "high" (H) -"1". The level of "decision makers" is represented by the elements "Owner", "Database Administrator", "Database Developer" and "CIS Operator". The obtained hierarchical model is shown in Fig. <ref type="figure" target="#fig_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 1: Hierarchical model of the distributed CIS node structure optimization problem</head><p>Note that the list of criteria differs for the decision makers. Thus, all three criteria are important for the owner (the database size, the need for synchronization and independence from the central database), because they have influence on both the quality of CIS and the cost of equipment. For the database administrator, the criteria of database size and the need to organize data synchronization are important. In turn, for the database developer and CIS operator, the criterion of database size is not critical. It is clear that the relative weight of each of the criteria for different decision makers will also differ. Using the scale of relative importance of the criteria [23] and with the involvement of the decision maker (which at this stage is the owner) we build a matrix of pairwise comparisons for decision makers (Table <ref type="table" target="#tab_0">1</ref>). At the third level of the hierarchy, the corresponding matrices of pairwise comparisons are formed according to the criteria of optimality for each decision maker. Thus, for the decision maker "owner" we have the following matrix of pairwise comparisons of optimality criteria (Table <ref type="table" target="#tab_1">2</ref>).  To check the conflicts existence between matrix elements, the consistency index (CI) is calculated. For the data in Table 2 CI = 3.2%, which indicates the allowable level of consistency (in case the value is higher 10% there is a need to adjust the values of the matrix).</p><p>The next step in the classical analytic hierarchy process is to fill in the matrices of pairwise comparisons of alternatives separately for each criterion of optimality, similar to Table <ref type="table" target="#tab_1">1 and Table 2</ref>. In our case, the presence of mathematical models for calculating the values of the optimality criteria formulated in <ref type="bibr" target="#b7">(8,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b14">15)</ref> allows to perform the initial calculation of matrix data based on numerical values of the data representation marker for each alternative. Next, the matrix is submitted to the decision maker for approval. For example, the size of the local node database depending on one of the five alternatives can change as follows (Table <ref type="table" target="#tab_2">3</ref>). Based on the above data, the size of the database at low (min) and high (max) level of the data representation marker differs by 0.75 / 0.02 = 37.5 times. According to principles of pairwise comparisons and the axiom of homogeneity, we perform normalization of the values given in Table <ref type="table" target="#tab_2">3</ref>, using a slightly modified formula of natural normalization: </p><p>where 𝑊 𝑖 is the value of the optimality criterion for the i-th alternative, and k = 9. Normalized according to <ref type="bibr" target="#b16">(17)</ref> the database size values (Table <ref type="table" target="#tab_2">3</ref>) are presented in Table <ref type="table" target="#tab_3">4</ref>. After rounding to the integer according to mathematical rules, we build a matrix of pairwise comparisons of alternatives for the criterion of the local database size (Table <ref type="table" target="#tab_5">5</ref>).</p><p>According to (4) we perform the calculation of the matrix of alternatives relative weight by the criterion of the local database size. Also, we similarly calculate the priority vectors of alternatives according to the criteria of independence from the central node and the need for data synchronization. As a result, we obtain following vectors.  According to ( <ref type="formula">16</ref>) and ( <ref type="formula">18</ref>) we calculate the global vector of priorities for the decision maker "owner" <ref type="bibr" target="#b18">(19)</ref>. </p><p>By performing the appropriate calculations, we obtain global priority vectors for other decision makers </p><p>Using the obtained results of the global priorities vectors for decision makers <ref type="bibr" target="#b18">(19)</ref>, <ref type="bibr" target="#b19">(20)</ref> and the matrix of preferences of decision makers (Table <ref type="table" target="#tab_0">1</ref>), we calculate the vector of global priorities of alternatives (Table <ref type="table" target="#tab_6">6</ref>). The performed calculations allow to organize decision support when choosing the optimal level of the data representation marker among the proposed alternatives.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Summary and conclusion</head><p>Based on the relational data model, the concept of data slices of the set of database relations is formalized. Using the definition of selecting and projection operations, as well as taking into account the hierarchical structure of user queries, the model that describes their structure was built. This model includes analytical characteristics and allows to define for each base relation a subset (node relation), which will consist of elements that are part of the resulting sets of SQL-queries sequence.</p><p>The term of data representation marker for elements of analytical dimensions was proposed. Using the offered aggregation function the level of representation marker for each attribute and tuple of relation is calculated. To determine the optimal value of the representation marker, several optimality criteria are introduced and mathematical models are built for each of them. This allow to calculate their values depending on the limit level of the data representation marker at the node of distributed CIS. Solving a multi-criteria problem and finding the optimal level of data representation at a remote node can increase the level of data availability and efficiency of distributed CIS. Efficiency is defined as the ratio of result and resources, so taking into account the vector of relative weight of the optimality criteria of the model ( <ref type="formula">16</ref>), we calculate the efficiency as The comparison of the obtained results for the database of the KIS node of the subject area is given in Table <ref type="table" target="#tab_7">7</ref>. Thus, the results of the research allow to increase the efficiency of using the distributed CIS node of the subject area by 25% compared to the presentation of only critical data, and by 11% compared to the presentation of all necessary data of the central database, respectively. The research can be followed by presenting the obtained vector of global priorities in the form of fuzzy sets of one variable. Dephasing the obtained results can make numerical value of the optimal level of data representation at the RKIS node more accurate.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>1 )</head><label>1</label><figDesc>R'' = {tup[P] | tup[P] ∈ R[P]data ˄ F(tup, S) = true} (Considering the set of queries to the database, the resulting subset R''union of the base relation R can be defined as the union of subsets R' of all queries received by the database from a remote node R''union = ⋃ 𝑅 𝑖 ′′ 𝑛 𝑖=1 , or R''union = {tup[Punion] | tup[Punion] ∈ R[Punion]data ˄ F(tup, Sunion) = true}, where tup[Punion] = ⋃ 𝑡𝑢𝑝[𝑃 𝑖 ] 𝑛 𝑖=1 , and Sunion = ⋁ 𝑆 𝑖 𝑛 𝑖=1</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>Qnode = {Q | Favailab(Node, Q) = 1}, where Favailab(Node, Q) = { 1, 𝑖𝑓 ( ∃ 𝑅 ′′ ∈ 𝑅 𝑠𝑒𝑡 ′′ → 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝑅 ′′ ) 𝑖=1 𝑛 𝑀𝑟𝑘 𝑖 &gt; −1) ˅ ( ∃ 𝑄 𝑖𝑛𝑛𝑒𝑟 ∈ 𝑄 𝑠𝑒𝑡 𝑖𝑛𝑛𝑒𝑟 → 𝐹 availab (Node, 𝑄 𝑖𝑛𝑛𝑒𝑟 ) = 1) 0, 𝑖𝑓 ( ∀ 𝑅 ′′ ∈ 𝑅 𝑠𝑒𝑡 ′′ → 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝑅 ′′ ) 𝑖=1 𝑛 𝑀𝑟𝑘 𝑖 ≤ −1) ˄ ( ∀ 𝑄 𝑖𝑛𝑛𝑒𝑟 ∈ 𝑄 𝑠𝑒𝑡 𝑖𝑛𝑛𝑒𝑟 → 𝐹 availab (Node, 𝑄 𝑖𝑛𝑛𝑒𝑟 ) = 0)</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head></head><label></label><figDesc>− 𝑚𝑖𝑛 𝑖 𝑊 𝑖 ) (𝑚𝑎𝑥 𝑖 𝑊 𝑖 − 𝑚𝑖𝑛 𝑖 𝑊 𝑖 ) * (𝑘 − 1) + 1,</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="7,72.00,211.14,451.00,253.55" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>The matrix of pairwise comparisons for decision makers</figDesc><table><row><cell></cell><cell>Owner</cell><cell>DB Admin</cell><cell>DB developer</cell><cell>CIS operator</cell></row><row><cell>Owner</cell><cell>1</cell><cell>3</cell><cell>5</cell><cell>7</cell></row><row><cell>DB Admin</cell><cell>1/3</cell><cell>1</cell><cell>3</cell><cell>5</cell></row><row><cell>DB developer</cell><cell>1/5</cell><cell>1/3</cell><cell>1</cell><cell>3</cell></row><row><cell>CIS operator</cell><cell>1/7</cell><cell>1/5</cell><cell>1/3</cell><cell>1</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>The matrix of pairwise comparisons of optimality criteria for the decision maker "owner"</figDesc><table><row><cell></cell><cell>BD size</cell><cell>Independence level</cell><cell>Need for synchronization</cell></row><row><cell>BD size</cell><cell>1</cell><cell>1/7</cell><cell>1/3</cell></row><row><cell>Independence level</cell><cell>7</cell><cell>1</cell><cell>5</cell></row><row><cell>Need for synchronization</cell><cell>3</cell><cell>1/5</cell><cell>1</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Dependence of the database size on the selected alternative</figDesc><table><row><cell>Marker level</cell><cell>Low</cell><cell>Lower then medium</cell><cell>Medium</cell><cell>Higher then medium</cell><cell>High</cell></row><row><cell>DB size</cell><cell>0,02</cell><cell>0,24</cell><cell>0,47</cell><cell>0,55</cell><cell>0,75</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>Normalized criteria values of the database size</figDesc><table><row><cell>Marker level</cell><cell>Low</cell><cell>Lower then medium</cell><cell>Medium</cell><cell>Higher then medium</cell><cell>High</cell></row><row><cell>DB size (Normalized)</cell><cell>1</cell><cell>3,41</cell><cell>5,93</cell><cell>6,81</cell><cell>9</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 5</head><label>5</label><figDesc>Dependence of the database size on the selected alternative</figDesc><table><row><cell>DB size</cell><cell>Low</cell><cell cols="2">Lower then medium Medium</cell><cell>Higher then medium</cell><cell>High</cell></row><row><cell>Low</cell><cell>1</cell><cell>3</cell><cell>6</cell><cell>7</cell><cell>9</cell></row><row><cell>Lower then medium</cell><cell>0,33333</cell><cell>1</cell><cell>2</cell><cell>2</cell><cell>3</cell></row><row><cell>Medium</cell><cell>0,16667</cell><cell>0,5</cell><cell>1</cell><cell>1</cell><cell>2</cell></row><row><cell>Higher then medium</cell><cell>0,14286</cell><cell>0,5</cell><cell>1</cell><cell>1</cell><cell>1</cell></row><row><cell>High</cell><cell>0,11111</cell><cell>0,33333</cell><cell>0,5</cell><cell>1</cell><cell>1</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 6</head><label>6</label><figDesc>Calculation of global priorities of alternatives</figDesc><table><row><cell></cell><cell>Owner</cell><cell>CIS operator</cell><cell>DB Admin</cell><cell>DB developer</cell><cell>Global</cell></row><row><cell></cell><cell>0,563</cell><cell>0,055</cell><cell>0,263</cell><cell>0,117</cell><cell>priorities</cell></row><row><cell>Low</cell><cell>0,09</cell><cell>0,05</cell><cell>0,41</cell><cell>0,04</cell><cell>0,17</cell></row><row><cell>Lower then medium</cell><cell>0,09</cell><cell>0,08</cell><cell>0,15</cell><cell>0,08</cell><cell>0,1</cell></row><row><cell>Medium</cell><cell>0,15</cell><cell>0,16</cell><cell>0,1</cell><cell>0,17</cell><cell>0,14</cell></row><row><cell>Higher then medium</cell><cell>0,3</cell><cell>0,31</cell><cell>0,12</cell><cell>0,33</cell><cell>0,26</cell></row><row><cell>High</cell><cell>0,37</cell><cell>0,4</cell><cell>0,23</cell><cell>0,38</cell><cell>0,34</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 7</head><label>7</label><figDesc>Comparison of the database structure effectiveness in different strategies and levels of data representation at the node of the distributed CIS</figDesc><table><row><cell></cell><cell></cell><cell>Using central node DB</cell><cell>Presenting only critical data</cell><cell>Presenting of all necessary data</cell><cell>Full data duplication</cell><cell>Optimal level of data representation marker</cell></row><row><cell>Independence</cell><cell>0,73</cell><cell>0</cell><cell>0,35</cell><cell>0,97</cell><cell>1</cell><cell>0,97</cell></row><row><cell>DB size</cell><cell>0,081</cell><cell>0</cell><cell>0,02</cell><cell>0,75</cell><cell>1</cell><cell>0,63</cell></row><row><cell>Synchro need</cell><cell>0,188</cell><cell>0</cell><cell>0,15</cell><cell>0,07</cell><cell>1</cell><cell>0,08</cell></row><row><cell cols="2">DB node efficiency</cell><cell>-</cell><cell>8,5589</cell><cell>9,5892</cell><cell>2,7126</cell><cell>10,7257</cell></row><row><cell cols="2">Efficiency increase,%</cell><cell>-</cell><cell>25,32%</cell><cell>11,85%</cell><cell>295,41%</cell><cell>-</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Acknowledgements</head><p>This research was partially supported by the state research projects: "Development of information and communication decision support technologies for strategic decision-making with multiple criteria and uncertainty for military-civilian use" (research project no. 0117U007144, financed by the Government of Ukraine); "Development of information-analytical system for military-civil application as a information protection factor in the conditions of multi-criteria, uncertainty and risk" (research project no. 0120U101222, financed by the Government of Ukraine).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Principles of Distributed Database Systems 3rd</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Tamer</forename><surname>Özsu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Patrick</forename><surname>Valduriez</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<publisher>Springler</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Web-application of warehouse accounting in non-automated points of sale, Science works</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dvoretskyi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Borovlova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Series: Computer technologies</title>
				<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">308</biblScope>
			<biblScope unit="page" from="45" to="52" />
		</imprint>
		<respStmt>
			<orgName>Petro Mohyla Black Sea National University</orgName>
		</respStmt>
	</monogr>
	<note>Rel.. in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://rarus.com.ua/torgovyy-i-skladskoy-uchet/1S_Predpriyatie_8_Upravlenie_torgovym_predpriyatiem_dlya_Ukrainy_/(inRussian" />
		<title level="m">Management of a trade enterprise for Ukraine</title>
				<imprint>
			<date>1C</date>
		</imprint>
	</monogr>
	<note>Enterprise 8</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<author>
			<persName><forename type="first">Natalia</forename><surname>Kozliuk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Svetlana</forename><surname>Uhrymova</surname></persName>
		</author>
		<title level="m">Warehouse accounting at trade enterprises</title>
				<meeting><address><addrLine>Phenix</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
	<note>in Russian</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Garcia-Molina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Ullman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Widom</surname></persName>
		</author>
		<title level="m">Database Systems: The Complete Book 2nd Edition</title>
				<imprint>
			<publisher>Pearson</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="http://stimul.kiev.ua/materialy.htm?a=avtomaticheskaya_sinkhronizatsiya_raspredelennykh_baz_dannykh_v_razdelennom_rezh" />
		<title level="m">Automatic synchronization of distributed databases in split mode</title>
				<imprint/>
	</monogr>
	<note>in Russian</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">Maksym</forename><surname>Kuznetsov</surname></persName>
		</author>
		<title level="m">Yhor Symdianov</title>
				<imprint>
			<publisher>BHV-Piterburg</publisher>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
	<note>MySQL 5. in Russian</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Microsoft SQL Server 2019: A Beginner&apos;s Guide</title>
		<author>
			<persName><forename type="first">Dusan</forename><surname>Petkovich</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Seventh Edition 7th Edition</title>
				<imprint>
			<publisher>Mc-Graw-Hill Education</publisher>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note>Kindle Edition</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Organization of databases and knowledge bases</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Pasichnyk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">A</forename><surname>Reznichenko</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>Publishing group BHV</publisher>
		</imprint>
	</monogr>
	<note>in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Fundamentals of database design: Textbook for students of higher education institutions</title>
		<author>
			<persName><forename type="middle">V</forename><surname>Ye</surname></persName>
		</author>
		<author>
			<persName><surname>Malakhov</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<pubPlace>Odesa</pubPlace>
		</imprint>
		<respStmt>
			<orgName>Science and technology</orgName>
		</respStmt>
	</monogr>
	<note>in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Pasichnyk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">B</forename><surname>Shakhovska</surname></persName>
		</author>
		<title level="m">Data warehouses: a textbook</title>
				<meeting><address><addrLine>Lviv</addrLine></address></meeting>
		<imprint>
			<publisher>Magnoliya</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
	<note>in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Finding Patterns in Query Distribution to Manage Materialized Views</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">B</forename><surname>Kunhurtsev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yu</forename><forename type="middle">N</forename><surname>Vozovykov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Odessa Polytechnic University</title>
				<meeting>the Odessa Polytechnic University<address><addrLine>Odesa</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="135" to="140" />
		</imprint>
		<respStmt>
			<orgName>Publishing house Odessa National Polytechnic University</orgName>
		</respStmt>
	</monogr>
	<note>in Russian</note>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Relational database restructuring model by denormalizing the schema of relations</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">B</forename><surname>Kunhurtsev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">L</forename><surname>Zynovatnaia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Odessa Polytechnic University</title>
				<meeting>the Odessa Polytechnic University<address><addrLine>Odessa</addrLine></address></meeting>
		<imprint>
			<publisher>Publishing house</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="105" to="111" />
		</imprint>
		<respStmt>
			<orgName>Odessa National Polytechnic University</orgName>
		</respStmt>
	</monogr>
	<note>in Russian</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Methods and tools for designing information systems and distributed databases</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">A</forename><surname>Filatov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">V</forename><surname>Semenets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bulletin of Kherson National Technical University</title>
		<imprint>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="203" to="207" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
	<note>in Russian</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Zemlianskaia, Optimization of distributed corporate information networks using genetic algorithms and object modeling</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V</forename><surname>Lazdyn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientific works DonNTU</title>
		<imprint>
			<biblScope unit="volume">147</biblScope>
			<biblScope unit="page" from="83" to="95" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
	<note>in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Designing Relational Database Systems (Dv-Mps Designing)</title>
		<author>
			<persName><forename type="first">Rebecca</forename><forename type="middle">M</forename><surname>Riordan</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Microsoft Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Information technology for determining useful data while optimizing the structure and minimizing the volume of the distributed database node</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dvoretskyi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dvoretska</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Bulletin of Cherkasy</title>
		<imprint>
			<biblScope unit="page" from="26" to="35" />
			<date type="published" when="2019">4/2019. 2019</date>
		</imprint>
		<respStmt>
			<orgName>State Technological University</orgName>
		</respStmt>
	</monogr>
	<note>in Ukrainian</note>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Data Utility Assessment while Optimizing the Structure and Minimizing the Volume of a Distributed Database Node</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dvoretskyi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dvoretska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Nezdoliy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Borovlova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 1st International Workshop on Information-Communication Technologies &amp; Embedded Systems (ICTES 2019)</title>
				<meeting>the 1st International Workshop on Information-Communication Technologies &amp; Embedded Systems (ICTES 2019)<address><addrLine>Mykolaiv</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="128" to="137" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Query parsing in order to optimize distributed DB structure</title>
		<author>
			<persName><forename type="first">M</forename><surname>Fisun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dvoretskyi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Shved</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Davydenko</surname></persName>
		</author>
		<idno type="DOI">10.1109/IDAACS.2017.8095071</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)</title>
				<meeting>the 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)<address><addrLine>Bucharest</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="172" to="178" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Searching for Pareto-Optimal Solutions</title>
		<author>
			<persName><forename type="first">I</forename><surname>Kovalenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Davydenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Shved</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-33695-0_10</idno>
	</analytic>
	<monogr>
		<title level="m">Advances in Intelligent Systems and Computing IV. CCSIT 2019. Advances in Intelligent Systems and Computing</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Shakhovska</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Medykovskyy</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">1080</biblScope>
			<biblScope unit="page" from="121" to="138" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Method of Detection the Consistent Subgroups of Expert Assessments in a Group Based on Measures of Dissimilarity in Evidence Theory</title>
		<author>
			<persName><forename type="first">A</forename><surname>Shved</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Kovalenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Davydenko</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-33695-0_4</idno>
	</analytic>
	<monogr>
		<title level="m">Advances in Intelligent Systems and Computing IV. CCSIT 2019. Advances in Intelligent Systems and Computing</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Shakhovska</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Medykovskyy</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">1080</biblScope>
			<biblScope unit="page" from="36" to="53" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Barsegyan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Kupriyanov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Stepanenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">I</forename><surname>Holod</surname></persName>
		</author>
		<title level="m">Methods and models of data analysis: OLAP and Data Mining</title>
				<meeting><address><addrLine>BHV-Petersburg</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
	<note>in Russian</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
