<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Development of Algorithm for Improving Accuracy of Probability Coefficient of Threat Implementation in Personal Data Information</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Sergey</forename><surname>Verevkin</surname></persName>
							<email>vrjovkin@rambler.ru</email>
							<affiliation key="aff0">
								<orgName type="institution">Russian State Hydrometeorological University</orgName>
								<address>
									<addrLine>Voronezhskaya st. 79, St. Petersburg</addrLine>
									<postCode>192007</postCode>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ksenia</forename><surname>Naumova</surname></persName>
							<email>ksenia.naumovaks@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Russian State Hydrometeorological University</orgName>
								<address>
									<addrLine>Voronezhskaya st. 79, St. Petersburg</addrLine>
									<postCode>192007</postCode>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Tatiana</forename><surname>Tatarnikova</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Russian State Hydrometeorological University</orgName>
								<address>
									<addrLine>Voronezhskaya st. 79, St. Petersburg</addrLine>
									<postCode>192007</postCode>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pavel</forename><surname>Bogdanov</surname></persName>
							<email>45bogdanov@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Russian State Hydrometeorological University</orgName>
								<address>
									<addrLine>Voronezhskaya st. 79, St. Petersburg</addrLine>
									<postCode>192007</postCode>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ekaterina</forename><surname>Kraeva</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Russian State Hydrometeorological University</orgName>
								<address>
									<addrLine>Voronezhskaya st. 79, St. Petersburg</addrLine>
									<postCode>192007</postCode>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Development of Algorithm for Improving Accuracy of Probability Coefficient of Threat Implementation in Personal Data Information</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">BCB9A977005E0BF46AC4668715BC5829</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T00:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>OSINT, corporate networks, security analysis, information security 0000-0002-5255-940X (A. 1)</term>
					<term>0000-0001-6972-5390 (A. 2)</term>
					<term>0000-0002-6419-0072 (A. 3)</term>
					<term>0000-0002-7533-7316 (A. 4)</term>
					<term>0000-0002-6938-1775 (A. 5)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The continuing increase of the number of information systems inevitably entails the need to ensure cyber security of the information contained in them, in view of the need to provide both information containing commercial secrets and various types of information processed, including by State information systems. Considering the process of ensuring cyber security of the information, in the context of the need to comply with the requirements of legislative and regulatory acts, we should take note of the inevitability of creating a model of an illegal intruder and model of threats to the security of the protected information system, to determine the relevance of the vulnerabilities indicated in them. This article review the process of creating an algorithm that determines the existing methodology for determining actual threats to data security during their processing in information data systems, which is used at the step of building a model of security threats. The developed algorithm is relevant in view of its application to the current methodology, which serves as the main document in determining the requirements for the information security system. It is proposed to use a four-stage algorithm for collecting reconnaissance information from public sources (OSINT) for assessing risks and determining the state of security of an information system. The algorithm contains the steps of collecting information from freely distributed databases of supervisory authorities, external network resources of the organization, identifying potential an illegal intruderamong the employees of the organization, as well as checking the organization's internal network resources. The developed algorithm is recurrent and allows organizing a recursive update of the input data collected as a result of its first execution, thereby providing data for a more detailed analysis when performing subsequent cycles. The information obtained as a result of OSINT analyze and provide to the managerial staff of the organization or the owner of the information system for further use in determining the appropriate coefficients of the current methodology.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Today, a matter of necessity of the need to ensure the information security of the organization is increasingly arose not only by large corporations and government entities, but also by small private organizations. The main reason for it is the increase in the cost of processed information in the networks of organizations that has become the most desirable resource of cybercriminals.</p><p>With the need to protect the information being processed, it is necessary to properly assess the current state of security of the information system in accordance with the requirements of current federal laws and other governing documents of supervisory bodies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Justification of the existing problem</head><p>In carrying out the task of building an information protection system in organizations closely related to the processing of client databases that include personal data, an important point is the need to determine the current personal data threats when processing them in ISPD in accordance with the current FSTEC methodology <ref type="bibr" target="#b0">[1]</ref>.</p><p>As a result of the actions described in this methodology, employees of the organization are faced with the task of determining numerical coefficients 𝑌 1 and 𝑌 2 , which indicate the state of the initial security and the probability of the threat implementation.</p><p>Unlike the first coefficient determined by the table in the methodology, the value of the 𝑌 2 coefficient should be determined by using the proposed verbal estimates corresponding to small, medium, high and unlikely.</p><p>It is worth noting the difficulty of conducting such assessments in the absence of any actual data on the current state of the organization's information systems and not to mention a further similar process for assessing the feasibility of a threat, which requires an impartial assessment of the possibility of implementing security incidents, including by the organization's staff.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Algorithm development</head><p>As a way to solve the problem of correctly determining the values of the 𝑌 2 , coefficient, we will build an algorithm that allows using open source software used for intelligence based on open information sources (OSINT) to search for existing threats.</p><p>Among the methods for conducting OSINT, the four-stage cyclic method for conducting data collection has gained the greatest popularity:</p><p>1. Definition of information search criteria 2. Retrieving searched data from open sources 3. Analysis of the received information 4. Structuring the obtained information in order to use it for further data search.</p><p>Therefore, the accuracy of the research conducted depends on the number of OSINT cycles, which allows you to determine the depth of analysis of the collected information depending on its type, secrecy and the wishes of the organization's management <ref type="bibr" target="#b1">[2]</ref>.</p><p>An important feature of OSINT is the full analysis of the organization's information and personnel resources. For this reason, we highlight three main steps of the algorithm being developed and consider the most successful methods of their implementation: 1) Analysis of public pages of the organization It includes the collection and analysis of information about the organization posted in such sources of information, advertisements, organization websites, resources, tax information and other sources of information that allow you to obtain initial data on the activities of the organization: organizational structure, position, etc.</p><p>There are many software solutions, but as an example, we will consider the Maltego, which provides a convenient interface for visualizing data found and connections between it. Despite the fact that Maltego has a free version, the most effective are paid versions of the program that allow expanding its capabilities by connecting additional third-party libraries, the work of which is implemented by connecting using API keys. An example of analysis and construction of connections of collected data of the Russian State Hydrometeorological University (RSHU) website (rshu.ru) is shown in Figure <ref type="figure" target="#fig_0">1</ref>. As a result of the analysis, it becomes possible to obtain the following information: contact information of the owners of network resources, hosting on the basis of which the organization's website is located, personal data of employees whose numbers are indicated on the website, information about the current and completed judicial proceedings of the organization and information about the dates of important events, such as: company management's birthdays, dates of corporate events and many other information that will further facilitate the receipt of additional information <ref type="bibr" target="#b2">[3]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>2) Analysis of employee information</head><p>In this step, you search for existing employees in your organization using the data you have received in the previous step. The main goal is to collect information about the largest number of employees in the organization using the previously obtained data. As a result of the analysis, it becomes possible to determine most of the employees of the organization with high accuracy through the analysis of social networks of these employees, their personal e-mails, phone numbers, home addresses and relationships between the employees.</p><p>We will use the OSINT Framework, which combines a huge number of solutions in the field of searching for information from open sources. The Maltego that was discussed earlier can also be used for these purposes, but most of its functionality for analyzing social networks used in Russia requires purchase of paid packages. The main advantage of the OSINT Framework is the ability to get the user the access to the maximum number of information from free sources, with additional indication of paid resources. Figure <ref type="figure">2</ref> shows the OSINT Framework options for Social Network and Mail Address Analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 2: OSINT Framework solutions for finding information on popular social networks and e-mail services</head><p>An important task of this step is to identify dissatisfied employees who openly express dissatisfaction with colleagues and the organization as a whole. Often, it is a dissatisfied employee who is a potential victim of social engineers who provoke the employee to help achieve their own goals.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>3) Analysis of the organization's network</head><p>The last but no least important step is to analyze current state of security of corporate networks of the organization. In this step, it is important to analyze the network infrastructure used by the system and application software, the security tools used, protocols and other information that allows the abuser to plan attacks for specific network components.</p><p>The task of analyzing data about an organization's network can be solved in many different ways, the application of which depends on the type of network and the devices used in it. One of the most famous tools is Nmap. Using Nmap to the address found using Maltego IP, we can get information about the system software used, which is used on the hosting network resource. Figure <ref type="figure" target="#fig_1">3</ref> shows the result of the website rshu.ru hosting operating system definition. The main criterion for choosing an implementation tool is to locate an attacker in relation to the network of the organization. If located in a segment of the corporate network, the use of sniffers to analyze network traffic for the use of vulnerable network protocols is needed. At the same time, for the purpose of further penetration, it is necessary to use vulnerability scanners and Nmap analogues to search for vulnerabilities of border nodes of the network or to obtain information about the protection used in case of remote scanning of devices at the border of the investigated network in case of firewalls <ref type="bibr" target="#b3">[4]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>The result of the work is an algorithm, contributory factor to the process of determining the verbal coefficients of the probability of the implementation of security threats for information systems, through the use of the final report generated from the results of external OSINT and analysis of the organization's network. It should be pointed out the possibility of obtaining new data on threats existing in the information system, the identification of which in the case of multiple cyclical repetition of the algorithm contributes to the addition of the model of security threats and information created at the previous stages. Also should be pointed out that the developed algorithm can also be used when reevaluating the security of an information system to identify new sources of threats and determine their relevance.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Result of data collection from RSHU website (rshu.ru)</figDesc><graphic coords="3,123.20,122.60,348.49,154.75" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Definition of the website rshu.ru hosting operating system</figDesc><graphic coords="4,143.75,554.28,307.23,151.50" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Methodology for determining current threats to personal data security during their processing in personal data information systems</title>
	</analytic>
	<monogr>
		<title level="j">FSTEC</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="page">2008</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="http://www.pentest-standard.org/index.php/Main_Page" />
		<title level="m">Penetration Testing Execution Standard (PTES)</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="https://docs.maltego.com/support/solutions/articles/15000008703-client-requirements#network-requirements-0-3" />
		<title level="m">Maltego Desktop Application Guide</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Estimation of probabilistic-temporal characteristics of network nodes with traffic differentiation//Informatsionno-Upravliaiushchie Sistemy</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">M</forename><surname>Tatarnikova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Volskiy</surname></persName>
		</author>
		<idno type="DOI">10.15217/issn1684-8853.2018.3.5</idno>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">94</biblScope>
			<biblScope unit="page" from="54" to="60" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Statistical methods for studying network traffic //Informatsionno-Upravliaiushchie Sistemy</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">M</forename><surname>Tatarnikova</surname></persName>
		</author>
		<idno type="DOI">10.31799/1684-8853-2018-5-35-43</idno>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">96</biblScope>
			<biblScope unit="page" from="35" to="43" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Fault Tolerance of Clusters Configurations with Direct Connection of Storage Devices // Automatic Control and</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">A</forename><surname>Bogatyrev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computer Sciences</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="330" to="337" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Multipath Redundant Transmission with Packet Segmentation</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Bogatyrev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">A</forename><surname>Bogatyrev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V</forename><surname>Bogatyrev</surname></persName>
		</author>
		<idno type="DOI">10.1109/WECONF.2019.8840643</idno>
	</analytic>
	<monogr>
		<title level="m">Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)</title>
				<imprint>
			<date type="published" when="2019">2019. 2019</date>
			<biblScope unit="page">8840647</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Evaluation of a Cyber-Physical Computing System with Migration of Virtual Machines during Continuous Computing</title>
		<author>
			<persName><forename type="first">V</forename><surname>Bogatyrev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Derkach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computers</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page">42</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">IoT system for detecting dangerous substances by smell// Informatsionno-Upravliaiushchie Sistemy</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">M</forename><surname>Tatarnikova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">N</forename><surname>Dzubenko</surname></persName>
		</author>
		<idno type="DOI">10.15217/issn1684-8853.2018.2.84</idno>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">93</biblScope>
			<biblScope unit="page" from="84" to="90" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
