=Paper=
{{Paper
|id=Vol-2533/paper14
|storemode=property
|title=The Testing of Pseudorandom Sequences using Multidimensional Statistics
|pdfUrl=https://ceur-ws.org/Vol-2533/paper14.pdf
|volume=Vol-2533
|authors=Svitlana Popereshnyak,Georgi P. Dimitrov
|dblpUrl=https://dblp.org/rec/conf/dcsmart/PopereshnyakD19
}}
==The Testing of Pseudorandom Sequences using Multidimensional Statistics==
The Testing of Pseudorandom Sequences using Multidimensional Statistics Svitlana Popereshnyak 1[0000-0002-0531-9809] and Georgi P. Dimitrov 2[0000-0001-5064-3168] 1 Taras Shevchenko National University of Kyiv, 24, Bohdana Havrylyshyna str., Kyiv, 04116, Ukraine spopereshnyak@gmail.com 2 University of Library Studies and Information Technologies, 119, Tsarigradsko Shose, Sofia, Bulgaria geo.p.dimitrov@gmail.com Abstract. The available approaches to testing pseudorandom sequences show low flexibility and versatility in the means of finding hidden patterns in the da- ta. To solve this problem, it is suggested to use algorithms based on multidi- mensional statistics. The paper proposed a new approach for testing pseudoran- dom sequences, obtained an explicit form of the joint distribution of numbers of 2-chains and numbers of 3-chains of various options random bit sequence of a given small length. Examples, tables, diagrams that can be used to test for ran- domness of the location of zeros and ones in the bit section are presented. In fu- ture as a result an information system will be created that will allow analyzing the pseudorandom sequence of a small length and choosing a quality pseu- dorandom sequence for use in a particular subject area. Keywords: Algorithms, multidimensional Statistics, Random Sequence, s- chains, Cryptography, Pseudorandom Sequence, Statistical Testing. 1 Introduction Random sequences have found the widest application from the gaming computer industry to mathematical modeling and cryptology. We list some areas of their usage: modeling, cryptography and information securi- ty, decision making in automated expert systems, optimization of functional depend- encies, fun and games. There are various approaches to the formal definition of the term “randomness” based on the concepts of computability and algorithmic complexity [1-2]. By implementing some algorithm, software generators produce numbers (although not obvious) depending on the set of previous values, so the received numerical se- quences are not truly random and are called pseudo-random sequences (PRS). At the moment, more than a thousand software PRS generators are known, which differ in algorithms and values of parameters. Statistical properties are significantly different from the number sequences that are generated by them. Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0) 2019 DCSMart Workshop. The presented and not presented results allow us to characterize the state of modern technologies of designing the PRS (focusing on the most progressive of them by the following basic provisions [3-6]. 2 Problem Statement Before responsible using in mathematical modeling and cryptology, PRS should be tested. Unfortunately, for many PRS tests, there are some limitations: • checked out only one of the probable ones properties that are characterize PRS; • not fix family alternatives; • do not have theoretical ones ratings power. • do not give a correct an estimate of chance sequences provided a little sam- ple. Problems small and large samples refer to the main problems that arise in practical application methods analysis data. Let's be use the next classification samples by number [2], based on requirements presented in the program criteria: • very small sample - from 5 to 12, • small sample - from 13 to 40, • medium sample - from 41 to 100, • large sample - from 101 and more. The minimum size of the sample limits not so much the algorithm of calculating the criterion, but the distribution of its statistics. For a row algorithms with too much small ones numbers sample normal approximation distribution of statistics criterion will be under question. During the research, the localization of the local sections of the bit sequence was conducted to detect the dependencies in the location of its elements by using the exact distributions of the corresponding statistics. In the work an explicit form of the joint distribution of the numbers of 2-chains and numbers of 3-chains of various variants in a random sequence was obtained. This joint distribution allows more accurate com- parison of the use of one-dimensional statistics, to analyze the bit sequence small length by chance. 3 Joint Distribution of number of 2-chains and number of 3- chains of a provided type in binary sequence Consider a sequence of random variables 𝛾1 , 𝛾2 , . . . , 𝛾𝑛 , (1) where 𝛾𝑖 = {0, 1}, i= 1, 2, . . . , 𝑛, 𝑛 > 0. Subsequences 𝛾𝑗 , 𝛾𝑗+1 , . . . , 𝛾𝑗+𝑠−1 , sequences (1) are called s-chains, 𝑗 = 1, 2, . . . , 𝑛 − 𝑠 + 1, 𝑠 = 1, 2, . . . , 𝑛. Denote 𝜂(𝑡1 𝑡2 . .. 𝑡𝑠 ) the number of s-chains in the sequence (1) that coincide with 𝑡1 , 𝑡2 , . . . , 𝑡𝑠 , where 𝑡𝑖 = {0, 1}, 𝑖 = 1, 2, . . . , 𝑠. Theorem. Let sequence (1) consist of n, 𝑛 > 0 independent identically distributed random variables; Ρ{𝛾𝑖 = 1} = 𝑝, Ρ{𝛾𝑖 = 0} = 𝑞, p + q = 1, i = 1, 2, . . . , n and 𝑘1 , 𝑘2 , 𝑘3 , 𝑡, – integer numbers such that 𝑘1 ≥ 0, 𝑘2 ≥ 0, 𝑘3 ≥ 0, 𝑚1 + 𝑚0 = 𝑛 ≥ 3, 𝑡 𝜖 {0, 1}, 𝑡 ∗ = 1 − 𝑡. Then Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 , 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘3 } = ∑𝑛𝑚1 =0 𝑝𝑚1 𝑞 𝑚0 × 𝑘 −𝑖 𝛿 𝑚 −𝑘 −𝑘3 −1 {𝐶𝑚3𝑡−𝑘1−2 𝐶𝑚1𝑡−𝑘1−1 𝐶𝑘1𝑡+1 1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 + 1; 𝑚𝑡 − 𝑘1 − 𝛿1 − 1) + 𝑘3 𝛿2 𝐶𝑚𝑡−𝑘1 𝐶𝑚𝑡−𝑘1+1 Ζ(𝑘1 ; 𝑚𝑡 − 𝑘1 − 𝑘3 )Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 − 1; 𝑚𝑡 − 𝑘1 − 𝛿2 + 1) + 𝑘 𝛿 𝑚 −𝑘 −𝑘 −1 2𝐶𝑚3𝑡 −𝑘1−1 𝐶𝑚3𝑡−𝑘1 𝐶𝑘1𝑡 1 3 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 ; 𝑚𝑡 − 𝑘1 − 𝛿3 ) + 𝜒(𝑚𝑡 − 𝑘1 − 1, 𝑘2 , 𝑘3 , 𝑚𝑡 ∗ )}, (2) 1, if 𝑎1 = 𝑎2 = 𝑎3 = 𝑎4 = 0, where 𝑚𝑡 + 𝑚𝑡 ∗ = 𝑛, 𝜒(𝑎1 , 𝑎2 , 𝑎3 , 𝑎4 ) = { , 𝛿𝑖 = 𝑘2 − 0, elsewhere 𝑚𝑡 ∗ + 2(𝑚𝑡 − 𝑘1 + 𝛼𝑖 ), 𝑖 = 1,3, 𝛼1 = −1, 𝛼2 = 1, 𝛼3 = 0; 𝑏−1 𝐶𝑎−1 , if 𝑎 ≥ 𝑏 ≥ 1; Ζ(𝑎, 𝑏) ≝ { 1, if 𝑎 = 𝑏 = 0; 0, elsewhere. Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 } = ∑𝑛𝑚1=0 𝑝𝑚1 𝑞 𝑚0 × 𝛿 {𝐶𝑚1𝑡 −𝑘1−1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 + 1; 𝑚𝑡 − 𝑘1 − 𝛿1 − 1) + 𝛿 𝐶𝑚2𝑡−𝑘1+1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 − 1; 𝑚𝑡 − 𝑘1 − 𝛿2 + 1) + 𝛿 2𝐶𝑚3𝑡−𝑘1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 ; 𝑚𝑡 − 𝑘1 − 𝛿3 ) + 𝜒(𝑚𝑡 − 𝑘1 − 1, 𝑘2 , 𝑚𝑡 ∗ )}Ζ(𝑚𝑡 ; 𝑚𝑡 − 𝑘1 ), (3) Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘2 } = ∑𝑛𝑚1 =0 𝑝𝑚1 𝑞 𝑚0 × 𝑘 𝑚 −𝑘 −𝑘2 −1 Ζ(𝑚𝑡 ∗ ; 𝑚𝑡 − 𝑘1 − 1)𝜒1 (𝑚𝑡 − 𝑘1 − 2) + {𝐶𝑚2𝑡 −𝑘1 −2 𝐶𝑘1𝑡+1 1 𝑘2 𝑚t −𝑘1 𝑘 𝑚 −𝑘 −𝑘 −1 𝑚 −𝑘 −𝑘 −1 𝐶𝑚𝑡−𝑘1 𝐶𝑚 ∗ −1 Ζ(𝑘1 ; 𝑚𝑡 − 𝑘1 − 𝑘2 ) + 2𝐶𝑚2𝑡−𝑘1−1 𝐶𝑚 𝑡∗ −11 2 𝐶𝑘1𝑡 1 2 + 𝑡 𝑡 𝜒2 (𝑚𝑡 − 𝑘1 − 1, 𝑘2 , 𝑚𝑡 ∗ )}, (4) 1, if 𝑎1 ≥ 1, 1, if 𝑎1 = 𝑎2 = 𝑎3 = 0, where 𝜒1 (𝑎1 ) = { , 𝜒2 (𝑎1 , 𝑎2 , 𝑎3 ) = { 0, elsewhere 0, elsewhere Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) + 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘2 } = ∑𝑛𝑚1 =0 𝑝𝑚1 𝑞 𝑚0 × {(∑𝛿+𝛿 ∗ =𝑎1 1 × 𝛿 ∗ 𝑚 −𝑘 −δ−1 𝐶𝑚 𝐶𝛿 𝐶 𝑡 1 𝑡 −𝑘1 −2 𝑚𝑡 −𝑘1 −1 𝑘1 +1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 + 1; 𝑚𝑡 − 𝑘1 − 𝛿 ∗ − 1) + δ 𝛿∗ (∑𝛿+𝛿∗=𝑎2 𝐶𝑚𝑡 −𝑘1 𝐶𝑚𝑡−𝑘1+1 Ζ(𝑘1 ; 𝑚𝑡 − 𝑘1 − δ)Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 − 1; 𝑚𝑡 − 𝑘1 − 𝛿 ∗ 𝑚 −𝑘 −δ−1 𝛿 ∗ + 1))+(∑𝛿+𝛿∗ =𝑎3 𝐶𝑚 𝐶𝛿 𝐶 𝑡 1 𝑡 −𝑘1 −1 𝑚𝑡 −𝑘1 𝑘1 Ζ(𝑚𝑡 ∗ − 𝑚𝑡 + 𝑘1 ; 𝑚𝑡 − 𝑘1 − 𝛿 ∗ )) +𝜒(𝑚𝑡 − 𝑘1 − 1, 𝑘2 , 𝑚𝑡 ∗ )}, (5) is the symbol ∑ denotes addition over all non-negative integers 𝛿𝑡 and 𝛿𝑡 ∗ such that 𝑎1 = 𝑘2 − 𝑚𝑡 ∗ + 2(𝑚𝑡 − 𝑘1 − 1), 𝑎2 = 𝑘2 − 𝑚𝑡 ∗ + 2(𝑚𝑡 − 𝑘1 + 1), 𝑎1 = 𝑘2 − 𝑚𝑡 ∗ + 2(𝑚𝑡 − 𝑘1 ). 4 Experiment As a result of applying this technique for testing pseudo-random sequences for two- dimensional statistics, you can build tables (relations (2) - (5)) and bubble diagrams (relations (3) - (5)) with which you can get the probability of the distribution of zeros and ones in a given sequences. As practice shows, the use of ready-made tables for analyzing the sequence of randomness allows you to get the answer as quickly as possible, in contrast to the classical testing method. Consider an example of tables and bubble diagrams for a bit-sequence of small length. For example, let the length of the bit sequence n, n = 32 for relations (3) - (5) and n = 24 for relations (2). 4.1 Illustration of the Use of Equality (2) In Table 1 and in Fig. 1 shows the use of the relation (2) for a small sample 𝑛, 𝑛 = 32, and some values 𝑘1 and 𝑘2 . Table 1. Using relation (3) for a small sample of length 32 𝒌𝟏 𝒌𝟐 𝑷 𝑷𝒄 𝒌𝟏 𝒌𝟐 𝑷 𝑷𝒄 4 5 0,0102 0,44366 9 4 0,01595 0,67931 6 1 0,01037 0,45403 6 2 0,01596 0,69527 12 1 0,0108 0,46483 10 1 0,01623 0,7115 5 2 0,01106 0,4759 8 1 0,01642 0,72791 9 5 0,01121 0,48711 6 5 0,01655 0,74446 11 3 0,01157 0,49868 7 5 0,01655 0,76102 5 6 0,01187 0,51055 9 1 0,01721 0,77823 10 4 0,01189 0,52244 10 2 0,0181 0,79633 7 6 0,01203 0,53447 6 4 0,01898 0,81531 6 6 0,01289 0,54736 6 3 0,01901 0,83432 11 1 0,01387 0,56123 8 4 0,01915 0,85346 7 1 0,01393 0,57516 7 2 0,01981 0,87328 5 3 0,01417 0,58933 9 3 0,01985 0,89313 5 5 0,0142 0,60353 7 4 0,02039 0,91351 11 2 0,01421 0,61774 9 2 0,02085 0,93437 8 5 0,01449 0,63222 8 2 0,02156 0,95593 5 4 0,01519 0,64741 7 3 0,02192 0,97785 10 3 0,01595 0,66336 8 3 0,02215 1 In Table 1 the first column contains all possible values 𝑘1 and 𝑘2 , for which proba- bility is Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 } ≥ 0,01. The second column of Table 1 gives the probabilities (in non-decreasing order) 𝑃{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 } for pairs of numbers (𝑘1 , 𝑘2 ) listed in the first column. Each row of the fourth column contains the sum of the accumulated probabilities before the event is implemented {𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 } inclusive where 𝑘1 and 𝑘2 indicated in the same line in the first column. 4.2 Illustration of the Use of Equality (4) In Table 2 and in Fig. 2. shows the use of the relation (4) for a small sample of n, n = 32, and some values of 𝑘1 and 𝑘2 . Table 2. Using relation (4) for a small sample of length 32 𝑘1 𝑘2 𝑃 𝑃𝑐 𝑘1 𝑘2 𝑃 𝑃𝑐 12 1 0,010309 0,25025 5 6 0,019461 0,524834 4 7 0,010346 0,260596 11 2 0,020707 0,545541 13 2 0,010566 0,271162 8 2 0,020939 0,56648 10 1 0,010906 0,282067 6 3 0,022517 0,588997 11 1 0,011296 0,293363 5 4 0,023782 0,61278 3 6 0,011426 0,304789 10 2 0,024014 0,636794 7 6 0,01148 0,316269 9 2 0,024221 0,661015 9 5 0,011732 0,328001 9 4 0,025878 0,686893 12 3 0,011875 0,339876 7 5 0,026396 0,713288 5 3 0,013051 0,352927 10 3 0,027086 0,740375 4 4 0,013083 0,36601 5 5 0,027095 0,76747 7 2 0,015224 0,381234 6 5 0,029893 0,797363 12 2 0,015705 0,396939 7 3 0,030948 0,828311 6 6 0,016693 0,413631 6 4 0,033093 0,861404 10 4 0,017033 0,430665 9 3 0,033247 0,894651 4 6 0,017494 0,448159 8 4 0,033621 0,928272 4 5 0,018859 0,467018 8 3 0,034964 0,963236 11 3 0,019157 0,486174 7 4 0,036764 1 8 5 0,019199 0,505373 Table 2 is formed of columns whose contents are similar to the contents of the Table 1 columns. 4.3 Illustration of the Use of Equality (5) In Table 3 and in Fig. 3 shows the use of the relation (5) for a small sample 𝑛, 𝑛 = 32, and some values 𝑘1 and 𝑘2 . Table 3. Using relation (5) for a small sample of length 32 𝑘1 𝑘2 𝑃 𝑃𝑐 𝑘1 𝑘2 𝑃 𝑃𝑐 6 11 0,01018 0,35129 6 10 0,01737 0,61156 4 12 0,01025 0,36154 10 6 0,019 0,63056 6 6 0,01028 0,37182 5 10 0,01963 0,65019 12 4 0,01165 0,38347 7 9 0,01986 0,67005 7 10 0,01178 0,39525 7 6 0,02017 0,69022 11 6 0,01179 0,40704 6 7 0,02026 0,71048 9 4 0,01209 0,41913 10 5 0,02064 0,73112 4 9 0,01229 0,43143 9 7 0,02083 0,75195 8 9 0,01285 0,44428 8 8 0,0211 0,77305 10 7 0,0129 0,45718 9 5 0,02156 0,79461 9 8 0,01325 0,47043 5 9 0,02159 0,8162 5 11 0,014 0,48444 6 9 0,0242 0,8404 4 11 0,01416 0,4986 9 6 0,02513 0,86552 11 4 0,01481 0,51341 6 8 0,02612 0,89165 10 4 0,01521 0,52862 8 6 0,02619 0,91783 4 10 0,01543 0,54406 7 8 0,02698 0,94481 11 5 0,01578 0,55984 8 7 0,02735 0,97217 8 5 0,01706 0,57691 7 7 0,02783 1 5 8 0,01729 0,5942 Table 3 is formed of columns whose contents are similar to the contents of col- umns from Table 1. 4.4 Illustration of the Use of Equality (2) In Table 4 shows the use of the relation (2) for a small sample 𝑛, 𝑛 = 24, and some values 𝑘1 , 𝑘2 and 𝑘3 . Table 4. Using relation (2) for a small sample of length 24 𝑘1 𝑘2 𝑘3 𝑃 𝑃𝑐 5 1 3 0,009096 0,851162 4 4 3 0,009398 0,86056 5 1 4 0,009748 0,870309 8 1 2 0,009901 0,88021 7 1 3 0,009946 0,890155 4 3 3 0,009999 0,900154 6 3 2 0,010374 0,910529 7 1 2 0,010382 0,920911 4 2 4 0,010422 0,931332 6 2 2 0,010553 0,941885 7 2 2 0,011017 0,952902 5 3 3 0,011284 0,964186 6 2 3 0,011495 0,975681 6 1 3 0,011903 0,987584 5 2 3 0,012416 1 In Table 4 in the first, second and third columns are all possible values 𝑘1 , 𝑘2 and 𝑘3 , for which probability Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 , 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘3 } ≥ 0,009 , and the contents of the fourth and fifth columns are similar to the contents of the third and fourth columns of the Table 1. 5 Results and Discussion As a result of applying this technique for testing pseudo-random sequences for two- dimensional statistics (relations (3) - (5)), you can build a bubble diagram with which you can get the probability of the distribution of zeros and ones in a given sequence. Consider examples of bubble diagrams for a bit sequence of small length n, n = 32. 5.1 Graphic Illustration of the Use of Equality (3) Fig. 1 gives a bubble chart in which the first parameter (horizontal axis) is the value 𝑘1 , the second parameter (vertical axis) is the value 𝑘2 , and the third parameter (the bubble size) is the probability of the event occurring {𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 }, presented in percent. 6 5 1,42% 1,66% 1,66% 1,45% 4 1,52% 1,90% 2,04% 1,91% 1,60% Value k2 3 1,42% 1,90% 2,19% 2,21% 1,98% 1,59% 2 1,60% 1,98% 2,16% 2,09% 1,81% 1,42% 1 1,64% 1,72% 1,62% 0 4 5 6 7 8 9 10 11 12 Value k1 Fig. 1. Bubble chart of sequence with the length 32 for (3) After analyzing Fig. 1 it can be concluded that for the analysis of the sequence of chains of small and medium length (from 13 to 100 elements), one-dimensional statis- tics do not always give the correct result. For example, if we consider the sequence where the parameter 𝑘1 = 8, then we can draw a conclusion with a degree of probabil- ity about 10% of randomness of the sequence with these characteristics, however, if we pay attention when 𝑘1 = 8 and 𝑘2 = 5 it can be argued that this sequence is non- random, therefore as shown in Fig. 1 we have Ρ{𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) = 𝑘2 } = 1,45%. What also shows the lack of use of one-dimensional statistics for the analysis of small and medium bit sequences. An approach to testing using n-dimensional statistics allows us to rely on a deeper justification of the randomness of generated sequences. 5.2 Graphic Illustration of the Use of Equality (4) In Fig. 2 shows the use of the relation (4) for a small sample 𝑛, 𝑛 = 32 , and some values 𝑘1 and 𝑘2 . 6 5 2,71% 2,99% 2,64% 4 2,38% 3,31% 3,68% 3,36% 2,59% Value k2 3 2,25% 3,09% 3,50% 3,32% 2,71% 2 2,09% 2,42% 2,40% 2,07% 1 4 5 6 7 8 9 10 11 12 Value k1 Fig. 2. Bubble chart of sequence with the length 32 for formula (4). Fig. 2 gives a bubble chart in which the first parameter (horizontal axis) is the val- ue 𝑘1 , the second parameter ( vertical axis) is the value 𝑘2 , and the third parameter (bubble size) is the probability of the event occurring {𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘2 }, which is represented as a percentage. 5.3 Graphic Illustration of the Use of Equality (5) In Fig. 3 shows the use of relation (4) for a small sample 𝑛, 𝑛 = 32, and some values 𝑘1 and 𝑘2 . Fig. 3 gives a bubble chart in which the first parameter (horizontal axis) is the val- ue 𝑘1 , the second parameter (vertical axis) is the value 𝑘2 , and the third parameter (bubble size) is the probability of the event occurring {𝜂(𝑡 𝑡) = 𝑘1 , 𝜂( 𝑡 ∗ 𝑡 ∗ 𝑡 ∗ ) + 𝜂( 𝑡 ∗ 𝑡 𝑡 ∗ ) = 𝑘2 }, which is represented as a percentage. 10 9 2,16% 2,42% 8 2,61% 2,70% 2,11% Value k2 7 2,03% 2,78% 2,74% 2,08% 6 2,02% 2,62% 2,51% 5 2,16% 2,06% 4 4 5 6 7 8 9 10 11 Value k1 Fig. 3. Bubble chart of sequence with the length 32 for formula (5). In this paper, the exact compatible distributions of some statistics (0, 1) -sequences of length 1 < 𝑛 < ∞ are given. For a bit sequence of small length n, n = 32, the tables containing the numerical values of the corresponding distribution are given. These tables, as well as the proposed graphic representations, can be used to test the hypoth- esis of the randomness of the arrangement of zeros and units. 6 The Results of the Comparison the NIST Statistical Test Suite and Test of PRS of Small Length using Multidimensional Statistics Consider the well-known examples that are given in [7, 8]. Let us analyze the submit- ted sequences for the corresponding tests, where: • P is the probability of sequence randomness according to the selected criterion from the first column, • P1 is the probability obtained using relation (2), • P2 is the probability obtained using relation (3), • P3 is this is the probability obtained using relation (4), • P4 is this is the probability obtained using relation (5). Table 5. The results of the comparison Input Size Recom- Test mendation, length Sequences Р P1 P2 P3 P4 n more than Frequency (Monobit) 100 10 1011010101 0,527 0,007 0,027 0,007 0,057 Test Frequency Test within a 100 10 0110011010 0,801 0,01 0,075 0,102 0,01 Block Runs test 100 10 1001101011 0,147 0,052 0,075 0,087 0,09 Binary N=20 M = 01011001001 Matrix Rank 38000 0,741 0,004 0,008 0,014 0,017 Q=3 010101101 Test Discrete Fourier Transform 1000 N=10 0001010011 0,109 0,063 0,109 0,084 0,092 (Spectral) Test Non- overlapping N=20, 2 10100100101 Template 200 blocks of 0,344 0,01 0,026 0,051 0,025 110010110 Matching length 10 Test Maurer’s “Universal 01011010011 380000 N=20 0.767 0,001 0,03 0,009 0,023 Statistical” 101010111 Test Serial test 100 N=10 0011011101 0,907 0,029 0,064 0,087 0,088 Approximate 100 N=10 0100110101 0,261 0,052 0,075 0,087 0,09 Entropy test Cumulative Sums 100 N=10 1011010111 0,411 0,02 0,031 0,043 0,057 (Cusum) Test Random Excursions 106 N=10 0110110101 0,502 0,02 0,027 0,043 0,031 Test Random Excursions 106 N=10 0110110101 0,683 0,02 0,027 0,043 0,031 Variant Test As can be seen from the table, the use of two-dimensional statics gives a more ac- curate result for short sequences. And also, according to [8], the recommended mini- mum sequence length n is greater than 100 bits. 7 Conclusions The available approaches to testing pseudorandom sequences show low flexibility and versatility in the means of finding hidden patterns in the data. To solve this problem, it is suggested to use algorithms based on multidimensional statistics. The approach to testing using multidimensional statistics allows you to rely on a deeper justification of the randomness of the generated sequences. This area is prom- ising for scientific research. The paper proposed a methodology for testing a sequence and obtained a correct view of the joint distribution of the numbers of 2-chains and the numbers of 3-chains of various variants in a random bit sequence of a given small length. These algorithms and scheme of work for verification statistical tests of random- ness sequences (proposed in chapter II) combine all the advantages of statistical methods and are the only alternative for the analysis of sequences of small and medi- um length. To implement the proposed approach, a PRS software test package is being devel- oped, which will include tests using multidimensional statistics, which are well rec- ommended for testing a small length PRS. As a result of the implementation of this technique, an information system will be created that will allow analyzing the PRS of a small length and choosing a quality PRS for use in a particular subject area. References 1. Маsоl V., Pоpereshnyаk S. Stаtіstісаl аnаlysіs оf lосаl plоts оf bіts sequenсes. Prоblemy uprаvlenіyа і іnfоrmаtіkі, 5, 92-105 (2019). 2. Popereshnyak S. Analysis of pseudorandom small sequences using multidimensional sta- tistics. In: The 3rd IEEE International Conference on Advanced Information and Commu- nication Technologies (AICT), pp. 5.4.1-5.4.4. IEEE Press, Ukraine (2019) 3. Nejad F. H., Sabah S., Jam A. J. Analysis of avalanche effect on advance encryption standard by using dynamic S-Box depends on rounds keys. In: The 2014 International Conference on Computational Science and Technology (ICCST), pp. 1-5. IEEE Press, Ko- ta Kinabalu (2014) 4. Bhaskar C. U., Rupa C. An advanced symmetric block cipher based on chaotic systems. In: The 2017 Innovations in Power and Advanced Computing Technologies (i-PACT), pp. 1-4. IEEE Press, Vellore (2017) 5. Busireddygari P.; Kak S. Pseudorandom tableau sequences, In: 51st Asilomar Conference on Signals, Systems, and Computers, pp. 1733 – 1736. IEEE Press (2017) 6. Gurugopinath S., Samudhyatha B., Multi-dimensional Anderson-Darling statistic based goodness-of-fit test for spectrum sensing. In: Seventh International Workshop on Signal Design and its Applications in Communications (IWSDA). pp. 165-169. Bengaluru, India. (2015). 7. Moody D. Post-quantum cryptography: NIST’s plan for the future. In: Proceedings of the Seventh International Conference on Post Quantum Cryptography. IEEE Press, Japan, (2016). https://pqcrypto2016.jp 8. Special Publication 800-22. A Statistical Test Suite for Random and Pseudorandom Num- ber Generators for Cryptographic Applications. http://csrc. nist.gov