<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Development and analysis of a parallel method for detecting chromosomal translocations and inversions in DNA sequences ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Lesia</forename><surname>Mochurad</surname></persName>
							<email>lesia.i.mochurad@lpnu.ua</email>
							<affiliation key="aff0">
								<orgName type="institution">Lviv Polytechnic National University</orgName>
								<address>
									<addrLine>12 Bandera street</addrLine>
									<postCode>79013</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Renata</forename><surname>Vladyka</surname></persName>
							<email>renata.vladyka.shi.2022@lpnu.ua</email>
							<affiliation key="aff0">
								<orgName type="institution">Lviv Polytechnic National University</orgName>
								<address>
									<addrLine>12 Bandera street</addrLine>
									<postCode>79013</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Development and analysis of a parallel method for detecting chromosomal translocations and inversions in DNA sequences ⋆</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">C2192809CE7ABFE7E6D6214DC510F87D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:10+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Translocation</term>
					<term>inversion</term>
					<term>DNA sequence</term>
					<term>Needleman-Wunsch algorithm</term>
					<term>nucleotides</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In the current context of the growing volume of genetic research and the need for the fast and accurate detection of chromosomal abnormalities, developing efficient algorithms that provide high performance, scalability, and accuracy in comparing DNA sequences is a critical task. In this study, the aim was to create and analyze a parallel algorithm for detecting chromosomal translocations and inversions in DNA sequences. For this purpose, we implemented an algorithm capable of comparing DNA sequences to detect genetic abnormalities, including translocations and inversions. The use of parallel computing made it possible to significantly improve the efficiency of the analysis, reducing the algorithm's execution time and increasing scalability. Particular attention was paid to how the algorithm uses multithreading and how it efficiently distributes the load between threads. To detect inversions, an algorithm was developed that compares two DNA sequences and detects possible changes in nucleotide sequences. To find translocations, the Needleman-Wunsch algorithm was applied, which uses parallel computing to optimally align genetic fragments. The results of the algorithms have shown high efficiency in detecting chromosomal mutations, which is important for genetic research and can be used for medical diagnosis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>This paper is devoted to the development of an algorithm for preventing chromosomal translocations and inversions. Chromosomal translocations and inversions are complex structural changes in genetic materials that occur as a result of disruptions in the organization of chromosomes. Translocations involve moving a part of a chromosome to another, causing genetic sequences to be interrupted and reconnected, while inversions occur when a chromosome fragment rotates around its axis, changing the order of genes and other genetic elements. These structural variations can occur as a result of various errors during cell division. Incorrect joining or rotation of chromosome parts can lead to important changes in genetic information, which is often associated with the development of various genetic diseases and developmental disorders <ref type="bibr" target="#b0">[1]</ref>.</p><p>The main reasons why it is important to avoid chromosomal translocations and inversions: 1. A translocation between chromosomes 21 and 14 can lead to "familial" Down syndrome, where one of the parents may be a carrier of the translocation. This, in turn, can affect the likelihood of having a child with Down syndrome. 2. Gametes carrying defective chromosomes with inversion often develop into non-viable or miscarried organisms in the early stages of embryogenesis.</p><p>3. Robertson translocations can cause differences in the number of chromosomes between closely related species, which affects their fertility and genetic stability. 4. If gametes with an inverted chromosome develop into organisms, 50% of the gametes of these organisms may not be viable. However, the other 50% may be viable and maintain the mutation in the population.</p><p>Thus, the diagnosis and prediction of chromosomal inversions and translocations are critical in genetic research and medical practice. Detection of these changes contributes to early diagnosis, which allows timely treatment and monitoring of patients. Predicting the risks of developing certain diseases based on the identified mutations also helps patients to realize their genetic risks and opportunities for prevention. To solve this problem, the Needleman-Wunsch algorithm is used <ref type="bibr" target="#b1">[2]</ref>. However, one of the main problems with this algorithm is its high computational complexity; this algorithm has quadratic complexity. The latter makes it difficult to process large DNA sequences. The development of a parallel algorithm can reduce the execution time by simultaneously processing parts of the data <ref type="bibr" target="#b2">[3]</ref>, <ref type="bibr" target="#b3">[4]</ref>, <ref type="bibr" target="#b4">[5]</ref>. Ensuring efficient parallelization requires balanced load distribution among threads to prevent idle time, which may involve optimizing data structures and algorithms. Proper synchronization is also essential to avoid conflicts when accessing shared resources, especially in algorithms with interdependent results. Maintaining result accuracy during parallel processing may require extra validation. Lastly, scalability is a key factor. It should be able to scale across different hardware platforms, from personal computers to supercomputer clusters, to achieve optimal results in different environments <ref type="bibr" target="#b5">[6]</ref>, <ref type="bibr" target="#b6">[7]</ref>.</p><p>Solving all the problems described above is an important step to improve the speed and accuracy of chromosomal abnormalities diagnosis, which, in turn, will increase the efficiency of genetic research and clinical practice.</p><p>In the field of genetic research, there are many important aspects that relate to the study of genetic mutations, their diagnosis, and their impact on human health. Various articles offer valuable insights that emphasize the importance of such mutations in medicine, evolution, and biology. For example, paper <ref type="bibr" target="#b7">[8]</ref> discusses the work of the Human Gene Mutation Database (HGMD), which collects and analyzes information about human genetic mutations. The authors describe the functions of the HGMD, its use in clinical diagnostics and research, as well as plans to expand the database, which include the integration of GTEx project data and the introduction of automated tools for mutation prediction. The advantages of the article are a detailed overview of the HGMD functions and an emphasis on its role in clinical practice, but the disadvantages are the complexity of automatic identification of new mutations and the lack of examples of real-world use of the HGMD in clinical cases.</p><p>In <ref type="bibr" target="#b8">[9]</ref>, the authors analyze different types of genetic mutations and their impact on human health, including the occurrence of diseases due to mutations in genes or chromosomes. The advantages of this article are a clear description of the causes and consequences of genetic diseases, as well as a detailed overview of the types of mutations. However, there are also disadvantages, such as general statements that are not always supported by specific data, as well as a lack of details about research methods. Nevertheless, the article is useful for familiarizing oneself with genetic mutations and their impact on the human body.</p><p>Paper <ref type="bibr" target="#b9">[10]</ref> investigates the use of biodosimetry to assess radiation doses in US military personnel who participated in nuclear tests after World War II. The main goal of this study is to assess chromosomal aberrations, such as inversions and translocations, as a method of retrospective biodosimetry. The results show that inversions can be an effective way to establish radiation doses received decades ago and that their combination with translocations improves the accuracy of the estimate. The study also indicates the influence of age and smoking status on the frequency of aberrations, which is higher in older individuals and smokers.</p><p>The PETI method is discussed in <ref type="bibr" target="#b10">[11]</ref>. It effectively induces precise DNA recombinations in human cells. PETI successfully generates recombination and inversion mutations in endogenous genomes and can correct disease-related inversions and translocations, making it promising for disease modeling and therapeutic approaches. This method is characterized by high accuracy and flexibility compared to other genome editing methods such as Prime-Del and twinPE, but it is still limited in its use for episomal mutations and requires further research to confirm its results.</p><p>The authors of <ref type="bibr" target="#b11">[12]</ref> investigate the effectiveness of the third generation of sequencing analysis for detecting breakpoints in unbalanced chromosomal translocations. The use of Nanopore long reads allows for the detection of microdeletions, microinsertions, and other structural changes that complement translocations. This approach provides accurate genetic information necessary for diagnosis and treatment. However, the high cost and possible experimental failures point to the need for further research and optimization of methods.</p><p>The paper <ref type="bibr" target="#b12">[13]</ref> discusses the role of chromosomal rearrangements in the genetic differentiation and evolution of populations, in particular, using the example of the spiny frog with a chromosomal translocation polymorphism. The authors use whole-chromosome staining (WCP) and genetic analysis to confirm the common origin of the translocations. They found that translocated chromosomes have a higher level of genetic differentiation due to recombination suppression, which contributes to genetic diversity and population differentiation. The article also discusses the mechanisms of recombination suppression, such as the accumulation of repetitive sequences and the capture of adapted alleles.</p><p>This review is rounded off by article <ref type="bibr" target="#b13">[14]</ref>, which analyzes the importance of chromosomal inversions in the plant kingdom and their role in plant evolution. The authors note that inversions are common in many groups of plants and are often associated with locally advantageous traits that promote adaptation and speciation. The article also discusses methods for detecting inversions, such as karyotyping, genetic mapping, and high-fidelity sequencing. The authors call for further research to better understand the origin, evolutionary role, and molecular mechanisms of inversions in plants.</p><p>Thus, the analysis of scientific studies shows that existing approaches to diagnosing and predicting chromosomal inversions and translocations have their advantages, but also significant disadvantages. For example, the HGMD database, although providing useful information for clinical practice, faces problems with the automatic identification of new mutations. The use of biodosimetry to estimate radiation doses, in particular by analyzing chromosomal aberrations, has proven effective, but is dependent on factors such as age and smoking status. The PETI method demonstrates high accuracy in genetic editing, but requires additional research to confirm its widespread use. Chromosomal rearrangements confirm their role in the genetic differentiation of populations, but require a deeper study of the mechanisms of recombination suppression. The study of inversions in plants also emphasizes the need for further research into the evolutionary mechanisms of these changes. This confirms the importance of continuing research in this area. In particular, the need to develop a parallel algorithm for the diagnosis and prediction of chromosomal inversions and translocations is driven by the need for faster and more accurate detection of genetic abnormalities, which is critical for effective treatment and management of diseases.</p><p>The main contribution of this paper:  A new parallel method for detecting chromosomal translocations and inversions using multithreading and parallel computing (OpenMP) is proposed. This method ensures optimal load distribution between threads, reducing processing time and increasing scalability.  The Needleman-Wunsch alignment method has been parallelized to compare sequences, which has increased the accuracy of translocation detection. This approach allows for more accurate identification of chromosomal abnormalities, outperforming traditional sequential alignment methods.  When using eight threads, the proposed method achieves speedups of up to 3.8 for translocations and 3.4 for inversions. This confirms its suitability for processing large amounts of data and the possibility of further optimization on multi-core or GPUs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Methodic and materials 2.1. General scheme of the method for detecting chromosomal mutations</head><p>The task is to develop a method for detecting mutations in DNA sequences. There are two sets of DNA for inversion and four for translocation: half of them are taken as a reference, i.e. without mutations, and the rest with mutations. This method should be able to detect two types of mutations: inversions and translocations.</p><p>Let � and � be DNA sequences of length � and �, respectively, �=(� 1 , � 2 , ..., ��) and �=(� 1 , � 2 , ..., ��), where � i and � i take the values {A, C, T, G}. Sequence � is the same sequence �, but at a certain interval from i to i + k the sequence is inverted, i.e. �[i:i+k] ! �[i:i+k] and �[i:i+k] = reverse(�[i:i+k]). To search for inversions, the algorithm comparing two DNA sequences is parallelized. This algorithm detects mismatches in nucleotide sequences and determines the gaps where nucleotides in DNA molecules do not match. After that, the found gaps are checked for inversions. The result of this check is the determination of the gaps where inversions are detected.</p><p>To search for translocations, we need to parallelize the Needleman-Wunsch algorithm, which consists in constructing an optimal alignment, using the optimal alignments of the initial fragments of the original sequences obtained in the previous steps. For two sequences � and � with elements � I (0&lt;i&lt;n ) and � j (0&lt;j&lt;m), we construct the matrix F. The element of this matrix contains the weight (score) of the best alignment of elements � 1 , � 2 , ..., � i and � 1 , � 2 , ..., � j of the sequences � and �, respectively. We build the matrix F recursively. We assign the starting point zero weight F(0,0)=0. Then we fill in the matrix in ascending order of both indices, i.e. from the upper left corner to the lower right. If we have already defined F(i-1 ,j-1), F(i-1, j), F(i, j-1), then we can define F(i, j), as</p><formula xml:id="formula_0">� �, � = 푚� � � − 1, � − 1 + � � , � � � � − 1, � + � � �, � − 1 + �</formula><p>, where:</p><p> � �, � -the value in the alignment matrix in the position �, � ;</p><p> �( � , � � ) -the value of the penalty (or reward) for matching (match/mismatch) characters � and � � ;</p><p> � -the value of the gap penalty, which can be negative or positive. This formula recursively calculates the values in each cell of the alignment matrix � �, � , considering the three possible ways to reach the current cell: diagonally (matching), vertically (insertion/deletion in the first sequence), and horizontally (insertion/deletion in the second sequence). Figure <ref type="figure" target="#fig_0">1</ref> shows a general flowchart of the chromosomal mutation detection algorithm </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Parallelizing the detection of chromosomal inversions and translocations</head><p>This paper uses OpenMP technology to detect chromosomal inversions and translocations in the genome. OpenMP allows parallelization of the code, reducing computation time and increasing program efficiency. Testing on a different number of threads showed a significant speedup of the algorithm. By working with shared memory, OpenMP simplifies the exchange of data between threads, which helps to synchronize them.</p><p>The main work of the algorithm for finding chromosomal inversions can be divided into two steps:</p><p>1. Detection of chromosomal inversion gaps.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Analysis and verification of the results.</head><p>The detection of chromosomal inversion gaps is performed by comparing the reference with the target in order to detect mismatches. Accordingly, in a sequential algorithm, the search is performed from the beginning of the array to its end (see Figure <ref type="figure" target="#fig_1">2</ref>). This process was parallelized. The array is divided into N intervals of equal length, where N is the number of threads. Accordingly, having a gap length of 2000 characters and N = 4, each thread will search for a gap of 500 characters. Each thread will store the found gaps in its own array and, upon completion of execution, add it to the general array for further processing.</p><p>To parallelize the algorithm, we used the schedule(static, chunkSize) directive, which statically distributes tasks between threads. chunkSize -the size of the block that will be processed by a particular thread. The use of the static parameter ensures that the intervals will be consistent. Visualization of the search for inversion gaps in the parallel algorithm is shown in Figure <ref type="figure" target="#fig_2">3</ref>. Another resource-intensive process is the analysis and verification of the found inversion gaps. To speed up this process, we perform an initial check to see if the detected nucleotide sequence in the target is an inversion of the corresponding nucleotides in the reference. If the condition is met, we save the gap in the format (start, end, inversion), where the start is the beginning, the end is the end of the gap, and inversion indicates that this nucleotide sequence is an inversion and no further processing is necessary. Otherwise, additional analysis is required.</p><p>Additional analysis includes merging gaps and checking the newly formed sequences for inversion. This process cannot be parallelized and is identical to that of the sequential algorithm <ref type="bibr" target="#b14">[15]</ref>, <ref type="bibr" target="#b15">[16]</ref>.</p><p>The sequential algorithm for finding translocations involves the use of the Needleman-Wunsch method, which consists of sequentially calculating the elements of a matrix based on previously found matrix values. The idea of parallelizing this algorithm is to calculate a specific row or column of the matrix in a separate thread (see Figure <ref type="figure" target="#fig_3">4</ref>). With this implementation, the problem of a resource race arises. It is necessary to assume that all three elements that are used to calculate a particular (i; j) element of the matrix are calculated. For this logic, the matrix is used to identify whether the (i; j) element is calculated or not. Waiting for the necessary elements is realized by means of a spin lock, the essence of which is to place the thread in a loop that constantly checks the necessary condition for calculating the element.</p><p>To parallelize the algorithm, we used the schedule(static, 1) directive, which statically distributes tasks between threads. Using the static parameter ensures that the work for the threads is distributed evenly, and the second parameter ensures that each subsequent row or column is calculated under a different thread. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head><p>The data for testing were taken from the DNA sequence dataset <ref type="bibr" target="#b17">[17]</ref>. This dataset is a dataset that contains DNA sequences and contains information about nucleotide sequences consisting of four possible bases: adenine (A), cytosine (C), guanine (G), and thymine (T).</p><p>First, we present the results of the parallel algorithm using OpenMP technology to find inversions and translocations (see Table <ref type="table" target="#tab_0">1</ref> and Table <ref type="table">2</ref>, respectively). The program execution time will depend on the number of threads and the number of sequences in the file.  <ref type="table" target="#tab_0">1</ref>, it can be concluded that parallelization is not advisable for small values of n, and the calculation speed improves with increasing data volume. Similarly, Table <ref type="table">2</ref> shows that a small matrix dimension is not optimal for a large number of threads. Similarly to the algorithm for detecting inversions for translocations, the parallelization efficiency improves with increasing data size.</p><p>Further, based on the results obtained, we calculate the speedup</p><formula xml:id="formula_1">� � = � � (�) � � (�)</formula><p>and efficiency</p><formula xml:id="formula_2">� � = � � (�)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>�</head><p>of the proposed parallel method, where � � (�) -the execution time of the sequential algorithm, � � (�) -the execution time of the parallel algorithm, and � is the number of threads. The results are shown in Table <ref type="table" target="#tab_1">3 and Table 4</ref>  0.48 From the results, we can see that the proposed parallel method yields good speedup and efficiency with increasing number of threads and input data dimensionality, which indicates that the method is well scalable. It should be noted that all experiments were performed on a computer with eight logical processors. Therefore, they can be significantly improved with a more powerful computing system. The paper also investigates the accuracy achieved by the proposed approach for finding inversions and translocations. To do this, the program was run 1000 times with different lengths of DNA sets with pre-introduced mutations randomly, and the final result was calculated as an average. Table <ref type="table" target="#tab_2">5</ref> presents the accuracy results of the proposed method on different data sizes, and Table <ref type="table" target="#tab_3">6</ref> compares the accuracy of the Smith-Waterman algorithm <ref type="bibr" target="#b18">[18]</ref> and the Needleman-Wunsch algorithm. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusions and future research</head><p>In this paper, we investigated the effectiveness of parallel algorithms for detecting chromosomal mutations, such as inversions and translocations, using OpenMP technology. The algorithms were developed to compare DNA sequences and detect deviations in nucleotide sequences, which allowed for high accuracy in identifying inversions. For translocations, the parallel Needleman-Wunsch algorithm was used to ensure optimal alignment of DNA fragments. Numerical experiments demonstrated a significant performance improvement on different datasets: the best performance was 3.4 for inversions and 3.8 for translocations, achieved using eight threads. The results indicate an improvement in performance with increasing amounts of input data and the possibility of further optimization of the results obtained through the development of multi-core computing systems. To further improve the efficiency of the developed algorithm, we can consider its implementation on a graphics processing unit using CUDA technology, which will potentially provide even higher data processing speed.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Flowchart of the chromosomal mutation detection algorithm</figDesc><graphic coords="4,208.20,400.20,178.20,186.96" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Visualization of searching for inversion gaps in a sequential algorithmThis process was parallelized. The array is divided into N intervals of equal length, where N is the number of threads. Accordingly, having a gap length of 2000 characters and N = 4, each thread will search for a gap of 500 characters. Each thread will store the found gaps in its own array and, upon completion of execution, add it to the general array for further processing.To parallelize the algorithm, we used the schedule(static, chunkSize) directive, which statically distributes tasks between threads. chunkSize -the size of the block that will be processed by a particular thread. The use of the static parameter ensures that the intervals will be consistent. Visualization of the search for inversion gaps in the parallel algorithm is shown in Figure3.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Visualization of the search for inversion gaps in the parallel algorithmAnother resource-intensive process is the analysis and verification of the found inversion gaps. To speed up this process, we perform an initial check to see if the detected nucleotide sequence in the target is an inversion of the corresponding nucleotides in the reference. If the condition is met, we save the gap in the format (start, end, inversion), where the start is the beginning, the end is the end of the gap, and inversion indicates that this nucleotide sequence is an inversion and no further processing is necessary. Otherwise, additional analysis is required.Additional analysis includes merging gaps and checking the newly formed sequences for inversion. This process cannot be parallelized and is identical to that of the sequential algorithm<ref type="bibr" target="#b14">[15]</ref>,<ref type="bibr" target="#b15">[16]</ref>.The sequential algorithm for finding translocations involves the use of the Needleman-Wunsch method, which consists of sequentially calculating the elements of a matrix based on previously found matrix values. The idea of parallelizing this algorithm is to calculate a specific row or column of the matrix in a separate thread (see Figure4). With this implementation, the problem of a resource race arises. It is necessary to assume that all three elements that are used to calculate a particular (i; j) element of the matrix are calculated. For this logic, the matrix is used to identify whether the (i; j) element is calculated or not. Waiting for the necessary elements is realized by means of a spin lock, the essence of which is to place the thread in a loop that constantly checks the necessary condition for calculating the element.To parallelize the algorithm, we used the schedule(static, 1) directive, which statically distributes tasks between threads. Using the static parameter ensures that the work for the threads is distributed evenly, and the second parameter ensures that each subsequent row or column is calculated under a different thread.</figDesc><graphic coords="5,147.48,276.12,300.00,51.36" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Visualization of the matrix calculation process Thus, the main elements of the novelty of the proposed method include:  Parallelization of the algorithm for detecting chromosomal inversions;  Using OpenMP for static parallelization;  Parallelization of the Needleman-Wunsch algorithm for translocation detection;  Implementation of thread synchronization via spin lock;  Development of a mechanism for checking inversions without additional processing in case of successful inversion detection.</figDesc><graphic coords="6,176.04,72.00,243.00,196.80" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Program execution time (milliseconds) for searching for inversions in parallel calculations</figDesc><table><row><cell>n</cell><cell>1</cell><cell>Number of threads 3</cell><cell>4</cell><cell>8</cell></row><row><cell>4021</cell><cell>8</cell><cell>7</cell><cell>8</cell><cell>10</cell></row><row><cell>8042</cell><cell>27</cell><cell>19</cell><cell>16</cell><cell>15</cell></row><row><cell>32161</cell><cell>97</cell><cell>61</cell><cell>45</cell><cell>34</cell></row><row><cell>192261</cell><cell>611</cell><cell>313</cell><cell>222</cell><cell>180</cell></row><row><cell>Table 2</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">Program execution time (seconds) for searching for translocations in parallel calculations</cell><cell></cell></row><row><cell>n</cell><cell>1</cell><cell>Number of threads 3</cell><cell>4</cell><cell>8</cell></row><row><cell>349</cell><cell>0.3</cell><cell>0.17</cell><cell>0.14</cell><cell>0.4</cell></row><row><cell>1045</cell><cell>1.8</cell><cell>1.2</cell><cell>0.9</cell><cell>0.8</cell></row><row><cell>4177</cell><cell>25</cell><cell>14.5</cell><cell>10</cell><cell>9</cell></row><row><cell>5801</cell><cell>57</cell><cell>31</cell><cell>21</cell><cell>15</cell></row><row><cell cols="5">Numerical experiments were performed on four different samples of nucleotide sequences.</cell></row><row><cell>According to Table</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3</head><label>3</label><figDesc>Speedup and efficiency indicators of the parallel algorithm for detecting inversions</figDesc><table><row><cell>n</cell><cell>speedup</cell><cell cols="5">2 efficiency speedup efficiency speedup efficiency 4 8</cell></row><row><cell>4021</cell><cell>1.14</cell><cell>0.14</cell><cell>1.00</cell><cell>0.13</cell><cell>0.80</cell><cell>0.10</cell></row><row><cell>8042</cell><cell>1.42</cell><cell>0.18</cell><cell>1.69</cell><cell>0.21</cell><cell>1.80</cell><cell>0.23</cell></row><row><cell>32161</cell><cell>1.59</cell><cell>0.20</cell><cell>2.16</cell><cell>0.27</cell><cell>2.85</cell><cell>0.36</cell></row><row><cell>192261</cell><cell>1.95</cell><cell>0.24</cell><cell>2.75</cell><cell>0.34</cell><cell>3.39</cell><cell>0.42</cell></row><row><cell>Table 4</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="6">Speedup and efficiency indicators of a parallel algorithm for translocation detection</cell><cell></cell></row><row><cell>n</cell><cell>speedup</cell><cell cols="5">2 efficiency speedup efficiency speedup efficiency 4 8</cell></row><row><cell>349</cell><cell>1.76</cell><cell>0.22</cell><cell>2.14</cell><cell>0.27</cell><cell>0.75</cell><cell>0.09</cell></row><row><cell>1045</cell><cell>1.50</cell><cell>0.19</cell><cell>2.00</cell><cell>0.25</cell><cell>2.25</cell><cell>0.28</cell></row><row><cell>4177</cell><cell>1.72</cell><cell>0.22</cell><cell>2.50</cell><cell>0.31</cell><cell>2.78</cell><cell>0.35</cell></row><row><cell>5801</cell><cell>1.84</cell><cell>0.23</cell><cell>2.71</cell><cell>0.34</cell><cell>3.80</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 5</head><label>5</label><figDesc>Accuracy of the proposed parallel algorithms on different data sizes, %</figDesc><table><row><cell>Sequence length</cell><cell>100</cell><cell>200</cell><cell>500</cell><cell>1000</cell></row><row><cell>inversion</cell><cell>98.5</cell><cell>98.6</cell><cell>99.0</cell><cell>99.2</cell></row><row><cell>translocation</cell><cell>96.3</cell><cell>97.0</cell><cell>97.5</cell><cell>97.6</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 6</head><label>6</label><figDesc>Comparison of the accuracy of the Smith-Waterman and Needleman-Wunsch algorithms, %</figDesc><table><row><cell>Sequence length</cell><cell>Smith-Waterman algorithm</cell><cell>Needleman-Wunsch algorithm</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Acknowledgements</head><p>The authors would like to thank the Armed Forces of Ukraine for providing security to perform this work. This work has become possible only because of the resilience and courage of the Ukrainian Army.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Declaration on Generative AI</head><p>During the preparation of this paper, the authors utilized Grammarly to verify spelling and grammar accuracy.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Translocations and inversions: major chromosomal rearrangements during Vigna (Leguminosae) evolution</title>
		<author>
			<persName><forename type="first">S</forename><surname>Dias</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00122-024-04546-8</idno>
	</analytic>
	<monogr>
		<title level="j">Theor. Appl. Genet</title>
		<imprint>
			<biblScope unit="volume">137</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2024-01">Jan. 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Sequence Alignment Using Machine Learning-Based Needleman-Wunsch Algorithm</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E E</forename><surname>-D. Rashed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>Amer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>El-Seddek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">E</forename></persName>
		</author>
		<author>
			<persName><forename type="first">-D</forename><surname>Moustafa</surname></persName>
		</author>
		<idno type="DOI">10.1109/ACCESS.2021.3100408</idno>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="109522" to="109535" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A fast parallelized DBSCAN algorithm based on OpenMp for detection of criminals on streaming services</title>
		<author>
			<persName><forename type="first">L</forename><surname>Mochurad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sydor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Ratinskiy</surname></persName>
		</author>
		<idno type="DOI">10.3389/fdata.2023.1292923</idno>
	</analytic>
	<monogr>
		<title level="j">Front. Big Data</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page">1292923</biblScope>
			<date type="published" when="2023-10">Oct. 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Implementation and analysis of a parallel kalman filter algorithm for lidar localization based on CUDA technology</title>
		<author>
			<persName><forename type="first">L</forename><surname>Mochurad</surname></persName>
		</author>
		<idno type="DOI">10.3389/frobt.2024.1341689</idno>
	</analytic>
	<monogr>
		<title level="j">Front. Robot. AI</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">1341689</biblScope>
			<date type="published" when="2024-02">Feb. 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Simple statistical tests selection based parallel computating method ensures the guaranteed global extremum identification</title>
		<author>
			<persName><forename type="first">V</forename><surname>Kovtun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Altameem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Maitah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Kempa</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.jksus.2024.103165</idno>
	</analytic>
	<monogr>
		<title level="j">J. King Saud Univ. -Sci</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page">103165</biblScope>
			<date type="published" when="2024-05">May 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A secure data transmission framework for IoT enabled healthcare</title>
		<author>
			<persName><forename type="first">S</forename><surname>Saif</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Biswas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Haq</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Kovtun</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.heliyon.2024.e36269</idno>
	</analytic>
	<monogr>
		<title level="j">Heliyon</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="issue">16</biblScope>
			<biblScope unit="page">e36269</biblScope>
			<date type="published" when="2024-08">Aug. 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">On Applicability of Model Checking Technique in Power Systems and Electric Power Industry</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Shkarupylo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">V</forename><surname>Blinov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Chemeris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Dusheba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A J</forename><surname>Alsayaydeh</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-87675-3_1</idno>
	</analytic>
	<monogr>
		<title level="m">Systems, Decision and Control in Energy III</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Zaporozhets</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="volume">399</biblScope>
			<biblScope unit="page" from="3" to="21" />
		</imprint>
	</monogr>
	<note>Studies in Systems, Decision and Control</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Human Gene Mutation Database (HGMD®): optimizing its use in a clinical diagnostic or research setting</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D</forename><surname>Stenson</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00439-020-02199-3</idno>
	</analytic>
	<monogr>
		<title level="j">Hum. Genet</title>
		<imprint>
			<biblScope unit="volume">139</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="1197" to="1207" />
			<date type="published" when="2020-10">Oct. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Genetic Mutations and Major Human Disorders: A Review</title>
		<author>
			<persName><forename type="first">S</forename><surname>Banoon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Salih</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ghasemian</surname></persName>
		</author>
		<idno type="DOI">10.21608/ejchem.2021.98178.4575</idno>
	</analytic>
	<monogr>
		<title level="j">Egypt. J. Chem</title>
		<imprint>
			<biblScope unit="volume">0</biblScope>
			<biblScope unit="issue">0</biblScope>
			<biblScope unit="page" from="0" to="0" />
			<date type="published" when="2021-10">Oct. 2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Chromosome Translocations, Inversions and Telomere Length for Retrospective Biodosimetry on Exposed U.S. Atomic Veterans</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Mckenna</surname></persName>
		</author>
		<idno type="DOI">10.1667/RR15240.1</idno>
	</analytic>
	<monogr>
		<title level="j">Radiat. Res</title>
		<imprint>
			<biblScope unit="volume">191</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">311</biblScope>
			<date type="published" when="2019-02">Feb. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Targeted genomic translocations and inversions generated using a paired prime editing strategy</title>
		<author>
			<persName><forename type="first">J</forename><surname>Kweon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H.-Y</forename><surname>Hwang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Ryu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A.-H</forename><surname>Jang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.ymthe.2022.09.008</idno>
	</analytic>
	<monogr>
		<title level="j">Mol. Ther</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="249" to="259" />
			<date type="published" when="2023-01">Jan. 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Gene sequencing and result analysis of balanced translocation carriers by third-generation gene sequencing technology</title>
		<author>
			<persName><forename type="first">X</forename><surname>Zeng</surname></persName>
		</author>
		<idno type="DOI">10.1038/s41598-022-20356-8</idno>
	</analytic>
	<monogr>
		<title level="j">Sci. Rep</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">7004</biblScope>
			<date type="published" when="2023-04">Apr. 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The Origin and Evolution of Chromosomal Reciprocal Translocation in Quasipaa boulengeri (Anura, Dicroglossidae)</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Xia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Yuan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yuan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Zeng</surname></persName>
		</author>
		<idno type="DOI">10.3389/fgene.2019.01364</idno>
	</analytic>
	<monogr>
		<title level="j">Front. Genet</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page">1364</biblScope>
			<date type="published" when="2020-01">Jan. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Frequency, Origins, and Evolutionary Role of Chromosomal Inversions in Plants</title>
		<author>
			<persName><forename type="first">K</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">H</forename><surname>Rieseberg</surname></persName>
		</author>
		<idno type="DOI">10.3389/fpls.2020.00296</idno>
	</analytic>
	<monogr>
		<title level="j">Front. Plant Sci</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">296</biblScope>
			<date type="published" when="2020-03">Mar. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Channel Congestion Control in VANET for Safety and Non-Safety Communication: A Review</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Binti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Abdul</forename><surname>Rahim</surname></persName>
		</author>
		<idno type="DOI">10.1109/ICRAIE52900.2021.9704017</idno>
	</analytic>
	<monogr>
		<title level="m">6th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE)</title>
				<meeting><address><addrLine>Kedah, Malaysia</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2021-12">2021. Dec. 2021</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">I</forename><surname>Fedorchenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Oliinyk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A J</forename><surname>Alsayaydeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kharchenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Stepanenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<author>
			<persName><surname>Shkarupylo</surname></persName>
		</author>
		<idno type="DOI">10.5281/ZENODO.5163692</idno>
		<title level="m">MODIFIED GENETIC ALGORITHM TO DETERMINE THE LOCATION OF THE DISTRIBUTION POWER SUPPLY NETWORKS IN THE CITY</title>
				<imprint>
			<date type="published" when="2020-12">Dec. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">DNA sequence dataset</title>
		<author>
			<persName><forename type="first">Nagesh</forename><surname>Singh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chauhan</forename></persName>
		</author>
		<ptr target="https://www.kaggle.com/datasets/nageshsingh/dna-sequence-dataset" />
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">A Review of Parallel Implementations for the Smith-Waterman Algorithm</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Xia</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title/>
		<idno type="DOI">10.1007/s12539-021-00473-0</idno>
	</analytic>
	<monogr>
		<title level="j">Interdiscip. Sci. Comput. Life Sci</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="14" />
			<date type="published" when="2022-03">Mar. 2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
