<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Variation in the Structure of Cyanobacteria Genomes with the Length of the Sliding Window *</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Vladimir</forename><surname>Gustov</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Siberian Federal University</orgName>
								<address>
									<addrLine>79 Svobodny st</addrLine>
									<postCode>660041</postCode>
									<settlement>Krasnoyarsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department" key="dep1">Institute of Computational Modelling</orgName>
								<orgName type="department" key="dep2">Siberian Branch</orgName>
								<orgName type="institution">Russian Academy of Sciences</orgName>
								<address>
									<addrLine>50/44 Akademgorodok</addrLine>
									<postCode>660036</postCode>
									<settlement>Krasnoyarsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Variation in the Structure of Cyanobacteria Genomes with the Length of the Sliding Window *</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">ED7A2B28BC4EDB92F40879385233D465</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T19:56+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Order</term>
					<term>Distribution</term>
					<term>Clustering</term>
					<term>Evolution</term>
					<term>Symmetry</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>A seven-cluster pattern for bacterial genomes has been reported. The pattern is revealed through the distribution of formally identified short fragments of a genome converted into triplet frequency dictionaries. The pattern is found to be dependent on the parameters of the genome fragmentation, namely the length of a formally identified fragment within the genome, and the move step to identify the fragments to obtain an ensemble to reveal the clustering pattern. Here, we present the results of the studying the impact of the fragment length on the type and shape of the pattern.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The diversity of structures found in biological macromolecules grows permanently. Their analysis is a key issue for researchers studying both classical biology and mathematics or computer science. Previously, some new structures were reported for bacteria <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b1">2]</ref>, transcriptomes <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9]</ref> and chloroplasts <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b6">7]</ref>. The structure manifests in the mutual distribution of the formally identified fragments of a genome; to reveal the structure, one must convert the fragments into the triplet frequency dictionary W ( , ) and trace their distribution in the 64-dimensional metric space. Various fragments tend to gather into clusters, thus representing the structuredness. The choice of the biological matter for such studies makes matter. Keeping aside of the sequencing, assembling, annotation etc., one faces an extremely high complexity of the objects under consideration. Prokaryotic organisms seem to be more convenient for such studies than eukaryotic ones. Prokaryotic genomes are significantly shorter and (almost always) consist of a single circular chromosome. Organelle genomes are even more advantageous when compared to prokaryotic ones in such capacity, since they encode the same function. Thus, there is no functional impact on the study of the interplay mentioned above, if a researcher studies the organelle genomes of the same group.</p><p>The papers mentioned above present the study of the inner structuredness for various genetic entities. The point is that the studies were carried out with the same parameters of tiling: the latter is the specific coverage of a genome with a set of (may be, overlapping) windows to identify the fragments for further clustering and revealing the inner structuredness.</p><p>Here we present some preliminary studies of the impact of variation in the tiling parameters on the structure revealed in the genome. Essentially, we check what happens with the structures reported previously, if the length of a tile increases. This growth means that one takes into consideration longer and longer correlations in the features found in a genome. Here, both features and correlations should be understood in broader sense.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Material and Methods</head><p>Tiling is a set of (overlapping) subsequences covering the genome under consideration. To do it, consider a genetic sequence of the length L from the fourletter alphabet ℵ = {A, C, G, T}. No other symbols are stipulated to occur in a sequence. To reveal the structuredness, tiling is developed. A window of the length Δ identifying a fragment within a sequence moves with the step t; we take Δ = 603 and t = 202. Moving the window of the length Δ along the sequence to right with the given step t, one obtains the tiling. The latter is unambiguously determined by two parameters: Δ and t.</p><p>As soon as the tiling is developed, each tile is converted into the triplet frequency dictionary W ( , ) (j); here j denotes the tiles location along the sequence. Frequency dictionary is the list of all the triplets ω = ν ν ν ranging from ω = AAA to ω = TTT provided with their frequency value f . First, a sliding reading frame moving along the tile, captures the triplets ω = ν ν ν counting the number of copies n of each triplet. Next, the frequency of the triplet f is derived by dividing the number n by the total number of all the triplets:</p><formula xml:id="formula_0">𝑓 = ,<label>(1)</label></formula><p>where 𝑀 stands for the total number of the triplets. They are counted along the fragment, with the reading frame shift equal to 3, thus providing neither gaps, nor overlaps. Further, we omit the subscript in the dictionary notation, unless it makes a confusion. The conversion of the sequence into an ensemble of the frequency dictionaries transforms it into a set of points in the 64-dimensional metric space.</p><p>For the purpose of the study, each point was labeled with the location coordinate j which is the number of central nucleotides of the relevant tile, and the relative phase index. The latter represents the location of the tile against the coding and non-coding regions found in the genome. To begin with, we neglected the exon-intron structure of the genes, and consider them as a rigid coding region. To identify it we followed the annotation of the genome.</p><p>Seven labels of the index labeling are introduced: F , F , F , B , B , B and J. The tile is indexed as F , 0 ≤ k ≤ 2, if the central nucleotide falls inside the coding region and the distance from the starting nucleotide of the coding region to the central one has the remainder k when divided by 3. This labeling holds for the genes located in the leading strand. Reciprocally, the tile is labeled with B , 0 ≤ k ≤ 2, if the gene is located at the ladder strand; the distance here is determined from the end of the coding region, and it is counted in the opposite direction, since the genetic sequence is presented in the genetic bank with the leading strand only. Finally, the tile is indexed as 𝐽, if its central nucleotide falls out of the coding region.</p><p>Everywhere below, the following coloring label system for the relative phase indices is applied:</p><p> 𝐽 phase tiles (corresponding to the non-coding regions) are colored in brown;  the tiles indexed as 𝐹 and 𝐵 are colored in rose and violet, correspondingly;  the tiles indexed as 𝐹 and 𝐵 are colored in green and cyan, correspondingly;  the tiles indexed as 𝐹 and 𝐵 are colored in orange and yellow, correspondingly.</p><p>There is a point concerning the coloring scheme: it loses sense as the length of the fragment Δ grows. Indeed, the longer is the fragment, the greater is the number of the coding regions falling within the fragment. There is no reason to expect that all the newly coming coding regions observed within the increased fragment yield the same relative phase index. Thus, the fragment becomes a multi-phase entity, and the interfusion of the coding regions eliminates the unambiguity of the relative phase index. Nonetheless, we used the coloring system to relate the relative phase index to the very first coding region found in the fragment. Actually, it leads to mixing of the differently colored fragments within a cluster. We used the freely distributed software VidaExpert<ref type="foot" target="#foot_0">1</ref> to visualize the distribution of the tiles converted into the triplet frequency dictionaries in the metric space. We studied the distribution of the points corresponding to the tiling in the principal component space. To do this, we used the Euclidean metrics.</p><p>Any triplet frequency dictionary maps a tile into a point in the 64-dimensional metric space. The problem is that the sum of all the frequencies is one:</p><formula xml:id="formula_1">∑ 𝑓 ТТТ = 1<label>(2)</label></formula><p>making the frequencies linearly dependent. This linear constraint may cause a false signal in clustering, so a triplet must be excluded from the analysis. Formally speaking, any triplet may be excluded; actually, we excluded the triplets yielding the least standard deviation determined over the set of tiles for each genome. We studied 38 genomes of cyanobacteria downloaded from the EMBL-bank<ref type="foot" target="#foot_1">2</ref> . The studied species belong to the divisions Synechococcales, Nostocales, Chroococcales, Pleurocapsales and Oscillatoriales. Cyanobacteria are supposed to have a common ancestor with the existent chloroplasts, thus we tried to figure out relations between the structures observed in the latter, and those reported in chloroplasts <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b9">10]</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Results and Discussion</head><p>To begin with, let us concentrate on the structures observed for the "standard" parameters of tiling: that is Δ = 603 and move step is t = 202. We analyzed the structures in the bacterial genomes mentioned above for Δ varying from 603 to 60 000 nucleotides; the move step t was equal to 202 or 201. The comparison of the structures observed with the extreme lengths of the fragment were of primary interest for us.</p><p>For Δ = 603 and t = 202 we observed four seven-cluster patterns, as described in <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>; namely, these are:</p><p> "parallel triangles" for AT-reach genomes (with GC-content close to 0.25), see Fig. <ref type="figure" target="#fig_0">1</ref>;  "orthogonal triangles" for the genomes with GC-content close to 0.35;  "coinciding triangles" pattern characteristic of the genomes with GC-content close to 0.50, and finally  a pattern degenerated into a plane with the six-beam cluster structure, for GCcontent close to 0.60 (see Fig. <ref type="figure" target="#fig_0">1</ref>).</p><p>Also, we investigated the variation of the pattern for the fragment lengths equal to 1206, 1809, 2412, 3015, 9015, 15015, 21015, 27015, 33015 and 45000 nucleotides. Fig. <ref type="figure" target="#fig_1">2</ref> illustrates the changes occurring in the patterns, as the fragment length grows. Evidently, the structure changes rather smoothly, as the window length increases. This change of the pattern seems to be universal for any initial structure. We believe there exist 2 essentially different structures, and another substructure for Δ = 60 000. These two essentially different structures are called ball (Fig. <ref type="figure" target="#fig_3">3(a)</ref>) and Two clusters (Fig. <ref type="figure" target="#fig_3">3(c</ref>)); the substructure mentioned above is called Snitch, see Fig. <ref type="figure" target="#fig_3">3(b)</ref>. The Ball structure resembles a ball, regardless of the initial type of the structure; this structure is characteristic of quite a chaotic distribution of the points corresponding to all six coding phases, and junk, over the ball. Careful examination shows that the structure looks like a set of threads; they can be well seen in the periphery of the ball. The Snitch substructure resembles, more or less, the ball structure. Three clearly identified threads resembling a protuberance characterize this substructure; they are extremely extended in size.</p><p>Table <ref type="table">1</ref>. Correspondence between the structure types for Δ = 603 and Δ = 60 000; 2C stands for two clusters, S stands for the snitch and B stands for the ball. The Two clusters structure consists of long threads which comprise not a ball, but two extended clusters connected with a bridge. The clusters differ in length, in all the cases. The clusters comprise the points belonging to two different non-overlapping parts of the genome: one may identify the points from each cluster (coloring, e. g.) and trace their location over the genome. Remarkably, the points of the same color occupy the sites over the genome separately, never mixing with each other. Also, all these structures exhibit three (relatively independent) threads, in the principal component space; the thread composition depends on the divisibility of the window step t by 3. If the step is divisible, then the ball comprises a single thread; otherwise, three threads are observed (see Fig. <ref type="figure" target="#fig_5">4</ref>). The points in the threads are located sequentially, if t is divisible by 3. Otherwise, the points in three threads observed for t indivisible by 3 are separated for each thread: the threads comprise the fragments with the numbers giving the same remainder of the division of the location number of the fragment by 3, and the fragments within the thread are located sequentially, again.</p><p>Neither relation between the structure type of the genome of cyanobacteria observed for Δ = 60 000 and for Δ = 603, nor taxonomy was found. Similarly, no relation of GC-content to these structures was revealed. Table <ref type="table" target="#tab_1">3</ref> shows the GC-content values obtained at Δ = 60 000. It should be noted that the average GC-content for the two clusters structure exceeds that observed for other structures. However, it should be said that some genomes exhibiting these two structures other than two clusters have the GC-content values up to 60 %. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusion</head><p>The majority of studies <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b0">1]</ref> consider structures in terms of their functional role, or chemical properties. We focus exclusively on the statistical properties of biological macromolecules, regardless of their physical issues. The data presented here unambiguously prove that the inner structuredness in cyanobacteria genomes is observed at the window length Δ = 603. Similar structuredness is observed for longer windows (up to Δ = 60 000), as well. The Ball, snitch and two clusters structures start to manifest themselves at Δ ≈ 30 000 (see Fig. <ref type="figure" target="#fig_1">2</ref>); further growth of the window length does not significantly change the structures. Another important issue is the amazing impact of the triplet composition manifested at any length Δ of the window. Indeed, if the step t of the window move is divisible by 3, then a single thread pattern is observed; otherwise, one can see three threads.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Cyanobacteria genome structures observed for Δ = 603. The projections on 𝑃𝐶 , 𝑃𝐶 are shown in subfigs. 1(a), 1(c), 1(e) and 1(g); the projections on 𝑃𝐶 , 𝑃𝐶 are shown in subfigs. 1(b), 1(d), 1(f) and 1(h). (See the genomes in the text).</figDesc><graphic coords="3,143.28,470.52,310.08,153.48" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Transformation of the structure, depending on Δ, for Synechococcus sp. WH 8102. Each subfigure presents two projections: the former is (𝑃𝐶 , 𝑃𝐶 ) (left), and the latter is (𝑃𝐶 , 𝑃𝐶 ) (right).</figDesc><graphic coords="4,126.24,373.80,345.96,235.08" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 1</head><label>1</label><figDesc>shows these patterns; subfigs. 1(a), 1(b) show Prochlorococcus sp. MIT 0604 (AC CP007753), subfigs. 1(c), 1(d) show Nostoc sp. PCC 7107 (AC CP003548), subfigs. 1(e), 1(f) show Synechococcus elongatus PCC 7942 (AC CP000100), and subfigs. 1(g), 1(h) show Synechococcus sp. WH 8109 (AC CP006882).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Transformation Structures identified for Δ = 60 000 for cyanobacteria; subfig. 3(a) shows Synechococcus elongatus PCC 6301; subfig. 3(b) shows Anabaena variabilis ATCC 29413 and subfig. 3(c) shows Synechococcus sp. CC9902.</figDesc><graphic coords="5,126.24,501.96,118.44,98.64" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. Acetobacter pasteurianus genome structure observed for 𝑡 divisible by 3 (Fig. 4(a)) and 𝑡 indivisible by 3 (Fig. 4(b)). Here, the starting thousand of points are shown, to simplify visualization.</figDesc><graphic coords="6,126.24,470.04,186.00,140.52" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2 .</head><label>2</label><figDesc>Correspondence between the structure types for Δ = 603 and Δ = 60 000.</figDesc><table><row><cell></cell><cell>Ball</cell><cell>Snitch</cell><cell>Two clusters</cell></row><row><cell>Chroococcales</cell><cell>1</cell><cell>1</cell><cell>0</cell></row><row><cell>Pleurocapsales</cell><cell>1</cell><cell>0</cell><cell>0</cell></row><row><cell>Nostocales</cell><cell>4</cell><cell>3</cell><cell>0</cell></row><row><cell>Oscillatoriales</cell><cell>5</cell><cell>0</cell><cell>0</cell></row><row><cell>Synechococcales</cell><cell>13</cell><cell>1</cell><cell>10</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3 .</head><label>3</label><figDesc>Correspondence between the structure types of GC-content; σ is the standard deviation.</figDesc><table><row><cell></cell><cell>GC , %</cell><cell>GC , %</cell><cell>GC</cell><cell>, %</cell><cell>𝜎, %</cell></row><row><cell>Snitch</cell><cell>41.35</cell><cell>42.22</cell><cell cols="2">43.87</cell><cell>1.13</cell></row><row><cell>Ball</cell><cell>31.17</cell><cell>45.60</cell><cell cols="2">60.24</cell><cell>6.99</cell></row><row><cell>Two clusters</cell><cell>52.45</cell><cell>58.44</cell><cell cols="2">61.37</cell><cell>2.93</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://bioinfo-out.curie.fr/projects/vidaexpert/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://www.ebi.ac.uk/genomes/bacteria.html</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Natural product biosynthetic diversity and comparative genomics of the cyanobacteria</title>
		<author>
			<persName><forename type="first">E</forename><surname>Dittmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gugger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Sivonen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">P</forename><surname>Fewer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Trends in microbiology</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="642" to="652" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Seven clusters in genomic triplet distributions</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Gorban</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">G</forename><surname>Popova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Zinovyev</surname></persName>
		</author>
		<ptr target="http://content.iospress.com/articles/in-silico-biology/isb00110" />
	</analytic>
	<monogr>
		<title level="j">Silico Biology</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="471" to="482" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Four basic symmetry types in the universal 7-cluster structure of microbial genomic sequences</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Gorban</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">G</forename><surname>Popova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Zinovyev</surname></persName>
		</author>
		<ptr target="http://content.iospress.com/articles/in-silico-biology/isb00185" />
	</analytic>
	<monogr>
		<title level="j">Silico Biology</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="265" to="282" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Complete genome structure of the unicellular cyanobacterium synechocystis sp. pcc6803</title>
		<author>
			<persName><forename type="first">T</forename><surname>Kaneko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tabata</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Plant and Cell Physiology</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page" from="1171" to="1176" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Eight-cluster structure of chloroplast genomes differs from similar one observed for bacteria</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Senashova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Malyshev</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018-02">Feb 2018</date>
		</imprint>
	</monogr>
	<note type="report_type">ArXiv e-prints</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">De Novo assembly and cluster analysis of siberian larch transcriptome and genome</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Putintseva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Birukov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Novikova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Krutovsky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Bioinformatics and Biomedical Engineering</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Ortuño</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">I</forename><surname>Rojas</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="455" to="464" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Chloroplast genomes exhibit eight cluster structuredness and mirror symmetry</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Senashova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Malyshev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Bioinformatics and Biomedical Engineering</title>
				<editor>
			<persName><forename type="first">I</forename><surname>Rojas</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Ortuño</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="186" to="196" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Symmetry of siberian larch transcriptome</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">V</forename><surname>Birukov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">A</forename><surname>Putintseva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">V</forename><surname>Oreshkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A</forename><surname>Vaganov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">V</forename><surname>Krutovsky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Siberian federal university</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="278" to="286" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Seven-cluster structure of larch chloroplast genome</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">I</forename><surname>Bondar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">A</forename><surname>Putintseva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">V</forename><surname>Oreshkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A</forename><surname>Vaganov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">V</forename><surname>Krutovsky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Siberian federal university</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="268" to="277" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Sadovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Y</forename><surname>Senashova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">A</forename><surname>Putintseva</surname></persName>
		</author>
		<title level="m">Chloroplasts and Cytoplasm: Structure and Functions</title>
				<imprint>
			<publisher>Nova Science Publishers, Inc</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="25" to="95" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Nostocyclamide: a new macrocyclic, thiazole-containing allelochemical from nostoc sp. 31 (cyanobacteria)</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Todorova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Juettner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Linden</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pluess</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Von Philipsborn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Organic Chemistry</title>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="issue">24</biblScope>
			<biblScope unit="page" from="7891" to="7895" />
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Fragmentation mass spectra dataset of linear cyanopeptides-microginins</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">K</forename><surname>Zervou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kaloudis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hiskia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Mazur-Marzec</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data in Brief</title>
		<imprint>
			<biblScope unit="page">105825</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
