<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Annotating Protein Structures for Understanding SARS-CoV-2 Interactome</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Barbara</forename><surname>Puccio</surname></persName>
							<email>barbara.puccio@unicz.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Surgical and Medical Sciences</orgName>
								<orgName type="institution">Magna Graecia University</orgName>
								<address>
									<settlement>Catanzaro</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ugo</forename><surname>Lomoio</surname></persName>
							<email>ugo.lomoio@studenti.unicz.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Surgical and Medical Sciences</orgName>
								<orgName type="institution">Magna Graecia University</orgName>
								<address>
									<settlement>Catanzaro</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Luisa</forename><surname>Di</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Department of Engineering</orgName>
								<orgName type="laboratory">Unit of Chemical-Physics Fundamentals in Chemical Engineering</orgName>
								<orgName type="institution">Università Campus Bio-Medico di Roma</orgName>
								<address>
									<addrLine>via Álvaro del Portillo 21</addrLine>
									<postCode>00128</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pietro</forename><forename type="middle">Hiram</forename><surname>Guzzi</surname></persName>
							<email>hguzzi@unicz.it</email>
						</author>
						<author>
							<persName><forename type="first">Pierangelo</forename><surname>Veltri</surname></persName>
							<email>veltri@unicz.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Surgical and Medical Sciences</orgName>
								<orgName type="institution">Magna Graecia University</orgName>
								<address>
									<settlement>Catanzaro</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Annotating Protein Structures for Understanding SARS-CoV-2 Interactome</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">51BAAD0524B6F2381740E401947327EC</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T20:31+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Protein Structure</term>
					<term>Protein Contact Network</term>
					<term>Data Annotations</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Protein Contact Network (PCN) is an emerging paradigm for modelling protein structure. A common approach to interpreting such data is through network-based analyses. It has been shown that clustering analysis may discover allostery in PCN. Nevertheless Network Embedding has shown good performances in discovering hidden communities and structures in network. SARS-CoV-2 proteins, and in particular S protein, have a modular structure that need to be annotated to understand complex mechanism of infections. Such annotations, and in particular the highlighting of regions participating in the binding of human ACE2 and TMPRSS, may help the design of tailored strategy for preventing and blocking infection. In this work, we compare some approaches for graph embedding with respect to some classical clustering approaches for annotating protein structures. Results shows that embedding may reveal interesting structure that constitute the starting point for further analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Proteins are polymers made of twenty different amino acids organised to assemble a linear chain. The linear sequence of the amino acid determines the spatial conformation of proteins <ref type="bibr" target="#b0">[1]</ref>. The spatial structure of proteins is characterized by the presence of a central carbon atom (called carbon-C), a carboxyl group, an amino group and a lateral chain, different for each amino acid (this chain can be hydrophobic, no polar or charged). They are linked to each other by covalent bonds (that are called peptide bonds) between molecules <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>.</p><p>The amino acids sequence is known as primary structure. The secondary structure indicates the folding of peptides, i.e. protein subsquences, chain resulting from the interaction between each amino acid and neighboring amino acids. The main types of secondary structure are 𝛼-helix and 𝛽-sheet. The tertiary structure is a combination of secondary structures, that makes a complex molecular shape (3D-shape). A protein in its 3D-conformation is called 'native' and this is closely connected with its biological function. Finally, the quaternary structure is only present in proteins with multiple subunits (peptide chains) that can be the same or different.</p><p>In such a scenario Protein Contact Networks (PCNs) emerged as a relevant paradigm for the analysis of protein molecular structures <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>. A PCN is a graph built from protein structure. A node in a PCN represents an Carbon-𝛼 atom of the backbone, while an edge represents a spatial distance of the atoms in the range 4 and 8 Å.</p><p>PCN descriptors are useful to model and analyse protein functions <ref type="bibr" target="#b3">[4]</ref>. PCNs allow to identify modules in protein molecules through network spectral clustering <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b9">10]</ref>, with relevant application in different biological contexts <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>.</p><p>For instance, analysis of PCNs allows to detect such as allosteric regulation. 𝐴𝑙𝑙𝑜𝑠𝑡𝑒𝑟𝑦 is the ability of proteins to trasmit a signal from one site to another in response to environmental stimuli and this is related to the trasmission of information across the protein from a sensor site (or allosteric site) to the effector site (or binding site) <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref>. Allostery may also be studied using wet lab methods such as X-ray or NMR structures correspondent to different activation states or molecular dynamics simulation of allosteric agent binding. Such methods are usually time and resource consuming, therefore there is the need to introduce computational method to detect allostery and then to annotate allosteric regions.</p><p>Starting from PCNs it is possibile to detect modules in protein structure using clustering algorithm approach. In a graph a cluster is a group of nodes that are characterized by a strong intra-cluster connection (in terms of number of contacts) and a weaker inter-cluster connection. Clustering allows to detect community (cluster) in a graph and this is perfectly comparable to the modules detection in protein structure. Two methods have been devised to partition PCNs into clusters: a geometrical method, based on the k-means algorithm and spectral clustering, and the clustering of the embedding of the network.</p><p>PCNs allows to simplify protein analysis to detect modules, essential in allosteric regulation. On the other hand graphs consist of high number of nodes and links (particularly in protein world) thus it is challenging to apply different mathematical and statistical operations. In this situation, embeddings appear as a reasonable solution. Based on potential of graph embeddings, we propose a PCNs analysis using clustering approaches on embeddings, in order to discover allostery in PCN and annotate protein structures.</p><p>We use PCN-Miner, a software tool implemented in the Python programming language able to import protein in the Protein Data Bank format and generate the corresponding protein contact network. Also it offers a set of algorithms for the PCN analysis <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b16">17,</ref><ref type="bibr" target="#b17">18]</ref>. As previously reported, in this work we focus on application of clustering algorithm on the embeddings with the aim of evaluating network embedding approaches in PCN analysis. Our analysis based on SARS-CoV-2 spike glycoprotein, in its closed form, and some of its Variants of Concern <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b14">15,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b19">20]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Protein Contact Networks</head><p>A protein structure can be represented as a complex three-dimensional object, formally defined by the coordinates in 3D space of its atoms. Despite the large availability of protein molecular structures data, there are yet many problems regarding the relationship between protein structures and their functions. For this reason it is necessary to define simple descriptors that can describe protein structures with few numerical variables. Structure and function are based on the complex network of inter-residue interactions, where residues are identified by amino acids sequences <ref type="bibr" target="#b20">[21]</ref>. Therefore, the residues interactions are used to define protein interaction networks. Protein interaction networks are thus used to study protein functions. The most simple choice to define networks, is to represent the protein structure by means of 𝛼-carbon location. The spatial position of 𝐶 𝛼 is still reminiscent of the protein backbone and this allows to highlight also the most important characteristics of the three-dimensional structure. Starting from the 𝐶 𝛼 spatial distribution, a distance matrix d is evaluated where each 𝑑 𝑖,𝑗 represents the Euclidean distance in the 3D space between the 𝑖-th and 𝑗-th residues, defined as</p><formula xml:id="formula_0">𝑑 𝑖,𝑗 = √︁ ((𝑥 𝑖 − 𝑥 𝑗 ) 2 ) + ((𝑦 𝑖 − 𝑦 𝑗 ) 2 ) + (𝑧 𝑖 − 𝑧 𝑗 ) 2 )<label>(1)</label></formula><p>where (𝑥 𝑖 , 𝑦 𝑖 , 𝑧 𝑖 ) and (𝑥 𝑗 , 𝑦 𝑗 , 𝑧 𝑗 ) respectively are the cartesian coordinates of residue 𝑖 and 𝑗.</p><p>Matrix 𝑑 is used to define a Protein Contact Network concept, that is an alternative and different representation of using graph-based models to represent protein structures. A graph is the most natural structure to represent proteins, where nodes (or vertices) are the protein residues and links (or edges) between the 𝑖-th and the 𝑗-th nodes (residues) represent residue contacts. In the graph representation there exists a link between two residues 𝑖 and 𝑗 if the distance between two residues (i.e., 𝑑 𝑖,𝑗 ) is higher than 4 and lower than 8 Å. The lower end excludes all covalent bonds, which are not sensible to environment change (so to protein functionality), while the upper end gets rid of weaker non-covalent bonds (so not significant for protein functionality).</p><p>At this point, it is possible to build up adjacency matrix A, whose generic element is defined as:</p><formula xml:id="formula_1">𝐴 𝑖𝑗 = {︂ 1 if 4 ≤ 𝑑 𝑖𝑗 ≤ 8 0 otherwise<label>(2)</label></formula><p>The adjacency matrix of a graph is unique in regard to the ordering nodes. In the case of proteins, in which the order of nodes (residues) corresponds to the residues sequence (primary structure) it can be said that its corresponding network is unique: this establishes a one-to-one correspondence between protein and its network.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Protein Contact Network Analysis</head><p>𝐺𝑟𝑎𝑝ℎ 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔𝑠 are the transformation of property graphs to a vector or a set of vectors. Embedding should capture the graph topology, vertex-to-vertex relationship, and other relevant information about graphs, subgraphs, and vertices. There are a few reasons why graph embeddings are needed:</p><p>1. Graphs consist of edges and nodes. Those network relationships can only use a specific subset of mathematics, statistics, and machine learning, while vector spaces have a richer toolset of approaches. Embeddings are more practical than the adjacency matrix since they pack node properties in a vector with a smaller dimension. 3. Vector operations are simpler and faster than comparable operations on graphs.</p><p>The development of novel methods fro encoding structural information of graph to be used for subsequent analysis is a recent area in research. These methods are usually referred to as graph representation learning or graph-embedding <ref type="bibr" target="#b21">[22,</ref><ref type="bibr" target="#b22">23]</ref>. The goal of these approaches is learning a mapping for graph substructures (i.e. nodes or sub graphs) into points of a low-dimensional vector space R 𝑑 , having 𝑑 &lt; 𝑛, n is the dimension of the adjacency matrix <ref type="bibr" target="#b21">[22]</ref>. We here focus on node-based embeddings, thus all these methods realise a mapping among nodes and point of the embedding space so that geometric relationships among embedded objects reflect the structure of the original graph. Since embeddings are points of an euclidean space, they may be used in other machine learning tasks (e.g. node classification) or in other graph analysis algorithms.</p><p>Currently, there exists many algorithms and many classification attempts that are categorised and described in some previous surveys <ref type="bibr" target="#b23">[24,</ref><ref type="bibr" target="#b24">25,</ref><ref type="bibr" target="#b25">26,</ref><ref type="bibr" target="#b26">27,</ref><ref type="bibr" target="#b21">22,</ref><ref type="bibr" target="#b27">28]</ref>.</p><p>The input of representation learning algorithms is a undirected and unweighted graph 𝐺 = (𝑉, 𝐸) with its associated adjacency matrix 𝐴 and a real-valued matrix 𝑋 containing node attributes 𝑋 ∈ 𝑅 𝑚𝑥|𝑉 | . The goal of each algorithm is to map each node into a vector z ∈ R 𝑑 where 𝑑 &lt; |𝑉 |.</p><p>Shallow embedding methods encode each node (𝑣 𝑖 ∈ 𝐺) into a single vector through the use of a simple encoding function defined as:</p><formula xml:id="formula_2">𝐸𝑁 𝐶(𝑣 𝑖 ) = 𝑀 𝑣 𝑖<label>(3)</label></formula><p>where M is a matrix containing the embedding vectors and 𝑣 𝑖 is a vector used for selecting the column. The matrix M contains all the embeddings. Each column of M encodes a node of the original graph and the number of rows 𝑑 is lower than the number of nodes 𝑛. These embeddings were initially inspired by matrix factorization approaches. The differences among these methods are in the use of different loss function and similarity measures.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Workflow of the Experiment</head><p>In this work we focused on the comparation between a direct analysis of PCNs using network clustering and network embedding approach followed by clustering on the embeddings. We used PCN-Miner, implemented in Python 3.8 programming. It uses scypy and numpy libraries for managing matrices; management of PDB files is provided by ProDy package; the network embedding is realised by wrapping the GEM library and clustering algorithms bu CdLib; visualisation of protein structures is made by wrapping the community edition of PyMol. Analysis started from protein data. The reference database for protein structures is the Protein Data Bank (PDB https://www.rcsb.org/), which also defines the PDB format, a standard for recording atom files. Using PDB files we obtain protein contact networks. First step consist of import protein structure in PDB format from which it is possible to obtain the PCN (alternatively we can directly import a PCN previously determined). After we access analysis fuctionalities. On one hand we work with network clustering, on the other we work with network embeddings followed by clustering on the embeddings. Therefore we compared the results by comparison of centrality measures and participation coefficient, computed for each residue.</p><p>We focused on four structures of Spike protein, in its closed form: the wild type and three variants of concern (alpha, delta, omicron).</p><p>Figure <ref type="figure" target="#fig_0">2</ref> shows the Structure of the SARS-CoV-2 spike glycoprotein pdb code 6VXX. We first built the protein contact network using PCN-Miner. Then we embedded the resulting graph by using the HOPE algorithm. Each node was embedded into a vector having 64 dimension. Finally we applied the spectral clustering algorithm. We found some interesting community that could be annotated as allosteric regions after verification. Figure <ref type="figure" target="#fig_3">3</ref> shows the Structure of the SARS-CoV-2 spike glycoprotein (closed state)(pdb code 6VXX). This structure is the result of the clustering analysis, with soft clustering using a normalised lapacian. Figure evidences the found communities.</p><p>Figure <ref type="figure" target="#fig_4">4</ref> shows the structure of Closed state of pre-fusion SARS-CoV-2 Delta variant spike protein (pdb code 7SBK). The structure is the result of the embeddings+clustering analysis, with HOPE as embdeddings algorithm.</p><p>Figure <ref type="figure" target="#fig_5">5</ref> shows the Structure of the SARS-CoV-2 spike glycoprotein (closed state)(pdb code 6VXX). Closed state of pre-fusion SARS-CoV-2 Delta variant spike protein (pdb code 7SBK). This structure is the result of the clustering analysis, with soft clustering using a normalised lapacian. ). This structure is the result of the embeddings+clustering analysis, using the HOPE for node embedding</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>Protein Contact Network (PCN) is an emerging paradigm for modelling protein structure. A common approach to interpreting such data is through network-based analyses. It has been shown that clustering analysis may discover allostery in PCN. Nevertheless Network Embedding has shown good performances in discovering hidden communities and structures in network. We demonstrated how an embedding based approach is able to discover modular substructures of the Spike SARS-CoV-2 protein. Such regions need to be annotated and then further analysed to understand complex mechanism of infections. This may help the design of tailored strategy for preventing and blocking infection.   </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>2 .</head><label>2</label><figDesc>Adjacency matrix describes connections between nodes in the graph. It is a | 𝑉 | x | 𝑉 | matrix, where | 𝑉 | is a number of nodes in the graph. Each column and each row in the matrix present a node. Non-zero values in the matrix indicate that two nodes are connected. Using an adjacency matrix as a feature space for large graphs is almost impossible. Imagine a graph with 1M nodes and an adjacency matrix of 1M x 1M.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Workflow</figDesc><graphic coords="5,89.29,84.19,416.69,155.26" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Structure of the SARS-CoV-2 spike glycoprotein (closed state)(pdb code 6VXX). This structure is the result of the embeddings+clustering analysis, using the HOPE for node embedding</figDesc><graphic coords="6,150.55,84.19,291.71,384.09" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Structure of the SARS-CoV-2 spike glycoprotein (closed state)(pdb code 6VXX). This structure is the result of the clustering analysis, with soft clustering using a normalised lapacian.</figDesc><graphic coords="7,181.80,84.19,229.18,285.28" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Structure of Closed state of pre-fusion SARS-CoV-2 Delta variant spike protein (pdb code 7SBK). This structure is the result of the embeddings+clustering analysis, with HOPE as embdeddings algorithm.</figDesc><graphic coords="7,233.89,414.86,125.01,159.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Structure of Closed state of pre-fusion SARS-CoV-2 Delta variant spike protein (pdb code 7SBK). This structure is the result of the clustering analysis, with soft clustering using a normalised lapacian.</figDesc><graphic coords="8,233.89,84.19,125.01,167.67" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Toward an accurate prediction of inter-residue distances in proteins using 2d recursive neural networks</title>
		<author>
			<persName><forename type="first">P</forename><surname>Kukic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Mirabello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Walsh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pollastri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMC bioinformatics</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page" from="1" to="15" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Non-coding rnas in cancer: platforms and strategies for investigating the genomic &quot;dark matter</title>
		<author>
			<persName><forename type="first">K</forename><surname>Grillone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Riillo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Scionti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rocca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Alcaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Di Martino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Tagliaferri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Tassone</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Experimental &amp; Clinical Cancer Research</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="page" from="1" to="19" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Predicting the response of the dental pulp to sars-cov2 infection: a transcriptome-wide effect cross-analysis</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Galicia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">M</forename><surname>Giorgi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Khan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Genes &amp; Immunity</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="360" to="363" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Modularity in protein structures: study on all-alpha proteins</title>
		<author>
			<persName><forename type="first">T</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Ghosh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Biomolecular Structure and Dynamics</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="2667" to="2681" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">On the use of networks in biomedicine</title>
		<author>
			<persName><forename type="first">E</forename><surname>Vocaturo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Procedia Computer Science</title>
		<imprint>
			<biblScope unit="volume">110</biblScope>
			<biblScope unit="page" from="498" to="503" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Modules identification in protein structures: the topological and geometrical solutions</title>
		<author>
			<persName><forename type="first">S</forename><surname>Tasdighian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Di Paola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Ruvo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Paci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Santoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Palumbo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Mei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Di</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Venere</surname></persName>
		</author>
		<author>
			<persName><surname>Giuliani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of chemical information and modeling</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="page" from="159" to="168" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Modeling multi-scale data via a network of networks</title>
		<author>
			<persName><forename type="first">S</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Milenković</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="2544" to="2553" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The discovery of a putative allosteric site in the sars-cov-2 spike protein using an integrated structural/dynamic approach</title>
		<author>
			<persName><forename type="first">L</forename><surname>Di Paola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hadi-Alijanvand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Giuliani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of proteome research</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="page" from="4576" to="4586" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">A framework for the atrial fibrillation prediction in electrophysiological studies</title>
		<author>
			<persName><forename type="first">P</forename><surname>Vizza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Curcio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Indolfi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computer methods and programs in biomedicine</title>
		<imprint>
			<biblScope unit="volume">120</biblScope>
			<biblScope unit="page" from="65" to="76" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Methodologies of speech analysis for neurodegenerative diseases evaluation</title>
		<author>
			<persName><forename type="first">P</forename><surname>Vizza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mirarchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">B</forename><surname>Bossio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Lombardo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Arabia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Quattrone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International journal of medical informatics</title>
		<imprint>
			<biblScope unit="volume">122</biblScope>
			<biblScope unit="page" from="45" to="54" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Structural genetics of circulating variants affecting the sars-cov-2 spike/human ace2 complex</title>
		<author>
			<persName><forename type="first">F</forename><surname>Ortuso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mercatelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">M</forename><surname>Giorgi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Biomolecular Structure and Dynamics</title>
		<imprint>
			<biblScope unit="page" from="1" to="11" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">From single level analysis to multi-omics integrative approaches: a powerful strategy towards the precision oncology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Gallo Cantafio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Grillone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Caracciolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Scionti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Arbitrio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Pensabene</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Di Martino</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">High-throughput</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">33</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Disclosing allostery through protein contact networks</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">D</forename><surname>Paola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Mei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">D</forename><surname>Venere</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Giuliani</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<publisher>Springer</publisher>
			<biblScope unit="page" from="7" to="20" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Pattern discovery in multilayer networks</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Ren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sarkar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dobra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kahveci</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE/ACM Transactions on Computational Biology and Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="page" from="741" to="752" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Di Paola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Giuliani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2201.05434</idno>
		<title level="m">Design and development of pcn-miner: A tool for the analysis of protein contact networks</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">On the analysis of diseases and their related geographical data</title>
		<author>
			<persName><forename type="first">G</forename><surname>Canino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE journal of biomedical and health informatics</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="228" to="237" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Using dual-network-analyser for communities detecting in dual networks</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMC Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Data science in unveiling covid-19 pathogenesis and diagnosis: Evolutionary origin to drug repurposing</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Kumar</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tradigo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Roy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Briefings in Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="855" to="872" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Exploiting the molecular basis of age and gender differences in outcomes of sars-cov-2 infections</title>
		<author>
			<persName><forename type="first">D</forename><surname>Mercatelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Pedace</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Veltri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">M</forename><surname>Giorgi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational and Structural Biotechnology Journal</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="page" from="4092" to="4100" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Viral pneumonia images classification by multiple instance learning: preliminary results</title>
		<author>
			<persName><forename type="first">E</forename><surname>Zumpano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Fuduli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Vocaturo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Avolio</surname></persName>
		</author>
		<idno type="DOI">10.1145/3472163.3472170</idno>
		<idno>doi:10.1145/ 3472163.3472170</idno>
		<ptr target="https://doi.org/10.1145/3472163.3472170" />
	</analytic>
	<monogr>
		<title level="m">25th International Database Engineering &amp; Applications Symposium</title>
				<meeting><address><addrLine>Montreal, QC, Canada</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2021">July 14-16, 2021. 2021</date>
			<biblScope unit="page" from="292" to="296" />
		</imprint>
	</monogr>
	<note>IDEAS 2021</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Protein contact networks: an emerging paradigm in chemistry</title>
		<author>
			<persName><forename type="first">L</forename><surname>Di Paola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Ruvo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Paci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Santoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Giuliani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Chemical reviews</title>
		<imprint>
			<biblScope unit="volume">113</biblScope>
			<biblScope unit="page" from="1598" to="1613" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">L</forename><surname>Hamilton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ying</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Leskovec</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1709.05584</idno>
		<title level="m">Representation learning on graphs: Methods and applications</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Convolutional neural network techniques on x-ray images for covid-19 classification</title>
		<author>
			<persName><forename type="first">E</forename><surname>Vocaturo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Zumpano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Caroprese</surname></persName>
		</author>
		<idno type="DOI">10.1109/BIBM52615.2021.9669784</idno>
		<ptr target="https://doi.org/10.1109/BIBM52615.2021.9669784.doi:10.1109/BIBM52615.2021.9669784" />
	</analytic>
	<monogr>
		<title level="m">IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021</title>
				<editor>
			<persName><forename type="first">Y</forename><surname>Huang</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">A</forename><surname>Kurgan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Luo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">X</forename><surname>Hu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Chen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><forename type="middle">R</forename><surname>Dougherty</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Kloczkowski</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</editor>
		<meeting><address><addrLine>Houston, TX, USA</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2021">December 9-12, 2021. 2021</date>
			<biblScope unit="page" from="3113" to="3115" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">A survey on network embedding</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Knowledge and Data Engineering</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="page" from="833" to="852" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Network embedding in biomedical data science</title>
		<author>
			<persName><forename type="first">C</forename><surname>Su</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Briefings in bioinformatics</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="182" to="197" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">To embed or not: network embedding as a paradigm in computational biology</title>
		<author>
			<persName><forename type="first">W</forename><surname>Nelson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zitnik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Leskovec</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Goldenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sharan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in genetics</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Graph embedding techniques, applications, and performance: A survey</title>
		<author>
			<persName><forename type="first">P</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ferrara</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Knowledge-Based Systems</title>
		<imprint>
			<biblScope unit="volume">151</biblScope>
			<biblScope unit="page" from="78" to="94" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Data mining and life sciences applications on the grid</title>
		<author>
			<persName><forename type="first">M</forename><surname>Cannataro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Guzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sarica</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="216" to="238" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
