<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Building Executable Biological Pathway Models Automatically from BioPAX</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Timo</forename><surname>Willemsen</surname></persName>
							<email>timo.willemsen@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">VU University Amsterdam</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Anton</forename><surname>Feenstra</surname></persName>
							<email>k.a.feenstra@vu.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">VU University Amsterdam</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Paul</forename><surname>Groth</surname></persName>
							<email>p.t.groth@vu.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">VU University Amsterdam</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Building Executable Biological Pathway Models Automatically from BioPAX</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">A727905159BB1668C22DEB74C6C25DA4</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:28+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>biological pathways</term>
					<term>biological networks</term>
					<term>BioPax</term>
					<term>executable models</term>
					<term>Petri nets</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The amount of biological data exposed in semantic formats is steadily increasing. In particular, pathway information (a model of how molecules interact within a cell) from databases such as KEGG and WikiPathways are available in a standard RDF-based format BioPAX. However, these models are descriptive and not executable in nature. Being able to simulate or execute a pathway is one key mechanism for understanding the operation of a cell. The creation of executable models can take a significant amount of time and only relatively few such models currently exist. In this paper, we leverage the availability of semantically represented pathways, to bootstrap the creation of executable pathway models. We present an approach to automate the creation of executable models in the form of Petri-Nets from BioPAX represented pathways. This approach is encapsulated in an online tool, BioPax2PNML.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>A biological pathway, simply said, is a sequence of interactions among molecules of a cell. There are many different types of pathways; gene regulation pathways, signaling pathways and protein interaction pathways are among the most commonly used ones. <ref type="bibr" target="#b0">[1]</ref> Originally, pathways were hand-drawn and presented in papers. Pathways are now made available in online databases in computer parsable formats (e.g. BioPAX). For example, the WikiPathways has over 1700 available pathways <ref type="foot" target="#foot_0">1</ref> . While these pathway descriptions are highly useful, they contain mostly static information about interacting molecules and do not describe how pathways actually work or give insight into the dynamics of these interactions <ref type="bibr" target="#b1">[2]</ref>.</p><p>To address this lack of information, work has been undertaken to create computational models of these pathways <ref type="bibr" target="#b2">[3]</ref>. Two types of models can be distinguished: executable and mathematical <ref type="bibr" target="#b3">[4]</ref>. The mathematical models give insight into quantities and how they change over time, and are frequently created by systems biologists. Executable models are valuable to biologists because they have a large variety of uses <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>. They can be used to summarize available knowledge of interactions and mechanisms in a system, and to investigate how components cooperate to produce global system behaviour. Creating an executable model is still a tedious manual process, mostly because they contain parameters that need to be collected manually. On the other hand, mathematical models typically require detailed knowledge of (kinetic and rate) parameters, which are often not available and can be very hard to obtain from experiments. From our experience, for executable models, the process of model construction and parameter calibration usually takes several months <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b6">7]</ref>, even for a modestly sized network. This is currently one of the major bottlenecks in computational life sciences research <ref type="bibr" target="#b7">[8]</ref>.</p><p>This paper begins to address this bottleneck by leveraging the availability of semantic representations of pathways and converting them to an executable model. Concretely, the contributions of this paper are: i) to present a method to automate validation of pathway data; ii) a mapping of the BioPAX format to an executable model (Petri nets, represented in the Petri Net Markup Language; PNML); and, iii) a method to automatically create these executable models. We have developed a webservice that encapsulates the described method and can be accessed at www.few.vu.nl/∼twn370/BioPax2PNML/. Additionally, all code is available online at: https://github.com/TimoWillemsen/Biopax2PNML.</p><p>The rest of this paper is organized as follows. We begin in Section 2 with background information on biological pathways and common formats for both descriptive (BioPAX) and executable (PNML) representations of them. We then describe our approach for mapping between these two formats (Section 3). To ensure that a BioPAX pathway has the appropriate information to be converted to PNML, we present a validation approach in Section 4. This is followed, in Section 5, by a description of the implementation of our method. Finally, we conclude with some thoughts on future work in Section 6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Biological Pathways</head><p>There are different types of biological pathways, corresponding to different levels of abstraction. For example, a pathway may describe interactions between different cells, or between genes, or between proteins, or it may describe biochemical reactions (or combinations thereof). Many databases exist that collect this information in a variety of forms, and some are very specialized on particular types of data. It is beyond the scope of this work to provide a comprehensive overiew. Some of the most well-known are WikiPathways <ref type="bibr" target="#b8">[9]</ref>, focused on signal transduction; the KEGG Pathway database <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11]</ref>, with a focus on metabolic pathways; and Reactome <ref type="bibr" target="#b11">[12]</ref> which has a broader scope.</p><p>The examples provided in this paper will focus on signal transduction pathways, as these tend to be well-studied and therefore well-defined. Such pathways typically include protein-protein interactions, protein-gene interactions and biochemical reactions.</p><p>We have based our research on the pathways provided by the WikiPathways database <ref type="bibr" target="#b8">[9]</ref>. This is a community-driven service where biological pathways are extensively manually curated. The context of the pathways included in Wiki-Pathways can vary considerably, depending on their intended use. For example, simply representing known interactions in a shareable way is considered useful, but such pathways likely will not include details that are crucial for computational analysis, even as simple as explicit notation of interactions among proteins and genes. As a result of this, only certain pathways are suitable for computational analysis.</p><p>One such example is the C. elegans Programmed Cell Death pathway from the WikiPathways Database, as shown in Fig. <ref type="figure" target="#fig_0">1</ref>. For the purpose of this paper, we have taken a subset of this pathway, as shown in Fig. <ref type="figure" target="#fig_0">1</ref>. This pathway consists of 5 genes. When ced-3 is activated, it will trigger the cell's programmed death.</p><p>We now discuss the computational representation of pathways used by Wiki-Pathways. After which, we briefly describe the use of Petri-nets to as a language for executable models of pathways.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">BioPax</head><p>In 2010 Demir et al <ref type="bibr" target="#b12">[13]</ref> created the Web Ontology Language (OWL) based standard for modeling pathways: BioPax. A key aspect of this standard is that it allows for referring to external databases for information (e.g. linking to UniProt protein descriptions.) This standard has been used in many different biological databases; all the three mentioned above, Reactome, KEGG and WikiPathways expose BioPax through an RDF interface <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12]</ref>.</p><p>BioPax can be used to model different types of pathway components. An example of how genes are modelled in BioPax, is shown below; the ced-3 and ced-4 genes of the C. elegans Programmed Cell Death pathway, as shown in Fig. <ref type="figure" target="#fig_0">1</ref>.</p><p>Two genes, ced-3 and ced-4, from the C. elegans Programmed Cell Death Pathway from the WikiPathways Database ID:WP367 &lt; bp:Protein rdf:about = " eef1e " &gt; &lt; bp:displayName &gt;ced -3 &lt;/ bp:displayName &gt; &lt; bp:entityReference rdf:resource = " id3 " / &gt; &lt;/ bp:Protein &gt; &lt; bp:Protein rdf:about = " c0b3e " &gt; &lt; bp:displayName &gt;ced -4 &lt;/ bp:displayName &gt; &lt; bp:entityReference rdf:resource = " id4 " / &gt; &lt;/ bp:Protein &gt; An example of interactions in a pathway modelled in BioPax is shown below; we see a reaction 'id40' that connects a right-hand-side element (eef1e; ced-3) with a left-hand-side (c0b3e; ced-4) element. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Petri nets</head><p>Petri nets are a formalism geared towards modelling and analysis of concurrent systems. A Place-Transition (PT) Petri net is a quadruple (P, T, A, m), where P is a set of places and T a set of transitions. A describes arcs which connect places with transitions or vice versa. Each place holds zero or more tokens, which represent flow of control through this place. The number of tokens in each place all together are called a marking m of the network.</p><p>Fig. <ref type="figure" target="#fig_2">2</ref> shows a graphical representation of such a Petri Net, again for our small example part of the C. elegans Programmed Cell Death pathway. Squares are transitions, representing interactions, and circles are places, representing genes. Arcs are represented by arrows, and the marking is empty. Firing of a transition depends on the availability of resources (tokens) in the input places, and represents the execution of a reaction: consuming substrates and creating products. <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b14">15]</ref> For computational purposes we have chosen to represent Petri nets in the Petri Net Markup Language (PNML) format. This is a straightforward XML standard that a number of systems support.[16] Fig. <ref type="figure" target="#fig_2">2</ref>.2 shows the Petri net of Fig. <ref type="figure" target="#fig_2">2</ref> in an XML representation. Petri nets are recognized as a powerful tool to model biological pathways <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b14">15]</ref>, as the formalism readily allows to capture both the complexity and the highly concurrent nature of biological systems, while optimally leveraging the large amounts of qualitative data available. <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b5">6]</ref>  &lt; transition id = " t11 " &gt; &lt;/ transition &gt; &lt; place id = " eef1e " &gt; &lt; name &gt; &lt; text &gt;ced -3 &lt;/ text &gt; &lt;/ name &gt; &lt;/ place &gt; &lt; place id = " c0b3e " &gt; &lt; name &gt; &lt; text &gt;ced -4 &lt;/ text &gt; &lt;/ name &gt; &lt;/ place &gt; &lt; arc id = " a2 " source = " c0b3e " target = " t11 " / &gt; &lt; arc id = " a3 " source = " t11 " target = " eef1e " / &gt;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">BioPax to PNML mapping</head><p>To transform static BioPax data into an executable Petri net, we have developed a mapping between the two formats. BioPax is an RDF format, while PNML is an XML format. It should be taken into account that the semantic linking is lost when a BioPax pathway is converted to PNML Petri-net. For example, genes or proteins have different identifiers in different databases. BioPax gives a way to link multiple identifiers to a gene or protein, but PNML does not support this feature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Genes or Proteins</head><p>Each gene or protein is modelled as a place in the Petri net. Because the creation of the Pathways in WikiPathways has been done manually, often they are not consistent and may, for example, contain multiple instances of one gene or protein. The mapping does not take into consideration the fact that duplicate genes or proteins may represent the same entity and are modelled twice simply for readability, or rather that they are modelled twice because they represent a different entity of the same gene/protein (for example in a different location, or in a different state). However we address this issue with the validation rules introduced in Section 4.</p><p>The first stage in mapping is shown in Algorithm 1, which transforms BioPax proteins/genes to PNML. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Interactions</head><p>Interactions are also mapped to PNML. Each &lt;bp:BiochemicalInteraction&gt; is mapped to a transition. Then for each &lt;bp:Left&gt; an arc is added pointing into the transition and out from the corresponding place; for each &lt;bp:Right&gt; an arc is created pointing out of the transition and into the corresponding place. Algorithm 2 shows the straightforward way to do this.</p><p>Once both algorithms 1 and 2 are executed a Petri net is created. Formally, the Petri net can be described as P N = P, T, A, ∅ where P are the places, T the transitions, A the arcs and markings m = ∅ since there are no tokens in the system yet. In terms of modelling the biological system, the places represent biological entities, like genes, proteins or complexes, the transitions represent biochemical reactions and interactions, and the arcs represent the associations between these two. Tokens represent the availability of the resources of the corresponding place in the Petri net.</p><p>Algorithm 2 Gene/protein interaction BioPax to PNML If we then execute both Algorithm 1 and Algorithm 2 on Fig. <ref type="figure" target="#fig_0">1</ref>, a petri net is generated. Part of the output is shown in Fig. <ref type="figure" target="#fig_2">2</ref>.2</p><formula xml:id="formula_0">T = ∅ A = ∅</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">BioPax Validation</head><p>The mapping described in Section 3 is based on several assumptions about the contents of the input BioPax file. The basic assumptions are that genes, proteins and complexes (bound combinations of proteins, possibly including a gene) are entities, and that these entities can change state or identity only through biochemical interactions.</p><p>However, because of the manual nature of pathway construction, these assumptions may not hold for a given pathway instance in the database. To make sure the data is presented as it should be, we have developed a set of validation rules and a validator available online.</p><p>We have developed two types of validation rules; semantic and syntactic. The syntactic validation consists of basic RDF-validation. This is necessarily because from our preliminary survey, a large fraction of pathways are not modelled correctly for translation.</p><p>More interesting is the semantic validation. These rules ensure that the information contained in the model is consistent and complete enough to create an executable Petri net. Table <ref type="table" target="#tab_2">1</ref> shows these validation rules.</p><p>These rules ensure that the provided BioPax file contains everything needed. We have categorized the validation rules by severity:</p><p>-Category error rules are minimal requirements for mapping.</p><p>-Category warning rules that mean the mapping can proceed but may lead to an unconnected or incomplete Petri net.</p><p>This framework is set up in a modular fashion, so that extension is easy. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Implementation</head><p>We have implemented the methods described above as a webservice. The service consists of 4 components: a validation rule database, a validator, a BioPax to PNML converter and a pathway retriever, as is shown schematically in Fig. <ref type="figure" target="#fig_4">4</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Pathway retriever</head><p>The webservice provides an interface to query different datasources. At the time of writing only an interface to WikiPathways is provided, using the available webservices <ref type="bibr" target="#b16">[17]</ref>. However, support for other generic BioPax could be a future extension.</p><p>The retriever queries WikiPathways and downloads the pathway in the Bio-Pax format, so validation and conversion can be done.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Validation rule database</head><p>The validation rule database is a set of SPARQL queries. Each query returns a set of RDF triples that violate the rule (this set may be empty). This way feedback can be given about where the rule violation takes place in the BioPax File.</p><p>The way the database is set up allows easy addition of rules. This modularity makes it possible to improve on the current validation rules, but also allows validation rule sets for different types of pathways (for example signalling pathways vs. gene regulatory networks). Fig. <ref type="figure" target="#fig_5">5</ref> shows as an example the implementation of rule 1 of Table <ref type="table" target="#tab_2">1</ref>.  This query returns every bp:BiochemicalReaction that does not have a bp:left child element associated to it.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">BioPax Validator</head><p>The biopax validator is software that can analyze BioPax files according to the validation rules provided by the rule database. It is essentially a graphical user interface around the SPARQL queries. It annotates the place where errors or warnings have occurred and provides an easy to use interface to solve them.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.4">BioPax to PNML Converter</head><p>Once a BioPax file has been validated, the BioPax to PNML converter can be used to generate an executable Petri net. This converter works according to the mapping described in Section 3. This is implemented as an online tool, named BioPax2PNML, and can be accessed on www.few.vu.nl/∼twn370/BioPax2PNML/. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.5">Executing the PNML file</head><p>Although the proof of concept of the current work stops with the generation of a valid Petri net model in the form of a PNML file, it is nevertheless instructive to consider what subsequent steps should be. Execution of a Petri net can be performed under different execution semantics, however the most relevant for biological systems is commonly thought to be the so-called 'bounded asynchronous' execution <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b14">15]</ref>. Under this semantics, as many transitions as possible are executed simultaneously in each execution step. This represents the inherent concurrency of biological systems, where molecules typically act independently, certainly if they reside in different locations. This is also known as the 'token game', because execution of transitions has the effect of shifting tokens around the Petri net. Fig. <ref type="figure">7</ref> shows an example network and the change in state due to execution of enabled transitions.</p><p>Execution leads to a trajectory of markings, that represent the progression of states of the system in response to the intial marking, which corresponds to Fig. <ref type="figure">7</ref>. Example of execution of a slightly non-trivial example network, taken from <ref type="bibr" target="#b14">[15]</ref>. Enabled transitions (with input requirements satisfied; marked in red) will execute each step, execution of enabled transitions in the left panel will lead to the state shown in the right panel.</p><p>a particular state or condition of the biological system. Typically, token levels are collected from a few places of interest and compared to experimental data of the corresponding biological molecule, or used to predict the behaviour of that particular molecule under the conditions modeled. Examples of these for signalling pathways can be found in <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b5">6]</ref>, and for gene regulatory networks in <ref type="bibr" target="#b6">[7]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>Automatic Petri net creation of biological pathways is still a tedious process. The manual labor involved makes it so that even a modestly sized model can take several months to develop. In this paper we have provided a method to bootstrap this process. By using a mapping between the commonly used BioPax format and the PNML format, we have developed a way to automate the construction of Petri net models. Because biological information online may be inconsistent or incomplete, we have developed a set of validation rules to make sure that the data is suitable for automatic conversion.</p><p>To facilitate this, and as a proof of concept, an online tool BioPax2PNML that executes this and provide an easy interface for Petri net modelers to bootstrap the process of model creation.</p><p>The approach outlined here is an initial start to making fully developed executable models. In particular, deriving the weights on edges of the Petri nets is a challenging task. In terms of future work, we believe that by leveraging the links to other databases (e.g. Uniprot) we may be able to find additional information to infer such edge weights. Moreover, we may be able to connect additional parts of the resulting Petri-nets based on background knowledge about interactions contained in other databases or even use knowledge of chemistry provided by other data sources to create more precise models. A key foundation for work going forward is that Linked Data and Semantic Web standards facilitate the merging and acquisition of this information.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. C. elegans Programmed Cell Death Pathway from the WikiPathways Database ID:WP367. The left panel shows the complete pathway, the right panel shows the subset of 5-genes used.</figDesc><graphic coords="3,147.34,310.31,178.90,190.18" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>Gene interaction of the C. elegans Programmed Cell Death Pathway from the Wiki-Pathways Database ID:WP367 &lt; b p : B i o c h e m i c a l R e a c t i o n rdf:about = " id40 " &gt; &lt; bp:right rdf:resource = " eef1e " / &gt; &lt; bp:left rdf:resource = " c0b3e " / &gt; &lt;/ b p : B i o c h e m i c a l R e a c t i o n &gt;</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. An example Petri net of a small part of the C. elegans Programmed Cell Death Pathway (WikiPathways:WP367)</figDesc><graphic coords="5,246.37,146.69,119.50,139.11" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. PNML representation of the C. elegans Programmed Cell Death pathway (Wiki-Pathways:WP367) Petri net as shown in Fig. 2.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. BioPax to PNML webservice architecture</figDesc><graphic coords="8,134.77,545.44,346.47,81.71" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Fig. 5 .</head><label>5</label><figDesc>Fig. 5. SPARQL implementation of rule 1 of Table1</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head></head><label></label><figDesc>PREFIX xsd: &lt; http: // www . w3 . org /2001/ XMLSchema # &gt; PREFIX owl: &lt; http: // www . w3 . org /2002/07/ owl # &gt; PREFIX rdf: &lt; http: // www . w3 . org /1999/02/22 -rdf -syntax -ns # &gt; PREFIX bp: &lt; http: // www . biopax . org / release / biopax -level3 . owl # &gt; SELECT ? reaction WHERE { ? reaction rdf:type b p : B i o c h e m i c a l R e a c t i o n . OPTIONAL { ? reaction bp:left ? left . } FILTER (! BOUND (? left )) }</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>Fig. 6 .</head><label>6</label><figDesc>Fig. 6. Screen shot of the user interface of the BioPax2PNML tool at www.few.vu.nl/∼twn370/BioPax2PNML/.</figDesc><graphic coords="10,134.77,243.98,346.64,184.23" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>Algorithm 1 Genes/Proteins BioPax to PNML</figDesc><table><row><cell>P = ∅</cell></row><row><cell>for all &lt;bp:Protein&gt; p in BioPax do</cell></row><row><cell>if p / ∈ P and p is other entity then</cell></row><row><cell>add p to P</cell></row><row><cell>end if</cell></row><row><cell>end for</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 1 .</head><label>1</label><figDesc>Semantic Validation Rules Warning Each BiochemicalReaction Right child is the actant of the interaction. 6 Warning Each unique entity of a protein/gene is modelled as a different Protein. 7 Warning Each Protein should have a corresponding RelationshipXref. 8 Warning Whenever a BiochemicalReaction has multiple Left or Right tags, it means that it has effect on multiple genes/proteins. 8 Warning Protein complexes are modelled as a Complex tag.</figDesc><table><row><cell cols="2">Id Severity Rule</cell></row><row><cell>1 Error</cell><cell>Each BioChemicalReaction should have a Left child element.</cell></row><row><cell>2 Error</cell><cell>Each BioChemicalReaction should have a Right child element.</cell></row><row><cell>3 Error</cell><cell>Each Pathway should have one or more PathwayComponents of</cell></row><row><cell></cell><cell>type BiochemicalReactions.</cell></row><row><cell cols="2">4 Warning Each BiochemicalReaction Left child is the actor of the interac-</cell></row><row><cell></cell><cell>tion.</cell></row><row><cell>5</cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">See http://WikiPathways.org/index.php/WikiPathways:Statistics for statistics on WikiPathways</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Getting Started in Biological Pathway Construction and Analysis</title>
		<author>
			<persName><forename type="first">Jeremy</forename><surname>Ganesh A Viswanathan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sonali</forename><surname>Seto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">German</forename><surname>Patil</surname></persName>
		</author>
		<author>
			<persName><surname>Nudelman</surname></persName>
		</author>
		<author>
			<persName><surname>Stuart C Sealfon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLoS Comput Biol</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">e16</biblScope>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Using Petri Net tools to study properties and dynamics of biological systems</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pinney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Westhead</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Mcconkey</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Am Med Inform Assoc</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="181" to="199" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Executing multicellular differentiation: Quantitative predictive modelling of C. elegans vulval development</title>
		<author>
			<persName><forename type="first">Nicola</forename><surname>Bonzanni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elzbieta</forename><surname>Krepska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">Anton</forename><surname>Feenstra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wan</forename><surname>Fokkink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thilo</forename><surname>Kielmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Henri</forename><surname>Bal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jaap</forename><surname>Heringa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="2049" to="2056" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Executable cell biology</title>
		<author>
			<persName><forename type="first">Jasmin</forename><surname>Fisher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tom</forename><surname>Henzinger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nature Biotechnology</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page" from="1239" to="1249" />
			<date type="published" when="2007-11">November 2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Cellular abstractions: Cells as computation</title>
		<author>
			<persName><forename type="first">Aviv</forename><surname>Regev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ehud</forename><surname>Shapiro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<imprint>
			<biblScope unit="volume">419</biblScope>
			<biblScope unit="page">343</biblScope>
			<date type="published" when="2002-09">September 2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The role of proteasomemediated proteolysis in modulating activity of potentially harmful transcription factor activity in Saccharomyces cerevisiae</title>
		<author>
			<persName><forename type="first">N</forename><surname>Bonzanni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">G</forename><surname>Oliver</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fisher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">27</biblScope>
			<biblScope unit="page" from="282" to="287" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">Nicola</forename><surname>Bonzanni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Abhishek</forename><surname>Garg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">Anton</forename><surname>Feenstra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sarah</forename><surname>Kinston</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Diego</forename><surname>Miranda-Saavedra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Judith</forename><surname>Schutte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jaap</forename><surname>Heringa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ioannis</forename><surname>Xenarios</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Berthold</forename><surname>Göttgens</surname></persName>
		</author>
		<title level="m">Hard-wired heterogeneity in blood stem cells revealed using a dynamic regulatory network model Bioinformatics in press</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Toward interoperable bioscience data</title>
		<author>
			<persName><forename type="first">Susanna-Assunta</forename><surname>Sansone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Philippe</forename><surname>Rocca-Serra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dawn</forename><surname>Field</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eamonn</forename><surname>Maguire</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nat Genet</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="121" to="126" />
			<date type="published" when="2012-01">January 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">WikiPathways: building research communities on biological pathways</title>
		<author>
			<persName><forename type="first">Thomas</forename><surname>Kelder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Martijn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristina</forename><surname>Van Iersel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martina</forename><surname>Hanspers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bruce</forename><forename type="middle">R</forename><surname>Kutmon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chris</forename><forename type="middle">T</forename><surname>Conklin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alexander</forename><forename type="middle">R</forename><surname>Evelo</surname></persName>
		</author>
		<author>
			<persName><surname>Pico</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nucleic Acids Res</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="page" from="D1301" to="D1307" />
			<date type="published" when="2012-01">January 2012</date>
		</imprint>
	</monogr>
	<note>Database issue</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">KEGG: Kyoto Encyclopedia of Genes and Genomes</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kanehisa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Goto</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nucleic Acids Res</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="27" to="30" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">From genomics to chemical genomics: new developments in KEGG</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kanehisa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Goto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hattori</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">F</forename><surname>Aoki-Kinoshita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Itoh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kawashima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Katayama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Araki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hirakawa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nucleic Acids Res</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="D354" to="D357" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
	<note>Database issue</note>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">The Reactome BioMart In Database</title>
		<author>
			<persName><forename type="first">Robin</forename><forename type="middle">A</forename><surname>Haw</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Croft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christina</forename><forename type="middle">K</forename><surname>Yung</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nelson</forename><surname>Ndegwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D'</forename><surname>Peter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Henning</forename><surname>Eustachio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lincoln</forename><forename type="middle">D</forename><surname>Hermjakob</surname></persName>
		</author>
		<author>
			<persName><surname>Stein</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011-10">October 2011</date>
			<pubPlace>Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">BioPAX -A community standard for pathway data sharing</title>
		<author>
			<persName><forename type="first">Emek</forename><surname>Demir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><forename type="middle">P</forename><surname>Cary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Suzanne</forename><surname>Paley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ken</forename><surname>Fukuda</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nat Biotechnol</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="page" from="935" to="942" />
			<date type="published" when="2010-09">September 2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Design issues for qualitative modelling of biological cells with Petri nets</title>
		<author>
			<persName><forename type="first">Elzbieta</forename><surname>Krepska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nicola</forename><surname>Bonzanni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">Anton</forename><surname>Feenstra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wan</forename><surname>Fokkink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thilo</forename><surname>Kielmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Henri</forename><surname>Bal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jaap</forename><surname>Heringa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. FMSB&apos;</title>
				<meeting>FMSB&apos;</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2008-06">June 2008</date>
			<biblScope unit="volume">08</biblScope>
			<biblScope unit="page" from="48" to="62" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">What can Formal Methods bring to Systems Biology?</title>
		<author>
			<persName><forename type="first">Nicola</forename><surname>Bonzanni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Anton Feenstra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wan</forename><surname>Fokkink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elzbieta</forename><surname>Krepska</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. FM&apos;09, 5850 LNCS</title>
				<meeting>FM&apos;09, 5850 LNCS</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="16" to="22" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Using Cell Illustrator and Pathway Databases</title>
		<author>
			<persName><forename type="first">Masao</forename><surname>Nagasaki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ayumu</forename><surname>Saito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Atsushi</forename><surname>Doi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hiroshi</forename><surname>Matsuno</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Satoru</forename><surname>Miyano</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009-04">April 2009</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Mining Biological Pathways Using WikiPathways Web Services</title>
		<author>
			<persName><forename type="first">T</forename><surname>Kelder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">R</forename><surname>Pico</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hanspers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>Van Iersel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Evelo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLoS ONE</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page">e6447</biblScope>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Bounded asynchrony: A biologically-inspired notion of concurrency</title>
		<author>
			<persName><forename type="first">Jasmin</forename><surname>Fisher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tom</forename><surname>Henzinger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Maria</forename><surname>Mateescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nir</forename><surname>Piterman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. FMSB&apos;08</title>
				<meeting>FMSB&apos;08<address><addrLine>Cambridge</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2008-06">5054. June 2008</date>
			<biblScope unit="page" from="17" to="32" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
