<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Parameter Estimation of Biological Pathways Using Data Assimilation and Model Checking</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chen Li</string-name>
          <email>chenli@hgc.jp</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Keisuke Kuroyanagi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Masao Nagasaki</string-name>
          <email>masao@hgc.jp</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Satoru Miyano</string-name>
          <email>miyano@hgc.jp</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Information Science and Technology, University of Tokyo</institution>
          ,
          <addr-line>4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Human Genome Center, Institute of Medical Science</institution>
        </aff>
      </contrib-group>
      <volume>724</volume>
      <fpage>53</fpage>
      <lpage>70</lpage>
      <abstract>
        <p>This paper presents a novel method to estimate kinetic parameter of biological pathways by using observed time-series data and other knowledge that cannot be formulated in the form of time-series data. Our method utilizes data assimilation (DA) framework and model checking (MC) technique, with a quantitative modeling and simulation architecture named hybrid functional Petri net with extension (HFPNe). Proposed method is applied to an HFPNe model underlying circadian rhythm in mouse. We first translate 23 rules of biological knowledge with temporal logic for the model checking, which are not described in the time-series data. Next, we employ particle filter often applied to DA for our estimation procedure. Each particle checks whether its simulation result satisfies the rules or not, and the result of the checking is used for its resampling step. Our simulation results show that proposed method is faster and more accurate than previous method.</p>
      </abstract>
      <kwd-group>
        <kwd>Hybrid functional Petri net with extension</kwd>
        <kwd>parameter estimation</kwd>
        <kwd>data assimilation</kwd>
        <kwd>particle filter</kwd>
        <kwd>model checking</kwd>
        <kwd>temporal logic</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Modeling and simulating large-scale biological pathways have played an
important role in systems biology. Owing to their importance, many formal description
methods of biological pathway models have been made so far [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1–3</xref>
        ]. Petri net and
its related concepts are one of the succeeded ways of describing biological models
[
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
        ], which have been used for modeling a wide variety of biological pathways
and succeeded in reproducing consistent time-series profiles of biological
elements such as the concentrations of mRNAs and proteins by means of computer
simulations.
      </p>
      <p>Simulation studies on biological pathways promises a deep understanding of
complex cellular mechanisms by investigating the dynamic feature.
Simulationbased models are commonly governed by a series of parameters, e.g. initial values,
reaction speeds and threshold values of activities. Before the model can be
simulated, all parameters must be assigned in advance. However, most parameters are
often unknown or not obvious. In general, such parameters are carefully tuned
by experts to fit the simulated elements with observed in vivo/vitro experiment
results. Due to the nonlinearity of the model, parameter estimation is difficult
and requires a lot of trial and errors. Small differences of the parameters make
large gap between reality and simulation results. Therefore, conventional hand
tuning method severely limits the size and complexity of simulation models built
as more output data (e.g., microarray gene expression data) are being measured.</p>
      <p>The aim of this paper is to develop a novel method to automatically and
efficiently estimate kinetic parameters of a given model or a model starting
from scratch by combining data assimilation (DA) and model checking (MC)
approaches, coupled with observed experiment data. Observed experiment data
includes well-defined time-series data and other knowledge that cannot be
formulated in the form of time-series data.</p>
      <p>
        DA was originally established in the field of geophysical simulation science.
Nagasaki et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have proposed a so-called genomic data assimilation approach.
Their DA framework enables users to handle both the model construction and
parameter tuning in the context of statistical inferences, and establishes a link
between the HFPNe simulation model and observed data, e.g., microarray gene
expression data [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] or time series proteomic data [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Current DA approach has
some issues because it depends on providing successive time points (at least
1020 time points) of time-series data by biological experiments. That is, for a small
time-series data set including for example two or three time points, DA will cost
massive computational resource, in some cases, it will be completely impossible
to estimate parameters. The response to this difficulty of dealing with sparse
and/or not well-defined time-series data is the use of model checking.
      </p>
      <p>
        MC is a method for automatic verification of system requirements [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], which
firstly used in hardware field because of its determined behavior and limited value
space. Today, this technique has been applied to more complex biological models.
There are several studies that use model checking for parameter estimation, e.g.
Donaldson and Gilbert developed a computational system named MC2(GA) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
that couples model checking with genetic algorithm for parameter estimation. Li
et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposed online model checking approach based parameter estimation
framework applied to the HFPNe class. In order to check the model, one need
to write biological rules of the knowledge with temporal logic. For example, “A
biological phenomena F oo keeps decreasing until Bar rises.” can be written as
“d([Foo])≤0 U d([Bar])&gt;0” in a kind of temporal logic. Various knowledge
can be described in this way. Nevertheless, in order to improve the estimation
efficiency and accuracy, it is expected to find a general methodology to determine
parameters by combining DA and MC dealing with both time-series experimental
data and biological queries.
      </p>
      <p>The paper is organized as follows. In Methods, we briefly explain how to (i)
construct biological models with HFPNe, (ii) estimate unknown parameters in a
nonlinear state space model, (iii) translate biological rules with a temporal logic
for querying system properties, and (iv) combine MC with DA for parameter
estimation. In Results, we compare our novel method with previous method by
applying them to mouse circadian rhythm model represented by HFPNe. We
show that our method is potentially faster and more accurate than previous one
that excludes MC technique.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <sec id="sec-2-1">
        <title>Hybrid Functional Petri Net with Extension (HFPNe)</title>
        <p>
          HFPNe is developed as a biosimulation tool for pathway modeling and simulation
extended from original Petri net [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. HFPNe can deal with three types of data:
discrete, continuous and generic, whereas the original Petri net deal with only
discrete data. HFPNe consists of three types of elements: entity, process and
connector (see Figure 1 (a)). Figure 1 (b) shows connection rules in HFPNe. For
more definitions and usages of HFPNe, see Nagasaki et al. [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>
          Figure 2 shows circadian rhythm model of mouse represented by HFPNe [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
This model is composed of of 12 entities, 28 processes and 45 connectors. Due
to the space restriction, Appendix gives the details of these elements. Initial
value of entities mi(0) (i=1, · · ·, 12), reaction speed parameters ki (i=1, 2 and ki
is a common parameter to control speeds of similar biological processes: k1 for
protein binding; k2 for translation), and threshold parameters si (i=1, · · ·, 3) are
unknown parameters.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Data Assimilation for Parameter Estimation</title>
        <p>We here explain how to estimate parameters in a simulation model from
timeseries data with the use of DA.</p>
        <p>
          Data Assimilation with Nonlinear State Space Model DA is an approach
to improve the accuracy of the models by combining with data, which can deal
with models formulated by nonlinear state space model (SSM) given by two
equations [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]:
mt=f (mt−1, wt, θsys) ,
(1)
yt=Hmt + t . (2)
Equation (1) is called “system model” and Equation (2) is called “observed
model”. In the system model, mt≡(m1t, · · ·, mpt)T is a state vector consisting
of p state variables mit (i=1, · · ·, p) at discrete time point t. wt denoting system
noise is an l-dimensional white noise at t with a density q(w). f is a vector-valued
function, f : Rp+l → Rp, and θsys is a vector of model parameters. In the
observed model, yt∈Rd is an observation vector at t, H∈Rd × Rp is an observation
matrix; Hij takes value one if observed value of jth entity corresponds to the
ith element of yt, otherwise zero. t called observational noise is a d-dimensional
white noise at t with a density r( ). The likelihood of the parameter is given by
        </p>
        <p>L(θsys)=p(y1, · · ·, yT |θsys)= tT=1 p(yt|Y t−1, θsys)= tT=1 p(yt|mt, θsys)p(mt|
Y t−1, θsys)dmt, where Y t={y1, · · ·, yt}. The maximum likelihood estimator
(MLE) for θsys is given by
argmax log L(θsys).</p>
        <p>θsys
We can estimate parameters with MLE, however, this maximum likelihood method
has an important issue that the value of log L(θsys) computed by Monte Carlo
filter includes an approximation error. For accurate estimation, huge
computation resources are thus required.</p>
      </sec>
      <sec id="sec-2-3">
        <title>Self-Organizing State Space Model To deal with the difficulty of maximum</title>
        <p>
          likelihood method, we use self-organizing state space model (SOSSM) [
          <xref ref-type="bibr" rid="ref16 ref17 ref18">16–18</xref>
          ].
We can estimate parameters based on Bayesian inference with SOSSM by
weaving parameters in the state vector as
The SSM for this vector is given by zt=F ∗(zt−1, wt) and yt=H∗zt+ t, where
zt=
mt
θsys
        </p>
        <p>.</p>
        <p>F ∗(zt−1, wt) =
f (mt−1, wt, θsys)</p>
        <p>θsys
H∗zt = Hmt.
We can obtain marginal posterior densities without obtaining MLE of θ as
p(mT | Y T ) =</p>
        <p>p(zT | Y T )dθsys
p(θsys | Y T ) =</p>
        <p>p(zT | Y T )dmT .</p>
        <p>In this way, we can avoid the issue of maximum likelihood method during the
estimation.</p>
        <p>
          Particle Filter As mentioned above, we have to calculate distribution p(zT |Y T )
for estimating parameters. However, it generally becomes a non-gaussian
distribution in SSM. Therefore, it is needed to represent this distribution with some
method. We here use a sequential Monte Carlo method called particle filter (PF)
[
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Figure 3 shows the overview of particle filter’s algorithm. In particle filter,
predictive distribution p(zt|Y t−1) and filter distribution p(zt|Y t) are
approximated by m in which each realization is called particle as follows:
pt|t−1 ≡ {pt(|1t)−1, · · · , pt(|mt−) 1} ∼ p(zt | Y t−1)
        </p>
        <p>pt|t ≡ {pt(|1t), . . . , pt(|mt )} ∼ p(zt | Y t),
where pt(|jt)−1 (j=1, · · ·, m) and pt(jt) (j=1, · · ·, m) are (p+|θsys|)-dimensional
num|
bers. This algorithm is processed with the following steps.
Step 1 Generate the (p+|θsys|)-dimensional random number p(0j0) for j=1, · · ·, m.
|
Step 2 Repeat the following three steps for the observed time points t=1, · · ·, T .</p>
        <p>Step 2.1 (Prediction step) Generate m system noises wt(j) independently
and identically from q(wt), and compute particles by inputting filtered
states into simulation model, pt(|jt)−1=F ∗(pt(−j)1|t−1, wt(j)) for j=1, · · ·, m.</p>
      </sec>
      <sec id="sec-2-4">
        <title>Step 2.2 (Weight calculation step) Compute the weights of importance</title>
        <p>for the particles according to</p>
        <p>αt(j)=p(yt|pt(|jt)−1)=r(yt − H∗pt(|jt)−1)
for j=1, · · ·, m. These weights are then normalized as α¯t(j)=αt(j)
Step 2.3 (Resampling step) Generate filtered state pt(jt) for j=1, · · ·, m
|
by resampling {pt(|1t)−1, · · · , pt(|mt−) 1} with the probabilities {α¯t(1), · · · , α¯t(m)}.
(3)
m
j=1αt(j).
2.3</p>
      </sec>
      <sec id="sec-2-5">
        <title>Model Checking</title>
        <p>
          We explain how to represent requirements of biological pathway models for model
checking. Model checking is a method to verify whether models satisfy the
requirements or not. Temporal logic formulae are often used to describe the system
requirements. In this study, we selected PLTL (Probabilistic Linear-time
Temporal Logic) for querying dynamic models of cellular networks [
          <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
          ] which
extends original LTL to a stochastic setting with a probability operator and a
filter criterion defining the starting state where the property is satisfied.
[PLTL Syntax] Table 1 shows definition of PLTL syntax, which is used to ask
for the probability of user’s query via a PLTL formulae ψ. In the LTL expression
φ{AP }, φ will be checked from the state that AP is satisfied rather than from
the default initial state, where AP is called atomic proposition and takes boolean
domain. PLTL allows (i) LTL expression to contain temporal operators, i.e., X,
F, G, U, R. Five temporal operators are used to describe the sequencing of the
states along the execution; and (ii) the usage of ψ without probabilistic operators
(i.e. simply in the form of LTL), which is useful when the model is deterministic.
[PLTL Semantics] The semantics of PLTL is defined over the finite sets of
finite paths through system’s state space, obtained by repeated simulation runs
of HFPNe models. The PLTL formula is built upon two components:
probabilistic operator and property LTL. For each simulation run, the LTL expression is
evaluated to a boolean truth value, and the probability of the LTL statement
holding true is calculated based on the whole set of simulation results.
        </p>
        <p>For the probability operator components, there are two distinct operators:
(i) P x(LT L) is any inequality comparison of the probability of the property
LTL holding true, for example P≥0.5(LT L); and (ii) P=?(LT L) returns the value
of the probability of the property holding true. The semantics of the temporal
logic operators are described in Table 2. Concentrations of biochemical species
in the model are denoted by [variableName]. A special variable, [time], stands
for simulation time.</p>
        <p>Due to the ability of PLTL, it is possible to define functions of two
different natures: functions that return a real number and functions that return a
boolean value. An example of the real number function is d([variableN ame])
which returns the subtracted value of [variableName] between time i and i−1.
Note that, d([variableN ame]) equals zero at time point zero. One example of
a boolean function is similarAbsolute(value a, value b, value ), which returns
true if |a−b|&lt; or else it returns false. Table 3 shows the rules written in PLTL
for circadian rhythm model of mouse.
As stated in Introduction, we have applied either DA or MC to pathway model
in the previous researches, but used time scales are different. We here employ
common time scale called Petri net time for combining DA with MC, which is the
virtual time unit of the HFPNe model denoted by [pt]. We define: (i) simInt∈R
as simulation interval; (ii) mcInt∈R which is a multiple of simInt as a model
checking interval; and (iii) M apOttoP t : N→R as a mapping from observed time
point to Petri net time for combining. From the MC’s viewpoint, Xφ means that
φ must be true at the state after mcInt Petri net time. Meanwhile, a simulation
from time M apOttoP t(t−1) to M apOttoP t(t) is a run in f from DA’s viewpoint.</p>
        <p>
          Hu¨rseler and Ku¨nsch mentioned that it is difficult to generate a good
initial distribution of parameters with SOSSM [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. This is because parameters
in resampled particles are the subset of parameters in the initial particles and
a model with randomly generated parameters rarely satisfy all rules. To
generate good initial distribution of parameters, two considerations are designed:
        </p>
        <p>Rule
LTL translation</p>
        <sec id="sec-2-5-1">
          <title>Rule 1 GC(o[npceernmtrRaNtAi]on&lt; o3f.p8er&amp;&amp;m[RpNerAmRisNAb]et&gt;we0e.n2)0.2 and 3.8.</title>
          <p>Rule 2 Concentration of Rev-Erv mRNA is between 0.2 and 3.4.</p>
          <p>G([Rev-Erv mRNA] &lt; 3.4 &amp;&amp; [Rev-Erv mRNA] &gt; 0.2)
Rule 3 Concentration of Bmal mRNA is between 0.2 and 4.3.</p>
          <p>G([Bmal mRNA] &lt; 4.3 &amp;&amp; [Bmal mRNA] &gt; 0.2)
Rule 4 Concentration of Bmal mRNA is between 2.4 and 2.6 after time becomes 20.</p>
          <p>G(similarAbsolute([Clock mRNA],2.5,0,1)){[time] &gt; 20}
Rule 5 Concentration of Cry mRNA is between 3.8 and 0.2.</p>
          <p>G([Cry mRNA] &lt; 3.8 &amp;&amp; [Cry mRNA] &gt; 0.2)
Rule 6 GC(o[nPcEeRn]tr&lt;at2i.o6n &amp;o&amp;f P[EPERR]is&gt;b0e.tw2)een 2.6 and 0.2.</p>
          <p>Concentration of CRY is between 2.6 and 0.2.</p>
          <p>Rule 7 G([CRY] &lt; 2.2 &amp;&amp; [CRY] &gt; 0.2)</p>
          <p>Concentration of PER/CRY is between 2.7 and 0.2.</p>
          <p>Rule 8 G([PER/CRY] &lt; 2.7 &amp;&amp; [PER/CRY] &gt; 0.2)
Rule 9 Concentration of REV ERB is between 1.5 and 0.2.</p>
          <p>G([REV ERB] &lt; 1.5 &amp;&amp; [REV ERB] &gt; 0.2)
Rule 10 GL(odc(a[lpmeraxmRimNAu]m) ≥con0ce&amp;n&amp;trXa(tdio(n[poefr pmeRrNAm])R&lt;N0)A ⇒is g[rpeeartemrRNtAh]an&gt; 22.0..0)
Rule 11 GL(odc(a[lpmerinmiRmNuA]m)≤co0n&amp;c&amp;enXt(radt(i[opnerofmRpNeAr]m)&gt;R0)N⇒A [ispelerssmRtNhAa]n &lt;1.10..0)
Rule 12 GL(odc(a[lRmeva-xEirmvummRNAc]o)n≥ce0nt&amp;r&amp;atXio(nd(o[fReRve-vE-rEvrmvRNmAR])N&lt;0A) is⇒greater than 1.5.
[Rev-Erv mRNA] &gt; 1.5)
Rule 13 GL(odc(a[lRmevi-nEimrvummRNcAo]n)c≤e0nt&amp;ra&amp;tiXo(nd(o[fRReve-vE-rEvrvmRmNAR])N&gt;A0)is less than 1.0.
⇒ [Rev-Erv mRNA] &lt; 1.0)
Rule 14 GL(odc(a[lBmmaalxmimRNuAm])c≥on0ce&amp;n&amp;trXa(tdio(n[BomfalBmmRaNlA]m)&lt;R0N)A⇒is[gBrmeaaltemrRNthA]an&gt;11.5..5)
Rule 15 GL(odc(a[lBmmailnimmRuNAm])c≤on0c&amp;e&amp;ntXra(tdi(o[nBmoaflBmmRNaAl]m)&gt;R0)NA⇒is[BlemsasltmhRaNnA]1.&lt;0.1.0)
Rule 16 GL(odc(a[lCmryaxmRimNAu]m)≥c0on&amp;c&amp;enXt(rda(t[ioCnryomfRCNAry])m&lt;0R)N⇒A i[sCrgyremaRtNeAr]th&gt;an2.20.)0.</p>
          <p>Rule 17 GL(odc(a[lCmryinmiRmNuA]m)≤co0n&amp;c&amp;enXt(radt(i[oCnryofmRCNrAy])m&gt;0R)N⇒A i[sClreyssmtRhNaA]n 1&lt;.01..0)
Rule 18 tWhahnencocnocnecnetnrtartaitoinonofopfeBrmmaRlNmAR.NA takes local minimum, concentration of Bmal mRNA is less</p>
          <p>G(d([Bmal mRNA])≤0 &amp;&amp; X(d([Bmal mRNA])&gt;0) ⇒ [Bmal mRNA] &lt; [per mRNA])
Rule 19 When concentration of Bmal mRNA takes local maximum, concentration of Bmal mRNA is grater
than concentration of per mRNA</p>
          <p>G(d([Bmal mRNA])≥0 &amp;&amp; X(d([Bmal mRNA])&lt;0) ⇒ [Bmal mRNA] &gt; [per mRNA])
Rule 20 cWonhceenntcroantcieonntroaftpioenr mofRpNerAm.RNA takes local minimum, concentration of per mRNA is less than</p>
          <p>G(d([per mRNA])≤0 &amp;&amp; X(d([per mRNA])&gt;0) ⇒ [per mRNA] &lt; [Bmal mRNA])
Rule 21 tWhahnencocnocnecnetnrtartaitoinonofoBfmpearl mmRRNNAA takes local maximum, concentration of per mRNA is grater</p>
          <p>G(d([per mRNA])≥0 &amp;&amp; X(d([per mRNA])&lt;0) ⇒ [per mRNA] &gt; [Bmal mRNA])
Rule 22 GC(o[npceernmtrRaNtAi]on&gt; o[fRpeevr-EmrvRmNRANA]is) greater than concentration of Rev-Erv mRNA.</p>
        </sec>
        <sec id="sec-2-5-2">
          <title>Rule 23 GC(o[nCcLeOnCtKr]at&gt;io[nPEoRf]C)LOCK is greater than concentration of PER.</title>
          <p>Firstly, we repeat particle filters many times by regenerating p0|0 from pT |T like
a crossover in genetic algorithm since the length of time courses for biological
use is generally shorter than that for other fields; Secondly, a threshold value T h
is used to discard particle whose unsatisfied number by the checking is greater
than T h. T h is changed for each run of particle filter. In this study, system noise
is not taken into account in order to accelerate the estimation. Nevertheless, a
generic process can be mapped to Java object supporting HFPNe model in a
nondeterministic settings.</p>
          <p>Algorithm 1 shows pseudocode of our parameter estimation method. In
step 9, P redictandM C returns predicated particle and the result whether it is
worthless or not. The detail of this function is displayed in Algorithm 2. In
steps 10–15, if the simulation results unsatisfy more than T h rules, the weight of
the particle will become zero, or else the weight is calculated by Equation (3)
via r. To calculate r, we assume that observed noise is a gaussian white noise
and its mean is zero. The variance of observed noise is thus needed to be
estimated. We generate multiple candidate values and use the value that has the
maximum likelihood as the variance. Steps 17–19 are designed to break the run
of particle filter if all the particles unsatisfied more than T h rules; Otherwise T h
is incremented or decremented in steps 23–27. p0|0 for the next run of particle
filter is generated from pT |T in step 28.</p>
          <p>Algorithm 1. Pseudecode of our parameter estimation method.</p>
          <p>In Algorithm 2, steps 2–3 convert time scale from observed time to Petri
net time. Step 4 separates particle p into m(start) and θ, where m(t) is a
vector consisting of the values of p entities at Petri net time t and θ is a vector
of parameters to be estimated. m(pt+simInt) is predicted by simulation in
step 6. Step 7 returns the number of unsatisfied rules via M odelChecking(). In
steps 9–11, if simulation result violates more than T h rules, simulation will stop
immediately.</p>
          <p>Algorithm 2 Prediction step of particle filter and model checking.
1: function P redictandMC(p, t, Rules, T h)
2: start ← MapOttoP t(t − 1)
3: end ← MapOttoP t(t) − simInt
4: (m(start), θ) ← p
5: for pt ← start to end step simInt do
6: m(pt + simInt) ← Simulation(m(pt), θ)
7: if (pt + simInt)%mcInt = 0 then
8: N ← ModelChecking(m, Rules)
9: if T h &lt; N then
10: return ([m(pt + simInt)T , θT ]T , false)
11: end if
12: end if
13: end for
14: return ([m(MapOttoP t(t))T , θT ]T , true)
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussions</title>
      <p>3.1</p>
      <sec id="sec-3-1">
        <title>Estimation environment and evaluation criteria</title>
        <p>We estimate 17 parameters of circadian rhythm model of mouse (i.e., 12
initial values of entities, two reaction speeds and three threshold values). We
use synthesized data set coupled with simulation and observed noise t ( t ∼
N (0, 0.052I12)) as observed data. It contains 312 data of 26 time points (see
Figure 4) for each 12 biological entities – five mRNAs, five proteins and two
complex proteins –. One observed time point is mapped to five Petri net times.
Table 4 summarizes details of our estimation environment. Parameter search
range is set from zero to 15.</p>
        <p>To evaluate the results of estimation, following Scores are defined and
calculated before the step RegenerateP articles step given in Algorithm 1.</p>
        <p>Scoremean = tT=1 ep=1 |SimResult(P ar|YamTsm|ean,e,t)−yt,p|2
Scoremode = tT=1 ep=1 |SimResult(P ar| YamTs|mode,e,t)−yt,p|2
Scoremedian = tT=1 ep=1 |SimResult(P ara|YmsTm|edian,e,t)−yt,p|2
Scorebest = tT=1 ep=1 |SimResult(P a|rYamTs|best,e,t)−yt,p|2</p>
        <p>Score = min{Scoremean, Scoremode, Scoremedian, Scorebest, Scorecurrent},
where yt,p is an observed value of entity p at observed time point t. P aramsbest
denotes parameters of a particle which is made of particles. P aramsmean, P aramsmode
and P aramsmedian represent mean, mode and median value of all particles’
parameters respectively. SimResult(P arams, e, t) returns the value of entity e at
time t which is calculated by simulation with P aram. Scorecurrent is ∞ at first
time of the calculation, otherwise it is the Score of previous calculation.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Experimental Result</title>
        <p>Comparison between PFMC and PF We compare the performance
between our new method Particle Filter with Model Checking (PFMC) and
previous method Particle Filter (PF). The estimation experiments are carried out
on workstation of Intel Xeon E5450 (3.0GHz) with 32G bytes of memory. We
performed estimation 100 times for both methods. Figure 5 shows the result
with respect to the distribution of Score with elapsed time. Mean of Score is
also analyzed by Welch’s t-test (See Table 5). Nearly all Score of PFMC and
PF are good on and after 600 seconds. Both medians are also good on and after
600 seconds, but there are many bad cases before 1,000 seconds for PF. That
is, roughly speaking, if there are enough amounts of observed data, there is no
much difference by using either PMFC or PF for the estimation. However, in
almost all cases, we can finish the estimation within a short time incorporating
model checking.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Estimation with small amount of observed data To investigate the per</title>
        <p>formance in the case of small amount of observed data, we use only first ten time
points of five biological entities’ data for the estimation: per mRN A, Cry mRN A,
Rev Erv mRN A, Clock mRN A and Bmal mRN A. More detailedly, we use all
default parameters for Score calculation and just overwrite p to five and T to
ten. The estimation results are exhibited in Figure 6 and Table 6.</p>
        <p>The results clearly show that on and after 1,200 seconds, in contrast to the
previous experiment, there is difference not only between bad cases, but also
between medians of PFMC and PF. This is because it is difficult to estimate
parameters which makes certain rhythms with only two cycles of observed data.
Therefore, median Scores of PF are not good before 3,000 seconds. Moreover,
convergence of PFMC is worse than previous experiment.</p>
        <p>Effects of the rules We also investigate the effect of rules by checking which
rule is unsatisfied which results in the cutting of a particle. We run estimation
100 times with default parameters and all of observed data. All the runs finished
after 20 minutes and the results are shown in Figure 7.</p>
        <p>Two kinds of effects used in model checking approach can be considered.
First, it cuts useless particles,which enables us to estimate with more particles
or more runs of particle filters. Second, it cuts bad results which facilitates the
estimation only with observed data. From the first effect’s viewpoint, the rule
that cut more particles is a good rule. This is due to the fact that rule is able
to cut many particles from early time because the number of unique particle
decreases with the time elapse in our method. Rule 1 to 3, 5 to 9, 22 and
23 are such rules. From the second effect’s viewpoints, good rules are different
depending on behaviors of target models. For the circadian rhythm model, it is
important to reproduce the oscillations within a certain range. Rule 10 to 21
are specified for verifying the behavior of oscillation. Generally, it is not easy
to prepare this kind of rules before trial. Nevertheless, unlike observed data, it
will be a great help in improving the efficiency and accuracy of conventional
parameter estimation process and eventually leading to better understanding of
biological pathways.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>We propose a novel parameter estimation method for biological pathways. By
combining model checking with DA framework, our method enables us to use
various knowledge in addition to observed time series data. We extract 23
biological rules with temporal logic which cannot be formulated in the form of
time-series data. Proposed method and previous method are applied to mouse
circadian rhythm model of HFPNe by means of performance evaluation. Results
shows that (i) if estimations execute with enough amounts of observed data, our
method can practically give good parameters in a short time; and (ii) if
estimations execute with small amount of observed data, new method is much faster
than the method without model checking.</p>
    </sec>
    <sec id="sec-5">
      <title>Appendix:</title>
      <p>Corresponding
connector
c45
c40
c39</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <surname>Q.</surname>
          </string-name>
          :
          <article-title>A Formal Model for Analyzing Drug Combination Effects and its Application in TNF-alpha-induced NFkappaB Pathway</article-title>
          .
          <source>BMC Syst Biol</source>
          .,
          <volume>4</volume>
          (
          <issue>50</issue>
          ) (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Antoniotti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Policriti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ugel</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mishra</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Model Building and Model Checking for Biochemical Processes</article-title>
          .
          <source>Cell Biochem</source>
          . Biophys.,
          <volume>38</volume>
          (
          <issue>3</issue>
          ),
          <fpage>271</fpage>
          -
          <lpage>286</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Regev</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silverman</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shapiro</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Representation and Simulation of Biochemical Processes Using the Pi-calculus Process Algebra</article-title>
          . Pac. Symp. Biocomput.,
          <fpage>459</fpage>
          -
          <lpage>470</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Chaouiya</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Petri Net Modelling of Biological Networks</article-title>
          . Brief Bioinform.,
          <volume>8</volume>
          (
          <issue>4</issue>
          ),
          <fpage>210</fpage>
          -
          <lpage>219</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Koch</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heiner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Petri nets</article-title>
          .
          <source>In Analysis of Biological Networks</source>
          . Edited by Junker,
          <string-name>
            <given-names>B.H.</given-names>
            ,
            <surname>Schreiber</surname>
          </string-name>
          , F. A Wiley Interscience Publication,
          <fpage>139</fpage>
          -
          <lpage>180</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Matsuno</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miyano</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Petri Net Based Descriptions for Systematic Understanding of Biological Pathways</article-title>
          .
          <source>IEICE Trans. Fundamentals, E89-A(11)</source>
          ,
          <fpage>3166</fpage>
          -
          <lpage>3174</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Nagasaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamaguchi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yoshida</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seiya</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tamada</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matsuno</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miyano</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Genomic Data Assimilation for Estimating Hybrid Functional Petri Net from Time-course Gene Expression Data</article-title>
          . Genome Inform.,
          <volume>17</volume>
          (
          <issue>1</issue>
          ),
          <fpage>46</fpage>
          -
          <lpage>61</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Tasaki</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagasaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozuka-Hata</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Semba</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gotoh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hattori</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Inoue</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamamoto</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miyano</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sugano</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oyama</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Phosphoproteomics-based Modeling Defines the Regulatory Mechanism Underlying Aberrant EGFR Signaling</article-title>
          .
          <source>PLoS One</source>
          ,
          <volume>5</volume>
          (
          <issue>11</issue>
          ),
          <year>e13926</year>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. Be´rard,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Bidoit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Finkel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Laroussinie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Petit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Petrucci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Schnoebelen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>McKenzie</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <source>Systems and Software Verification: Model-Checking Techniques and Tools</source>
          , Springer (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Donaldson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gilbert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <string-name>
            <given-names>A Model</given-names>
            <surname>Checking</surname>
          </string-name>
          <article-title>Approach to the Parameter Estimation of Biochemical Pathways</article-title>
          .
          <source>In: Proceedings of the 6th International Conference on Computational Methods in Systems Biology (CMSB '08)</source>
          , pp.
          <fpage>269</fpage>
          -
          <lpage>287</lpage>
          . SpringerVerlag, Berlin, Heidelberg (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagasaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Hock</given-names>
            <surname>Koh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Miyano</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          :
          <article-title>Online model checking approach based parameter estimation to a neuronal fate decision simulation model in C. elegans with hybrid functional Petri net with extension</article-title>
          ,
          <source>Mol. Biosyst</source>
          .,
          <volume>7</volume>
          (
          <issue>5</issue>
          ),
          <fpage>1576</fpage>
          -
          <lpage>1592</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Nagasaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saito</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jeong</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kojima</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ikeda</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miyano</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <source>Cell Illustrator</source>
          <volume>4</volume>
          .0:
          <string-name>
            <given-names>A</given-names>
            <surname>Computational</surname>
          </string-name>
          <article-title>Platform for Systems Biology</article-title>
          , In Silico Biol.,
          <volume>10</volume>
          ,
          <issue>0002</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Nagasaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matsuno</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miyano</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A Versatile Petri Net Based Architecture for Modeling and Simulation of Complex Biological Processes</article-title>
          . Genome Inform.,
          <volume>15</volume>
          (
          <issue>1</issue>
          ),
          <fpage>180</fpage>
          -
          <lpage>197</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <article-title>Circadian rhythms in Mus musculus</article-title>
          , http://www.csml.org/models/ csml-models/
          <article-title>circadian-rhythms-in-mouse/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Kitagawa</surname>
          </string-name>
          , G.:
          <article-title>Non-Gaussian State-space Modeling of Nonstationary Time Series</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          ,
          <volume>82</volume>
          (
          <issue>400</issue>
          ),
          <fpage>1032</fpage>
          -
          <lpage>1063</lpage>
          ,
          <year>1987</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Kitagawa</surname>
          </string-name>
          , G.:
          <article-title>Self-organizing State Space Model</article-title>
          .
          <source>J. of the American Statistical Association</source>
          ,
          <volume>93</volume>
          ,
          <fpage>1203</fpage>
          -
          <lpage>1215</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Higuchi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Self-organizing Time Series Model</article-title>
          .
          <source>In Sequential Monte Carlo Methods in Practice</source>
          , Springer-Verlag New York,
          <fpage>429</fpage>
          -
          <lpage>444</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Kitagawa</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sato</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Monte Carlo Smoothing and Self-Organising Statespace Model</article-title>
          .
          <source>In Sequential Monte Carlo Methods in Practice</source>
          , Springer-Verlag New York,
          <fpage>177</fpage>
          -
          <lpage>195</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Kitagawa</surname>
          </string-name>
          , G.:
          <article-title>Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models</article-title>
          .
          <source>Journal of Computational and Graphical Statistics</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>25</lpage>
          (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Heiner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehrack</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gilbert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marwan</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          :
          <article-title>Extended Stochastic Petri Nets for Model-based Design of Wetlab Experiments</article-title>
          . Edited by Priami,
          <string-name>
            <surname>C.</surname>
          </string-name>
          , et al.,
          <source>Trans. on Comput. Syst. Biol. XI, LNBI</source>
          ,
          <volume>5750</volume>
          ,
          <fpage>138</fpage>
          -
          <lpage>163</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Donaldson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gilbert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <string-name>
            <given-names>A Monte</given-names>
            <surname>Carlo</surname>
          </string-name>
          <article-title>Model Checker for Probabilistic LTL with Numerical Constraints</article-title>
          ,
          <source>Technical report</source>
          , University of Glasgow, Department of Computing Science (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Hu</surname>
          </string-name>
          <article-title>¨rzeler and</article-title>
          <string-name>
            <surname>Hans</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Ku¨nsch, R.:
          <article-title>Approximating and maximising the likelihood for a general state-space model</article-title>
          .
          <source>In: Sequential Monte Carlo Methods in Practice</source>
          , Springer-Verlag New York,
          <fpage>159</fpage>
          -
          <lpage>175</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>