<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Completing signaling networks by abductive reasoning with perturbation experiments</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adrien Rougny</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yoshitaka Yamamoto</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hidetomo Nabeshima</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gauvain Bourgne</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anne Poupon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Katsumi Inoue</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christine Froidevaux</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>BIOS group, INRA</institution>
          ,
          <addr-line>CNRS</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LIP6, CNRS, Universit Pierre et Marie Curie</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Laboratoire de Recherche en Informatique, CNRS, Universite Paris-Sud</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>National Institute of Informatics</institution>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Yamanashi</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Signaling networks model the ow of information occurring in cells after they
have been stimulated by an extracellular signal (for instance, a hormone).
Together with the rise of available high-throughput data, such networks become
always larger and more complex. Consequently, automatic methods have
become necessary for their analysis. Methods relying on discrete formalisms, such
as logical ones, seem well suited as numerical parameters are often di cult to
obtain.</p>
      <p>One fundamental task of cell signaling biology is to test whether available
experimental data can be explained by a given signaling network, and else, to
modify the network (by adding or removing an edge) or clean the data so that it
can be explained. Methods to accomplish this task take as input a representation
of signaling networks called interaction graph (IG). In IGs, nodes are molecules
(or activities that originate from molecules) and arcs are positive or negative
in uences that molecules have on each other.</p>
      <p>These various works mainly di er on four aspects. (i) The semantics used to
interpret IGs: In [11] and [14, 12], authors interpret the arcs of an input IG under
the path semantics, introduced in [13], or under the Sign Consistency Model [15],
respectively, whereas authors of [4, 10] interpret IGs as causality networks, which
implies the use of a more constrained semantics. Authors of [14] also consider the
problem within the boolean network semantics. (ii) The experimental data they
take into account: Methods of [11, 4] take as input steady-state shift experiments,
whereas methods of [14, 12, 10] take into account perturbation experiments. (iii)
The modi cations of the network or the cleaning of the data they propose to
explain unexplained experimental results: The methods in [11, 12] provide
possible modi cation of the input network or the data, whereas the method in [4, 10]
allows only the completion of the input network by addition of edges. (iv) The
formalism they use: graph theory [14], integer linear programming [12], answer
set programming [11] and rst-order logic [4, 10].</p>
      <p>In this work, we propose a method to check whether a set of perturbation
experiments can be explained by a signaling network represented in SBGN-AF
[16], and else, to complete the network by adding edges to the network.</p>
      <p>SBGN-AF is a standard to represent signaling and gene regulation networks.
It extends the classical IG representation by including logical operators (the
AND and the OR operator) that permit to specify logical functions within the
graph. Taking as input SBGN-AF maps, we extend the path semantics of [13]
by considering such operators in the de nition of the paths, that we formalize
in rst-order logic, based on the translation of SBGN-AF maps into predicates
introduced in [8]. We also interpret perturbation experiments making a stronger
assumption than in [14, 12, 10], resulting in a more constrained setting (cf.
section 3). We perform both the explanation and the completion tasks within the
same abductive framework by using the consequence nding method from
SOLAR [7].
2</p>
    </sec>
    <sec id="sec-2">
      <title>Paths semantics with logical operators</title>
      <p>
        Positive and negative paths of an SBGN-AF map are built by transitive closure
of the elementary arcs.. We interpret a positive path from an activity A to
an activity B as a possibility to explain an increase (resp. decrease) of B by
an increase (resp. decrease) of A. Analogously, we interpret a negative path
from A to B (denoted by inhibits (A; B)) as a possibility to explain a decrease
(resp. increase) of B by an increase (resp. decrease) of A. Positive and negative
paths are denoted by stimulates (A; B) and inhibits (A; B), respectively. The
following axioms allow building positive and negative paths using the in uences
and the logical operators of an SBGN-AF map.
Axioms (
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5 ref6">1-6</xref>
        ) are the main transitivity axioms, while axioms (
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7-9</xref>
        ) and (
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10-12</xref>
        )
express the semantics of the AND and the OR logical operators, respectively.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Formalization of experimental observations</title>
      <p>We consider experimental observations that originate from perturbation
experiments. Such experiments consist in comparing the rate of an activity aT between
two batches of cells each having received a particular treatment. In the control
batch, cells are stimulated by a set of molecules, whose corresponding set of
activities are denoted by S. In the experimental batch, cells are rst treated with
a number of inhibitors that suppress a set of activities denoted by KO. The cells
are then stimulated as in the control batch. We introduce a variable e that takes
the value # (resp. ") if and only if (i ) the rate of aT is lower (resp. higher) in the
experimental batch than in the control batch. We denote such an experimental
observation by the tuple (S; KO; aT ; e).</p>
      <p>For a given experimental observation E = (S; KO; aT ; e), if e =#, then aT is
more inhibited or less stimulated by the activities of S in the experimental batch
than in the control batch due to the suppression of at least one activity of KO.
In the cells of the experimental batch, as all activities of KO are suppressed,
they can no longer be performed by the cells. Consequently, the lower overall
stimulation of aT can only be caused by suppressing at least one positive path
from an activity of S to aT . Thus, e =# i there exists at least one positive path
outgoing from an activity of S, incoming to aT , and passing through an activity
of KO. Analogous reasoning is made for e =", hence e =" i there exists at least
one negative path from an activity of S to aT and passing through an activity
of KO.</p>
      <p>Here, we make the hypothesis that suppressing the activities of KO has
an e ect on the pathways that link the activities of S to aT . That is not the
case in [14, 12, 10], where the authors make the assumption that suppressing the
activities of KO only a ects the pathway between activities of KO and aT , thus
not taking into account the activities of S. As a result, our interpretation is
more constrained. Therefore, experiments that could be explained by a network
with the interpretation of experiments made in [14, 12, 10] could no longer be
explained within our setting, resulting in the discovery of new possible arcs.</p>
      <p>To explicitly describe the role of S, we add one virtual activity node aS to
the prior network so that for each activity a 2 S, we add a stimulation arc
from aS to a. According to our interpretation of perturbation experiments and
our transitivity axioms, each experimental observation E = (S; KO; aT ; e) is
formalized as the following disjunction OE :</p>
      <p>OE =
OE =</p>
      <p>_
aKO2KO</p>
      <p>
        _
aKO2KO
(stimulates (aS ; aKO) ^ inhibits (aKO; aT ))_
(inhibits (aS ; aKO) ^ stimulates (aKO; aT ))
if e =";
(stimulates (aS ; aKO) ^ stimulates (aKO; aT ))_
(inhibits (aS ; aKO) ^ inhibits (aKO; aT ))
if e =# :
(
        <xref ref-type="bibr" rid="ref13">13</xref>
        )
      </p>
      <p>Given an SBGN-AF map N and an experimental observation E = (S; KO; aT ; e),
we want to check if E can be explained by N . If not, we want to nd a minimal
set of arcs that complete N in order to explain E . Both tasks can be realized
within the same abductive setting, presented hereafter.</p>
    </sec>
    <sec id="sec-4">
      <title>Abductive setting for the completion task</title>
      <p>
        Let N be an SBGN-AF map and E = (S; KO; aT ; e) be an experimental
observation. Let B be the background theory formed of the translation of N into
predicates and axioms (
        <xref ref-type="bibr" rid="ref1 ref10 ref11 ref12 ref2 ref3 ref4 ref5 ref6 ref7 ref8 ref9">1-12</xref>
        ), and OE be the observation formalized from E.
Then, solving both the explanation and the completion task consists in
searching for all minimal hypotheses H such that B [ H j= OE and B [ H 6j= . If
B j= OE , then clearly N explains E.
      </p>
      <p>For the computation of H, we can use the consequence nding system
SOLAR [7], that allows to de ne a set of abducibles by means of the language
bias P describing the negations of desirable hypotheses, and seek for all the
subsumption-minimal hypotheses belonging to P. In the completion task, every
added in uence is either a stimulation or an inhibition. Besides, we restrict the
number of added in uences to at most two for each observation in order to get
more realistic hypotheses that could be tested experimentally. Then, P is given
under the form hf:stimulates( ; ); :inhibits( ; )g; Length 2i, where Length
is the number of literals (i.e., instances of :stimulates( ; ) and :inhibits( ; ))
allowed in the hypothesis.</p>
      <p>In general, SOLAR can produce a large amount of hypotheses. To reduce
it, we perform a rst selection that operates directly at the generation step or
during a post- ltering step. We do not consider hypotheses that generate a loop
in the prior map and those that contain constants mapped to logical operators
or the constant aS . We then use a greedy algorithm to select hypotheses based
on the decreasing number of experimental observations they can explain.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Application: the FSHR-induced network</title>
      <p>We applied our method to two pathways of the FSHR-induced signaling network,
namely the G protein pathway and the PI3K pathway taken from [1] (See Fig.
1). We built a dataset of 29 experimental observations by gathering and
formalizing reliable experimental results from the literature related to the FSHR. For
each experiment, only one activity suppressor was used. Consequently, for each
experimental observation, the set KO is merely a singleton.</p>
      <p>Among the 29 di erent experimental observations, 17 observations could be
explained by the network, and the 12 remaining ones were used to complete the
network. For each of them we computed minimal hypotheses su cient to explain
it when added to the network. We ran SOLAR (ver. 2) with 12 machines (Intel
Xeon E-1230 V2 (3.3GHz) and 8GB RAM) in parallel, with a limited executing
time of 4 hours.</p>
      <p>
        Each of the 12 observations could be explained by hypotheses containing a
unique in uence, although more complex hypotheses were also generated.
Consequently, we chose to focus on the hypotheses containing a unique in uence.
Using our greedy algorithm, we ranked more than 250 hypotheses containing a
unique in uence generated during the abduction phase, and selected 28 among
them. Results are shown in Table 1.
1 (fcamp; epacg; pi3k; akt; #)
2 (fcamp; epacg; pi3k; rps6; #)
3 (fcamp; epacg; p38mapk; akt; #)
4 (fcamp; epacg; pi3k; p70s6k; #)
5 (fcamp; epacg; pi3k; rps6; #)
6 (ffsh fshr; epacg; pka; akt; ")
7 (ffsh fshr; epacg; p38mapk; akt; #)
8 (fcamp; epacg; pka; p70s6k; ")
9 (ffsh fshr; epacg; p38mapk; erk12; #)
10 (ffsh fshr; epacg; camp epac; erk12; #)
11 (ffsh fshr; epacg; mek; p38mapk; #)
12 (fcamp; epacg; pka; akt; ")
Table 1: Application to the
FSHRinduced network. Lines correspond
to experimental observations, columns
to selected hypotheses. A cell is green Figure 1: The FSHR-induced
netif the hypothesis explains the observa- work, represented in SBGN-AF. The
tion. Experimental observations that are G protein pathway is represented in red
explained by the network, as well as and the PI3K pathway in blue.
hypotheses (
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref8 ref9">8-28</xref>
        ) are omitted.
      </p>
      <p>
        Hypothesis (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) proposes that p38MAPK could activate PI3K. In [3], the
authors make the hypothesis of such a crosstalk in Granulosa Cells.
Moreover, activation of Akt in Zn2+-treated cells has been shown to pass through
PI3K downstream of p38MAPK [9]. This result shows that p38MAPK is able
to trigger the PI3K pathway in Zn2+ treated cells, and thus this reinforces our
hypothesis for FSH stimulated cell. Hypotheses (
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2-4</xref>
        ) all suggest an inhibitory
crosstalk between p38MAPK and the RAF/MEK/ERK pathway. In [5], the
authors clearly state that p38MAPK inhibits the RAF/MEK/ERK pathway during
muscle di erentiation, thus suggesting a potential in uence of p38MAPK on the
RAF/MEK/ERK pathway. Hypotheses (
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref5 ref6 ref7 ref8 ref9">5-28</xref>
        ) all suggest a crosstalk between
the pathway downstream of MEK and the cAMP-EPAC pathway. A crosstalk
between ERK and cAMP has indeed been evidenced in [6], even if it involves a
feedback loop (excluded in our work).
      </p>
      <p>According to our literature review, top ranked hypotheses are more promising
than low ranked ones, indicating that selecting hypotheses based on the number
of observations they can explain seems to be appropriate.</p>
      <p>
        Interestingly, experimental results (
        <xref ref-type="bibr" rid="ref1 ref2 ref4 ref5 ref8">1,2,4,5,8</xref>
        ) would have been explained by
the network considering the less constrained interpretation of experimental
results given in [14, 12, 10], and would not have allowed to generate any hypothesis.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Concluding remarks</title>
      <p>We have proposed a logical formalization of SBGN-AF maps and transitivity
axioms that allow to check, given an SBGN-AF map, whether some experimental
observations can be explained by the map, and else to generate hypotheses that
complete the map. Application to the FSHR-induced signaling network shows
that the method leads to plausible hypotheses, some of which having already
been demonstrated in other signaling systems, and thus that it is promising.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Gloaguen</surname>
          </string-name>
          et al.:
          <article-title>Mapping the follicle-stimulating hormone-induced signaling networks</article-title>
          .
          <source>Frontiers in endocrinology 2</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Choi</surname>
          </string-name>
          et al.:
          <article-title>Gonadotropin-stimulated epidermal growth factor receptor expression in human ovarian surface epithelial cells: involvement of cyclic amp-dependent exchange protein activated by camp pathway</article-title>
          .
          <source>Endocrine-related cancer 16(1)</source>
          , pp.
          <volume>179</volume>
          {
          <issue>188</issue>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gonzalez-Robayna</surname>
          </string-name>
          et al.:
          <article-title>Follicle-stimulating hormone (fsh) stimulates phosphorylation and activation of protein kinase b (pkb/akt) and serum and glucocorticoidinduced kinase (sgk): evidence for a kinase-independent signaling by fsh in granulosa cells</article-title>
          .
          <source>Molecular Endocrinology</source>
          <volume>14</volume>
          (
          <issue>8</issue>
          ), pp.
          <volume>1283</volume>
          {
          <issue>1300</issue>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Inoue</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doncescu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nabeshima</surname>
          </string-name>
          , H.:
          <article-title>Completing causal networks by meta-level abduction</article-title>
          .
          <source>Machine learning 91(2)</source>
          , pp.
          <volume>239</volume>
          {
          <issue>277</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lee</surname>
          </string-name>
          et al.:
          <article-title>Activation of p38 mapk induces cell cycle arrest via inhibition of raf/erk pathway during muscle di erentiation</article-title>
          .
          <source>Biochemical and biophysical research communications 298(5)</source>
          , pp.
          <volume>765</volume>
          {
          <issue>771</issue>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Baillie</surname>
          </string-name>
          et al.
          <article-title>Phorbol 12-myristate 13-acetate triggers the protein kinase Amediated phosphorylation and activation of the PDE4D5 cAMP phosphodiesterase in human aortic smooth muscle cells through a route involving extracellular signal regulated kinase (ERK)</article-title>
          .
          <source>Molecular Pharmacology</source>
          <volume>60</volume>
          (
          <issue>5</issue>
          ), pp.
          <volume>1100</volume>
          {
          <issue>1111</issue>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Nabeshima</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iwanuma</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Inoue</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Solar: a consequence nding system for advanced reasoning</article-title>
          .
          <source>In: Automated Reasoning with Analytic Tableaux and Related Methods</source>
          , pp.
          <volume>257</volume>
          {
          <fpage>263</fpage>
          . Springer (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rougny</surname>
          </string-name>
          et al.:
          <article-title>Analyzing sbgn-af networks using normal logic programs</article-title>
          .
          <source>Logical Modeling of Biological Systems</source>
          , pp.
          <volume>325</volume>
          {
          <issue>361</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Wu</surname>
          </string-name>
          et al.:
          <article-title>p38 and egf receptor kinase-mediated activation of the phosphatidylinositol 3-kinase/akt pathway is required for zn2+-induced cyclooxygenase-2 expression</article-title>
          .
          <source>AJP-Lung Cellular and Molecular Physiology</source>
          <volume>289</volume>
          (
          <issue>5</issue>
          ),
          <source>L883{L889</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Yamamoto</surname>
          </string-name>
          et al.:
          <article-title>Completing sbgn-af networks by logic-based hypothesis nding</article-title>
          .
          <source>In: Formal Methods in Macro-Biology</source>
          , pp.
          <volume>165</volume>
          {
          <fpage>179</fpage>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Gebser</surname>
          </string-name>
          et al.:
          <article-title>Repair and Prediction (under Inconsistency) in Large Biological Networks with Answer Set Programming</article-title>
          .
          <source>In: KR</source>
          (
          <year>2010</year>
          , April)
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Melas</surname>
          </string-name>
          et al.:
          <article-title>Detecting and removing inconsistencies between experimental data and signaling network topologies using integer linear programming on interaction graphs</article-title>
          .
          <source>PLoS computational biology 9</source>
          (
          <issue>9</issue>
          ), p.
          <source>e1003204</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Klamt</surname>
          </string-name>
          et al.
          <article-title>A methodology for the structural and functional analysis of signaling and regulatory networks</article-title>
          .
          <source>BMC bioinformatics 7</source>
          (
          <issue>1</issue>
          ), p.
          <volume>56</volume>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Samaga</surname>
          </string-name>
          et al.
          <article-title>The logic of EGFR/ErbB signaling: theoretical properties and analysis of high-throughput data</article-title>
          .
          <source>PLoS Comput Biol</source>
          <volume>5</volume>
          (
          <issue>8</issue>
          ), p.
          <source>e1000438</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Siegel</surname>
          </string-name>
          et al.
          <article-title>Qualitative analysis of the relation between DNA microarray data and behavioral models of regulation networks</article-title>
          .
          <source>Biosystems</source>
          <volume>84</volume>
          (
          <issue>2</issue>
          ), p.
          <fpage>153</fpage>
          -
          <lpage>174</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Mi</surname>
          </string-name>
          et al.
          <article-title>Systems biology graphical notation: activity ow language level 1</article-title>
          .
          <string-name>
            <given-names>Nature</given-names>
            <surname>Precedings</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>