=Paper= {{Paper |id=None |storemode=property |title=Dynamical models of genetic regulatory networks |pdfUrl=https://ceur-ws.org/Vol-741/INV03_Serra.pdf |volume=Vol-741 |dblpUrl=https://dblp.org/rec/conf/woa/Serra11 }} ==Dynamical models of genetic regulatory networks== https://ceur-ws.org/Vol-741/INV03_Serra.pdf
          On the dynamics of gene regulatory networks
                                                          Roberto Serra
                                           Modena and Reggio Emilia University
                                       European Centre for Living Technology, Venice
                                                 roberto.serra@unimore.it


Abstract— It is shown that a simplified model of genetic            shown that it can actually describe some quantitative features
regulatory networks, aimed at the study of their generic            of real genetic regulatory networks. The possibility to verify
properties, can shed light on some important biological             the appropriateness of such a model has been opened by the
phenomena. Two cases are analyzed, namely perturbations in          availability of DNA microarrays, which allow genome-wide
gene expression induced in an organism by the knock-out of          monitoring of the changes in gene expression levels. In this
selected genes, and cell differentiation. The role of simplified    paper, after reviewing the RBN model in section 2, I will
models in biology is discussed.                                     briefly discuss in section 3 its application to the study of
                                                                    perturbations induced by single gene knock-out in the yeast S.
                      I.   INTRODUCTION                             Cerevisiae, which was the first application of RBNs to the
    The wealth of data nowadays available in genomics and           simulation of quantitative properties of real genes in a cell.
other -omics has superseded our understanding of the key            Interestingly enough, the same study opens a way to test one of
biological processes which they refer to, so there is a strong      the strongest claims which have been put forth in complex
need for theories and models in order to make sense of the data     systems biology, i.e. that evolution has led organisms to critical
themselves.                                                         dynamical states, intermediate between order and chaos [3],
                                                                    which are sometimes referred to as "the edge of chaos". This
    The movement of systems biology has contributed to shift
                                                                    aspect will also be discussed in section 3.
the focus of research from a naive genocentric viewpoint to a
more sophisticated approach, which takes properly into account          In section 4 I will describe another application of RBNs to
system-level interactions. In this field, most models are quite     real biological systems. In this case the quantitative data which
specific, as they refer to the behaviour of a particular organ      can validate or disprove the model are not yet available,
(like e.g. the heart in mammals) or to a particular genetic-        however the model is interesting in principle, as it shows that it
metabolic subsystem.                                                is possible to describe several different phenomena involved in
                                                                    cell differentiation in a unified way by supposing that
    A useful complementary approach (which has been called
                                                                    differentiation is an emergent phenomenon of a genetic
complex systems biology by Kaneko [1]) is centered instead on
                                                                    regulatory network, without postulating particular gene
the search for generic properties, common to many different
                                                                    circuits. It is also possible to devise experiments to test the
biological systems. An example of this kind of properties is the
                                                                    hypotheses which lie at the basis of the model, as well as some
widespread fat-tailed distributions of several biological
                                                                    of their consequences.
variables, and an even more striking example is provided by
the scaling law which relates power consumption to body mass           A final remark concerns the reference list, which might
of different living species, irrespective of the differences        have become very long, had all the relevant original papers
among their anatomical and physiological properties [2]. Data       been mentioned. I prefer to keep the list short, quoting only a
indicate that the power consumption grows as mass3/4. It seems      few papers where all the other relevant references can easily be
indeed that the scaling law applies to very different sizes, from   found.
blue whales to small mammals and birds to mithocondria, thus
spanning an impressive range of 20 orders of magnitude.                             II. RANDOM BOOLEAN NETWORKS
Interestingly enough, the smartest explanation of this regularity        The model is fairly well-known and the reader is referred to
and of the value of the exponent is related to the number of        [4,5] for a more detailed description. A RBN is a dynamical
(relevant) spatial dimensions, i.e. 3, and to the (generic)         system whose N variables take values {1,0} which can change
hypothesis that biological evolution has tuned the features of      in time according to a well-defined function (called transition
living organisms in order to optimize power efficiency (under       function) of their inputs. It is convenient to think of it as a
suitable constraints).                                              directed graph, with nodes associated to variables. If variable i
   The search for generic properties of living beings had been      depends upon variable j then there is a link from node j to node
pioneered by Stuart Kauffman, who introduced his model of           i. In the case considered here, all the nodes have exactly k input
random boolean networks (RBNs) more than 40 years ago in            links, and the updating is synchronous. The network is random
an attempt at exploring the properties of genetic regulatory        in that both the connections are drawn at random (choosing the
networks. The model later became popular, in particular in the      k inputs to a node with uniform probability among the other N-
complex systems community, but it has only recently been            1) and the Boolean function associated to a node is chosen at
                                                                    random. Usually one either chooses the Boolean function with
uniform probability from a predefined set, or generates each          set concerns 6325 genes and 227 experiments and the
Boolean function by associating with a certain probability the        comparison with the experimental distribution of avalanches
output 1 or 0 to each of the 2k input possible input vectors.         turned out to be good.
     The system is deterministic and synchronous, therefore if N          The reason why such a simple model worked so well has
is finite its asymptotic states are cycles (a fixed point being a     been uncovered by analytical methods which have proven that
cycle with period one). Depending upon its structural features        the distribution of avalanches depends only upon the outdegree
(i.e. topology and boolean functions), a family of networks           distribution, while the indegree distribution plays no role [7].
shows a typical time behaviour, although single network               Moreover, in the case of classical random Boolean networks,
realizations can behave in a way different from that typical of       where the distribution of outgoing connections is Poissonian, it
their family. Indeed, some combinations of structural                 can be also proven that the distribution of small avalanches
parameters give rise to behaviours which have been termed             depends only upon a single parameter, the so-called Derrida
ordered, other combinations lead to disordered or "chaotic"           exponent which is given by the equation:
states. In the case of ordered systems the typical length of the
asymptotic cycles increases slowly with the system size;
moreover, it often happens that two nearby initial conditions
                                                                      where A is the average connectivity of the network and q is the
evolve to the same final attractor. In the case of disordered
                                                                      probability that a chosen node does not change its value when
networks the length of the cycles increases sharply with N, and
                                                                      one (and only one) of its inputs has changed (note that q
it often happens that close initial conditions lead to different
attractors. Critical networks have been defined as those whose        depends on the choice of the set of Boolean functions). had
structural parameters take values which are intermediate              been introduced in the past in order to distinguish between
between those of ordered and those of disordered networks.            ordered and disordered dynamical regimes (1 being the critical
                                                                      value), and it turns out that it also rules the distribution of
    A bold theoretical ansatz which has been proposed [3] is          avalanches. Therefore it is possible to estimate its value from
that critical RBNs are endowed with features which make them          the distribution of avalanches, so these analyses provide a
particularly well-suited to perform complex tasks in a changing       general way to test the criticality hypothesis and, within the
environment; therefore it has been argued that biological             limitations of the data set presently available, they also provide
evolution should have driven biological organisms in, or close        support to it.
to this region in parameter space.
                                                                                         IV. CELL DIFFERENTIATION
    III. PERTURBATIONS IN GENE REGULATORY NETWORKS
                                                                          One of the major challenges in complex systems biology is
    It has recently been possible to study the expression levels      that of providing a general theoretical framework to describe
of all the genes of an organism, and to compare their global          the phenomena involved in cell differentiation, i.e. the process
properties with those of genetic network models. A more               whereby stem cells, which can develop into different types,
detailed description of the results summarized below can be           become progressively more specialized. The model described
found in [6-8] and further references quoted therein.                 below (for more details see 9-10 and further references quoted
    In an important series of experiments a single gene of S.         therein) is an abstract one (it does not refer to a specific
cerevisiae was knocked-out, and the expression levels of all          organism or cell type) and it aims at reproducing the most
the genes, in cells with a knocked-out gene, was compared with        relevant features of the process: (i) the existence of different
those in normal, wild type cells. In order to make precise            degrees of differentiation, that span from totipotent stem cells
statements about the number of genes perturbed in a given             to fully differentiated cells; (ii) stochastic differentiation, where
experiment, and to compare them with Boolean models, it is            populations of identical multipotent cells stochastically
required that a threshold be defined, such that the difference is     generate different cell types; (iii) deterministic differentiation,
regarded as "meaningful" if the ratio of the expression of gene i     where signals trigger the progress of multipotent cells into
in experiment j to the expression of gene i in the wild type cell     more differentiated types, in well defined lineages; (iv) limited
is greater than the threshold (or smaller than its reciprocal). In    reversibility: differentiation is almost always irreversible, but
order to describe the global features of these experiments it is      there are limited exceptions under the action of appropriate
convenient to introduce the notion of avalanche, which is the         signals; (v) induced pluripotency: fully differentiated cells can
number of genes affected by the perturbation induced by a             come back to a pluripotent state by modifying the expression of
particular knock-out experiment.                                      some genes and (vi) induced change of cell type: modification
                                                                      of the expression of few genes can directly convert one
    The knock-out experiment can be simulated in silico by            differentiated cell type into another.
comparing the evolution of two identical RBNs which start
from the same state of an attractor, the only difference being            The key hypotheses are that the differentiation process is an
that one gene is clamped permanently to the value 0 in the            emerging property due to the interactions of very many genes
network which simulates knock-out. A gene belongs to the              (so its main features should be shared by a variety of different
avalanche associated to a particular knock-out if it differs in the   organisms) and that cellular noise plays a crucial role. To check
final states of the two networks at least once in the attractor       these hypotheses a noisy version of the RBN model can be
cycle. The initial simulations were performed using a classical       used (briefly refereed to as NRBN).
RBN with 2 input connections per node, restricting the set of            Noise is modelled as a transient flip of a single node,
Boolean functions to the so-called canalyzing ones. The data          chosen at random. Attractors of deterministic RBNs are
unstable with respect to noise even at these low levels, and if a    most stringent outcome of the model, and could be amenable to
node is flipped for a single time step in an attactor state one      experimental test.
sometimes observes transitions from that attractor to another
one. Therefore, by flipping all the states belonging to the              This hypothesis explains in a straightforward way the fact
                                                                     that there are different degrees of differentiation (i.e. property
attractors of a RBN, it is possible to create a complete map of
the transitions among the attractors. In these conditions single     i), corresponding to different threshold values. It is also
                                                                     straightforward to describe stochastic differentiation (i.e.
attractors can no longer be associated to cell types, as it is
usually assumed [4]. Ribeiro and Kauffman [11] observed that         property ii): in this vision the fate of a cell depends on the
                                                                     particular attractor where the system is found when the noise
it is possible to identify in the attractors’ landscape subsets of
attractors, which they called Ergodic Sets, which entrap the         level changes. The new cell type will be that corresponding to
                                                                     the new TES to which the attractor belongs at the new
system in the long time limit, so the system continues to jump
between attractors which belong to the set. Unfortunately it         threshold level.
turns out that most NRBNs have just one such set: this                   There exist also several processes, e.g. during the
observation rules out the possibility to associate them to cell      embryogenesis, in which cell differentiation is not stochastic
types.                                                               but it is driven towards precise, repeatable types by specific
    A possible solution to this problem is based on the              chemical signals, which activate or silence some genes. These
                                                                     signals can be simulated by permanently fixing to 1 or 0 the
observation that flips are a kind of noise fairly intense, as they
amount to silencing an expressed gene or to express a gene           state of some nodes. However this single action doesn’t
                                                                     influence the level of noise, and therefore doesn’t enable
which would otherwise be inactive: a particular transition may
well be an event too rare to happen with significant probability     differentiation: in order to have deterministic differentiation it
                                                                     is necessary that so-called "switch" nodes exist, whose
in the cell lifetime, if it can happen only by perturbing a
specific gene, or very few ones. It is possible therefore to         permanent perturbation coupled with a change in noise level
                                                                     always leads the system to the same TES. The existence of
introduce a threshold θ, and to neglect all the transitions having
                                                                     switch nodes has actually been verified to be a common
an occurrence probability lower than that. In such a way, the
                                                                     property (found in about 1/3 of the nets), thereby proving the
notion of Ergodic Set has to be modified in that of Threshold
                                                                     effectiveness of the model (i.e. property iii).
Ergodic Set (briefly, TES), a set of attractors linked only by
jumps having a probability higher than θ, that entrap the                Moreover, by simulating the overexpression of a few genes,
system in the long time limit. A TES is therefore a subset of        it has been possible to simulate also the other properties
attractors which are directly or indirectly θ-reachable              summarized above, and in particular the important processes of
(reachable by means of transition whose probability exceeds          induced pluripotency and transitions among different cell
the threshold θ) from at least another member of the set, and        types.
from which no transition can allow escaping. The threshold
clearly is related to the level of noise in the cell, and scales                             V. CONCLUSIONS
with the reciprocal of the frequency of flips [9].                       The above examples show that relatively simple generic
                                                                     models of gene regulatory networks are able to describe the
    An ergodic set can be described therefore as a TES with          quantitative features of the perturbations induced by gene
θ=0; by increasing the threshold, one usually observes the birth     knock-out, and to form the basis of an interesting model of cell
of more and more TESs until, above a certain level, all the          differentiation. It goes without saying that more sophisticated
attractors of the deterministic model are also independent           models might be necessary to describe other important
TESs. It is therefore possible to associate cell types to TESs,      properties.
that represent coherent stable ways of functioning of the same
genome even in the presence of noise. Several authors, on                But the point which is worth stressing is that even models
theoretical and experimental bases, associate different levels of    which are based on crude approximations may well provide
noise to different levels of differentiation, the noise being        insights on complex phenomena. This is well known in
higher the less differentiated the cell is. So the degree of         physics, where the aim is often that of finding very general
differentiation appears to be related to the possibility for an      properties, and simple models which display these properties
undifferentiated cell to wander in a portion of phase space          are considered very useful. On the contrary, researchers in
greater than the corresponding portions covered by more              biology and social sciences often overstate the need for detailed
differentiated cells. In the NRBN model a convenient proxy for       models, which entrap all the features and the interactions which
the available portion of phase space could be the number of          they suppose might be important - a requirement which, if
different attractors belonging to the TES associated to that cell.   taken too seriously, might even prevent the development of
A 0-threshold TES could therefore be associated to a totipotent      dynamical medelling in those fields. What might be envisaged
cell, while as the threshold is increased smaller TESs appear,       is a hierarchy of models, where the simpler ones, which
corresponding to more differentiated biological forms, until at      however capture some key properties, are used to understand
high enough threshold values all the attractors are TESs, thus       some of the most relevant aspects, and to suggest further
representing the fully differentiated cells. The increase of the     experiments. They may well be complemented by more
threshold would correspond to a decrease of noise level, that        detailed models able to provide more accurate quantitative (and
could be related to an improvement in the mechanisms                 sometimes also qualitative) behaviours.
whereby fluctuations are kept under control. This association
of differentiation to changes in the noise level represents the
                    ACKNOWLEDGMENT                                              [5]     Aldana M., Coppersmith S., Kadanoff L.P. (2003) Boolean dynamics
                                                                                with random couplings. In: Kaplan, E., Marsden, J.E., Sreenivasan, K. R.
   Useful discussions with Stuart Kauffman, Annamaria                           (Eds), Perspectives and Problems in Nonlinear Science, 23–89. Springer
Colacci, Marco Villani, Alex Graudenzi, Alessia Barbieri and                    Applied Mathematical Sciences Series
Chiara Damiani are gratefully acknowledged. This work has                       [6]     Serra, R. Villani, M., Semeria, A. (2004) Genetic network models and
been supported by the Italian MIUR-FISRproject nr.                              statistical properties of gene expression data in knock-out experiments. J.
                                                                                Theor. Biol. 227: 149-157, 2004
2982/Ric(Mitica) and by the Dice project of the Fondazione di                   [7]     Serra, R. Villani, M., Graudenzi, A., Kauffman, S.A. (2007) Why a
Venezia.                                                                        simple model of genetic regulatory networks describes the distribution of
                                                                                avalanches in gene expression data. J. Theor. Biol. 249: 449-460
                              REFERENCES                                        [8]     Serra, R., Villani, M., Graudenzi, A., Colacci, A., Kauffman, S.A.
[1]    Kaneko, K. (2006) Life: An Introduction to Complex System Biology.       (2008) The simulation of gene knock-out in scale-free random boolean models
Springer-Verlag, Berlin, New York                                               of genetic networks. Networks and heterogeneous media 3 (2), 333-343
[2]    West, G.B., Brown, J.H.(2005) The origin of allometric scaling laws in   [9]     Serra, R. Villani, M., Barbieri, A., Kauffman, S.A., Colacci, A. (2010)
biology from genomes to ecosystems: towards a quantitative unifying theory      On the dynamics of random boolean networks subject to noise: attractors,
of biological structure and organization. J. Exp. Biol. 208: 1575-1592          ergodic sets and cell types. J. Theor. Biol. 265: 185-193
[3]    Kauffman, S.A. (1995) At Home in the Universe. New York, Oxford          [10] Villani, M., Barbieri, A., Serra, R. (2011) A Dynamical Model of
University Press                                                                Genetic Networks for Cell Differentiation. PLoS ONE 6 (3): e17703
[4]    Kauffman S.A. (1993) The Origins of Order. New York, Oxford              [11] Ribeiro, A.S., Kauffman, S.A. (2007) Noisy attractors and ergodic sets
University Press                                                                in models of gene regulatory networks. J. Theor. Biol. 247: 743-755