<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Overcoming Dynamicity with Plasticity Neuromodulation for Lifelike Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chloe M. Barnes</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anikó Ekárt</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kai Olav Ellefsen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kyrre Glette</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter R. Lewis</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jim Tørresen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aston University</institution>
          ,
          <addr-line>Birmingham, B4 7ET</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics, University of Oslo</institution>
          ,
          <addr-line>Oslo, NO-0316</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Ontario Tech University</institution>
          ,
          <addr-line>Oshawa, ON L1G 0C5</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>RITMO, University of Oslo</institution>
          ,
          <addr-line>Oslo, NO-0316</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Natural beings are often situated in dynamic and unpredictable environments, and have evolved to use mechanisms such as neuromodulation - the ability to change behaviour via changes to synaptic activity in the brain - to adapt their behaviour over time to survive. The ability to change behaviour in this way is referred to as 'behavioural plasticity'. In this extended abstract, we summarise the findings from an exploration of how plasticity can afect how artificial agents evolve when solving tasks of diferent complexity [ 1, 2], and when evolving in dynamic and unpredictable environments [3].</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Plasticity in Natural and Artificial Life</title>
      <p>
        Behavioural plasticity – specifically activational plasticity [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] – is the ability for an individual
to change its behaviour immediately in response to new environments or stimuli, by making
temporary phenotypic changes. Neuromodulation is a biological process found in animal
brains that can facilitate this type of behavioural plasticity [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], where synaptic activity between
neurons is modified or regulated temporarily to produce reversible behavioural changes that do
not afect learnt behaviour [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        This type of plasticity has inspired research into designing adaptive, lifelike, artificial
systems – especially those underpinned by neural networks since they themselves are inspired by
connectionist models of the brain. In artificial systems, activational plasticity can be achieved
by regulating or modulating connection or ‘synaptic’ activity locally in a neural network, or in
a separate modulatory network [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Operationalising Neuromodulation</title>
      <p>
        In the studies discussed in this extended abstract [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ], the efects of plasticity were explored
using the River Crossing Dilemma (RCD) testbed, first proposed by Barnes et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] – used to
explore how artificial agents evolve to solve tasks in shared 2D grid-world environment. The
RCD is characterised by a lethal, vertical river of water in the centre of a 19 × 19 grid, which
agents must learn to cross to achieve their goal. In doing so, they are presented with a social
dilemma, as a bridge for safe passage requires two stones – each with an increasing personal
cost to place. Using the RCD testbed, we explored how artificial agents with neural controllers
learnt to achieve goals when alone and when situated in an environment with another, unknown
agent; the presence and actions of this other agent makes the environment unpredictable for all,
to study how neuromodulation and behavioural plasticity afects goal-achievement in dynamic
environments. These agents learnt using neuroevolution, whereby the weights of a population
of neural networks are evolved or modified over time using an evolutionary algorithm.
      </p>
      <p>Specifically, neuromodulation is operationalised in these studies within a single neural
network, by temporarily gating/regulating the outgoing signals of neurons depending on the
incoming signal. Hidden neurons in the network could evolve to be non-modulatory (standard)
or modulatory (will gate or ‘turn of’ outgoing signals depending on the input). Efectively,
modulatory neurons ‘fire’ when the incoming signal is negative, changing the outgoing signal
from the neuron (i.e. the weights of the connections to the next layer of neurons) to be 0. In this
way, an agent can temporarily change its behaviour to respond to its environmental stimuli
without changing learnt or encoded knowledge, since neural network activity is regulated
locally without modifying connection weights permanently. This is intended to help agents
overcome dynamicity in their environments.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Task Complexity and Plasticity</title>
      <p>
        Both natural and artificial agents are often presented with environments that change over time,
are shared with others, and involve tasks that require multiple steps to complete [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. This
environmental dynamicity and uncertainty can make it challenging to learn to complete tasks
when the full state-space is not known – which is often the case. Consequently, the efect
of behavioural plasticity via neuromodulation was explored in artificial agents to ascertain
whether plastic behaviour is beneficial when learning to solve tasks with multiple stages [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ].
      </p>
      <p>The results show that the activity-gating neuromodulation described above has a significant
efect on an agent’s ability to solve tasks, when evolving in both single- and multi-agent
(paired with one other, unknown agent) environments and when agents are presented with
either a single- or multi-stage task. The expected fitness of agents evolving to solve these
tasks of varying dificulties in variable conditions was also seen to increase when agents are
capable of neuromodulation; this shows that behavioural plasticity can be beneficial for creating
adaptive agent controllers that are able to overcome the dynamicity and uncertainty that often
characterises realistic environments. Despite the significant benefit that behavioural plasticity
has on these agents for goal-achievement and fitness, this does come at the cost of evolutionary
volatility – that is, agent fitness is observed to fluctuate more often during evolution compared to
agents without neuromodulation. This creates a trade-of between fitness and goal-achievement,
and evolutionary volatility.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Dynamicity and Plasticity</title>
      <p>
        A further study explored the efect of neuromodulation and behavioural plasticity on agents that
are situated in environments with increasing variability [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]; in the natural world, environmental
dynamicity can arise from the unpredictable actions of others, which is becoming increasingly
common in artificial systems as components may interact unintentionally [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. To explore the
efect of plasticity on agents in variable environments, this study observed agents evolving in an
increasing number of environments with another, unknown agent that either stays consistent
throughout evolution (less variable), or is random at each generation (more variable); the actions
of a consistent partner would theoretically make the environment more predictable over time
than a randomised partner, which would be inherently unpredictable.
      </p>
      <p>The study found that modulatory agents achieved a significantly higher fitness than
nonmodulatory agents in all areas of the study. Further, a correlation was found between the
variability in the environment and the strength of the efect that neuromodulation has on
agent fitness, where neuromodulation has a stronger benefit on agents as variability increases.
Evolving artificial agents to achieve goals in highly variable environments is challenging, but
this study shows that a biologically-inspired mechanism like neuromodulation can increase
agent fitness by enabling them to temporarily change their behaviour and phenotype – even
when there are unknown entities in the environment.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Outlook</title>
      <p>
        Artificial systems are growing in size and it is increasingly likely that the components they
are comprised of will interact in unintended ways [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. By designing artificial and technical
systems to be more lifelike in their behaviour – such as employing them with the ability to
express behavioural plasticity – one would hope that these systems could combat the dynamicity
and uncertainty that characterises the realistic environments inhabited by the natural beings
Artificial Life researchers are inspired by. Neuromodulation is thus shown in these studies
to be a viable option for such plastic behaviour, by enabling agents to adapt their behaviour
temporarily in response to environmental changes, without afecting learnt knowledge, and
without requiring knowledge of others in the environment.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Barnes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekárt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. O.</given-names>
            <surname>Ellefsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Glette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. R.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tørresen</surname>
          </string-name>
          ,
          <article-title>Coevolutionary learning of neuromodulated controllers for multi-stage and gamified tasks</article-title>
          ,
          <source>in: Proceedings of the IEEE 1st International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS)</source>
          , IEEE,
          <year>2020</year>
          , pp.
          <fpage>129</fpage>
          -
          <lpage>138</lpage>
          . URL: https://ieeexplore.ieee.org/document/ 9196458. doi:
          <volume>10</volume>
          .1109/ACSOS49614.
          <year>2020</year>
          .
          <volume>00034</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Barnes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekárt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. O.</given-names>
            <surname>Ellefsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Glette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. R.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tørresen</surname>
          </string-name>
          ,
          <article-title>Behavioural plasticity can help evolving agents in dynamic environments but at the cost of volatility</article-title>
          ,
          <source>ACM Transactions on Autonomous Adaptive Systems</source>
          <volume>15</volume>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .1145/3487918.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Barnes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekárt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. O.</given-names>
            <surname>Ellefsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Glette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. R.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tørresen</surname>
          </string-name>
          ,
          <article-title>Evolving neuromodulated controllers in variable environments</article-title>
          ,
          <source>in: Proceedings of the IEEE 2nd International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS)</source>
          , IEEE,
          <year>2021</year>
          , pp.
          <fpage>164</fpage>
          -
          <lpage>169</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACSOS52086.
          <year>2021</year>
          .
          <volume>00037</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E. C.</given-names>
            <surname>Snell-Rood</surname>
          </string-name>
          ,
          <article-title>An overview of the evolutionary causes and consequences of behavioural plasticity</article-title>
          ,
          <source>Animal Behaviour</source>
          (
          <year>2013</year>
          ). doi:
          <volume>10</volume>
          .1016/j.anbehav.
          <year>2012</year>
          .
          <volume>12</volume>
          .031.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Hamood</surname>
          </string-name>
          , E. Marder,
          <article-title>Animal-to-animal variability in neuromodulation and circuit function</article-title>
          ,
          <source>in: Cold Spring Harbor Symposia on Quantitative Biology</source>
          , volume
          <volume>79</volume>
          , Cold Spring Harbor Laboratory Press,
          <year>2014</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L. F.</given-names>
            <surname>Abbott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Nelson</surname>
          </string-name>
          ,
          <article-title>Synaptic plasticity: taming the beast</article-title>
          ,
          <source>Nature Neuroscience</source>
          <volume>3</volume>
          (
          <year>2000</year>
          )
          <fpage>1178</fpage>
          -
          <lpage>1183</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Beaulieu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Frati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Miconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. O.</given-names>
            <surname>Stanley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Clune</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Cheney</surname>
          </string-name>
          , Learning to continually learn,
          <source>in: Proceedings of the 24th European Conference on Artificial Intelligence (ECAI)</source>
          , IOS Press,
          <year>2020</year>
          , pp.
          <fpage>992</fpage>
          -
          <lpage>1001</lpage>
          . doi:
          <volume>10</volume>
          .3233/FAIA200193.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Barnes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekárt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. R.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <article-title>Social action in socially situated agents</article-title>
          ,
          <source>in: Proceedings of the IEEE 13th International Conference on Self-Adaptive and Self-Organizing Systems</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>97</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dezfouli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Balleine</surname>
          </string-name>
          ,
          <article-title>Learning the structure of the world: The adaptive nature of statespace and action representations in multi-stage decision-making</article-title>
          ,
          <source>PLOS Computational Biology</source>
          <volume>15</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          . doi:
          <volume>10</volume>
          .1371/journal.pcbi.
          <volume>1007334</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hähner</surname>
          </string-name>
          , U. Brinkschulte,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lukowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mostaghim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tomforde</surname>
          </string-name>
          ,
          <article-title>Runtime self-integration as key challenge for mastering interwoven systems</article-title>
          ,
          <source>in: Proc. of the 28th Intl. Conf. on Architecture of Computing Systems (ARCS)</source>
          ,
          <source>VDE</source>
          ,
          <year>2015</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>