<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Migratable AI Systems for Tailoring User Experience through Multimodal Afective-Cognitive State Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ludovica La Monica</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Cognitive Science and Technology, National Research Council</institution>
          ,
          <addr-line>00185 Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma</institution>
          ,
          <addr-line>00128 Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In the era of digital technologies permeating numerous aspects of society, the development of intelligent artificial agents endowed with autonomy, social capabilities, reactivity, and proactivity has become a pivotal innovation. Traditionally, these agents were confined to singular forms, known as embodiments, which dictated their capabilities and the environments in which they operated. To transcend these limitations and enable agents to expand their capabilities, the concept of agent migration has emerged, allowing them to seamlessly transition between diferent embodiments. My research program delves into the realm of agent migration, uncovering its challenges and potential opportunities. The central objective is to leverage this technology to deepen our understanding of human afective and cognitive states across various contexts. Specifically, my aim is to develop a migratory agent architecture capable of recognizing and characterizing individuals' emotional experiences using diferent sets of capabilities tied to distinct embodiments. In summary, the intent of this research is to harness migratable AI to promote the technical advantages brought about by agent migration and gain insights into how to utilize the acquisition of additional capabilities resulting from migration to craft a more comprehensive and potentially enduring architecture, to be applied in the realm of afective computing.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Intelligent Agent</kwd>
        <kwd>Embodied Agent</kwd>
        <kwd>Embodied Cognition</kwd>
        <kwd>Migratable AI</kwd>
        <kwd>Afective Computing</kwd>
        <kwd>Emotion recognition</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Nowadays, an increasing number of human activities rely on digital technologies in the current
society [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This phenomenon has driven innovation in the field of artificial agents, i.e.,
sophisticated software entities exhibiting specific characteristics such as autonomy, social abilities,
reactivity, and proactivity [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Traditionally, such agents were constrained to a single form,
known as embodiment, representing their physical or digital manifestation. Embodiment plays
a key role in the development of an intelligent agent as it determines the sense of presence that
allows users to recognize it and establish a relationship with it [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Simultaneously, embodiment
dictates the intrinsic capabilities of the agent and the environment in which it can operate. This
concept implies that the agent’s capabilities are inherently tied to and constrained by its form.
To overcome this limitation and enable agents to extend their abilities, the concept of agent
migration has been introduced. This approach allows agents to transfer their "being" across
diferent embodiments, enabling them to acquire the capabilities associated with each of them
and possibly expand their interactions with the surrounding environment and users. Several
studies explored aspects related to agent migration, investigating the advantages and challenges
associated with this practice. Agent migration, by and large, is defined as the process where
intelligent agent moves through diferent embodiments, remaining active in one at a time [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        The concept of an agent capable of migration allows for the separation of the agent’s mind
from the body it occupies [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The mind defines its essence, being an intelligent system that
makes decisions and plans actions to be executed by the body. The body is seen as a shell
that carries out the actions prescribed by the mind and determines the capabilities it can
leverage. The detachment of the mind from a specific body not only facilitates the acquisition
of extended capabilities but eliminates the limitation of environmental confinement, facilitating
uninterrupted interaction with the agent, as the mind can accompany the user across diverse
contexts by merely changing its physical form.
      </p>
      <p>The objective of my Ph.D. research program (started this year) is to examine the concept of
agent migration, exploring its challenges and opportunities, with the aim of harnessing this
technology to enhance the comprehension of the afective and cognitive states of individuals
across various contexts. Specifically, the goal is to develop a migratory agent architecture that
enables the recognition and comprehensive characterization of subject’s emotional experiences,
utilizing diferent sets of capabilities based on the embodiment it assumes.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Review of Literature</title>
      <p>
        Focus on Migratable AI. Migrant agents and the migration process have been studied in
the context of social interactions and human-machine interactions, i.e., HRI (Human-Robot
Interaction) and HCI (Human-Computer Interaction). Pioneering this field, Imai et al. (1999)
addressed agent migration with the goal of enhancing human-robot interaction [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In their
work, they introduced the migratable system ITAKO, which allowed an intelligent agent to
move between a mobile PC and an autonomous robot.
      </p>
      <p>
        In the 2000s, the UCD (University College Dublin) team introduced the Agent Chameleons
architecture with the aim of enabling seamless transitions between physical and virtual
information environments, emphasizing the crucial role of migration as a means for expanding the
capabilities of autonomous systems [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Subsequently, research began to consider the efects of migration on users. Indeed, while
migration enhances agent capabilities and frees them from environmental constraints, the
critical issue of user recognition of the agent’s identity emerged. Studies, such as that of Gomes
et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], analyzed how users perceive the migration process and the number of perceived agent
identities. Concurrently, Holz et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] evaluated diferent forms of embodiment (i.e., physical
or virtual) to optimize agent capabilities, particularly in relational terms. Koay et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] looked
at the use of visual representation of migration as a method of retaining the agent’s identity
and improving its identification, so facilitating the user’s interaction. In alignment with these
approaches, Tejwani et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposed a study aimed at assessing the efect on user-agent
interaction quality resulting from maintaining specific parameters during the migration process,
namely agent identity and acquired information.
      </p>
      <p>It is evident, therefore, that the agent’s identity constitutes a key feature in the migration
process, fostering the establishment of stable and enduring relationships between the user and
the agent.</p>
      <p>
        Focus on Emotion Recognition. Numerous attempts have been made to classify emotions
in order to understand human behavior [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The number of categories in which emotions
should be classified has always been a contentious topic. The two main recognized theories
are Ekman’s discrete theory of emotions [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and Russell’s dimensional theory of emotions
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Systems for recognizing emotional states leverage these theories to characterize acquired
information and establish connections between emotions, afect, and behavior.
      </p>
      <p>One of the channels used to gather emotional information is speech. In this context various
approaches are available depending on the audio signal format. These data can be processed
as raw audio waveforms or in 2D format, such as spectrograms [12]. When analyzing raw
audio waveform data, a widely used approach involves the use of the WaveNet architecture
[13, 14]. On the other hand, deep learning technologies, such as Long Short-Term Memory
(LSTM) artificial neural networks integrated with Convolutional Neural Networks (CNN), have
been employed for the analysis of 2D audio signals [15, 16, 17]. Another relevant methodology
involves the use of transformers [18].</p>
      <p>Facial expressions, similar to natural language, are a manifestation of human emotions. The
Facial Action Coding System (FACS) [19], developed by Ekman and Friesen in 1978, is the
most well-known and widely used system for facial activity analysis. More recently, with
the introduction of Graphics Processing Units (GPUs) and Convolutional Neural Networks,
real-time applications have been developed, such as Microsoft service Emotion API [20] and
the MIT Media Lab Afectiva technology Afedex [21].</p>
      <p>From a physiological perspective, emotions leave traces that can be studied. Signals such as
electrodermal activity, also known as Galvanic Skin Response (GSR), heart and respiratory rate,
brain activity, and many others are typically considered when attempting emotion classification
through a dimensional approach [22].</p>
      <p>In conclusion, it is evident that emotion detection and, more broadly, Afective Computing
are fields of great interest, despite the ongoing challenges.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Problem Statement</title>
      <p>The literature review illustrates that agent migration is not devoid of challenges and
complications. One of the primary challenges concerns the continuous recognition of the agent by the
user, as the various embodiments may not be immediately attributed to a singular entity. This
has prompted fundamental inquiries, which have undergone comprehensive scrutiny, regarding
the nature of the agent’s identity and the mechanisms facilitating recognition when its form
may undergo changes. Indeed, the quest for optimal interaction between humans and agents has
constituted the central focus of recent research in the domain of migratable AI. Simultaneously,
it becomes imperative to understand how to leverage migratable AI, what technical benefits
agent migration can bring and how to use the acquisition of additional capabilities brought
about by migration to produce a more comprehensive and potentially enduring architecture.
These fundamental questions have served as the driving impetus behind my research endeavors
in this field.</p>
      <p>Another key aspect in the designing of the research project was the recognition of the
significance of analyzing individuals’ emotional states. The conducted assessment revealed that
emotions represent a fundamental form of non-verbal interaction, and the ability to express
emotional states prompted by everyday situations is a crucial aspect of human life.
Simultaneously, the capacity to recognize emotional states expressed by others plays a pivotal role
in both interpersonal relationships and the psychological well-being of individuals. Afective
computing constitutes the main research area dedicated to the analysis of these emotional
states: it leverages information acquired through the analysis of specific signals, such as
facial expressions, natural language, body gestures, and physiological cues, to infer individuals’
afective-cognitive states. However, research has underscored that information such as natural
language and facial expressions, when considered in isolation, may lead to erroneous
interpretations, as they can be consciously controlled by humans who may choose to conceal their
emotional states. Conversely, physiological signals represent a more objective and reliable type
of information, albeit more challenging to capture and, most importantly, interpret. These
considerations have prompted the idea of applying migratable artificial intelligence to the field
of afective computing.</p>
    </sec>
    <sec id="sec-4">
      <title>4. My work</title>
      <p>The main objective of this research is to develop an intelligent agent capable of migrating
between diferent embodiments in order to leverage a broader range of capabilities and assist
the user across various types of tasks. The capabilities of this intelligent system to migrate will
enable the consistent and accurate integration of various type of information gathered during
user interaction.</p>
      <p>The proposed project aims to harness these capabilities to enhance the recognition of emotions
expressed by individuals. Specifically, the agent’s ability to migrate across diferent
embodiments will enable it to acquire, analyze, and combine various types of user-related information.
Consequently, the recognized emotional state will no longer rely on a single information channel
but will result from a comprehensive analysis involving diverse data sources.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>This work delves into the concept of agent migration, shedding light on both its advantages
and the remaining challenges. Specifically, it identifies migration as the means to enhance the
versatility and adaptability of intelligent agents across various contexts.</p>
      <p>The main objective of this research is to expand the range of capabilities within the intelligent
system, enabling it to assist users in diverse tasks. The focal point of this endeavor lies in
the analysis of emotions and afective computing, recognizing the paramount importance of
emotional state analysis in human interactions. Consequently, the aim is to utilize migratory
capabilities to integrate diverse user-related information, ultimately enhancing the capacity to
recognize individuals’ emotional states.</p>
      <p>In conclusion, this research seeks to harness migratory artificial intelligence, identify the
technical advantages stemming from agent migration, and acquire insights on how to leverage the
acquisition of additional capabilities brought about by migration to develop more comprehensive
and potentially enduring architectures in the field of afective computing.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This research is carried out as PhD student enrolled in the National PhD in Artificial Intelligence
for Health and Life Sciences, XXXVIII Bis cycle, Università Campus Bio-Medico di Roma.
[12] M. Zielonka, A. Piastowski, A. Czyżewski, P. Nadachowski, M. Operlejn, K. Kaczor,
Recognition of emotions in speech using convolutional neural networks on diferent datasets,
Electronics 11 (2022).
[13] A. v. d. Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner,
A. Senior, K. Kavukcuoglu, Wavenet: A generative model for raw audio, arXiv preprint
arXiv:1609.03499 (2016).
[14] S. K. Pandey, H. S. Shekhawat, S. Prasanna, Emotion recognition from raw speech using
wavenet, in: TENCON 2019-2019 IEEE Region 10 Conference (TENCON), IEEE, 2019, pp.
1292–1297.
[15] J. Wang, M. Xue, R. Culhane, E. Diao, J. Ding, V. Tarokh, Speech emotion recognition with
dual-sequence lstm architecture, in: ICASSP 2020-2020 IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2020, pp. 6474–6478.
[16] C.-W. Huang, S. S. Narayanan, Deep convolutional recurrent neural network with attention
mechanism for robust speech emotion recognition, in: 2017 IEEE international conference
on multimedia and expo (ICME), IEEE, 2017.
[17] G. Trigeorgis, F. Ringeval, R. Brueckner, E. Marchi, M. A. Nicolaou, B. Schuller, S. Zafeiriou,
Adieu features? end-to-end speech emotion recognition using a deep convolutional
recurrent network, in: 2016 IEEE international conference on acoustics, speech and signal
processing (ICASSP), IEEE, 2016, pp. 5200–5204.
[18] G. Tamulevičius, G. Korvel, A. B. Yayak, P. Treigys, J. Bernatavičienė, B. Kostek, A study
of cross-linguistic speech emotion recognition based on 2d feature spaces, Electronics 9
(2020) 1725.
[19] P. Ekman, W. V. Friesen, Facial action coding system, Environmental Psychology &amp;</p>
      <p>Nonverbal Behavior (1978).
[20] A. Verma, D. Malla, A. K. Choudhary, V. Arora, A detailed study of azure platform &amp; its
cognitive services, in: 2019 International conference on machine learning, big data, cloud
and parallel computing (COMITCon), IEEE, 2019, pp. 129–134.
[21] D. McDuf, A. Mahmoud, M. Mavadati, M. Amr, J. Turcot, R. e. Kaliouby, Afdex sdk: a
cross-platform real-time multi-face expression recognition toolkit, in: Proceedings of the
2016 CHI conference extended abstracts on human factors in computing systems, 2016, pp.
3723–3726.
[22] L. Shu, J. Xie, M. Yang, Z. Li, Z. Li, D. Liao, X. Xu, X. Yang, A review of emotion recognition
using physiological signals, Sensors 18 (2018) 2074.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Holz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dragone</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. M. O'Hare</surname>
          </string-name>
          ,
          <article-title>Where robots and virtual agents meet: a survey of social interaction research across milgram's reality-virtuality continuum</article-title>
          ,
          <source>International Journal of Social Robotics</source>
          <volume>1</volume>
          (
          <year>2009</year>
          )
          <fpage>83</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Wooldridge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. R.</given-names>
            <surname>Jennings</surname>
          </string-name>
          ,
          <article-title>Intelligent agents: Theory and practice, The knowledge engineering review 10 (</article-title>
          <year>1995</year>
          )
          <fpage>115</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>G. M. O'Hare</surname>
            ,
            <given-names>B. R.</given-names>
          </string-name>
          <string-name>
            <surname>Dufy</surname>
          </string-name>
          ,
          <article-title>Agent chameleons: Migration and mutation within and between real and virtual spaces</article-title>
          ,
          <source>The Society for the Study of Artificial Intelligence and the Simulation of Behaviour AISB</source>
          (
          <year>2002</year>
          )
          <fpage>3</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Gomes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sardinha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Márquez</given-names>
            <surname>Segura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Cramer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Paiva</surname>
          </string-name>
          ,
          <article-title>Migration between two embodiments of an artificial pet</article-title>
          ,
          <source>International Journal of Humanoid Robotics</source>
          <volume>11</volume>
          (
          <year>2014</year>
          )
          <fpage>1450001</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hassani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.-S.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>On designing migrating agents: from autonomous virtual agents to intelligent robotic systems</article-title>
          ,
          <source>in: SIGGRAPH Asia 2014 Autonomous Virtual Humans and Social Robot for Telepresence</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Imai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ono</surname>
          </string-name>
          , T. Etani,
          <article-title>Agent migration: communications between a human and robot</article-title>
          ,
          <source>in: IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No. 99CH37028)</source>
          , volume
          <volume>4</volume>
          , IEEE,
          <year>1999</year>
          , pp.
          <fpage>1044</fpage>
          -
          <lpage>1048</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K. L.</given-names>
            <surname>Koay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Syrdal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Walters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dautenhahn</surname>
          </string-name>
          ,
          <article-title>A user study on visualization of agent migration between two companion robots (</article-title>
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Tejwani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jeong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. W.</given-names>
            <surname>Park</surname>
          </string-name>
          , C. Breazeal,
          <article-title>Migratable ai: Efect of identity and information migration on users' perception of conversational ai agents</article-title>
          ,
          <source>in: 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)</source>
          , IEEE,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dzedzickis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kaklauskas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bucinskas</surname>
          </string-name>
          ,
          <article-title>Human emotion recognition: Review of sensors and methods</article-title>
          ,
          <source>Sensors</source>
          <volume>20</volume>
          (
          <year>2020</year>
          )
          <fpage>592</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ekman</surname>
          </string-name>
          , et al.,
          <string-name>
            <surname>Basic</surname>
            <given-names>emotions</given-names>
          </string-name>
          ,
          <source>Handbook of cognition and emotion 98</source>
          (
          <year>1999</year>
          )
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Russell</surname>
          </string-name>
          ,
          <article-title>A circumplex model of afect</article-title>
          .,
          <source>Journal of personality and social psychology 39</source>
          (
          <year>1980</year>
          )
          <fpage>1161</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>