<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Workshop on sociAL roboTs for peRsonalized, continUous and adaptIve aSsisTance,
Workshop on Behavior Adaptation and Learning for Assistive Robotics, Workshop on Trust, Acceptance and Social Cues in
Human-Robot Interaction, and Workshop on Weighing the benefits of Autonomous Robot persoNalisation. August</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Use of Irony and Sarcasm for Uncertainty in HRI</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mario Barbato</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandra Rossi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Silvia Rossi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Electrical Engineering and Information Technologies, University of Naples "Federico II"</institution>
          ,
          <addr-line>Piazzale Tecchio 80, 80125, Naples</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>26</volume>
      <issue>2024</issue>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>Autonomous robots are being used in human-centred environments, such as ofices, restaurants, hospitals and private homes, for carrying out collaborative and cooperative tasks. These activities require that robots engage people in socially acceptable ways, even when they make errors. It is very common that robots make communication failures due to technical or environmental limitations, such as mismatch of multimodal observations. While these errors cannot be entirely avoided, it is still necessary to minimize them. In this paper, we want to use sarcasm by using contrasting multiple cues, both verbal and non-verbal, for allowing a robot to hide its uncertainty of the interaction signals. The results indicate some diferences between the two attitudes, such as in the robot's independence and assertiveness.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;HRI</kwd>
        <kwd>uncertainty</kwd>
        <kwd>humour</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Social robotics is a rapidly developing field. Thanks to advancements in hardware and software,
encountering a robot is becoming an increasingly common event: from hospitals, where they interact
with both children and older people, to museums as guides, and even in restaurants to serve customers
[
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Once placed in these unsupervised scenarios, the likelihood that a robot may make errors
increases, in particular when robots need to interpret the interactions with humans via multiple signals.
Disruptions such as ambient noise, poor lighting, or the higher dynamism of a real-world context
often lead to contrasting or uncertain signals while, as a consequence, produce social failures during a
human-robot interaction (HRI). Social failures are errors that violate social norms and can degrade the
perception of the robot’s social and afective abilities [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]: not listening to the interlocutor, interrupting
while they are speaking, or changing the subject without reason are just a few examples of social failures.
In the field of HRI, it has been necessary to study behavioural techniques to mitigate the problem.
One of these techniques is the use of humour by the robot, inspired by human-human interactions.
Humour is pervasive in social relationships, being one of the most common ways to produce a positive
influence on others: it has been shown that the use of spontaneous humour makes individuals more
likeable and attractive in the eyes of others [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ], making them more friendly and improving the trust
conveyed [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In situations free from specific tasks [ 7], such as making icebreaker jokes, telling puns
and then apologizing, contrasting serious topics with jokes to ease tension, and being self-deprecating,
humour generate laughs and empathy towards the agent. Even in more structured scenarios, such as
vaccination in a hospital or reception in a hotel, there have been advantages noted in personalities
endowed with humour compared to neutral ones, specifically in terms of engagement, likeability, ease
of interaction and empathy [8, 9]. Exploring the vast field of humour, we particularly focus on irony
and sarcasm. Some HRI researchers [10] have suggested that these can bring benefits to interaction, but
it is not easy to understand how to efectively incorporate them into the robot’s personality, since not
everyone has the same humour or sarcasm. In this work we decided to adopt the Incongruity Theory
[11], according to which humour is described as a process related to the experience of inconsistency,
focusing on unexpectedness and inappropriateness. The approach proposed in this study is based on
handling a delicate management of episodes where multimodal user feedback is deemed unreliable. In
these cases, the robot reacts sarcastically by contrasting verbal cue polarity (i.e., the spoken phrase)
with non-verbal cues, which include voice pitch and speed, facial expression (i.e., colour of LEDs),
and gestures. The goal is to elicit a positive, particularly amused, reaction from the user, avoiding the
unpleasant scenario where the interlocutor realizes that the robot did not actually understand, leading
to a poorer perception of the robot’s social and afective abilities.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. The Scenario</title>
      <p>The incongruity-based behaviour approach has been integrated into BRILLO (Bartending Robot for
Interactive Long Lasting Operations), a three-year national project aimed at creating an autonomous
robotic system capable of performing bartender tasks and interacting naturally with customers. The
typical BRILLO scenario involves a user and three interaction systems: a kiosk where the user
authenticates/registers to order a cocktail, the bartender robot that prepares the drinks, and optionally a waiter
robot tasked with serving the customer if they are at a table. Focusing on the bartender robot, which is
part of the system equipped with the proposed approach, it consists physically of a head, represented
by Furhat, a torso, and two robotic arms for drink preparation. From an interaction standpoint, a key
element is personalization and recommendation of both the drinks and the interaction. The robot adapts
its behaviour based on the context, classifying the current customer into one of several profiles (e.g.,
with a worker on a lunch break, the bartender will converse in a way that relaxes them, while with a
curious person, it will try to discuss various topics).</p>
    </sec>
    <sec id="sec-3">
      <title>3. The Use Case</title>
      <p>In this section, we present the acquisition and processing of user input, the decision algorithm for the
behaviour to be adopted, and non-verbal signals configurations, and the results of use case scenario
will be presented.</p>
      <sec id="sec-3-1">
        <title>3.1. Input Acquisition and Processing</title>
        <p>During the dialogue, two types of input are taken from the robot’s sensors: voice and face. The first
input is immediately processed by the robot’s speech-to-text module, which transmits the phrase to
the cloud service LUIS 1(Language Understanding Intelligent Service) to perform intent recognition,
necessary to understand the user’s will, highlighting the involved entities, and sentiment analysis to
calculate the sentence emotional polarity (positive, neutral or negative). The second input, in the form
of video, is sent to the Afectiva tool2 to recognize, by processing frame by frame on a cloud-based
Docker container, the facial expression (positive: happiness, surprise; neutral; negative: sadness, anger,
contempt, disgust and fear).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Incoherent Behaviour Decision</title>
        <p>Once the inputs are processed, the core of the process begins: the decision on which behaviour the
robot should adopt, based on the user feedback polarity. Given the user’s facial expression and speech
polarities:
• If the user’s facial expression polarity is neutral, the robot will interact coherently, using verbal
and non-verbal signals with the same polarity as the user’s speech.
1LUIS https://www.luis.ai
2Afectiva SDK:
https://www.afectiva.com/science-resource/afdex-sdk-a-cross-platform-realtime-multi-face-expression-recognition-toolkit/
• If the user’s facial expression polarity is not neutral, it is compared with the user’s speech polarity:
– If the two user polarities are the same, a coherent behaviour will be chosen.
– If the two user polarities are diferent, an incoherent behaviour will be chosen: the robot’s
non-verbal signals will be opposite to its speech, simulating sarcasm.</p>
        <p>We observed that the LUIS output was more robust compared to that of Afectiva SDK . Example
of Pepper non-verbal cues are shown in Figure 1, while the full detailed non-verbal cues defined by
polarity are reported in Table 1.</p>
        <p>(a) Pepper in positive pose
(b) Pepper in negative pose</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Experimental Design</title>
        <p>To evaluate the proposed approach, an online study was conducted as a within-subject counterbalanced,
repeated measures study. The study was organised in three phases. Firstly, we collected demographic
information (i.e., age, gender, nationality, and previous experiences with robots), then we asked them to
complete the Italian adaptation of the Humor Styles Questionnaire [12], with the aim of finding the
participant humour style closest to the among:
• Afiliative : focuses on everyday life events, creating a sense of bonding with the listener.
• Self-enhancing: involves laughing at oneself and one’s abilities, often being perceived as humble.
• Aggressive: includes insults and anything aimed at putting someone else down, typical of
bullying.</p>
        <p>• Self-defeating: involves putting oneself down aggressively, often ending up ridiculing oneself.
Participants were tested either with Coherent or Incoherent robot’s behaviours. We asked participants
to watch two videos of Pepper welcoming a customer and entertaining them in small talk, such as
asking how they were doing. In both videos, the robot adopts the user sentence polarity for its verbal
reply; in the first video it is positive, in the second one it is negative. The distinction between the two
groups of participants occurs in the non-verbal signals: in the two videos shown, half of the participants
witness the robot’s coherent behaviour (non-verbal polarity matches verbal polarity), and the other half
see an incoherent behaviour (non-verbal polarity opposite to verbal polarity), hence sarcastic.</p>
        <p>At the end of each video, participants are asked to rate the robot using the Short Form Bem Sex
Role Inventory questionnaire [13], to evaluate its character traits, such as kindness, understanding,
aggressiveness. We used a 5-point Likert scale (from 1 - totally disagree, to 5 - totally agree).
(a) Incoherent vs. Coherent (Positive Response)
(b) Incoherent vs. Coherent (Negative Response)</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Results</title>
        <p>We collected responses of 63 respondents (50 male, 10 female, no non-binary), average age was 24.3
years. We analysed only 60 questionnaires, with 3 discarded due to incorrect completion. Only 23%
had previous experiences with robots, mostly as observers. Regarding humour style: 83.3% fell into the
Afiliative category, 10% into Self-enhancing, and 6.7% into Self-defeating.</p>
        <p>The two behaviours, coherent and incoherent, were compared given a certain polarity of the customer’s
response, positive or negative. Full means are reported in Figure 2. Regarding the positive incoherent
behaviours, we observed higher results in terms of Defence of one’s beliefs (2.70 vs. 2.20), Independence
(3.23 vs. 2.76), and Dominance (2.30 vs. 1.90), while the coherent mode was better in Warmth (3.30 vs.
3.00), Sympathetic (3.26 vs. 2.96), and Understanding (3.20 vs. 2.93). Regarding the negative incoherent
behaviours, we observed higher averages in Strong personality (2.83 vs. 2.13), Dominance (2.40 vs. 1.93),
and Assertiveness (2.90 vs. 2.40), whereas the coherent behaviour stood out in Sensitivity to the needs of
others (3.46 vs. 2.96) and Compassion (3.43 vs. 3.03). In both polarities, there was a tendency to perceive
the incoherent robot as more self-confident, probably because participants felt more surprised and
"threatened" by the sarcastic reaction. For the incoherent configuration group alone, the evaluations
between the two polarities were also compared. No significant diferences were noted. A T-Test was
conducted to check for statistically significant diferences between the two behaviours for each item of
the Short Form BSRI with relevant variance. We did not find any statistically significant diference.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>In this study, we investigated people’s perceptions of a robot’s sarcastic behaviour, which have been
created by contrasting incoherent behaviours. The incoherent behaviours have been presented with
verbal and non-verbal cues communicating positive and negative afective expressions. Our results
showed that the robot’s incoherent behaviour was perceived as more self-confident and assertive
compared to the coherent modality, which was rated as more warm and gentle. The diferences were
probably determined by the contrasts between the two attitudes. However, these results were not
supported by statistical significance. Future works will test this incoherent approach in a real-bar
interaction, by developing incongruency through facial emotions, and by exploring diferent humour
styles.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Acknowledgments</title>
      <p>This work has been supported by Italian PON R&amp;I 2014-2020 - REACT-EU Azione IV.4 (CUP
E65F21002920003, and Italian PON I&amp;C 2014-2020 within the BRILLO research project “Bartending
Robot for Interactive Long-Lasting Operations”, no. F/190066/01-02/X44
[7] Peter H. Kahn, Jolina H. Ruckert, Takayuki Kanda, Hiroshi Ishiguro, Heather E. Gary, and Solace
Shen. No joking aside: using humor to establish sociality in hri. In Proceedings of the 2014 ACM/IEEE
International Conference on Human-Robot Interaction, HRI ’14, page 188–189, New York, NY, USA,
2014. Association for Computing Machinery.
[8] Deborah L. Johanson, Ho Seok Ahn, JongYoon Lim, Christopher Lee, Gabrielle Sebaratnam, Bruce A.</p>
      <p>MacDonald, and Elizabeth Broadbent. Use of Humor by a Healthcare Robot Positively Afects
User Perceptions and Behavior. Technology, Mind, and Behavior, 1(2), 2020.
[9] Andreea Niculescu, Betsy Dijk, Anton Nijholt, Haizhou Li, and Sl See. Making social robots more
attractive: The efects of voice pitch, humor and empathy. International Journal of Social Robotics,
5:171–191, 2013.
[10] Tony Veale. A massive sarcastic robot: What a great idea! two approaches to the computational
generation of irony. In Proceedings of the 9th International Conference on Computational Creativity,
ICCC 2018, pages 120–127, 2018.
[11] Elizabeth E. Graham. The involvement of sense of humor in the development of social relationships.</p>
      <p>Communication Reports, 8(2):158–169, 1995.
[12] Ilaria Penzo, Enrichetta Giannetti, Cristina Stefanile, and Saulo Sirigatti. Stili umoristici e possibili
relazioni con il benessere psicologico secondo una versione italiana dello humor styles
questionnaire (hsq) [humor styles and possible relationship with psychological well-being according to an
italian version of the humor styles questionnaire (hsq)]. Psicologia della Salute, 2:49–68, 2011.
[13] Namok Choi, Dale R. Fuqua, and Jody L. Newman. Exploratory and confirmatory studies of the
structure of the bem sex role inventory short form with two divergent samples. Educational and
Psychological Measurement, 69(4):696–705, 2009.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Claudia</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Napoli</surname>
          </string-name>
          , Giovanni Ercolano, and
          <string-name>
            <given-names>Silvia</given-names>
            <surname>Rossi</surname>
          </string-name>
          .
          <article-title>Personalized home-care support for the elderly: a field experience with a social robot at home. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>33</volume>
          (
          <issue>2</issue>
          ):
          <fpage>405</fpage>
          -
          <lpage>440</lpage>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Deirdre</surname>
            <given-names>E Logan</given-names>
          </string-name>
          , Cynthia Breazeal, Matthew S Goodwin, Sooyeon Jeong,
          <string-name>
            <surname>Brianna O'Connell</surname>
            ,
            <given-names>Duncan</given-names>
          </string-name>
          <string-name>
            <surname>Smith-Freedman</surname>
            ,
            <given-names>James</given-names>
          </string-name>
          <string-name>
            <surname>Heathers</surname>
            , and
            <given-names>Peter</given-names>
          </string-name>
          <string-name>
            <surname>Weinstock</surname>
          </string-name>
          .
          <article-title>Social robots for hospitalized children</article-title>
          .
          <source>Pediatrics</source>
          ,
          <volume>144</volume>
          (
          <issue>1</issue>
          ),
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Leimin</given-names>
            <surname>Tian</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sharon</given-names>
            <surname>Oviatt</surname>
          </string-name>
          .
          <article-title>A taxonomy of social errors in human-robot interaction</article-title>
          . J.
          <string-name>
            <surname>Hum</surname>
          </string-name>
          .-Robot
          <string-name>
            <surname>Interact</surname>
          </string-name>
          .,
          <volume>10</volume>
          (
          <issue>2</issue>
          ),
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Charles</surname>
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Wilson</surname>
          </string-name>
          . Jokes: Form, Content, Use, and
          <article-title>Function. European monographs in social psychology</article-title>
          . European Association of Experimental Social Psychology by Academic Press,
          <year>1979</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Arnie</given-names>
            <surname>Cann</surname>
          </string-name>
          , Lawrence Calhoun, and
          <string-name>
            <given-names>Janet</given-names>
            <surname>Banks</surname>
          </string-name>
          .
          <article-title>On the role of humor appreciation in interpersonal attraction: It's no joking matter</article-title>
          .
          <source>Humor-international Journal of Humor Research - HUMOR</source>
          ,
          <volume>10</volume>
          :
          <fpage>77</fpage>
          -
          <lpage>90</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>WIlliam P.</given-names>
            <surname>Hampes</surname>
          </string-name>
          .
          <article-title>The relationship between humor and trust</article-title>
          .
          <source>HUMOR</source>
          ,
          <volume>12</volume>
          (
          <issue>3</issue>
          ):
          <fpage>253</fpage>
          -
          <lpage>260</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>