<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Determination of Reflective User Engagement in Argumentative Dialogue Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Annalena Aicher</string-name>
          <email>annalena.aicher@uni-ulm.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wolfgang Minker</string-name>
          <email>wolfgang.minker@uni-ulm.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Ultes</string-name>
          <email>stefan.ultes@daimler.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CMNA'21: Workshop on Computational Models of Natural Argument</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mercedes Benz AG</institution>
          ,
          <addr-line>Stuttgart</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Ulm University, Institute of Communications Engineering</institution>
          ,
          <addr-line>Albert-Einstein-Allee 43, 89081 Ulm</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this work we propose to our knowledge the first approach to determine the reflective user engagement (RUE) during an argumentative dialogue. Therefore, we review state-of-the-art literature definitions for reflective engagement (RE) and approaches to measure the latter. Given some basic characteristics the argumentative dialogue system has to provide, we derive a formula to determine the RE taking into account the argument structure and the respective current position at each state of the dialogue. Reflective User Engagement, Argumentative Dialogue Systems, Bipolar Argumentation Structures A natural way of resolving diferent points of view or forming an opinion for humans is through conversation, i.e., through the exchange of arguments. Due to the vast amount of diferent available information people tend to focus on a biased subset of sources that repeat or strengthen an already established or convenient opinion which is furthermore reinforced by iflter algorithms [ 20]. In order to avoid the (often unconscious) process of intellectual isolation, we suggested an approach to explore large amounts of diverging information in a natural and intuitive way[1]. On this basis we aim for a system that provides an engaging form of interaction via natural language and encourages users to address diverging points of view and to scrutinize information. In order to foster a dialogue conveying a balanced discussion of topics, we will extract reward signals required for reinforcement learning from properties of the argumentative dialogue between the user and the system. In particular one property is the RUE, denoting the criticalthinking and open-mindedness demonstrated by the user in the interaction with the system. In their study [16] Masrek et al. showed that user engagement is a strong predictor of user satisfaction and thus, crucial to keep the users motivated to talk to the system and confront themselves with diverging arguments. Therefore, we derive an in-dialogue calculation for RUE taking into account the argument structure and user behavior during the dialogue. The remainder of this paper is as follows: the overview over the related work in Section 2</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>(S. Ultes)
is followed by Section 3 describing our proposed derivation of the RUE after explaining the
dialogue model upon which the former is based. In Section 4 we conclude by summarizing the
presented ideas and give a short outlook.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>In general, O’Brien et al. [18] define engagement as the ‘quality of user experiences with
technology that is characterized by challenge, aesthetic and sensory appeal, feedback, novelty,
interactivity, perceived control and time, awareness, motivation, interest, and afect’. Lalmas et
al. [13] specify user engagement to be the quality of the user experience that emphasizes the
positive aspects of interacting with an online application and the desire to use it longer and
repeatedly.</p>
      <p>As user engagement is a very complex phenomenon there exist numerous of (potential)
measurement approaches. Common ways to evaluate user engagement include using self-report
measures like questionnaires [7, 21]; observational methods such as facial expression analysis [6];
neuro-physiological signal processing methods for example cardiovascular accelerations [13].
Oh et al. [19] suggest a measurement and structural model for empirically capturing the meaning
and process of user engagement in the context of interactive media. They chose four attributes,
i.e. physical interaction, interface assessment, absorption, and digital outreach.
Other studies ([2, 4]) examined the variety of time measures, cursor movement, and eye tracking
data, in addition to self-reported items and click data. Lalmas et al. [13] give an overview
on techniques based on physiological measurement, such as bodily and brain response and
function [17] and eye tracking [3]. Correlations between gaze tracking and cursor tracking
are discussed by [10]. Measures based on web analytics include online behavioral metrics e.g.
click-through rates [22], number of page views [11], time spent on a site (i.e., dwelltime [8, 26])
and frequency of return visits [14].</p>
      <p>Still, as stated by Arapakis et al. [2] it is important to move beyond the ‘legacy of the click’
and consider cognitive and afective factors of engagement. Silpasuwanchai [ 25] et al. relate
cognitive engagement to the sense of involvement, focused attention, and deep reflection.
Prado-Romero et al. [23] propose to use anomaly detection for finding ’influential’ and ’open
minded’ individuals in the Twitter network. Their approach is based on the InterScore anomaly
detection algorithm, identifying users with an anomalous number of out- and in-edges.
According to Haim et al. [9] open-mindedness correlates with linguistic style accommodation 1
and relates to the assumed speaker role in diferent contexts. In contrast to our work, these
approaches are not (preliminary) concerned with determining content-related open-mindedness.
According to [5, 15, 24] reflective engagement (RE) refers to learners’ continual and active
participation in their problem inquiry with a continuous and critical judgment of inquiry process
and inquiry outcomes for possible improvement. Most approaches that describe RE are strongly
connected to teaching-learning processes [12, 25]. Instead we consider a more general definition,
which refers to the user’s motivation in scrutinizing arguments and exploring diverging views.
In extension to existing literature we propose a calculation approach extracted from the user
1Linguistic style accommodation denotes the ’unconscious process in which a speaker accommodates their
communicative behavior with respect to the communication partner’ [9]
behavior (actions) instead of solely relying on self-report measures.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Reflective User Engagement in BEA</title>
      <p>In the following we shortly point out the main characteristics of the dialogue model used in the
argumentative dialogue system BEA [1]. Based on this model we then calculate the RUE.</p>
      <sec id="sec-3-1">
        <title>3.1. Dialogue model</title>
        <p>The interaction between the ADS and the user is separated in turns, consisting of a user action
and corresponding natural language answer of the system. The possible actions (moves) the
user is able to choose from, depend on the position of the current argument (root / parent node /
’leaf’ node). Due to limited space we will focus only on the moves which are relevant to derive
the RUE.</p>
        <p>To prevent the user from being overwhelmed by the amount of information, the user is able to
navigate incrementally through the argument structure resembling the one of a tree based on
bipolar argument structures. These structures depict support or attack relations between the
arguments (nodes) in a graph. We choose a non-cyclic tree structure, where each node (’parent’)
is supported or attacked by its ’children’. If no children exist, the node is a leaf and marks the
end of a branch. Usually a single major claim formulates the overall topic, representing the root
node in the graph.</p>
        <p>The user is able to specify if he enquires for a supporting (pro) or attacking (con) argument on
the current argument. For a better understanding, we will consider the following example. Let
the topic of the discussion be concerned with the question whether to stay in a certain hotel
or not. One aspect of the discussion might be the service of the hotel. Thus, the user can e.g.
request more information by stating: ’I would like to hear a supporting/contradicting argument
for the claim, that the service of the hotel is very good.’
At any time during the conversation the user is able to ascend the argument branch (level up to
the ‘parent’ node) and descend on another unknown branch (targeting the parent node) again.
But in doing one will not be able to return to the previous branch, especially if one has not heard
all arguments, these arguments will be ‘dropped’. In this case we assumed that either the user
lost interest in the current argument or received in his/her perception suficient information.
This is important to keep in mind for the following derivation.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Derivation of the Reflective User Engagement</title>
        <p>We propose an approach based on Yi et al. [26], who correlate rather short website content and
long browsing time with great user interest. In analogy to this a user who inquires for more
information is more engaged. Recalling our previous definition of reflective engagement as the
user’s interest scrutinizing arguments and exploring diverging views. This can be mapped to
the two actions of the user asking for more information, either pro or con sides of the current
argument2. Thus, the more arguments of both sides are heard, the higher is the RUE. The
2BEA visualizes all subtrees of the current argument, such that the user knows exactly how many arguments
are available. This is crucial as we assume that unvisited arguments are intended and not just missed by mistake
highest RUE is given if the same number of pro and con arguments are heard. To take a potential,
data-related bias (#pro ≠ #con) into account, we introduce the characteristic function 1. It
considers if at least one pro/con pair has been heard and if so, makes it possible to consider
single additional arguments, which have been heard. Thus, we define:</p>
        <sec id="sec-3-2-1">
          <title>1, if ∃ visited pro/con pairs</title>
          <p>p visited = { 1, if no pro/con pairs exist .</p>
          <p>
            0, if ∄ visited pro/con pairs
(
            <xref ref-type="bibr" rid="ref1">1</xref>
            )
          </p>
          <p>For example, if we consider the simple argument subtree structure shown in 1, on each level
both arguments (pro and con) have to be heard such that the characteristic function  p visited = 1.
If only one side is heard, e.g. solely C2 or only C3 it follows  p visited = 0 for the respective
Level 2. Likewise this follows for Level 3, in case just C4 or C5 are heard.</p>
          <p>As the RUE reflects critical thinking and openmindedness of the user, we weight a balanced
relation of pro and con pairs higher than the exploration of solely the pro or con side of an
argument. We choose to weight all visited pro/con pairs with a factor   &gt; 0.5 and all single
arguments with (1 −   ) &lt; 0.5. Without loss of generality, if no pro/con pairs exist for level
 + 1 it follows:
and vice versa, if no single arguments exist
  ∶= 0;</p>
          <p>The  is recommended to be chosen depending on the relation between pro/con pairs and single
pro or con arguments.</p>
          <p>, =</p>
          <p>,(−)
∑=1
 max−</p>
          <p>,
 ,1 =</p>
          <p>1
∑3=−11 
= .</p>
          <p>
            1
3
(
            <xref ref-type="bibr" rid="ref2">2</xref>
            )
(
            <xref ref-type="bibr" rid="ref3">3</xref>
            )
(
            <xref ref-type="bibr" rid="ref4">4</xref>
            )
(
            <xref ref-type="bibr" rid="ref5">5</xref>
            )
If we look e.g. at C3 with  = 1 and  = 2 and assume all C1-C5 have been heard, we get
where  ,(−)
          </p>
          <p>denotes the depth of the level  with respect to the level of parent node at level  .
To avoid an over-representation of levels with only few arguments while levels with many
arguments will be under-represented, we define a weight   which takes the diferent sizes of
levels into account. Thus, we relate the number of descendants of the respective level  to all
descendants such that
 , =</p>
          <p>#pro + #con
 max
∑=+1
#pro + #con</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>RUE at the parent node at  it follows:</title>
          <p>where #pro , #con denotes the number of all pro, con arguments. Again assuming that all
claims C1-C5 have been heard, it follows for the level  = 1 that  ,1 = 24 = 0.5. For the overall
The resulting RUE of a parent node   for the single level  can therefore be determined by:
  =  
,
engagement measure.
the root node.
follows, that  1 =   1
can be derived completely analogously.
where # (+1) denotes the number of child pro/con pairs at level  + 1 and  (+1) denotes the
number of single children at level  + 1 . Regarding the given example in Figure 1, for  = 1 it
1 = 1 if both C2 and C3 are heard. If only C2 or C3 are heard  1 = 0. For  2
When considering hierarchical argumentation structures, arguments at the beginning of a
branch are more general than ones at deeper levels. Due to this we introduce a hierarchical
weight   in order to incorporate the diferent levels of argument depth into our reflective
Therefore, a balanced exploring of lower levels will be assigned larger weight values than near
 
 =
∑=
 max−1  ,+1
∑=
 max−1  ,+1
 ,+1  
 ,+1
,  
 ∈ [0, 1],
which denotes the normalized sum over the weighted reflective user engagement values 
for each descending level  + 1,  + 2, ...,  max−13. Regarding our example the total RUE can be

derived by calculating all single values as shown above and afterwards taking the sum over the
respective products which is not shown in detail due to the limited scope of this paper.</p>
          <p>3Leaf nodes are not succeeded by arguments and RUE can only be determined for their parents.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions and Outlook</title>
      <p>The purpose of this work is to present to our knowledge the first approach to calculate reflective
user engagement in an Argumentative Dialogue System. Given a bipolar argumentation graph
and fitting dialogue model, we propose a derivation which takes the depth, balance and number
of inquiries into account.</p>
      <p>In future work, we want to test the calculated RUE with simulated and real user data and explore
its suitability for RL. Our aim is to cooperatively provide as much balanced information as
possible, while adapting the system’s strategy to the RUE.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work has been funded by the DFG within the project “How to Win Arguments –
Empowering Virtual Agents to Improve their Persuasiveness”, Grant no. 376696351, as part of the Priority
Program “Robust Argumentation Machines (RATIO)” (SPP-1999).
[9] A. Haim and Oren Tsur. Open-mindedness and style coordination in argumentative
discussions. In EACL, 2021.
[10] Jef Huang, Ryen White, and Georg Buscher. User See, User Point: Gaze and Cursor
Alignment in Web Search, page 1341–1350. Association for Computing Machinery, New
York, NY, USA, 2012.
[11] Steve Jackson. Cult of Analytics: Driving online marketing strategies using web analytics.</p>
      <p>Routledge, 2009.
[12] Siu Cheung Kong and Yanjie Song. An experience of personalized learning hub initiative
embedding byod for reflective engagement in higher education. Computers &amp; Education,
88:227–240, 2015.
[13] M. Lalmas, H. O’Brien, and Elad Yom-Tov. Measuring user engagement. In Measuring User</p>
      <p>
        Engagement, 2014.
[14] Janette Lehmann, Mounia Lalmas, Georges Dupret, and Ricardo Baeza-Yates. Online
multitasking and user engagement. In Proceedings of the 22nd ACM International Conference
on Information and Knowledge Management, CIKM ’13, page 519–528, New York, NY, USA,
2013.
[15] Nona Lyons. Reflective engagement as professional development in the lives of university
teachers. Teachers and teaching, 12(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ):151–168, 2006.
[16] Mohamad Noorman Masrek, Mohammad Hudzari Razali, Ishak Ramli, and Trias
Andromeda. User engagement and satisfaction: The case of web digital library. International
Journal of Engineering and Technology (UAE), 7(
        <xref ref-type="bibr" rid="ref4">4</xref>
        ):19–24, 2018.
[17] Maurizio Mauri, Pietro Cipresso, Anna Balgera, Marco Villamira, and Giuseppe Riva. Why
is facebook so successful? psychophysiological measures describe a core flow state while
using facebook. Cyberpsychology, Behavior, and Social Networking, 14(12):723–731, 2011.
[18] Heather L. O’Brien and Elaine G. Toms. What is user engagement? a conceptual framework
for defining user engagement with technology. JASIST, 59(
        <xref ref-type="bibr" rid="ref6">6</xref>
        ):938–955, 2008.
[19] Jeeyun Oh, Saraswathi Bellur, and S. Shyam Sundar. Clicking, assessing, immersing, and
sharing: An empirical model of user engagement with interactive media. Communication
Research, 45(
        <xref ref-type="bibr" rid="ref5">5</xref>
        ):737–763, 2018.
[20] Eli Pariser. The filter bubble: How the new personalized web is changing what we read and
how we think. Penguin, 2011.
[21] Olga Perski, Ann Blandford, Claire Garnett, David Crane, Robert West, and Susan Michie.
      </p>
      <p>
        A self-report measure of engagement with digital behavior change interventions (DBCIs):
development and psychometric evaluation of the “DBCI Engagement Scale”. Translational
Behavioral Medicine, 10(
        <xref ref-type="bibr" rid="ref1">1</xref>
        ):267–277, 03 2019.
[22] Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and
Tapas Kanungo. On composition of a federated web search result page: Using online users
to provide pairwise preference for heterogeneous verticals. In Proceedings of the Fourth
ACM International Conference on Web Search and Data Mining, WSDM ’11, page 715–724,
New York, NY, USA, 2011. Association for Computing Machinery.
[23] Mario Alfonso Prado-Romero, Alberto Fernández Oliva, and Lucina García Hernández.
      </p>
      <p>
        Identifying twitter users influence and open mindedness using anomaly detection. In Yanio
Hernández Heredia, Vladimir Milián Núñez, and José Ruiz Shulcloper, editors, Progress
in Artificial Intelligence and Pattern Recognition , pages 166–173, Cham, 2018. Springer
International Publishing.
[24] Gloria Jean Rodman. Facilitating the teaching-learning process through the reflective
engagement of pre-service teachers. Australian Journal of Teacher Education, 35(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ):20–34,
2010.
[25] Chaklam Silpasuwanchai, Xiaojuan Ma, Hiroaki Shigemasu, and Xiangshi Ren.
Developing a comprehensive engagement framework of gamification for reflective learning. In
Proceedings of the 2016 ACM Conference on Designing Interactive Systems, pages 459–472,
2016.
[26] Xing Yi, Liangjie Hong, Erheng Zhong, Nanthan Nan Liu, and Suju Rajan. Beyond clicks:
Dwell time for personalization. In Proceedings of the 8th ACM Conference on Recommender
Systems, RecSys ’14, page 113–120, New York, NY, USA, 2014. Association for Computing
Machinery.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Annalena</given-names>
            <surname>Aicher</surname>
          </string-name>
          , Niklas Rach, Wolfgang Minker, and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Ultes</surname>
          </string-name>
          .
          <article-title>Opinion building based on the argumentative dialogue system bea</article-title>
          .
          <source>Increasing Naturalness and Flexibility in Spoken Dialogue Interaction: 10th IWSDS</source>
          , pages
          <fpage>307</fpage>
          -
          <lpage>318</lpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Ioannis</given-names>
            <surname>Arapakis</surname>
          </string-name>
          , Mounia Lalmas, and
          <string-name>
            <given-names>George</given-names>
            <surname>Valkanas</surname>
          </string-name>
          .
          <article-title>Understanding within-content engagement through pattern analysis of mouse gestures</article-title>
          .
          <source>In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management</source>
          , New York, NY, USA,
          <year>2014</year>
          .
          <article-title>Association for Computing Machinery</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Georg</given-names>
            <surname>Buscher</surname>
          </string-name>
          , Andreas Dengel, and Ludger van Elst.
          <article-title>Query expansion using gaze-based feedback on the subdocument level</article-title>
          .
          <source>In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 387-394</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Georges</given-names>
            <surname>Dupret</surname>
          </string-name>
          and
          <string-name>
            <given-names>Mounia</given-names>
            <surname>Lalmas</surname>
          </string-name>
          .
          <article-title>Absence time and user engagement: Evaluating ranking functions</article-title>
          .
          <source>In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining</source>
          , New York, NY, USA,
          <year>2013</year>
          .
          <article-title>Association for Computing Machinery</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Fiona</given-names>
            <surname>Farr</surname>
          </string-name>
          and
          <string-name>
            <given-names>Elaine</given-names>
            <surname>Riordan</surname>
          </string-name>
          .
          <article-title>Students' engagement in reflective tasks: an investigation of interactive and non-interactive discourse corpora</article-title>
          .
          <source>Classroom Discourse</source>
          ,
          <volume>3</volume>
          (
          <issue>2</issue>
          ):
          <fpage>129</fpage>
          -
          <lpage>146</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Joseph</given-names>
            <surname>Grafsgaard</surname>
          </string-name>
          , Joseph B Wiggins, Kristy Elizabeth Boyer,
          <string-name>
            <surname>Eric N Wiebe</surname>
            , and
            <given-names>James</given-names>
          </string-name>
          <string-name>
            <surname>Lester</surname>
          </string-name>
          .
          <article-title>Automatically recognizing facial expression: Predicting engagement and frustration</article-title>
          .
          <source>In Educational Data Mining</source>
          <year>2013</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Barbara</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Greene</surname>
          </string-name>
          .
          <article-title>Measuring cognitive engagement with self-report scales: Reflections from over 20 years of research</article-title>
          .
          <source>Educational Psychologist</source>
          ,
          <volume>50</volume>
          (
          <issue>1</issue>
          ):
          <fpage>14</fpage>
          -
          <lpage>30</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Qi</given-names>
            <surname>Guo</surname>
          </string-name>
          and
          <string-name>
            <given-names>Eugene</given-names>
            <surname>Agichtein</surname>
          </string-name>
          .
          <article-title>Beyond dwell time: Estimating document relevance from cursor movements and other post-click searcher behavior</article-title>
          .
          <source>In Proceedings of the 21st International Conference on World Wide Web, WWW '12, page 569-578</source>
          , New York, NY, USA,
          <year>2012</year>
          .
          <article-title>Association for Computing Machinery</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>