<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>For Interpretable Response Selection In Conversational Modelling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Malavika Suresh</string-name>
          <email>m.suresh@rgu.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Case Based Reasoning</institution>
          ,
          <addr-line>Conversational Modelling, Motivational Interviewing, Abstract Argumentation</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Robert Gordon University</institution>
          ,
          <addr-line>Aberdeen</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <abstract>
        <p>Current state-of-the-art dialogue systems are increasingly complex. When used in applications such as motivational interviewing, the lack of interpretability is a concern. CBR ofers to bridge this gap by using the most similar past cases to decide the outcome for a new problem, which then serves as a natural as well as accurate explanation of the outcome. This research proposes to extend the Abstract Argumentation CBR (AA-CBR) framework for predicting the next response type in an ongoing conversation by reusing the knowledge of previous conversations to achieve a desirable outcome for a new conversation context.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        due to the efectiveness of MI and lack of trained MI interviewers [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. While some work has
shown that large-scale pre-trained language models can be useful to train MI interviewers by
predicting responses [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the model predictions are not explained. In such case, the decision
to accept or reject a model’s proposed response falls on the trainee. For example, without an
explanation of possible outcomes, it is possible that the trainee may reject a model’s suitable
proposal for a less suited response that they prefer. This is a concern as it is already a dificulty
for human MI practitioners to suppress the instinct to respond with premature advice [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Equally possible is that an unsuitable model prediction may be accepted by the trainee, which
can be avoided when an explanation is available. Additionally, the type of response (i.e dialogue
strategy) needs to be adapted based on the individual as the same strategy may result in diferent
outcomes with diferent individuals. Thus, the interpretable CBR approach of using past cases
to decide the outcome for a new case could be better suited for this problem.
      </p>
      <p>
        CBR approaches to dialogue management have been studied in prior work [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], where
each utterance in a dialogue is considered as a case. In contrast, in this work we consider the
whole dialogue as a case and propose the use of Abstract Argumentation for CBR (AA-CBR) for
selecting next response type. As the framework represents past cases as a tree of arguments
attacking and defending each other (i.e an argument graph, as in Fig 1), it can provide natural
interactive explanations of the predicted outcomes [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
CEUR
Workshop
Proceedings
      </p>
      <p>The expected research contributions are - (i) case representation of an MI conversation,
(ii) adaptation and extension of AA-CBR for response type selection in MI conversations.
These contributions can be extended to any task-oriented conversational model that requires
interpretable response generation based on personalized context.</p>
      <sec id="sec-1-1">
        <title>1.1. Background</title>
        <p>
          This section illustrates an example to briefly summarize the AA-CBR framework originally
proposed by [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. AA-CBR is based on the argumentation framework [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] which is basically a set
of arguments and a binary attack relationship between them which defines whether ArgumentA
attacks ArgumentB. AA-CBR represents each case as a set of factors. When adding a new
factor to the case changes the case outcome, it is considered an attack. Argumentation rules
define the attack relationship between cases to then represent the case base as an argument
graph. For instance in Fig 1, {A,B,C} which is a more specific case (i.e more factors) attacks {A,B}
which is a less specific case with a diferent outcome. By inference, {A,B,C} defends {A} and
attacks the default case. Note that {A,B,C} does not directly attack the default case because {A}
already attacks the default case and {A} is more concise than {A,B,C}. The default case defines
the assumed outcome for any new case unless the default case is suficiently attacked by other
cases in the case base. A suficient attack against the default case occurs if all the unattacked
nodes of the graph attack the default case (i.e., none of the unattacked nodes defend the default
case).
        </p>
        <p>For a new case, the outcome is decided by first determining which, if any, of the historical
cases the new case attacks and subsequently inferring from the argument graph whether the
default outcome is attacked or defended. If the default outcome is defended, then the outcome
for the new case is the default outcome (in this example, negative). Argumentation rules define
that a new case attacks a past case if the past case factors are not contained in the new case 3.
Here, the new case {A,B,C,D} does not attack any case because all historical cases are a subset
of the factors of the new case. Since {A,B,C} is the closest unattacked case in the graph and by
inference it attacks the default case, the outcome for the new case is positive.</p>
        <p>Argumentation rules are also used to decide the outcome when multiple similar cases with
difering outcomes exist in the case base. For instance, in the above example, if the case base
3This ensures that factors which are not present in the new case (and thus deemed irrelevant) do not contribute to
the outcome
included a historical case such as ({A,D},-), both ({A,D},-) and ({A,B,C},+) would be similar cases.
By inference, the default outcome is now defended by at least one of the unattacked nodes
(since {A,D} attacks {A}) and is chosen as the outcome for the new case.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Research Plan</title>
      <p>This section lists the research objectives, defines associated terminologies and describes the
approaches considered. Fig 2 depicts an overview of the components in the research.</p>
      <p>The research aims to build an interpretable conversational model for MI. An AA-CBR based
approach is proposed to introduce interpretability when deciding the next response type. The
following research questions will be investigated through the listed objectives:
1. How can an MI conversation be represented as a case of factors?
• Identify a set of dialogue factors (i.e case attributes) to represent a given conversation
history, which forms the problem component of the case.
• Label each conversation as successful (good) or unsuccessful (bad), which forms the
case outcome.</p>
      <p>• Identify a set of counsellor response types, which forms the solution for the case.
2. How can AA-CBR be applied for case retrieval?
• Identify challenges in applying AA-CBR for MI conversations
• Extend AA-CBR for MI conversations
• Apply the extended AA-CBR framework and evaluate using a sample dataset
Definitions:</p>
      <p>Case: A case comprises the entire available conversation history, represented as a set of
dialogue factors and depicted as a node in the argument graph.</p>
      <p>Dialogue factors: Dialogue factors can capture both relevant content such as the topic of
the conversation as well as contextual features such as speaker sentiment and resistance or
willingness to change (called MI talk-type). These will be annotated against each utterance.</p>
      <p>Outcome: For MI, a good conversation outcome is either an explicit user expression of
satisfaction at the end of a conversation or an implicit change in user perspective.</p>
      <sec id="sec-2-1">
        <title>2.1. Approach / Methodology</title>
        <p>
          Construct case base: First, the right set of dialogue factors that capture the separation between
good and bad outcomes of an MI conversation need to be identified. Broadly, there are a few
types of dialogue factors that may be considered - (i) frame of mind factors (eg. sentiment, LIWC
markers [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], MI talk-type [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]) which are indicative of psychological state (ii) conversation
topic (eg. addiction, weight-loss) (iii) linguistic factors (eg. utterance length, use of questions).
AA-CBR assumes factors to be independent of each other while here, some of them such as
sentiment and MI talk-type labels may be related and presence of one factor may entail the
other. Such relationships will also need to be investigated. The final set of dialogue factors
chosen will form the vocabulary knowledge container of the case base.
        </p>
        <p>
          The quality of the AA-CBR framework will depend on the quality of the extracted factors and
outcome labels. While public datasets such as [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] provide annotations for some factors, other
factors may need to be either freshly annotated or predicted. Given the dificulty in obtaining
expert annotations, this work will potentially adopt domain-transfer of well-studied models for
classifying non-MI factors such as sentiment [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] since words indicating sentiment polarity are
reasonably generalizable. Transparency of predictions will be enabled with explanation methods
such as feature relevance scores [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The models will be trained for use in the continual learning
setting [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] so that new data can be used to improve the model throughout its life. The overall
approach towards case base construction is summarized as follows:
• Identify and define the dialogue factors to be used
• Annotate cases with factors (manually where expertise is available, or with existing
models in literature, or by training a classifier on a few annotated cases)
        </p>
        <p>Extend AA-CBR for response type selection: This work proposes to consider dialogue
factors from available conversation history at each turn of a conversation as factors in the
AA-CBR framework. Thus for a new case, the case representation would evolve with the
unfolding of the conversation over time. It is worth noting that in reality not all possible factors
may become available and new factors previously unseen in the case base may be added, which
are both supported by the AA-CBR framework, making it suitable for this use case.</p>
        <p>
          When retrieving a solution for a new case, some possible solutions (i.e, response types) are
reflective, neutral or advice [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and which of these is best suited will depend on the client’s
current frame of mind and other personality traits. For each of these possible solutions, similar
cases can be retrieved and argumentation used to decide the outcome of choosing that solution.
Accordingly, the response type(s) that leads to a positive predicted outcome can then be chosen
for the next response from the counsellor. Thus, the AA-CBR framework argues why a particular
response type should be chosen over other response types for a new client, based on the previous
outcomes seen in the case base. Fig 3) depicts an example, where the default outcome for the
response type of giving advice is taken to be positive. However, giving advice when the speaker
sentiment is anger results in a negative outcome, and this node attacks the default outcome
node. For the new conversation, the argument graph will result in proposing a positive outcome
for choosing to advice, using the same logic as described in section 1.1.
        </p>
        <p>It is likely that the AA-CBR framework may not be directly suitable for conversations. For
instance, a crucial assumption in AA-CBR is that a particular combination of factors always
leads to the same outcome. However for conversations, this cannot be guaranteed in reality
and outcomes may be probabilistic. Also, considering the temporal aspect, the same factors
may appear at diferent times in the conversation and their ordering may result in diferent
outcomes (eg. anger at the beginning vs end of the conversation are diferent). Therefore, the
research will explore an extension of the AA-CBR framework to address such challenges.</p>
        <p>
          Generate responses and evaluate: For a given conversation context, the next response-type
as determined using the AA-CBR framework will be used as conditioning input to a suitable
natural language generation model. The generated response will then be evaluated for:
• How well the response aligns to the given input response-type: by comparing the semantic
similarity between outputs with and without the input conditioning.
• Whether the response-type conditioning can match the baseline performance using
non-interpretable generative models as in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], while providing interpretability.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Conclusion</title>
      <p>This research proposes the use of AA-CBR for an interpretable modelling of motivational
interviewing. The idea is to determine the response type at each counsellor turn in a new
conversation by comparing it to similar conversations in the case base. Specifically, the case
structure, i.e the representation of each conversation as a set of dialogue factors and the case
structure evolution as the conversation progresses will be investigated. Further, approaches for
extending the AA-CBR framework to allow for probabilistic attack relationships between cases
and multi-outcome case representations will be explored and will form the major contribution of
the research. The proposed work is currently in the initial stages and other research directions
such as case adaptation in AA-CBR may also be explored in the future.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.</given-names>
            <surname>Tavabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Stefanov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Borsari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Woolley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Scherer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Soleymani</surname>
          </string-name>
          ,
          <article-title>Analysis of behavior classification in motivational interviewing</article-title>
          ,
          <source>in: Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Welch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mihalcea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Pérez-Rosas</surname>
          </string-name>
          ,
          <article-title>Counseling-style reflection generation using generative pretrained transformers with augmented context</article-title>
          ,
          <source>in: Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Resnicow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>McMaster</surname>
          </string-name>
          ,
          <article-title>Motivational interviewing: Moving from why to how with autonomy support, The international journal of behavioral nutrition and physical activity (</article-title>
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Inui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Indurkhya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kotani</surname>
          </string-name>
          ,
          <article-title>A case-based natural language dialogue system using dialogue act</article-title>
          ,
          <source>in: IEEE International Conference on Systems, Man and Cybernetics</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Eliasson</surname>
          </string-name>
          ,
          <article-title>An integrated discourse model for a case-based reasoning dialogue system</article-title>
          ,
          <source>SAIS-SSL event on Artificial Intelligence and Learning Systems</source>
          (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>K.</given-names>
            <surname>Čyras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Satoh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Toni</surname>
          </string-name>
          ,
          <article-title>Explanation for case-based reasoning via abstract argumentation</article-title>
          ,
          <source>in: Computational Models of Argument</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Cyras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Satoh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Toni</surname>
          </string-name>
          ,
          <article-title>Abstract argumentation for case-based reasoning</article-title>
          ,
          <source>in: Fifteenth international conference on the principles of knowledge representation and reasoning</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P. M.</given-names>
            <surname>Dung</surname>
          </string-name>
          ,
          <article-title>On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games</article-title>
          ,
          <source>Artificial Intelligence</source>
          (
          <year>1995</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Althof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Leskovec</surname>
          </string-name>
          ,
          <article-title>Large-scale analysis of counseling conversations: An application of natural language processing to mental health, Transactions of the Association for Computational Linguistics (</article-title>
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Balloccu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Helaoui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Reiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Recupero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Riboni</surname>
          </string-name>
          , Anno-mi:
          <article-title>A dataset of expert-annotated counselling dialogues</article-title>
          , in: IEEE International Conference on Acoustics,
          <source>Speech and Signal Processing (ICASSP)</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>N.</given-names>
            <surname>Majumder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Poria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hazarika</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mihalcea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gelbukh</surname>
          </string-name>
          , E. Cambria,
          <string-name>
            <surname>Dialoguernn:</surname>
          </string-name>
          <article-title>An attentive rnn for emotion detection in conversations</article-title>
          ,
          <source>Proceedings of the AAAI Conference on Artificial Intelligence</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Danilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Qian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aharonov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Katsis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Kawas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sen</surname>
          </string-name>
          ,
          <article-title>A survey of the state of explainable AI for natural language processing</article-title>
          ,
          <source>in: Proceedings of the 10th International Joint Conference on Natural Language Processing</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Biesialska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Biesialska</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. R.</surname>
          </string-name>
          <article-title>Costa-jussà, Continual lifelong learning in natural language processing: A survey</article-title>
          ,
          <source>in: Proceedings of the 28th International Conference on Computational Linguistics</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>