<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>On The Role of Dialogue Models in the Age of Large Language Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Simon Wells</string-name>
          <email>s.wells@napier.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark Snaith</string-name>
          <email>m.snaith@rgu.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CMNA'23: International Workshop on Computational Models of Natural Argument</institution>
          ,
          <addr-line>2023, London</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Edinburgh Napier University</institution>
          ,
          <addr-line>10 Colinton Road, Edinburgh, EH10 5DT, Scotland</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Robert Gordon University</institution>
          ,
          <addr-line>Garthdee House, Aberdeen, AB10 7QB, Scotland</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Workshop Proceedings We argue that Machine learning, in particular the currently prevalent generation of Large Language Models (LLMs) [1], can work constructively with existing normative models of dialogue as exemplified by dialogue games [ 2], specifically their computational applications within, for example, inter-agent communication [3] and automated dialogue management [4]. Furthermore we argue that this relationship is bi-directional, that some uses of dialogue games benefit from increased functionality due to the specific capabilities of LLMs, whilst LLMs benefit from externalised models of, variously, problematic, normative, or idealised behaviour. Machine Learning (ML) approaches, especially LLMs , appear to be making great advances against long-standing Artificial Intelligence challenges. In particular, LLMs are increasingly achieving successes in areas both adjacent to, and overlapping with, those of interest to the Computational Models of Natural Argument community. A prevalent opinion, not without some basis, within the ML research community is that many, if not all, AI challenges, will eventually be solved by ML models of increasing power and utility, negating the need for alternative or traditional approaches. An exemplar of this position, is the study of distinct models of dialogue for inter-agent communication when LLM based chatbots are increasingly able to surpass their performance in specific contexts. The trajectory of increased LLM capabilities suggests no reason that this trend will not continue, at least for some time. However, it is not the case that only the one, or the other approach, is necessary. Despite a tendency for LLMs to feature creep, and to appear to subsume additional areas of study, there are very good reasons to consider three modes of study of dialogue. Firstly, LLMs as their own individual field within ML, secondly, dialogue both in terms of actual human behaviour, which can exhibit wide quality standards, but also in terms of normative and idealised models, and thirdly, the fertile area in which the two overlap and can operate collaboratively. It is this third aspect with which this paper is concerned, for the first will occur anyway as researchers seek to map out the boundaries of what LLMs, as AI models, can actually achieve, and the second will continue, because the study of how people interact naturally through argument and dialogue will remain both fascinating and of objective value regardless of advances made in LLMs. However, where LLMs, Dialogue Models, and, for completion, people, come together, there is fertile ground for the development of principled models of interaction that are well-founded, well-regulated, and supportive of htp:/ceur-ws.org CEUR Workshop Proceedings (CEUR-WS.org) ISN1613-073</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR</p>
      <p>ceur-ws.org
https://www.simonwells.org/ (S. Wells); https://www3.rgu.ac.uk/dmstaff/snaith-mark/ (M. Snaith)</p>
      <p>
        © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
Workshop
Proceedings
mixed-initiative interactions between humans and intelligent software agents [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>Our research has focused upon an investigation of the various activities and responsibilities
associated with the actors and systems that can engage in dialogue, identifying the strengths
and weaknesses of each. To this end we have constructed a characterisation of dialogue systems
that focuses upon the roles, responsibilities, and necessary abilities of the participating actors or
systems that comprise those actors. We attempted to characterise dialogues in three contexts;
where the actors within the dialogue are people, where the actors are software agents that
incorporate dialogue games, and where the actors are software agents that incorporate LLMs.
The aim was to delineate the kinds of roles, responsibilities, and capabilities that a dialogue
system needs and to determine how the responsibility for fulfilling these factors is spread across
agents within these three contexts. We then posed a series of “wh-Questions” (who, what , how,
which, when, why, where). The aim of this approach was to provide a new analytical tools for
considering what a dialogue systems needs to do, and, in the case of software agents, which
capabilities are delegated to, or fulfilled by which sub-systems.</p>
      <p>
        We then investigated examples of where LLMs are currently demonstrating utility in order to
benchmark actual LLM performance against other approaches. Dialogue systems, comprising
multiple components, achieve a variety of levels of capability in dialogue. Using humans as
an exemplar of agents who are generally capable of choosing what to say and when to say it
in a strategically useful way, we compare them firstly to software agents comprising various
combinations of non-LLM modules for dialogue, sentence generation, and strategic reasoning,
and subsequently to LLM behaviour. We then show how a dialogue game, utilising an existing
dialogue game execution platform [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], together with an LLM, can work together to achieve more
in aggregate using current technologies. Throughout we argue that LLMs, at least at present,
do not currently subsume traditional dialogue game research, but have an ancillary role, due to
their complimentary strengths, that can lead to great improvements in the ability of intelligent
agents to eventually engage in principled, well-structured, well-regulated, constructive, and
purposeful dialogue.
      </p>
      <p>Finally, we address the question of why, if LLMs are increasingly able to subsume the
functionality of other approaches, should research continue into other approaches, such as
dialogue games. We argue that dialogue games have been studied for a long time as a way to
understand dialogue dynamics and to yield models that capture and explain both normative and
ideal expectations for how dialogues should progress. Even if LLMs are trained to engage in
increasingly realistic dialogue, dialogue games will still have an important regulatory role to play.
This regulatory role utilises the dialogue game variously as ideal or normative model, depending
upon the circumstances, against which dialogue participants, including both humans and LLMs,
can self-evaluate, testing their own generated responses against the kind of ideal response
that a dialogue model would propose. In this way, we can still aspire towards higher quality,
computer-supported, argumentative dialogue as well as rich and naturalistic, human-machine
interaction.</p>
      <p>
        Next steps will involve further study of the useful interactions between LLMs and dialogue
models as well as automated benchmarking of the abilities of the resulting systems. One
approach might build upon the idea of the Arguing Agents Competition [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ].
      </p>
      <p>In summary, despite advances in ML based approaches to dialogue, traditional approaches to
dialogue modelling have a more important role to play than ever before.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Brown</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Mann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ryder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Subbiah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Kaplan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Dhariwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Neelakantan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Shyam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sastry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Askell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Herbert-Voss</surname>
          </string-name>
          , G. Krueger,
          <string-name>
            <given-names>T.</given-names>
            <surname>Henighan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Child</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ramesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ziegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Winter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Hesse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chen</surname>
          </string-name>
          , E. Sigler,
          <string-name>
            <given-names>M.</given-names>
            <surname>Litwin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chess</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Berner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>McCandlish</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Radford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Sutskever</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Amodei</surname>
          </string-name>
          ,
          <article-title>Language models are few-shot learners</article-title>
          , in: H.
          <string-name>
            <surname>Larochelle</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ranzato</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Hadsell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Balcan</surname>
          </string-name>
          , H. Lin (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>33</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2020</year>
          , pp.
          <fpage>1877</fpage>
          -
          <lpage>1901</lpage>
          . URL: https://proceedings.neurips.cc/paper_files/paper/ 2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <article-title>A domain specific language for describing diverse systems of dialogue</article-title>
          ,
          <source>Journal of Applied Logic</source>
          <volume>10</volume>
          (
          <year>2012</year>
          )
          <fpage>309</fpage>
          -
          <lpage>329</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wells</surname>
          </string-name>
          , Formal Dialectical Games in Multiagent Argumentation,
          <source>Ph.D. thesis</source>
          , School of Computing, University of Dundee,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bex</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <article-title>Generalising argument dialogue with the dialogue game execution platform</article-title>
          .,
          <source>in: COMMA</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>141</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Snaith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <article-title>Mixed initiative argument in public deliberation</article-title>
          ,
          <source>Online Deliberation</source>
          <volume>2</volume>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lozinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <article-title>Towards an arguing agents competition: Architectural considerations</article-title>
          ,
          <source>in: Proceedings of the 8th International Workshop on Computational Models of Natural Argument (CMNA8)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schulze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <article-title>Towards an arguing agents competition: Building on argumento</article-title>
          ,
          <source>in: Proceedings of the 8th International Workshop on Computational Models of Natural Argument (CMNA8)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>