<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Road map for the implementation of a conversational agent chatbot consistent with the guidelines of the Design System Italy (DSI)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Davide Bruno</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Regione Toscana</institution>
          ,
          <addr-line>via di Novoli,27, Firenze, 50127</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The project idea intends to set some cornerstones for the design and subsequent implementation of a conversational agent based on intent and artificial intelligence generative with the aim of exploiting public service semantics (CPSV-AP_IT Core Public Service Vocabulary) to improve the interaction between citizens and the public sector (PS). 1Ital-IA 2024: 4th National Conference on Artificial Intelligence, organized by CINI, May 29-30, 2024, Naples, Italy davide.bruno@regione.toscana.it ;</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;generative artificial intelligence</kwd>
        <kwd>public sector (PS)</kwd>
        <kwd>citizen</kwd>
        <kwd>CPSV-AP_IT</kwd>
        <kwd>CEUR-WS 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Artificial Intelligence Generative will
certainly have a significant impact in the
Public sector (PS). This technology offers
opportunities to improve the efficiency,
transparency and quality of public services.
By implementing AI Generative-based
systems, PS can automate repetitive
processes, optimise data management and
improve interaction with citizens. aims to
improve productivity, accessibility and
efficiency in service delivery. It leverages
technologies such as machine learning,
natural language processing (NLP) and
natural language processing.
These tools, known as 'conversational
applications' or more commonly 'chatbots',
exploit the ability of machines to understand
natural language to handle requests and
interactions with citizens in a timely and
contextual manner. These applications will
make it possible to improve the relationship
with citizens, providing personalised and
rapid responses, thus helping to optimise
the delivery of public services.
These highly innovative solutions will have
to take into account the regulatory
framework of reference for the PA and, in
any case, avoid excessive dependence on
suppliers that could quickly become
technological lock-in.</p>
      <p>Another risk that should not be
underestimated at this time of feverish
excitement on the subject is 'overpromising'
and the illusion of perfection that comes
© 2024 Copyright for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
from controlled demos where everything
seems perfect and everything is solved
simply by installing Artificial Intelligence
Generative, at the moment there are not
many solutions that are field-tested in real
situations and are not known.</p>
      <p>It is therefore crucial to choose the paradigm
for the development of the target
architecture and to identify generative AI
models.</p>
      <p>It will be possible to exploit the great
potential of generative AI if we take a
cautious and forward-looking responsible
approach
favouring open-source solutions or those
based on shared open standards so as to
guarantee PA flexibility and increase the
possibility of changing suppliers in the
future.</p>
      <p>PAs will have to develop in-house skills very
quickly by investing in staff training to make
them at least aware and capable of
expressing functional requirements, and not
in the acquisition phase, but as mentioned,
we must also be very vigilant in the
operation and pre-operation phase,
remembering that we are talking about new
technological solutions that will certainly
need to mature.
It will not be easy in this very dynamic and
exponentially exploding field of LLM
solutions, techniques and models.
However, lasting collaboration with other
public bodies, including research bodies,
must be encouraged and institutionalised in
order to share experiences and best
practices.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Reference Models and Goals</title>
      <p>The 'Core Public Service Vocabulary
Application Profile' (CPSV-AP) is a data
model designed to harmonise the
description of public services on
eGovernment portals. It provides a common
vocabulary to describe public services,
ensuring interoperability and
standardisation between different
implementations.2
The CPSV-AP aims at structuring public
service information, making it user-centred
and machine-readable, facilitating the
creation of a public service catalogue that is
interoperable and efficient. This vocabulary
is used by several EU countries, including
Italy, to describe public services and
associated life events in a standardised way.
In 2017, the Agency for Digital Italy (AgID)
published, in the OntoPiA controlled set of
ontologies and vocabularies3 (Ontologies for
Public Administration), a special vocabulary
for the definition of public services
CPSVAP_IT Core Public Service Vocabulary.4
The CPSV-AP_IT is the Italian version of this
vocabulary, offering a framework to
describe public services in Italy, in line with
the European vocabulary of core public
services.</p>
      <p>Designing and implementing a
conversational agent natively integrated
with CPSV-AP_IT will help improve the
accessibility and effectiveness of public
services, enabling users to interact in a more
intuitive and personalised way with the
information and services offered5.
Moreover, the artificial intelligence of the
conversational agent could be enhanced by
the structure and semantics provided by the
CPSV-AP_IT, facilitating a better
understanding of user requests and offering
more precise and contextualised answers
The realised artefact would be reusable at
different administrative levels of the central
and local Public Administration thanks to
the work carried out over the years by AGID
and by virtue of the fact that the Core
Vocabulary of Public Services-Italian Profile
(CPSV-AP_IT) has been defined. Moreover,
the aspects of accessibility and usability
would be guaranteed by including as a
requirement by design the respect of the
principles of accessibility and usability
2https://ec.europa.eu/isa2/solutions/core-public-servicevocabulary-application-profile-cpsv-ap_en/
3OntoPiA: GitHub:
https://github.com/italia/daf-ontologievocabolari-controllati e GitHub wiki:
https://github.com/italia/dafontologie-vocabolari-controllati/wiki (ita)
4Avaiable at
link:&lt;https://github.com/italia/daf-ontologievocabolari-controllati/blob/master/Ontologie/CPSV/v1.1/CPSVAP_IT.rdf.&gt;
5https://www.readygoone.it/approfondimenti/10-funzionidellassistente-conversazionale/</p>
    </sec>
    <sec id="sec-3">
      <title>3. Main steps for the design and implementation</title>
      <p>already present in the Design System Italia6
(DSI).</p>
      <p>The project idea therefore intends to set
some cornerstones for the design and
subsequent implementation of a
conversational agent based on intent and
generative artificial intelligence with the aim
of exploiting the semantics of public services
to improve the interaction between citizens
and public administration.
The selection phase of a chatbot
development paradigm/platform is crucial
as already mentioned from a technological
point of view it will be a mix of intent and
generative AI.
For the reasons already expressed for the
PA, open source solutions for generative AI
should be favoured, combining it with
retrieval-augmented generation (RAG)
techniques7.</p>
      <p>Retrieval-augmented generation (RAG) has
emerged as a promising solution that
incorporates knowledge from external
databases.</p>
      <p>For knowledge systems, RAG has several
advantages over the use of LLM alone:
Accuracy: RAG reduces and mitigates the
risk of 'hallucinations', where LLMs might
provide plausible but incorrect information.
It does this by 'rooting' LLM answers in
accurate data retrieved from your team's
data sources to generate reliable answers.
Transparency: good RAG systems can
provide references that allow users to verify
where information comes from, adding a
level of trust and accountability to the
answers provided by RAG models.</p>
      <p>Customisation: RAG systems can use data
specific to your company or industry (e.g.
naming conventions), making them
adaptable and ensuring that answers are
relevant to your specific context.
6https://designers.italia.it/design-system/
Other crucial elements in the PA's selection
of the open source artificial intelligence
architecture are:
Agnostic with respect to language models
(it must therefore work with customised
OpenAI, Cohere, HuggingFace models)
The solution must possess long-term
memory, have the possibility of using
external tools (API, other models), be able to
ingest documents of different formats (at
least pdf, txt, json) and be developed with
technologies that natively implement the
possibility of scaling horizontally and
vertically.</p>
      <sec id="sec-3-1">
        <title>Non-functional requirements:</title>
        <p>Accessibility by design: the chatbot should
be implemented according to the
accessibility by design paradigm while also
taking into account the needs of user groups
with disabilities as suggested by
accessibility best practices1, such as the
provision of text alternatives for any images
and audio transcripts.</p>
        <p>Multilingualism: implementation of
multilingual functionality.</p>
        <sec id="sec-3-1-1">
          <title>3.1. Acquisition and preparation of data</title>
          <p>1.
2.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Retrieval of CPSV-AP_IT data: Data in</title>
        <p>CPSV-AP_IT format (e.g. eligibility
requirements, tariffs) and identification of
relations with other services.</p>
        <p>Pre-processing and organisation: In the
case of data not conforming to CPSV-AP_IT, a
mapping phase is still necessary to clean and
harmonise the data in a format suitable for
the chosen development architecture. It may
be necessary to convert them into a
computer-readable format (e.g. JSON, CSV)
and to structure and optimise them for
efficient queries.</p>
        <p>Identification of intentions and entities: in
this phase, the potential questions and
intentions of the user (e.g. "how do I apply for
a passport renewal?") and the entities they
might mention (e.g. "passport", "renewal")
should be defined. This could help the
chatbot understand the user's needs.
7https://blogs.nvidia.com/blog/what-is-retrieval-augmentedgeneration/</p>
        <sec id="sec-3-2-1">
          <title>3.3. Test and distribution</title>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Design the flow of the conversation:</title>
        <p>Create natural language dialogues that guide
the user in the discovery of services.
Consider common user questions and create
branching paths based on their answers.
Prioritise clarity and conciseness and
nonbureaucratic language.</p>
      </sec>
      <sec id="sec-3-4">
        <title>Integration of data with the CPSV-AP_IT</title>
        <p>model: Linking the chatbot via ingestion
pipelines to the previously processed
CPSVAP_IT data.</p>
      </sec>
      <sec id="sec-3-5">
        <title>Implementation Natural Language</title>
        <p>Processing (NLP): Using NLP techniques
(intent recognition, entity extraction) to
understand user queries and map them to
relevant data points in the CPSV-AP_IT
knowledge base. In order to enable the
chatbot to retrieve accurate information
about services.</p>
        <p>Error handling and fallback: Implement
mechanisms to handle user input that does
not match defined intent or entity. Provide
helpful hints or propose to rephrase the
question. Consider offering a fallback option
such as connecting with a human agent for
complex questions by retrieving what was
typed.</p>
        <p>Extensive testing: Rigorous testing of the
chatbot's functionality with various
scenarios and user queries should be
envisaged. Ensure that it accurately retrieves
and presents information on public services,
understands intent and provides clear
guidance. Usability tests should also be
envisaged.</p>
      </sec>
      <sec id="sec-3-6">
        <title>Monitoring and improvement: Constant</title>
        <p>monitoring of the chatbot's performance.
One should plan to collect feedback from
users especially transactions that did not go
well and use it to refine conversation flow,
NLP accuracy and the overall user
experience.</p>
        <sec id="sec-3-6-1">
          <title>3.2. Chatbot technological development</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions and on-going activities</title>
      <p>This presentation briefly describes a possible road
map for the implementation of a conversational agent
based on intent and generative artificial intelligence
(AIG) with the aim of exploiting public service
semantics (CPSV-AP_IT Core Public Service
Vocabulary) to improve the interaction between
citizens and the public administration (PA).
Retrieval Augmented Generation (RAG) in general
offers several advantages, in particular to improve the
capabilities of artificial intelligence systems.
In short, it is an approach that combines large
language models (LLM) with information retrieval
(IR) to improve the accuracy and relevance of
LLMgenerated text.</p>
      <p>In a nutshell, the aims of this proposal:
1. Improving the discovery of online and
onsite services of the public administration
2. Providing personalised and relevant
answers to users
3. Reduce first level help desk calls to various
services
An indicative road map of development:
•
•</p>
      <p>A prototype will be realised and validated by
2024.</p>
      <p>By first half of 2025 go live in production.</p>
      <p>Possible future developments also automate the
delivery of some simple services, integration at least
as UX in the Design System Italy.</p>
      <p>We may conclude by saying that the PA should not
make the mistake of building an architecture that is
bound to a single LLM model or specific solutions
because depending on the specific use case, expected
performance and costs, the configuration of the
generative AI solution will be different.</p>
      <p>Just to give an example of the variety and speed with
which this sector is evolving, Anthropic alone released
three LLM models between 2023 and 2024: #Claude1,
#Claude2 and #Claude3.</p>
      <p>OpenAI appears to be close to launching new versions
of #GPT, which it claims will represent a further leap
forward. The galaxy of generative artificial
intelligence is still evolving strongly.</p>
      <p>And it also has an impact on the open source world in
fact the difference between an open-source LLM and
close so binary is showing its limits or rather this new
pradigma1 is establishing itself with respect to LLM so
we can have the following types of models:</p>
      <p>Openly Trained Models (OLMo, Pythia, etc.) - are
those models with training data, training code and
weights available without restrictions on use.
Permissible Usage Models (Llama**, Mistral,
Gemma, etc.) - are those models with base model
weights and inference code available for easy set-up
and distribution.</p>
      <p>Closed LLMs - everything from GPT4 to a random
set of tuned weights without much information.
From the point of view of the Public Administration
this is desirable a cautious and far-sighted responsible
approach confirming the main requirement already
expressed the framework selected must be agnostic
with respect to the LLM model and in any case as PA
we should prefer Openly Trained Models (OLMo,
Pythia, etc.) or Permissible Usage Models (Llama**,
Mistral, Gemma, etc.).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Retrieval-Augmented Generation for Natural Language Understanding" di Patrick Lewis et al</article-title>
          . (
          <year>2020</year>
          ): https://arxiv.org/abs/
          <year>2005</year>
          .11401
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>RAG</given-names>
            <surname>:</surname>
          </string-name>
          <article-title>A Simple but Effective Approach to Neural Conversational Modeling" di Alexander Rush et al</article-title>
          . (
          <year>2020</year>
          ): https://medium.com/dropoutanalytics/what-is
          <article-title>-rag-in-generative-aif5b8c13575f8</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>"</surname>
            <given-names>RAG-BERT</given-names>
          </string-name>
          :
          <article-title>Retrieval-Augmented Generation with BERT" di Honglei Zhuang et al</article-title>
          . (
          <year>2020</year>
          ): https://www.analyticsvidhya.com/blog/2023/ 10/rags-innovative
          <article-title>-approach-to-unifyingretrieval-and-generation-in-nlp/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>[4] "Towards Controllable and Consistent Generation with Retrieval-Augmented Generation" di Yilun Wang et al</article-title>
          . (
          <year>2021</year>
          ): https://aclanthology.org/
          <year>2020</year>
          .coling-main.
          <volume>207</volume>
          [5]
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>