<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>SEBD</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Simona Fioretto</string-name>
          <email>simona.fioretto@unina.it</email>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>31</volume>
      <fpage>02</fpage>
      <lpage>05</lpage>
      <abstract>
        <p>Keeping pace with technological advancements is a challenging task, particularly for Public Administration. The knowledge of working methods in Public Administration actually can speed up the digital transformation process and improve the integration of Information Technology; however, due to the coexistence of several diferent ofices and working methods, discovering this information without the continuous help of experts in the field is far from trivial. The main goal of this paper is to explore and show one of the driving factors for Public Administration digitalization. Since extracting business processes in the public sector is time-consuming, the approach is inverted by using oficial documents as the source of information, by the consciousness that workflows are dictated by policies, guidelines, regulations, and norms. In order to accomplish this goal, literature proposes many techniques from the use of traditional Natural Language Processing to more advanced Neural Network solutions. This paper aims to give an overview of the problem by exploring the current state of the art, and to show potential future work directions.</p>
      </abstract>
      <kwd-group>
        <kwd>Business Process</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>Digital transformation</kwd>
        <kwd>Public Administration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Today, public administrations are at the heart of digital transformation. The economic growth
of a country depends to a large extent on the services provided by public administrations,
which can be the facilitators or the blockers of economic development. With the advent of new
technologies and the ability of existing companies, large and small, to be part of a global market,
it is also necessary for the PA to adapt to the new economic context in which it finds itself. The
digitalisation of the PA is one of the key factors for improving the services ofered, both in terms
of time, quality, transparency and agility. Known IT solutions have not yet been applied to PAs,
which are much slower to follow this digitalisation path. Some authors propose the application
of artificial intelligence techniques to the PA, for services such as virtual agents, the automation
of some activities, the ability to optimise resources or even the possibility of finding the right
task-user match. In order to use new IT systems or apply artificial intelligence techniques,
it is necessary to know the working context of public administration ofices. Identifying and
analysing business processes is a prerequisite for digitalisation. Business Process Management
proposes a series of steps that can lead to the improvement of processes, and in particular
the first two consist of the discovery and modelling of a process, in order to identify all the
activities that must be carried out necessarily and sequentially for the realisation of a process
and therefore for the completion of a result. The realization of an IT system to support the
improvement of the services ofered must take into account the working methods of the public
administration ofices, which is why the identification phase of the business processes of each
public administration ofice is of fundamental importance. Even within the same sector, diferent
services are ofered by a State to its citizens, requiring diferent ways of working and interacting.
From the above, it is clear that business process identification is not an immediate activity in a
PA, but must be tailored for each ofice under consideration. With regard to the digitalization
of the PAs of a country, it would be ideal to have a clear idea of the business processes of all
PAs, but this is not possible. In fact, identifying these processes is a complex task from many
points of view:
• requires the help of experts of the domain;
• it is a process that requires a lot of time for each ofice analyzed;
• it is subject to and influenced by the perception of the users involved;
• it is intrinsic to errors.</p>
      <p>
        To overcome this problem, it is necessary to perform the extraction of processes in an objective
and faster way. The solution could lie in an extraction approach that is no longer based on
a bottom-up approach, which foresees the reconstruction of the process starting from its
execution and, therefore from the executing user, but is based on the recognition that public
administrations, just like companies (although with less rigidity), base their working methods
on the procedures imposed by public documents which cannot be changed. In this way, it will be
possible to rely on the oficial information in the procedures, regulations, standards, guidelines,
etc., not only to overcome the problems related to obtaining the process on the ground, but also
to update the work procedures more quickly when there are new regulatory orders that need
to be taken into account. Based on the activities identified in this way, artificial intelligence
systems can then be used to enable innovative, efective and eficient management of activities
and, with the support of process mining [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], it could be possible to perform a conformance
check between the process extracted from documents and the executed process extracted from
event logs. The problem, which is the information extraction from documents (unstructured
text) is not new and was addressed in the literature by various methodologies. It belongs to
natural language processing field. Even if the application of NLP methodologies is growing
rapidly, the same cannot be said about the analyzed problem, which is in fact still in an initial
exploration phase. The aim of this research is to explore the methods used to perform this
task through a literature review, identify possible opportunities among the new methods and
perform the application of some methods for a set of documents, with the aim of completely
extracting a process starting from a document and performing this task in less time.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Background and Problem statement</title>
      <p>
        The topic of digital transformation in public administration was already discussed several
times. While on the one hand information technologies and artificial intelligence are becoming
increasingly innovative, the same cannot be said for IT applications in public administration.
There is a mismatch between the integration of IT services in the private sector and in the public
sector; as highlighted by [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] the challenges presented to public organizations are connected
to other challenges such as conservative views, risk aversion to IT and innovation, increased
citizen’s expectations, lack of digital literacy and leadership in public organizations.
      </p>
      <sec id="sec-2-1">
        <title>2.1. AI applications in public administration</title>
        <p>
          There are several heterogeneous AI applications that support some of the features required
by the PAs, and some of them have been highlighted and compared by [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. It can be seen
that it is a young research field that lacks the description of the related applications and
challenges, and in the attempt to systematize it, 5 research categories are found, namely AI
in public service dealing with service-oriented applications and new challenges, applications
focusing on workflow improvement, AI predictive models, data management, decision and
knowledge management, AI-influenced work and social environment, public policy and law
related to AI, AI ethics, AI policy in public service. Among these areas, the one that has a more
concrete impact as a driving factor is the one related to AI - public services, and in particular
the applications of virtual agents and automation of workflows that allow to improve the
efectiveness, eficiency and quality of services provided by PAs. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] also ofers papers that
implement these applications, and the most important are [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] manages the workflow
of processing immigration forms on digitized documents using form processing software that
performs several key assessment and decision support functions with automatic data extraction
and integration of an AI modules, including workflow case assignment, automatic assessment,
follow-up action generation, precedent case retrieval, and learning of current practices. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
which recently deployed AI engines in the social insurance field, starting from standardized
workflows, to help in optimization of the Chinese government services, select civil servants for
the next processing step in the workflow using a task and civil servant optimization approach.
        </p>
        <p>Despite showing important results, both proposed services use already standardized workflow
for applying AI applications; the considered workflows are related only to the case study taken
into consideration, and not to the PA in general. It is clear that a crucial part of ensuring the
introduction of new information technologies is the identification of working procedures, an
activity that can not be carried out individually if the goal is to create applications that are able
to support the whole PA sector.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Business process extraction</title>
        <p>
          The integration of software for public administration, as well as resources optimization,
derive first of all from the identification of working procedures: once the tasks and the actors
are identified, it will be much easier to implement the right software function, and to apply
algorithms that solve optimization problems. The identification of working procedures could be
approached using process mining highlighted by [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] as an instrument to discover, monitor and
improve real processes (i.e., not assumed processes) by extracting knowledge from event logs
readily available in today’s (information) systems. Process mining is thus a kind of bottom-up
discovery of processes from event logs; however in public administration it could be challenging
to obtain event logs in order to identify business processes due to both:
• non-integration of IT tools through all the phases of the process;
• non-availability of public operational data coming from information systems;
It can be said, in a certain way, that the extraction of business processes can be done in two
ways: one goes from the execution of the process to its recognition following a bottom-up path,
and one starts from rules that define the necessary actions to be performed in order to obtain a
result and follows a top-down path. Since the first way is the most uncertain due to human
interaction and an alternative objective way such as process mining requires research for the
application in the chosen domain due to the previous highlighted challenges, in this context the
extraction of business processes using documents is explored. Additionally, the application of
business process extraction could also be matched with process mining to check whether the
execution of the process is responding to the requirements expressed in oficial documents.
        </p>
        <p>
          [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] proposes a new kind of extraction, moving from event logs to policies, calling it
Policybased Process Mining (PBPM), to automatically extract process information from policy
documents in text; the approach uses several text mining techniques applied to business policy
texts in order to discover process-related policies and extract process components. This field of
research is still quite new. An important role and step forward was done by [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] who extracts
the process from a text without making assumptions on text structure, furthermore makes a
proper anaphora resolution to identify concepts that are referenced using pronouns and articles.
Since NLP techniques are constantly and rapidly evolving, it is not an easy task to deal with
this topic, because the contributions and approaches are also changing. Most recent approaches
use AI methods such as neural networks. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] use in-context learning giving a prompt to GPT-3
model obtaining good results on participants’ information and still unpromising results on
follows relationship. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] extract business process models from texts on emergency plans and
instructions, using four layers of neural networks to extract emergency elements and introduce
an Adversarial Training Layer for nested entities.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Discussion and Conclusion</title>
      <p>In this section, based on the previous background analysis, a possible future work is examined
and a conclusion is discussed.
3.0.1. Future work
Achieving the solution to the problem of digital transformation for public administration a first
attempt can be seen in the identification of working procedures by information extraction from
texts. In this topic many approaches are used, which can be distinguished by two variables:
1. techniques: most of the recent work uses neural networks, whereas the earlier work
used more traditional NLP techniques; this is obviously because the current research can
benefit from the power of current neural networks in solving NLP tasks;
2. word model: in some studies, the process is derived directly from the text, while in other
studies a word model is created and then the process is derived from it;
Even though this challenge has already been faced several times, to the best of my currently
knowledge, these are some of the points still open which can afect the future work direction:
• model notation: which model notation for process representation best fit the needs of
public administration;
• text structure: the structure of the text is clearly the most important factor influencing
the results;
• documents relations: identifying relationships between multiple documents;
• through text relations: dificulty in identifying the relationships between tasks and
relationships between task and user;
• gateway identification : dificulty in identifying the kind of relationships between tasks
(which gateway construct they follow? under what condition?);
In order to find possible answers to the open questions, the aim of this research is to break the
problem down into several smaller problems. A possible approach to deal with the problem can
be structured as follow:
• Given an unstructured text containing both process-related and non-process-related
sentences, solve a classification problem to extract only process-related sentences;
• Starting from process-related sentences, solving a new classification problem to identify
process constructs;
• Investigate new models and principles for control-flow based models;
• Find a rule to identify the relationships between tasks and between task and user;
• Dealing with the process described in several documents;
• Integration or modification of current processes with new policy requirements.</p>
      <sec id="sec-3-1">
        <title>3.1. Conclusion</title>
        <p>In conclusion, this paper highlights the importance of discovering or extracting process models
from documents in order to support the digital transformation of public administrations.
Discovering or extracting process models from policies is a dificult task, but could lead to huge
improvements in supporting the digital transformation of public administrations. The support
provided by a simpler and more immediate identification of the activities and roles of a process
is linked to many stages of public administration management: simpler design of information
and IT systems, analysis of current procedures and comparison and integration with new ones.
Furthermore, this technique could be seen in a more general way as a support for process
mining, both to check whether the actual execution of the process complies with the policy,
and to evaluate the eficiency of the actual execution of the process, i.e. to check which of the
processes is the more eficient. As seen, although this subject has been studied and analysed,
given the number of disruptive new technologies in the field of Artificial Intelligence and, in
particular, Natural Language Processing, there is still much work to be done to improve the
phases of process extraction and reconstruction to improve some of the phases of extraction and
reconstruction of the process. To conclude it is possible to state that the cohesive application
of both process mining and policy based process mining could help in the identification and
consequent check of business process execution in public administration domain. Further
research in this area can help to improve the integration of Information Technology and support
the ongoing digital transformation of Public Administration.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>This publication article was produced while attending the PhD programme in Information and
Communication Technology at the University of Naples, Cycle XXXVIII, with the support of
a scholarship financed by the NRRP Ministerial Decree no. 351 of 9th April 2022 for Public
Administration
Work supported from the project ”MOD‐UPP” ‐ Macroarea 4 - project
PON_MDG_1.4.1_17PON GOV grant.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Van Der Aalst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Adriansyah</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. K. A. De Medeiros</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Arcieri</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Baier</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Blickle</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          <string-name>
            <surname>Bose</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Van Den</surname>
            <given-names>Brand</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Brandtjen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Buijs</surname>
          </string-name>
          , et al.,
          <article-title>Process mining manifesto</article-title>
          , in: Business Process Management Workshops:
          <article-title>BPM 2011 International Workshops</article-title>
          , Clermont-Ferrand, France,
          <year>August 29</year>
          ,
          <year>2011</year>
          ,
          <string-name>
            <given-names>Revised</given-names>
            <surname>Selected</surname>
          </string-name>
          <string-name>
            <surname>Papers</surname>
          </string-name>
          ,
          <source>Part I 9</source>
          , Springer,
          <year>2012</year>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Goh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Arenas</surname>
          </string-name>
          ,
          <article-title>It value creation in public sector: how it-enabled capabilities mitigate tradeofs in public organisations</article-title>
          ,
          <source>European Journal of Information Systems</source>
          <volume>29</volume>
          (
          <year>2020</year>
          )
          <fpage>25</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Wirtz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Weyerer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Geyer</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence and the public sector-applications and challenges</article-title>
          ,
          <source>International Journal of Public Administration</source>
          <volume>42</volume>
          (
          <year>2019</year>
          )
          <fpage>596</fpage>
          -
          <lpage>615</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A. H. W.</given-names>
            <surname>Chun</surname>
          </string-name>
          ,
          <article-title>An ai framework for the automatic assessment of e-government forms</article-title>
          ,
          <source>AI</source>
          Magazine
          <volume>29</volume>
          (
          <year>2008</year>
          )
          <fpage>52</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Miao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Leung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Smarths:</surname>
          </string-name>
          <article-title>An ai platform for improving government service provision</article-title>
          ,
          <source>in: Proceedings of the AAAI Conference on Artificial Intelligence</source>
          , volume
          <volume>32</volume>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>A policy-based process mining framework: mining business policy texts for discovering process models</article-title>
          ,
          <source>Information Systems and E-Business Management</source>
          <volume>8</volume>
          (
          <year>2010</year>
          )
          <fpage>169</fpage>
          -
          <lpage>188</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Friedrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Puhlmann</surname>
          </string-name>
          ,
          <article-title>Process model generation from natural language text</article-title>
          , in: Advanced Information Systems Engineering: 23rd International Conference, CAiSE
          <year>2011</year>
          , London, UK, June 20-24,
          <year>2011</year>
          . Proceedings 23, Springer,
          <year>2011</year>
          , pp.
          <fpage>482</fpage>
          -
          <lpage>496</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bellan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dragoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ghidini</surname>
          </string-name>
          ,
          <article-title>Extracting business process entities and relations from text using pre-trained language models and in-context learning</article-title>
          , in: Enterprise Design, Operations, and Computing: 26th International Conference, EDOC 2022,
          <article-title>Bozen-</article-title>
          <string-name>
            <surname>Bolzano</surname>
          </string-name>
          ,
          <source>Italy, October 3-7</source>
          ,
          <year>2022</year>
          , Proceedings, Springer,
          <year>2022</year>
          , pp.
          <fpage>182</fpage>
          -
          <lpage>199</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Q.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Law</surname>
          </string-name>
          ,
          <article-title>Extraction of emergency elements and business process model of urban rail transit plans</article-title>
          ,
          <source>IEEE Transactions on Computational Social Systems</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>