<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Process Mining of Public Administration Operations from Big Data</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Dmitry</forename><surname>Mingazov</surname></persName>
							<email>dmitry.mingazov@maggioli.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Camerino</orgName>
								<address>
									<addrLine>via Madonna delle Carceri 7</addrLine>
									<postCode>62032</postCode>
									<settlement>Camerino</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">R&amp;D Gruppo Maggioli</orgName>
								<address>
									<addrLine>via Bornaccino 101, Santarcangelo di Romagna</addrLine>
									<postCode>47822</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fabio</forename><surname>Celli</surname></persName>
							<email>fabio.celli@maggioli.it</email>
							<affiliation key="aff1">
								<orgName type="department">R&amp;D Gruppo Maggioli</orgName>
								<address>
									<addrLine>via Bornaccino 101, Santarcangelo di Romagna</addrLine>
									<postCode>47822</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Process Mining of Public Administration Operations from Big Data</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">C370CC8CC47E54969C60DB9A50DBB0C4</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:57+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Process Mining</term>
					<term>Public Administration</term>
					<term>Knowledge Graphs</term>
					<term>Big Data</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper we use Process Mining and unsupervised learning to extract Graphs from Big Data produced by Public Administration software logs. Starting from millions raw logs of a software used in many Italian municipalities, we group functions related to specific Public Administration operations -such as management of reversals, tax collection seizures, budget change -by means of clustering techniques. Then we apply Inductive Miner on clusters to extract process models and we visualize them in Business Process Models Notation, that represent generalized ways to perform specific operations and can be exploited for detailed process modeling, communication, and analysis of the workflows in the Public Administration. We argue that this work paves the way towards modeling Public Administration operations into Knowledge Graphs in a transparent way, suitable for the integration into ethical AI systems.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction and Background</head><p>Public Administration (PA) increasingly relies on effective process management to ensure the successful execution of both administrative and front-end services to the citizens. The application of Artificial Intelligence (AI) to the PA is crucial for improving the efficiency and transparency of process management in the public sector. However, AI applications within the PA remain underdeveloped <ref type="bibr" target="#b0">[1]</ref> for different reasons. These include data sparsity, lack of data interoperability <ref type="bibr" target="#b1">[2]</ref>, a general risk aversion in the public sector <ref type="bibr" target="#b2">[3]</ref> and the legacy of outdated Information Technology systems that are hard to integrate with AI tools. Nevertheless, there is a huge effort of the scientific community to make advances and improvements into the PA sector. On the one hand there are top-down approaches with Knowledge Graphs (KGs). These represent entities, process steps and the relations between them in a machine-readable form. KGs can include complex knowledge about a domain and facilitate PAs to adopt a data-centic orientation and operation analytics <ref type="bibr" target="#b3">[4]</ref>. On the other hand there are bottom-up approaches that try to extract patterns, rules and relations directly from data. Among these techniques, Process Mining <ref type="bibr" target="#b4">[5]</ref>, transparent Machine Learning and Association Rule Learning <ref type="bibr" target="#b5">[6]</ref> are powerful tools for discovering, modeling and managing business processes in the PA. Many organizations are currently utilizing Process Mining to discover patterns in data, applying research and innovation actions to the business <ref type="bibr" target="#b6">[7]</ref>. An analysis of 144 research papers in the business applications of Process Mining <ref type="bibr" target="#b7">[8]</ref> revealed that most of the existing research focuses on extracting models within a single organization to improve a single business process. Research on using Process Mining across different systems or between organizations is still underdeveloped. Additionally, the current literature rarely explores how Process Mining can be applied to analyze physical services, like municipal operators working at the counter. Process Mining has the potential to offer valuable insights into customer processes, but to achieve this, researchers need to explore more complex use cases, and there is need for collaboration between academics and practitioners to obtain good results. Machine Learning in the public sector instead is mainly used for the automation of routine operations that have complicated elements, such as triaging phonecalls or correspondence to the right points of contact <ref type="bibr" target="#b8">[9]</ref>. These algorithms are mainly supervised and trained for specific tasks but the advent of more powerful techniques with less transparent models, such as Deep Learning and Generative AI, increased the risk of bias and discrimination in using algorithms for taking decisions <ref type="bibr" target="#b9">[10]</ref> and this is especially true in the PA <ref type="bibr" target="#b10">[11]</ref>. Nevertheless, there are promising applications of transparent Process Mining <ref type="bibr" target="#b11">[12]</ref> in the medical domain. This study utilizes Process Mining to extract Petri Nets and graphs in Business Process Model Notation (BPMN) from big data of many municipalities encompassing PA operations. BPMN excels at depicting the flow of activities within a process, it and has been ratified as ISO 19510 standard and also extended to cover some PA use cases <ref type="bibr" target="#b12">[13]</ref>. It visually depicts the sequence of activities, decision points, and potential outcomes within a process and facilitates the collaboration between business analysts, process designers, and developers. Moreover, BPMN also provides a mapping with execution languages, particularly Business Process Execution Language (BPEL), thus it is possible to run automations and even build KGs from BPMN <ref type="bibr" target="#b13">[14]</ref>. The paper is structured as follows: after a brief review of related works we introduce the data and the experiments, discuss the results, draw our conclusions and finally we trace our direction for future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1.">Related Work</head><p>Recent attempts to apply Process Mining to big databases of logs from PA software revealed that this kind of data is very hard to process with existing techniques. Previous work of this kind counts 104 operators and 227.000 logs <ref type="bibr" target="#b14">[15]</ref>. In particular these softwares are usually made of many different forms that allow the execution of nested operations or sub-parts of operations. In this scenario a form closure does not necessarily imply a parent relationship with the other open forms. Moreover, sometimes it happens that even if two forms are dependent, the closing date is incoherent, with a parent form closing before a child form. In fact, forms may remain opened for long even if they are not being used. The difficulties in the application of Process Mining to logs of PA data can be summarized in four problems <ref type="bibr" target="#b14">[15]</ref>: 1. the impossibility to reduce multiple levels of interweaving to a simpler structure due to the need of the software to allow multiple nested operations in parallel; 2. the difficulty of making structural assumptions based on temporal relations; 3. the presence of loops and redundant activities, such as technical automated functions mixed with the actual operations; 4. the difficulty of labelling operations on the fly due to the potential incoherence between parent and child forms.</p><p>The presence of loops can be solved with correlation process mining <ref type="bibr" target="#b15">[16]</ref>, that is designed for logs in which events that belong to the same case are related to each other. Similar functions and similar control flows can be detected and grouped by coupling Process Mining with parametric dissimilarity measures and clustering algorithms like K-medoids <ref type="bibr" target="#b16">[17]</ref>. However, it remains difficult to label operations and evaluate the quality of the labels, because clustering is an unsupervised Machine Learning technique. All these problems are current open challenges. The research in Knowledge Graphs is relatively less problematic. For example using data governance techniques on open data it is possible to build large semantic Knowledge Graphs that represent distributed data spaces for public e-procurement <ref type="bibr" target="#b17">[18]</ref>. However, data heterogeneity within the PA presents a challenge for stakeholders, such as PA employees, developers and decision makers, in identifying relevant data standards, formats, and APIs for digitizing specific public services, especially those with few open data available. However, there are attempts to solve this issue with semantic modeling and linked open data principles <ref type="bibr" target="#b18">[19]</ref>, and link them to existing KGs. For example it is possible to enable the automated creation of human-and machine-readable descriptions of processes from data into ontologies, and link them to existing process descriptions of public services <ref type="bibr" target="#b19">[20]</ref>, such as legal ontologies. The gap between top-down and bottom-up approaches is still large. The main challenge in the bottom-up approach is the lack of semantics. In other words it is not possible to exactly know from software logs the semantics of the operation performed and its relation to the other operations. The main challenge with the Top-down approach instead is the heterogeneity of data. Ontologies and KGs encode the semantic relations between processes but lack the ability to link them to real processes of the PA. Bridging this gap would allow us to spot the inefficiencies in the PA and to have much more control on the entire administrative system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data Description</head><p>We collected logs generated by Sicraweb Evo, a software designed to perform many operations in Italian municipalities. This software is divided in a client side, i.e. a web application used by the municipality operators, and a server side, from which the logs are currently generated. The logging system was designed for debugging purposes and it does not yield direct information about the processes, as happens in similar software described in literature. Moreover, the quantity of logs is enormous, averaging at 7.7 million records per day from more than 2000 municipalities. For our experiments we random sampled 1 million logs from 15 different municipalities and more than 150 operators. To the best of our knowledge this is the first work that applies Process Mining on PA operations using such a large amount of data. The data is recorded as a sequence of REST calls to the server side of the application, where each call is a single activity, until recurrent patterns will be discovered and associated to higher level operations. Each REST call contains the following attribute fields:</p><p>• Activity: the atomic software function that is activated in the process; • Resource: anonymized municipality and operator who used the software;</p><p>• Action order: a sequential number indicating the execution order of the activities; • Relative time: progressive record of milliseconds starting at 0 with the first activity.</p><p>The presence of the Action order helps solving problem 2, making structural assumptions even when relative time is not consistent. However, case id and process id are inherently missing from data. The event logs used can be classified as ⋆ ⋆ ⋆ in the maturity level for Process Mining described in literature <ref type="bibr" target="#b20">[21]</ref>. Process Mining algorithms operate on a set of cases, i.e. instances of processes. Since our dataset was lacking of case notations, we added them to the records. We assigned a case ID to each sequence of activities not interrupted by a change of client (municipality), date, operator or the opening of a new form. This approach was proven to work in a similar scenario <ref type="bibr" target="#b14">[15]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Experiments and Discussion</head><p>Our contribution follows a bottom-up approach and presents two experiments. In the first experiment we want to understand how much the raw log data can be linked to operation labels. We assume the form titles as operation labels provided by the software designers, who are domain experts.We evaluate the relationship between operation labels and clustering by means of Homogeneity <ref type="bibr" target="#b21">[22]</ref> and Silhouette metrics <ref type="bibr" target="#b22">[23]</ref>  <ref type="bibr" target="#b23">[24]</ref>. Homogeneity measures how many clusters contain only logs which are members of a single operation, while Silhouette measures how similar are the logs in their own cluster compared to the other clusters. In the second experiment we apply Process Mining on clusters to extract Petri Nets and visualize them in BPMN. We use Replay Fitness <ref type="bibr" target="#b24">[25]</ref> to evaluate the quality of the graphs extracted.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Clustering</head><p>Before applying any Process Mining algorithm to raw data, logs must be divided into chunks of homogeneous context. Following previous literature <ref type="bibr" target="#b16">[17]</ref> we applied unsupervised clustering techniques, K-medoids and OP-TICS for instance, to achieve that. We extracted features from the logs by using the frequency of specific activities. In this way we obtained a feature table, where rows represent case ids and the columns represent the frequency of activities. In order to reduce information sparseness, we applied Singular Value Decomposition and compressed the feature space from initial 1776 columns to two trials, with 100 and 50 columns respectively.</p><p>Results, reported in Table <ref type="table" target="#tab_0">1</ref>, show that K-medoid has higher Silhouette score, meaning that is able to aggregate more similar logs under operation labels. Homogeneity score is similar between the two, indicating that both algorithms are able to subsume the logs under the operations roughly the same way. A qualitative analysis revealed that OPTICS is able to manage noisy logs better than K-medoid, obtaining clearer graphs. The lower scores of OPTICS are possibly due to the fact that it tends to create a wastebasket cluster with noisy logs among other cleaner clusters, while K-medoid tends to aggregate noisy logs with others. Moreover, a manual check revealed that only 36.8% of the operations contains vertical functions from the same area. For example the management of reversals contains just functions from the financial area. The remaining 63.2% are operations that involve different areas. For example the management of purchase invoices contains functions used in the financial area as well as in the general affairs area. This indicates that the OPTICS algorithm may better reflect the actual percentage of homogeneous operations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Process Mining</head><p>Each cluster of logs represent a supposed operation containing several variants. With the amount of data we processed we obtained more than 200 clusters with both algorithms. Some operations are represented by more than one cluster. There are by average 5.05 clusters per operation, with about 30 clusters that contain mainly technical and automatic functions, and cannot be mapped to any specific operation and can be discarded. Aiming at a representation of the software processes with high simplicity of understanding, We applied Inductive Miner to the clusters to obtain both Petri Nets and BPMNs, and ultimately chose BPMN to visualize our data. These represent generalized ways of performing operations. In order to make the process discovery more scalable, traces which shared the same set of activities, regardless of their edges, were grouped together and used as input for the discovery of BPMN. The whole discovery process was performed using custom Python scripts which made use of the PM4Py library <ref type="bibr" target="#b25">[26]</ref>. We computed average Replay Fitness on 10 random clusters generated with both algorithms. The results with K-medoids is 0.976 and with OPTICS is 0.998, indicating that OPTICS captures information from all variants in a cleaner way, as emerged in the qualitative analysis. Figure <ref type="figure" target="#fig_0">1</ref> is a generalized BPMN graph of a purchase invoice management operation from 73 variants. The process can be represented by exclu- sive (x) and parallel (+) gateways. Despite BPMN models are not full KGs <ref type="bibr" target="#b26">[27]</ref>, they can serve as a ubiquitous visual tool across various disciplines, including software development, engineering design, and scientific experimentation. A great advantage of BPMN models is that it is possible to turn them into code and develop transparent automated processes from data with a bottom-up approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion and Future</head><p>We presented a method for the extraction of BPMN from big data using Process Mining and clustering techniques.</p><p>The major contribution of this work to the scientific community is to apply these algorithms to big data in a real world scenario. We plan to evolve this work in three different ways: applying new Process Mining algorithms, enhancing inductive miner to extract configurable graphs and aggregate processes at a level above operations; test the development of automations by turning BPMN into code by means of AI tools; explore the integration of BPMN and KGs. The integration of BPMN and KGs holds significant promise for enhancing business process management. By combining the structured flow representation of BPMN with the rich semantic relationships captured in KGs, organizations can gain a deeper understanding of their processes and automate the management of PA processes based on a broader knowledge base. Future research can explore specific implementation frameworks and evaluate the impact of this integration on process efficiency and knowledge utilization within organizations.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: BMPN graph of purchase invoice management operation.</figDesc><graphic coords="4,111.96,90.33,156.00,550.80" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Results of clustering experiments.</figDesc><table><row><cell cols="4">algorithm features Silhouette Homogeneity</cell></row><row><cell cols="2">K-medoid 100</cell><cell>0.498</cell><cell>0.432</cell></row><row><cell>OPTICS</cell><cell>100</cell><cell>0.339</cell><cell>0.403</cell></row><row><cell cols="2">K-medoid 50</cell><cell>0.513</cell><cell>0.435</cell></row><row><cell>OPTICS</cell><cell>50</cell><cell>0.332</cell><cell>0.401</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work was supported by the European Commission grant 101120657: European Lighthouse to Manifest Trustworthy and Green AI -ENFIELD.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">G</forename><surname>Reddick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Anthopoulos</surname></persName>
		</author>
		<title level="m">Information and communication technologies in public administration</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Publishing official classifications in linked open data</title>
		<author>
			<persName><forename type="first">G</forename><surname>Lodi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Maccioni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Scannapieco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Scanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Tosco</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SemStats@ ISWC</title>
				<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="1" to="12" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Performance and management in the public sector: Testing a model of relative risk aversion</title>
		<author>
			<persName><forename type="first">S</forename><surname>Nicholson-Crotty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Nicholson-Crotty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fernandez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Public Administration Review</title>
		<imprint>
			<biblScope unit="volume">77</biblScope>
			<biblScope unit="page" from="603" to="614" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">An event-centric knowledge graph approach for public administration as an enabler for data analytics</title>
		<author>
			<persName><forename type="first">D</forename><surname>Zeginis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Tarabanis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computers</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page">17</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Process mining solutions for public administration</title>
		<author>
			<persName><forename type="first">S</forename><surname>Fioretto</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">European Conference on Advances in Databases and Information Systems</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="668" to="675" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Research on public administration decision model based on big data association rules mining algorithm</title>
		<author>
			<persName><forename type="first">F</forename><surname>Guo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2023 International Conference on Networking, Informatics and Computing (ICNETIC), IEEE</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="544" to="549" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Process mining techniques and applications-a systematic mapping study</title>
		<author>
			<persName><forename type="first">C</forename><surname>Dos Santos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">R F</forename><surname>Meincheim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Junior</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M V</forename><surname>Dallagassa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Sato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A P</forename><surname>Carvalho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">E</forename><surname>Santos</surname></persName>
		</author>
		<author>
			<persName><surname>Scalabrin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Expert Systems with Applications</title>
		<imprint>
			<biblScope unit="volume">133</biblScope>
			<biblScope unit="page" from="260" to="295" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">How is process mining technology used by organizations? a systematic literature review of empirical studies</title>
		<author>
			<persName><forename type="first">M</forename><surname>Thiede</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Fuerstenau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P</forename><surname>Bezerra</surname></persName>
		</author>
		<author>
			<persName><surname>Barquet</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Business Process Management Journal</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="900" to="922" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Veale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Brass</surname></persName>
		</author>
		<title level="m">Administration by algorithm? public management meets public sector machine learning, Public management meets public sector machine learning</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Negligent algorithmic discrimination</title>
		<author>
			<persName><forename type="first">A</forename><surname>Páez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Law &amp; Contemp. Probs</title>
		<imprint>
			<biblScope unit="volume">84</biblScope>
			<biblScope unit="page">19</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">On machine learning and public administration</title>
		<author>
			<persName><forename type="first">L</forename><surname>Cao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in Management Science</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1" to="4" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Transparency promoted by process mining: an exploratory study in a public health product management process</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">R</forename><surname>Neubauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>De Araujo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Fantinato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Peres</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Anais do X Workshop de Computação Aplicada em Governo Eletrônico</title>
				<imprint>
			<publisher>SBC</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="37" to="48" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Adapting bpmn to public administration</title>
		<author>
			<persName><forename type="first">V</forename><surname>Torres</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Giner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Bonet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Pelechano</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Workshop on Business Process Modeling Notation</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="114" to="120" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Automated process knowledge graph construction from bpmn models</title>
		<author>
			<persName><forename type="first">S</forename><surname>Bachhofner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kiesling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Revoredo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Waibel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Polleres</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Database and Expert Systems Applications</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="32" to="47" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><surname>Mouysset</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Picard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bortolaso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Migeon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-P</forename><surname>Gleizes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Maurel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Derras</surname></persName>
		</author>
		<title level="m">Investigations of process mining methods to discover process models on a large public administration software</title>
				<meeting><address><addrLine>INFORSID</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019. 2019</date>
			<biblScope unit="page" from="147" to="162" />
		</imprint>
	</monogr>
	<note>Congrès Informatique des Organisations et Systèmes d&apos;Information et de Décision</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Correlation mining: mining process orchestrations without case identifiers</title>
		<author>
			<persName><forename type="first">S</forename><surname>Pourmirza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dijkman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Grefen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Service-Oriented Computing</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="237" to="252" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Tlv-diss _ 𝛾 𝛾: A dissimilarity measure for public administration process logs</title>
		<author>
			<persName><forename type="first">F</forename><surname>Corradini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Luciani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Morichetta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Piangerelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Polini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Electronic Government: 20th IFIP WG 8.5 International Conference, EGOV 2021</title>
				<meeting><address><addrLine>Granada, Spain</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021">September 7-9, 2021. 2021</date>
			<biblScope unit="page" from="301" to="314" />
		</imprint>
	</monogr>
	<note>Proceedings 20</note>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Semantic knowledge graphs for distributed data spaces: The public procurement pilot experience</title>
		<author>
			<persName><forename type="first">C</forename><surname>Guasch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lodi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V</forename><surname>Dooren</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Semantic Web Conference</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="753" to="769" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Knowledge graph construction with a façade: a unified method to access heterogeneous data sources on the web</title>
		<author>
			<persName><forename type="first">L</forename><surname>Asprino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Daga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gangemi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Mulholland</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Internet Technology</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page" from="1" to="31" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">On which legal regulations is a public service based? fostering transparency in public administration by using knowledge graphs</title>
		<author>
			<persName><forename type="first">L</forename><surname>Feddoul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Raupach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Löffler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Babalou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hoyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mauch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>König-Ries</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Lecture Notes in Informatics</title>
		<imprint>
			<date type="published" when="2023">2023</date>
			<publisher>LNI</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><surname>Daniel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Barkaoui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dustdar</surname></persName>
		</author>
		<title level="m">Business Process Management Workshops: BPM 2011 International Workshops</title>
				<meeting><address><addrLine>Clermont-Ferrand, France</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011-08-29">August 29, 2011. 2012</date>
			<biblScope unit="volume">99</biblScope>
		</imprint>
	</monogr>
	<note>Revised Selected Papers. Part I</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">V-measure: A conditional entropy-based external cluster evaluation measure</title>
		<author>
			<persName><forename type="first">A</forename><surname>Rosenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hirschberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL)</title>
				<meeting>the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL)</meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="410" to="420" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Clustering in an object-oriented environment</title>
		<author>
			<persName><forename type="first">A</forename><surname>Struyf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rousseeuw</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Statistical Software</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1" to="30" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Silhouette analysis for performance evaluation in machine learning with applications to clustering</title>
		<author>
			<persName><forename type="first">M</forename><surname>Shutaywi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">N</forename><surname>Kachouie</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Entropy</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page">759</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">A review on conformance checking technique for the evaluation of process mining algorithms</title>
		<author>
			<persName><forename type="first">V</forename><surname>Naderifar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sahran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Shukur</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">TEM Journal</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page">1232</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Pm4py: a process mining library for python</title>
		<author>
			<persName><forename type="first">A</forename><surname>Berti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Van Zelst</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schuster</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Software Impacts</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page">100556</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Mining process flowcharts from business data: An evolutionary approach</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Turner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tiwari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mehnen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th CIRP-Sponsored International Conference on Digital Enterprise Technology</title>
				<meeting>the 6th CIRP-Sponsored International Conference on Digital Enterprise Technology</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="1069" to="1087" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
