<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards Optimizing Ontology-Based Data Federation: Performance Insights from Experimental Studies</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Marco</forename><forename type="middle">Di</forename><surname>Panfilo</surname></persName>
							<email>mdipanfilo@unibz.it</email>
							<affiliation key="aff0">
								<orgName type="department">Supervised by Diego Calvanese and Davide Lanti</orgName>
								<orgName type="institution">Free University of Bozen-Bolzano</orgName>
							</affiliation>
							<affiliation key="aff1">
								<address>
									<settlement>Bucharest</settlement>
									<country key="RO">Romania</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Towards Optimizing Ontology-Based Data Federation: Performance Insights from Experimental Studies</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">C5642D6A8E8C11754CD562400EC5DC04</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T18:27+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>OBDA</term>
					<term>OBDF</term>
					<term>VKG</term>
					<term>Data Federation</term>
					<term>Query Optimization</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Ontology-Based Data Federation (OBDF) is a recently introduced framework that utilizes virtual knowledge graphs for real-time, unified data querying across diverse data sources. This approach significantly enhances data integration efficiency. As part of a PhD research, we aim to further investigate and optimize the performance of OBDF. To this end, we have set up and conducted extensive experiments with OBDF using three representative data federation systems, which allow for a comprehensive assessment of the framework and the extensions developed in this thesis. This paper provides a recap of OBDF and details the experimental work conducted and its results, paving the way for further OBDF refinements to be developed during the PhD thesis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Data is recognized as one of the most valuable resources, whether the context is industrial, academic, or private <ref type="bibr" target="#b0">[1]</ref>. In today's world, timely and accurate access to data, and the insights derived from it, are crucial for driving meaningful impact and informed decision-making. Granting such an access to data, though, becomes even more challenging when the data does not originate from a single consistent source but rather from multiple heterogeneous sources that may be inconsistent with one another. This challenge leads to the problem of Data Integration, which involves ensuring timely and accurate accessibility of multiple data sources to the end-user.</p><p>Today's rapid expansion in the volume, variety, and velocity of data <ref type="bibr" target="#b0">[1]</ref> presents significant challenges for traditional data integration solutions, particularly those based on data warehousing. These approaches are becoming increasingly costly and face difficulties to maintain data freshness. In contrast, data integration based on a virtual approach addresses these challenges by enabling the uniform and virtual management of heterogeneous data. This is achieved by unifying diverse sources, often controlled by different authorities, under a common global schema without the need to materialize a replica of the integrated data <ref type="bibr" target="#b1">[2]</ref>. Within the context of virtual data integration, a popular approach is provided by Data Federation systems. Traditionally defined as a type of metadatabase management system, these systems follow the virtual paradigm by transparently mapping multiple autonomous databases -often relational -into a single, unified federated relational database <ref type="bibr" target="#b2">[3]</ref>.</p><p>Ontology-Based Data Access (OBDA) <ref type="bibr" target="#b3">[4]</ref> also implements a virtual approach. OBDA allows exposing and querying of a (typically relational) data source as a Virtual Knowledge Graph (VKG) built around an ontology that formally represents knowledge as a set of concepts and roles within a domain. Ontologies in OBDA are expressed in lightweight conceptual languages from the DL-Lite family of description logics <ref type="bibr" target="#b4">[5]</ref>, making it possible to interpret data stored in a single relational database in a semantically rich way and with a terminology closer to the one of domain experts. This facilitates users' access to complex data, without sacrificing performance, as OBDA systems nowadays are capable of supporting efficient querying of large amounts of data.</p><p>Building on the same virtual roots, the combination of OBDA -to facilitate access to complex data -and Data Federation -to integrate multiple data sources -appears particularly appealing. While existing OBDA systems can be straightforwardly layered on top of data federation systems, this solution leads to inefficient native queries. Several research challenges remain open in order to fully realize the considered combination in real-world scenarios. The goal of the research that we intend to carry out within the PhD is to study such setting in depth and improve the currently available techniques and technologies along different directions. On the one hand, the aim is to optimize the query processing task to make data access as simple, fast, and accurate as possible, while minimizing computational resource requirements. On the other hand, we are also interested in making more convenient for users to access federated heterogeneous data through a VKG, e.g., by providing (semi-)automated means to bootstrap the VKG mappings and/or ontology.</p><p>Along these lines, we previously introduced Ontology-Based Data Federation (OBDF) <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>, a principled framework that combines Data Federation and OBDA, which we evaluated on a data federation system (Teiid), demonstrating substantial query performance improvements. Building on this contribution and to further advance this PhD research, we are now conducting an extensive experimental evaluation of the developed OBDF algorithms, extending our analysis to include two additional data federation systems (Dremio, Denodo). These experiments enable us to assess the generalization capabilities of our approaches, laying the foundation for future research. Specifically, we address the following research question: do the performance improvements observed in <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref> generalize across different data federation systems? In this paper, we revisit OBDF using an illustrative example and report on the new experiments, which represent the novel contributions of this work.</p><p>The paper is structured as follows: Section 2 provides a review of the state of the art, including key concepts and previous work in Ontology-Based Data Access (OBDA), Data Federation, and attempts at their integration. Section 3 introduces the OBDF framework, detailing its formalization, the use of data hints, and the optimization techniques applied, supported by illustrative examples. Section 4 reports on our extended experiments and discusses the results obtained from evaluating the OBDF framework across different data federation systems. Finally, Section 5 concludes the paper with a summary of findings and outlines the next steps in this PhD research, as well as potential future research directions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">State of the Art</head><p>The core of this PhD research lies in the combination of two techniques: Data Federation and OBDA, with our initial focus on optimizing query answering in the resulting setting. Therefore, we report here the relevant state of the art for each of these two techniques along with previous attempts towards their combination.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Ontology-Based Data Access (OBDA)</head><p>OBDA <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b8">9]</ref> is a widely used paradigm developed since the mid-2000s <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b9">10]</ref>, aimed at facilitating access to large-scale databases through domain ontologies. The classical OBDA framework comprises three core components: the user-facing ontology 𝒯 consisting of a DL-Lite TBox, the underlying database 𝒟, and the mappings ℳ that connect the database data to the ontology. We call the database schema 𝛴 and the database instance 𝐷. In traditional OBDA query answering, a SPARQL user query 𝑄 over the ontology 𝒯 is first translated into a SQL query 𝑞 over the database, which is then evaluated over 𝐷. Given a so-called OBDA specification (𝒯 , ℳ, 𝛴), the process of producing 𝑞 out of 𝑄 is carried out in two steps: (i) query rewriting, which is the process of rewriting 𝑄 into a query 𝑄 𝒯 that takes into account the intensional knowledge in 𝒯 ; (ii) query unfolding, which is the process of translating the rewritten query 𝑄 𝒯 based on the mappings ℳ into a SQL query that can be evaluated over the database instance 𝐷.</p><p>Query optimization <ref type="bibr" target="#b10">[11]</ref> occurs during the translation process, to produce a query that can be efficiently executed over the database. To simplify the query answering procedure, the concept of 𝒯 -mappings was introduced <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b12">13]</ref>. This approach involves encoding the axioms from 𝒯 into ℳ to obtain an extended mapping set ℳ 𝒯 , allowing 𝒯 to be disregarded during query answering, with only ℳ 𝒯 being considered. Therefore, we can focus our attention on the unfolding. In this work we consider the variant of the unfolding where a join of union of conjunctive queries (JUCQ) <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b14">15]</ref> is produced at the intermediate step. This strategy is also the one implemented in the state-of-the-art OBDA system Ontop <ref type="bibr" target="#b15">[16]</ref>. This JUCQ query will be further optimized through algebraic transformations, often by pushing the joins at the bottom of the algebraic tree, where they can often be simplified, to obtain a query that has the shape of a union of conjunctive queries (UCQ) <ref type="bibr" target="#b16">[17]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Data Federation</head><p>Data Federation is a well-studied problem in several related fields such as Database and the Semantic Web, as reported in <ref type="bibr" target="#b2">[3]</ref>. The primary goal of data federation is federated query answering, which entails accessing multiple, potentially heterogeneous data sources through a unified schema, often referred to as a virtual database (VDB). To compute the answers for a federated query over the VDB, a data federation system can delegate operations (e.g., joins) or entire sub-queries to individual data sources, or perform them itself. We thus distinguish between: local operations, which are handled within a single source, and federated operations, which affect more than one source and whose execution, which is computationally more expensive, involves the data federation system.</p><p>On these bases, federated query answering generally involves several key steps: identifying the data sources relevant to the query components and partitioning the query into sub-queries; devising a plan for evaluating these sub-queries, including determining the order of execution and the types of joins to be used for merging sub-query results; and finally, executing the sub-queries on the identified sources and combining the results according to the established query plan <ref type="bibr" target="#b2">[3]</ref>. To carry out these steps, most data federation systems rely on the relational model <ref type="bibr" target="#b17">[18]</ref>. Query answering in the relational model typically relies on SQL and relational algebra as the underlying query formalization framework.</p><p>In recent years, there have been significant advancements in data federation systems in both academia and industry. These advancements have led to mature implementations, including open-source and/or freely available systems such as Teiid<ref type="foot" target="#foot_0">1</ref> , Denodo<ref type="foot" target="#foot_1">2</ref> , and Dremio<ref type="foot" target="#foot_2">3</ref> , which combine federation capabilities, data security, and, not least, usable interfaces.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Combining Data Federation and OBDA</head><p>Although a combination of OBDA and Data Federation appears both reasonable -given that both approaches are inherently virtual -and practically interesting, as it would extend OBDA capabilities beyond a single source and make federated queries more accessible to domain experts, this research direction has been sparsely explored in the literature. To the best of our knowledge, few works have attempted to fully integrate these approaches.</p><p>An interesting work in this area is Obi-Wan <ref type="bibr" target="#b18">[19]</ref>, an OBDA system capable of integrating heterogeneous data sources, including relational, graph-based, and NoSQL databases. Obi-Wan follows the classical OBDA framework by rewriting queries based on the ontology and mappings and then evaluating these queries using a mediator across multiple heterogeneous data sources. However, Obi-Wan primarily serves as a proof-of-concept and does not include specific optimization techniques tailored to the federated setting, limiting its applicability in real-world, complex scenarios. Other systems like Squerall <ref type="bibr" target="#b19">[20]</ref> and PolyWeb <ref type="bibr" target="#b20">[21,</ref><ref type="bibr" target="#b21">22]</ref>, also attempt to address the integration of heterogeneous data sources by leveraging OBDA mappings. However, they lack reasoning support and therefore do not qualify as fully-fledged OBDA systems according to the definition in the literature <ref type="bibr" target="#b7">[8]</ref>. Overall, the mentioned works highlight the challenges faced and the substantial effort required when building OBDA systems over federated data sources from scratch. Our OBDF framework <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref> takes a different approach, aiming to combine the strengths of state-ofthe-art OBDA and Data Federation systems. Rather than reinventing the process of federated query answering, OBDF leverages existing, highly-optimized data federation systems and enhances them through a refined query rewriting process. This process is designed to minimize the computational overhead associated with federated joins, thereby ensuring that queries can be executed more efficiently by the data federation systems. To achieve this, OBDF incorporates data hints and hint-based optimization techniques, which are discussed in detail in the following section. Furthermore, the OBDF framework has been tested on an adapted version of the Berlin SPARQL Benchmark (BSBM) <ref type="bibr" target="#b22">[23]</ref> using a single data federation system (Teiid). This evaluation will be extended to three data federation systems and elaborated upon in this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">OBDF Framework</head><p>This section presents the OBDF framework <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>, whose extension and validation are part of this PhD research. We focus here on the core concepts and intuitions behind OBDF also through the discussion of an illustrative example, referring the reader to <ref type="bibr" target="#b5">[6]</ref> for further specific details about the algorithms implementing OBDF.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Overview and Formalization</head><p>The main objective of OBDF is to perform query answering using existing optimized OBDA and data federation systems. To capture the semantics of such combination, we formally introduce the notion of OBDF specification, as a triple ℱ = (𝒯 , ℳ, Σ S ) where 𝒯 is an ontology, Σ S is a VDB schema over (heterogeneous) data sources S, and ℳ is a set of mappings from Σ S to 𝒯 . This definition is derived from the established one of OBDA specification, where we replace the database schema 𝛴 with the VDB schema Σ S .</p><p>Figure <ref type="figure" target="#fig_0">1</ref> depicts the full process of query answering in OBDF and the involved systems. A SPARQL query 𝑄 is posed over the VKG exposed by the OBDA system. The OBDA system is responsible to rewrite the query with respect to the ontology 𝒯 and in a second step to unfold it according to the mappings ℳ of the OBDA system. The resulting SQL query 𝑞 is forwarded to the data federation system, like Teiid, Denodo, or Dremio, to be evaluated over the VDB data instance D through the issuing of sub-queries to the involved data sources. This way, the federation system facilitates the integration of potentially heterogeneous data sources, enabling the OBDA system to interact with it as it would with a single relational database in a standard OBDA setting.</p><p>The OBDF framework inherits challenges from both OBDA and data federation systems, along with new challenges that emerge from their interaction. The OBDA system alone is not aware of the different types of sources and naive query rewriting and unfolding of a SPARQL query can generate inefficient queries for the data federation system. For instance, an unfolding typically contains several joins, which poses a challenge in the federated setting where many of these joins could be federated, thus expensive to compute. In a data federation setting it is also likely that the data is spread over different sources and thus results are overlapping and redundant, which can further complicate query processing. If this system is not optimized, it can lead to inefficient operations and does not scale for big data analysis.</p><p>To address these challenges, we introduced a novel unfolding procedure for OBDF <ref type="bibr" target="#b5">[6]</ref>, which relies on specific statistics computed over the sources. We call these data hints.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Data Hints</head><p>The execution of federated joins in federated query answering is inherently time-intensive. A pivotal strategy in OBDF query optimization involves the preliminary identification of federated joins potentially present in SQL query translations. This step facilitates their query-time restructuring into forms more amenable to efficient evaluation by data federation systems. As articulated in the standard OBDA formalization <ref type="bibr" target="#b7">[8]</ref> and expounded in <ref type="bibr" target="#b14">[15]</ref>, all possible joins in SPARQL to SQL translations can be identified by analyzing the ontology and mappings within the OBDA specification. A central aspect is to provide an algorithm for doing so, as proposed in <ref type="bibr" target="#b5">[6]</ref> and may be the subject of future work.</p><p>Moreover, scenarios involving multiple data sources inherently involve data redundancy. Within OBDF, the top-level ontology provides a unified semantic overlay for diverse data sources. Our approach is to analyze the ontology and the mappings in order to identify redundancies across different data sources and thus optimize SQL query translations by pruning redundant subqueries.</p><p>To encapsulate the query optimization methodology in OBDF, three types of data hints were thus identified:</p><p>• Empty Federated Join ℰ refers to joins across distinct data sources that are expected to yield no results within the current data federation context. Formally, A ◁▷ D B = ∅, where A ◁▷ B denotes a federated join, and the subscript D indicates that the equivalence must hold in the current federated data instance D.</p><p>• Containment Redundancy 𝒞 refers to detectable redundancies that are across data sources. Formally, given a data instance D and two algebraic expressions A and B, we say that A is data-contained in B, denoted as A ⊆ D B, if the set of answers of A over the data instance D is contained in the set of answers B over the data instance D. We use A ≡ D B to indicate that A ⊆ D B and B ⊆ D A, meaning that A and B produce equivalent results within the data instance D.</p><p>• Materialized Views M, as discussed in <ref type="bibr" target="#b16">[17]</ref>, are known to significantly improve query processing performance and are supported by some data federation systems such as Teiid and Dremio.</p><p>In [6, §5.1], we presented an algorithm hintify on how to extract the data hints from a specific data instance D. This algorithm enumerates possible federated joins and redundancies as mentioned above, restricted to VDB relations labeled as static (i.e., whose content is not expected to change), and analyzes them on the current data instance D (e.g., checking whether a federated join is empty) to derive hints that can be expected to hold across data instances. Materialized views are introduced for selected non-empty federated joins, according to customizable heuristics and pending possible further research work to optimize such selection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Hints-Based Query Optimization</head><p>In our exploration of optimizing SQL translations of SPARQL queries within the OBDF framework, we focus on three integral components: query optimization rules, cost model, and unfolding algorithm.</p><p>Query Optimization Rules Our methodology extends standard optimization rules in OBDA, like the self join elimination rule <ref type="bibr" target="#b23">[24]</ref>, and introduces a comprehensive set of query optimization rules, shown in Figure <ref type="figure" target="#fig_1">2</ref>. These rules are determined on the principles of semantic equivalence and the utilization of pre-computed data hints as in Section 3.2.</p><p>The top part of Figure <ref type="figure" target="#fig_1">2</ref> shows the following classic optimization rules:</p><p>• The distributive rule (dlr) is typically used to "push" joins into unions, in order to transform JUCQ into UCQ unfoldings. This is the core optimization applied by state-of-the-art OBDA systems.</p><p>• The self-join elimination rule (sjr) handles the explosion of self-join operations during the unfolding, arising from the mismatch between the n-ary model of relational databases against the ternary model of RDF graphs.</p><p>The bottom part of Figure <ref type="figure" target="#fig_1">2</ref> introduces the novel hint-based rules:</p><p>• The Empty Join Elimination Rule (ejr) removes unnecessary empty joins in SQL queries. The rule is applicable if the empty federated join hint A ◁▷ D B = ∅ belongs to the set of hints.</p><p>• The Containment Elimination Rule (cr) aims to eliminate redundant unions in SQL queries using the identified set of containment redundancy hints.</p><p>• The Equivalence Elimination Rule (er) modifies or replaces join operations in SQL queries. It checks for data equivalence, i.e., two-sided containment, similar to rule cr, and also assesses a condition † to ensure that the cost of the new expression is less than the original cost. The method for cost computation is detailed below.</p><p>• Finally, the Materialization Rule (mtr) enhances the efficiency of federated joins in SQL queries by utilizing pre-computed views. This rule replaces complex join operations with these more optimized views.</p><p>Cost Model Our approach incorporates a straightforward cost model <ref type="bibr">[6, §5.3</ref>] that assigns an evaluation cost to each relational algebra expression. This model follows a set of heuristic principles:</p><p>• Preference for local joins over federated joins,</p><p>• Prioritization of efficient data sources,</p><p>• Elimination of redundant or empty sub-expressions,</p><p>• Utilization of materialized views, when available.</p><p>In addition to these federation-specific heuristics, the model integrates standard OBDA heuristics, providing a comprehensive framework for evaluating query efficiency.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Hints-Based Unfolding Algorithm</head><p>The hints-based unfolding algorithm defines how the query optimization rules and the cost model are integrated in the query processing workflow of an OBDF framework. The algorithm unfold OBDF was originally presented in <ref type="bibr">[6, §5.4</ref>] and since then it has been further streamlined based on implementation and evaluation feedback, and is currently being improved for additional optimizations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">Example</head><p>In this section, we provide concrete examples demonstrating the application of the data hints discussed in Section 3.2, in conjunction with the unfold OBDF <ref type="bibr" target="#b5">[6]</ref> algorithm. For simplicity in this example, we will focus primarily on the key objective of the cost model, which is to prefer local joins over federated joins. This should suffice to illustrate the algorithm's operation and main intuitions.</p><p>Figure <ref type="figure" target="#fig_3">3</ref> depicts the entire process of query optimization. As a first step, we describe all the input variables required by the unfold OBDF algorithm:</p><formula xml:id="formula_0">OBDF Specification ℱ = (𝒯 , ℳ, Σ S ):</formula><p>The OBDF specification ℱ consists of the ontology 𝒯 the mappings ℳ and the Federated VDB schema Σ S . They are depicted with a purple background on the top right of Figure <ref type="figure" target="#fig_3">3</ref>.</p><p>-The Ontology 𝒯 consists of two axioms stating that ConvenienceGoods and ShoppingGoods are both Products.</p><p>-The Federated VDB Schema Σ S (Virtual Database Schema) comprises relation names used by the federation system, capable of integrating multiple federated sources. In our example, subscripts are used to specify the exact federated source for each relation, with the sources ranging from Source 1-4.</p><p>-The Mappings ℳ define how the data in the data sources is mapped to the concepts and roles of the virtual knowledge graph. In our example, the source component, positioned on the left side of the mapping, is represented by a query in algebraic form that operates over the data sources. Conversely, the target component on the right is described by a first-order logic atom, which includes IRI templates 𝑓, 𝑔, ℎ and details on the transformation of database values into RDF literals. The mappings show that ConvenienceGoods and their attributes are sourced from the table CG 1 , while ShoppingGoods are derived from the table SG 2 . Additionally, the role ℎ𝑎𝑠𝑁 𝑎𝑚𝑒, which specifies the names of inspectors, links to the tables PerInfo 3 and Employee 4 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SPARQL Query 𝒬:</head><p>The topmost box 𝑄 of Figure <ref type="figure" target="#fig_3">3</ref> contains the SPARQL query of our example. We use a SPARQL query to retrieve information about all products. This includes each product's name, the inspector associated with the product, and the inspector's name.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Optimization Data Hints:</head><p>In this example we assume that the pre-computation of the hints following the hintify algorithm in <ref type="bibr" target="#b5">[6]</ref> has detected the following data hints.</p><p>-Empty Federated Join Hints ℰ:</p><formula xml:id="formula_1">CG 1 ⋊ ⋉ SG 2 ≡ D ∅</formula><p>The presence of an empty federated join hint between the CG SG -Materialized Views Hints M: ∅</p><p>In this particular example, no hints related to materialized views have been identified.</p><p>The SPARQL query 𝑄 is transformed into an SQL query as follows:</p><p>• Q → q1 The initial phase of the unfold OBDF algorithm involves invoking the unfold wrap algorithm. This algorithm, as detailed in <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b13">14]</ref>, is an unfolding method that results in a query in JUCQ form, as illustrated in the relational algebra expression of 𝑞1 in Figure <ref type="figure" target="#fig_3">3</ref>.</p><p>• q1 → q2 Subsequently, in the transition from 𝑞1 to 𝑞2, highlighted in green in Figure <ref type="figure" target="#fig_3">3</ref>, the distributive law rule (dlr) is applied to the initial join, effectively pushing the join operation inside the unions. This produces a union of four joins involving the relations CG 1 and SG 2 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>q2 → q3</head><p>In several optimization opportunities become Firstly, the self-join elimination rule (sjr) can be employed to remove the join with the same relation on the primary key 𝑐𝑖𝑑. More notably, the empty join elimination rule (ejr) can be applied, given the hint that CG 1 ⋊ ⋉ SG 2 ≡ D ∅. These optimizations lead to the removal of joins in the green-highlighted section of 𝑞2, resulting in the efficient 𝑞3. Our cost model supports this optimization from 𝑞1 to 𝑞3, as it reduces the number of federated joins from three to two.</p><p>• q3 → q4 The section of 𝑞3 highlighted in red mirrors the characteristics of the green-highlighted part of 𝑞1. Thus, the same optimization techniques -dlr, sjr, and ejr -can be applied, leading to the more simplified query 𝑞4, which contains only one federated join.</p><p>• q4 → q5 In the final transition 𝑞4 to 𝑞5, the unfold OBDF algorithm checks for unions where the containment elimination rule (cr) can be applied. Given the containment redundancy hint Employee 4 ⊆ D PerInfo 3 , the query can be further optimized by removing the Employee 4 relation from union, as all records in Employee 4 are already present in PerInfo 3 .</p><p>• q5 → SQL The final query, 𝑞5, represents the optimized execution plan, which is then translated into an SQL query.</p><p>The illustrated transformation demonstrates the optimization of the original SPARQL query depicted in 𝑄. Initially, the query involved three federated joins and four federated unions across relations from four different sources. The optimized query in 𝑞5 significantly reduces complexity, featuring only one federated join and one federated union.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Experiments and Results</head><p>In this section, we report on the extended experiments we conducted using our hint-based optimization approach, and discuss and analyze the results obtained.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Experimental Setup and Methodology</head><p>We extend the experiments in <ref type="bibr" target="#b5">[6]</ref> by expanding our tests from a single data federation system to three different systems: Teiid, Denodo, and Dremio. This broader evaluation allows us to assess the general effectiveness of our optimizations across diverse data federation systems.</p><p>As in <ref type="bibr" target="#b5">[6]</ref>, our evaluation is based on the Berlin SPARQL Benchmark (BSBM) from <ref type="bibr" target="#b22">[23]</ref>, <ref type="foot" target="#foot_3">4</ref> which we adapt here to the OBDF scenario. This well known synthetic benchmark comes in two aligned RDF and SQL versions, which together allow evaluating and comparing query answering performance of RDF triplestores, RDBMSs, and OBDA systems as well as demonstrated in <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b25">26]</ref>. BSBM is designed around an e-commerce scenario. It simulates a diverse marketplace where multiple vendors offer products coming from producers and subsequently reviewed by consumers. Synthetic data for such scenario can be created through the data generator. Given a scale factor 𝑛 corresponding to the desired number of products, the generator produces a corresponding dataset.</p><p>We start from the OBDA instance ((𝒯 , ℳ, 𝛴), 𝐷) of BSBM (for a given scale factor 𝑛) and we aim at transforming it into an OBDF instance ((𝒯 , ℳ ′ , 𝛴 ′ ), D). There, D is a VDB instance obtained using partitioning and replication to reorganize 𝐷 into multiple federated database instances 𝐷 𝑖 , which we allocate to either: (i) a set of relational sources, to test performance in a homogeneous setting (hom); or (ii) a mix of relational and NoSQL sources, to test a heterogeneous setting (het) <ref type="foot" target="#foot_4">5</ref> . 𝛴 ′ is the VDB schema for D, which matches 𝛴 except for partitioned and replicated tables and is obtained by setting up multiple data federation systems. Lastly, ℳ ′ are the new mappings derived from ℳ, which produce the same virtual ABox, which we use with other specification components to configure the tested system.</p><p>Besides the aforementioned hom and het settings for the obtained OBDF instances, we also consider two reference centralized settings sc1 and sc2, where all data is stored in a single PostgreSQL database directly accessed by the OBDA system without a data federation system: sc1 employs OBDA instance ((𝒯 , ℳ, 𝛴), 𝐷) for the original BSBM, whereas sc2 employs OBDA instance ((𝒯 , ℳ ′ , 𝛴 ′ ), D) where data partitioning and replication are introduced, although in a centralized setting.</p><p>To conduct our tests, we slightly adapt and reuse the 12 queries of BSBM Explore. We evaluate query answering with and without our approach focusing on execution time, measured in milliseconds (ms), which refers to the total time required to evaluate either an individual query 𝑄 or a query mix 𝑀 , the latter comprising one execution of each of the 12 BSBM Explore queries. For a query mix, the execution time is determined by geometrically averaging the execution times of its individual queries, calculated as 𝑇 𝑀 = |𝑀 | √︁ ∏︀ 𝑄∈𝑀 𝑇 𝑄 , where 𝑇 𝑀 represents the mix execution time and 𝑇 𝑄 represents the execution time for each query. Note that using a geometric mean to evaluate a query mix, rather than a simple arithmetic mean or sum, helps to prevent slower queries with large execution times from disproportionately influencing the results, particularly when the execution times of faster queries are several orders of magnitude smaller, as observed in some BSBM queries.</p><p>We evaluate the aforementioned metrics for different configurations of our evaluation environment, and specifically:</p><p>• for different scale factors 𝑛 of the BSBM benchmark, 2k, 20k 200k.</p><p>• for each setting sc1, sc2, hom, het; • for each data federation system Teiid, Dremio, Denodo (only for federated settings hom, het); • with or without our hint-based optimization approach, considering these three conditions:</p><p>base: no hint-based optimization (our baseline, all four settings); opt: hint-based optimization without materialized views (only for settings hom, het); optm: hint-based optimization also using materialized views (only for settings hom, het).</p><p>For each configuration we run a number 𝑛 𝑤 of warm up mixes, which serve to initialize caches and whose execution time we do not measure, followed by a number 𝑛 𝑡 of test mixes, over which we compute and average our metrics. We set 𝑛 𝑡 = 20 to have sufficient mixes to conduct statistical analyses. We set 𝑛 𝑤 = 50 after having verified that measured average execution times (on a moving average on 𝑛 𝑡 = 20 mixes) do not change considerably by running further warm up mixes.</p><p>Note that we report optimizations with and without materialized views for two primary reasons. First, the application of materialized views in our experiments was limited to only 2 out of 12 queries (𝑄 2 and 𝑄 12 ). Second, materialized views come with a significant drawback: they require substantial storage space. In contrast, the only negative impact of leveraging empty federated join and containment redundancy hints, which we consider in opt, is a negligible increase of query reformulation time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Results</head><p>Figure <ref type="figure" target="#fig_5">4</ref> shows the average execution time of the aforementioned settings for Denodo (a), Dremio (b) and Teiid (c) at a BSBM scale of 200k, for each query 𝑄 1 -𝑄 12 and at query mix level. Furthermore, the x-axis also displays the hint-based optimization rules that have been adopted in the optimizations of the single queries. We observe that the application of hints has been very selective among the queries. The empty join elimination rule (ejr) together with the empty federated join hint has been used to optimize queries 𝑄 1 -𝑄 6 . Meanwhile the containment redundancy hint and containment elimination rule (cr) has been used to optimize 𝑄 7 -𝑄 9 . No data hints and rules have been applied to queries 𝑄 10 -𝑄 11 , whereas materialization rule (mtr) has been used in 𝑄 2 and 𝑄 12 . An analysis of Figures 4 aligns with the expectations and validates the initial findings presented in <ref type="bibr" target="#b5">[6]</ref>. The red bar, representing centralized setting sc1 without data partitioning, consistently shows the shortest execution times, particularly in contrast to centralized setting sc2 (blue bar), where data partitioning is implemented. In general, the execution times relative to setting sc2 are slightly lower than those for the baseline homogeneous setting hom, indicated by the first green bar, demonstrating that the introduction of a data federation layer has no major impact on query execution time. This is especially true at lower BSBM scales like 2k, where we observe sc2 underperforms compared to the homogeneous setting in queries 𝑄 4 , 𝑄 7 , 𝑄 8 , and 𝑄 9 . At higher BSBM scales, the blue bar relative to sc2 remains below the first green bar for the hom baseline, with 𝑄 1 being an exception due to high data variance, as indicated by the error bars, and attributable to different PostgreSQL query execution plans for different instantiations of 𝑄 1 depending on the specific choice of query template parameters.</p><p>Figure <ref type="figure" target="#fig_5">4</ref> reveals a key observation, appreciable by noting that for each federation system and query, the execution times are displayed in three stages (via different shades of color) -first without optimizations (base), then with optimizations but without materialization, and finally with materializations. In nearly every instance, the execution time with optimizations is consistently lower than when no optimizations are applied. The only exceptions are queries 𝑄 10 and 𝑄 11 , where no optimization rules were triggered, and queries 𝑄 1 , 𝑄 3 , and 𝑄 5 in Dremio, which present an opportunity for further analysis to gain insights into the unexpected behavior.  Further supporting these observations, the right-most set of bars labeled as query mix represents the geometric average of all the queries, where the reduction in execution time is clearly evident, further confirming the effectiveness of the hint-based optimizations. Table <ref type="table" target="#tab_1">1</ref> provides the specific values corresponding to the right-most set of bars for the query mix of Figure <ref type="figure" target="#fig_5">4</ref> and also includes values for lower BSBM scales of 2k and 20k. As previously stated, the aggregated results of the query mix indicate consistent improvements. Notably, when using the materialized view and at a BSBM scale of 200k, we observe an average reduction in execution time of the query mix from 1.722 ms to 168 ms in the homogeneous setting and from 27.590 ms to 5.122 ms in the heterogeneous setting. This corresponds to an approximate 90% reduction in execution time for the homogeneous setting and a 77% reduction in the heterogeneous setting. Additionally, the table shows that these values decrease as the BSBM dataset scale increases, underscoring the enhanced efficiency of the optimizations in large data contexts.</p><p>Across all federation systems and BSBM scales, the inclusion of hint optimizations consistently yielded the shortest execution times, as revealed by Table <ref type="table" target="#tab_1">1</ref>. This outcome validates the effectiveness of hint-based optimization techniques proposed by <ref type="bibr" target="#b5">[6]</ref> and examined in this study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions and Future Work</head><p>In this paper, we presented an ongoing PhD research tackling the combination of OBDA and Data Federation, both in terms of efficient query evaluation and convenience for users in accessing federated data. We focused here on the former querying aspect, for which we previously introduced the Ontology-Based Data Federation (OBDF) framework <ref type="bibr" target="#b5">[6]</ref>, and for which we presented here a novel, extended evaluation aimed at assessing generalization of performance improvements across multiple federation systems. Notably, this was achieved by considering two additional data federation systems (Dremio and Denodo) in addition to the one (Teiid) previously considered in <ref type="bibr" target="#b5">[6]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Summary of Results</head><p>The results of the presented extended experiments confirm the performance improvements and the findings from our previous work <ref type="bibr" target="#b5">[6]</ref> across all the considered data federation systems. The improvements in query execution times due to OBDF optimizations are substantial, ranging from a 90% reduction (see Table <ref type="table" target="#tab_1">1</ref>) in the considered homogeneous setting anto a 77% reduction in the heterogeneous setting. Overall, the reported extended experiments indicate that the optimization techniques integrated within OBDF are effective regardless of the specific federation system employed, providing evidence for the framework's generalization and adaptation capabilities.</p><p>Based on these results, which may possibly further improve due to ongoing enhancements to the OBDF algorithms, we can conclude that efficient OBDA query evaluation over data federation systems is achievable and has been obtained within the scope of the OBDF framework, and we can turn our main focus on the other goal (convenience for users) of this PhD research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Next Activities in this PhD Research</head><p>The immediate next steps in this PhD research will deal with completing the current revision of the OBDF algorithms. This will involve refining the algorithms, particularly those discussed in the context of hint-based query optimization, and preparing a comprehensive journal publication that encapsulates all contributions and results obtained thus far.</p><p>Following that, we will shift our focus from query execution and optimization techniques to supporting OBDF/OBDA users in designing a Virtual Knowledge Base system, with a particular emphasis on defining mappings. Building on the mapping patterns proposed in recent literature, an automated data-driven approach for bootstrapping mappings directly from data sources will be explored, to avoid the need for users to fully perform this task manually. This approach aims to enhance the flexibility and efficiency of the system, thereby facilitating more seamless integration and utilization of diverse data sources.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Related Future Work Directions</head><p>Further extensions of OBDF, based on the results obtained so far, present several promising avenues for future research, to be carried out either within (if opportunities will arise) or beyond this PhD research. These extensions regard:</p><p>• Precomputation of hints and identification of materialized views. Further research may delve deeper into optimizing the precomputation of hints, particularly focusing on the identification of materialized views. Ideally, this should involve balancing additional storage usage against improvements in query evaluation times due to the use of materialized views, potentially leading to more sophisticated heuristics that are finely tuned for different data environments.</p><p>• Algorithm unfold OBDF . Additional research can be directed toward extending the unfold OBDF algorithm, enhancing its capability to handle more complex queries and data scenarios.</p><p>• Support for graph databases. A related new line of research may focus on integrating graph databases, like Neo4J, as a source within the Data Federation environment. Such source inherently provides additional query language features, such as navigational queries. To fully leverage these new capabilities, one should enrich the query language of the OBDF framework as well, allowing it to support and exploit these navigational features. This research should thus investigate how established query languages can be extended to incorporate the navigational elements typical of graph query languages. The primary challenge will be to reformulate navigational queries within the ontology context, taking full advantage of the graph database's inherent navigational capabilities. This enhancement has the potential to enable the OBDF framework to handle more complex queries, including those that return not only individual nodes or values but also entire paths or sub-graphs, thus significantly expanding the expressiveness and utility of the framework. This future research direction not only aims to enhance the theoretical underpinnings of the OBDF framework but also seeks to develop practical tools and methodologies that can be applied in real-world scenarios, making the system more robust and versatile for various applications leveraging graph data.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure1: OBDF framework and query answering procedure.<ref type="bibr" target="#b5">[6]</ref> </figDesc><graphic coords="4,335.15,80.42,123.02,127.27" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure2: List of standard optimization rules applied by OBDA systems (first group), and the new rules specific to OBDF (second group). The former is not complete (e.g., we omit trivial transformations like commutativity rule). The letters A, B, C denote relational algebra expressions, whereas T denotes base relations. We use standard relational algebra notation, where for conciseness we introduce the abbreviation 𝜋 r1 /a1 ,...,rn /an for the combination 𝜌 r1 /a1 ,...,rn /an 𝜋 a1 ,...,an of projection and renaming.</figDesc><graphic coords="6,162.25,65.61,270.78,279.54" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>Products.i), g(Products.n), h(Products.r), e(PerInfo3.pn) FROM ( SELECT cid AS i, cname AS n, cinsp AS r FROM CG1 UNION SELECT sid AS i, sname AS n, sinsp AS r FROM SG2 ) AS Products PerInfo3 ON Products.r = PerInfo3.piid; Apply self join elimination rule sjr Apply empty join elimination rule ejr with steps from q1 to q3 } Optimizations are directed by a cost model estimating costs of intermediate queries Local joins preferred over federated joins@ Efficient sources should be favoured over inefficient ones@ Whenever available, prefer materialized views.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Example of query optimization using the Hint-Based Unfolding Algorithm. 𝑞1 illustrates the relational algebra representation of the SPARQL query 𝑄, produced by standard unfolding techniques. Following the unfold OBDF algorithm through steps 𝑞1 to 𝑞5, combination of standard and hint-based optimization techniques is applied, effectively reducing the number of federated and unions, thereby enhancing query efficiency.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Average execution time (± standard deviation) over 20 test runs for the federation systems Denodo (a), Dremio (b), and Teiid (c) over the BSBM scale of 200k. The x-axes represent queries 𝑄 1 to 𝑄 12 and the query mix geometric averages, with the additional indication of applied hint-based optimization rules.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>This containment redundancy hint indicates that, for this specific database instance, all records from the relation Employee 4 are already included within the records of PerInfo 3 . Therefore, this implies that the records in Employee 4 are a subset of those in PerInfo 3 . Consequently, performing a union of both tables would result in no additional records beyond what is already present in PerInfo 3 , underscoring the unnecessary nature of combining Employee 4 with PerInfo 3 .</figDesc><table /><note>2 that the sets of ConvenienceGoods in CG 1 and ShoppingGoods in SG 2 are disjoint in this specific database instance, indicating no shared records between them.-Containment Redundancy Hints 𝒞: Employee 4 ⊆ D PerInfo 3</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Mean execution times (ms) for each Query mix 𝑀 , averaged over 20 test runs. The table organizes execution times by baselines (sc1, sc2) and data federation systems in the hom and het settings. It also categorizes the data based on optimization levels: base (not optimized), opt (optimized), and optm (optimized with materialized views), alongside the BSBM scale (2k-200k). Each Query mix 𝑀 corresponds to the geometric mean of the execution times of queries 𝑄 1 -𝑄 12 .</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>2k</cell><cell></cell><cell></cell><cell>20k</cell><cell></cell><cell></cell><cell>200k</cell></row><row><cell cols="2">setting system</cell><cell cols="4">base opt optm base</cell><cell cols="2">opt optm</cell><cell>base</cell><cell cols="2">opt optm</cell></row><row><cell>sc1</cell><cell></cell><cell>41</cell><cell></cell><cell></cell><cell>44</cell><cell></cell><cell></cell><cell>69</cell><cell></cell></row><row><cell>sc2</cell><cell></cell><cell>150</cell><cell></cell><cell></cell><cell>304</cell><cell></cell><cell></cell><cell>873</cell><cell></cell></row><row><cell></cell><cell>Teiid</cell><cell>107</cell><cell>60</cell><cell>59</cell><cell>282</cell><cell>72</cell><cell>71</cell><cell>1408</cell><cell>126</cell><cell>108</cell></row><row><cell>hom</cell><cell>Denodo Dremio</cell><cell cols="2">143 352 175 87</cell><cell>85 155</cell><cell>415 537</cell><cell>113 198</cell><cell>105 178</cell><cell>2098 1660</cell><cell>183 294</cell><cell>154 243</cell></row><row><cell></cell><cell>avg</cell><cell cols="2">200 107</cell><cell>100</cell><cell>411</cell><cell>128</cell><cell>118</cell><cell>1722</cell><cell>201</cell><cell>168</cell></row><row><cell></cell><cell>Teiid</cell><cell cols="2">533 299</cell><cell cols="3">217 5120 2179</cell><cell cols="3">1139 51690 17896</cell><cell>6877</cell></row><row><cell>het</cell><cell>Denodo Dremio</cell><cell cols="2">294 187 780 489</cell><cell cols="3">157 1818 418 2394 1321 926</cell><cell cols="2">629 17938 973 13141</cell><cell>7963 8440</cell><cell>3837 4652</cell></row><row><cell></cell><cell>avg</cell><cell cols="2">536 325</cell><cell cols="3">264 3111 1475</cell><cell cols="3">914 27590 11433</cell><cell>5122</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://teiid.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://www.denodo.com/en</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://www.dremio.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">http://wbsg.informatik.uni-mannheim.de/bizer/berlinsparqlbenchmark/spec/index.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">Specifically PostgresSQL, MariaDB, MySQL, MS SQL Server for setting hom and PostgresSQL, MS SQL Server, MongoDB and CSV files for setting het. See<ref type="bibr" target="#b5">[6]</ref> for further details, such as the allocations of tables to data sources.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has been carried out while Marco Di Panfilo was enrolled in the Italian National Doctorate on Artificial Intelligence run by Sapienza University of Rome in collaboration with Free University of Bozen-Bolzano.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Challenges and opportunities with big data</title>
		<author>
			<persName><forename type="first">A</forename><surname>Labrinidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">V</forename><surname>Jagadish</surname></persName>
		</author>
		<idno type="DOI">10.14778/2367502.2367572</idno>
	</analytic>
	<monogr>
		<title level="j">Proc. VLDB Endow</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="2032" to="2033" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Doan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Halevy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">G</forename><surname>Ives</surname></persName>
		</author>
		<title level="m">Principles of Data Integration</title>
				<imprint>
			<publisher>Morgan Kaufmann</publisher>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A systematic overview of data federation systems</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Corcoglioniti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mosca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<idno type="DOI">10.3233/SW-223201</idno>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page" from="107" to="165" />
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Ontologybased data access: A survey</title>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lembo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Poggi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zakharyaschev</surname></persName>
		</author>
		<idno type="DOI">10.24963/ijcai.2018/777</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 27th Int. Joint Conf. on Artificial Intelligence (IJCAI), IJCAI Org</title>
				<meeting>of the 27th Int. Joint Conf. on Artificial Intelligence (IJCAI), IJCAI Org</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="5511" to="5519" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Dl-lite: Tractable description logics for ontologies</title>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Giacomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lembo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lenzerini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosati</surname></persName>
		</author>
		<ptr target="http://www.aaai.org/Library/AAAI/2005/aaai05-094.php" />
	</analytic>
	<monogr>
		<title level="m">Proc. of the 20th National Conf. on Artificial Intelligence (AAAI)</title>
				<meeting>of the 20th National Conf. on Artificial Intelligence (AAAI)</meeting>
		<imprint>
			<publisher>AAAI Press / The MIT Press</publisher>
			<date type="published" when="2005">2005</date>
			<biblScope unit="page" from="602" to="607" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Ontology-based data federation</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mosca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<idno type="DOI">10.1145/3579051.3579070</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 11th Int. Joint Conf. on Knowledge Graphs IJCKG 2022</title>
				<meeting>of the 11th Int. Joint Conf. on Knowledge Graphs IJCKG 2022</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="10" to="19" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Ontology-based data federation -A framework proposal</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">D</forename><surname>Panfilo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mosca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<idno>CEUR-WS.org</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 31st Symposium of Advanced Database Systems</title>
				<meeting>the 31st Symposium of Advanced Database Systems<address><addrLine>Galzingano Terme, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">July 2nd to 5th, 2023. 2023</date>
			<biblScope unit="volume">3478</biblScope>
			<biblScope unit="page" from="210" to="219" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Linking data to ontologies</title>
		<author>
			<persName><forename type="first">A</forename><surname>Poggi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lembo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>De Giacomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lenzerini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosati</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-540-77688-8_5</idno>
	</analytic>
	<monogr>
		<title level="j">J. on Data Semantics</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page" from="133" to="173" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Tractable reasoning and efficient query answering in description logics: The DL-Lite family</title>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>De Giacomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lembo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lenzerini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosati</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10817-007-9078-x</idno>
	</analytic>
	<monogr>
		<title level="j">J. of Automated Reasoning</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="page" from="385" to="429" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Virtual Knowledge Graphs: An overview of systems and use cases</title>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Cogrel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<idno type="DOI">10.1162/dint_a_00011</idno>
	</analytic>
	<monogr>
		<title level="j">Data Intelligence</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="201" to="223" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Tractable query answering and rewriting under description logic constraints</title>
		<author>
			<persName><forename type="first">H</forename><surname>Pérez-Urbina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Motik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Horrocks</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.jal.2009.09.004</idno>
	</analytic>
	<monogr>
		<title level="j">J. of Applied Logic</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="186" to="209" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Ontology-based data access: Ontop of databases</title>
		<author>
			<persName><forename type="first">M</forename><surname>Rodriguez-Muro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zakharyaschev</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-642-41335-3_35</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 12th Int. Semantic Web Conf. (ISWC)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>of the 12th Int. Semantic Web Conf. (ISWC)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="volume">8218</biblScope>
			<biblScope unit="page" from="558" to="573" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">OBDA: query rewriting or materialization? in practice, both!</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>Sequeda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Arenas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">P</forename><surname>Miranker</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-319-11964-9_34</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 13th Int. Semantic Web Conf. (ISWC)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>of the 13th Int. Semantic Web Conf. (ISWC)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="volume">8796</biblScope>
			<biblScope unit="page" from="535" to="551" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Reformulation-based query answering in RDF: alternatives and performance</title>
		<author>
			<persName><forename type="first">D</forename><surname>Bursztyn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Goasdoué</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Manolescu</surname></persName>
		</author>
		<idno type="DOI">10.14778/2824032.2824093</idno>
	</analytic>
	<monogr>
		<title level="j">Proc. VLDB Endow</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="1888" to="1891" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Cost-driven ontology-based data access</title>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-319-68288-4_27</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 16th Int. Semantic Web Conf. (ISWC)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>of the 16th Int. Semantic Web Conf. (ISWC)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">10587</biblScope>
			<biblScope unit="page" from="452" to="470" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The virtual knowledge graph system Ontop</title>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Komla-Ebri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Güzel-Kalayci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Corman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Cogrel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Botoeva</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-62466-8_17</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 19th Int. Semantic Web Conf. (ISWC)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>of the 19th Int. Semantic Web Conf. (ISWC)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">12507</biblScope>
			<biblScope unit="page" from="259" to="277" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Foundations of Databases</title>
		<author>
			<persName><forename type="first">S</forename><surname>Abiteboul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hull</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Vianu</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1995">1995</date>
			<publisher>Addison Wesley Publ. Co</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">A relational model of data for large shared data banks</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">F</forename><surname>Codd</surname></persName>
		</author>
		<idno type="DOI">10.1145/362384.362685</idno>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="377" to="387" />
			<date type="published" when="1970">1970</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Ontology-based RDF integration of heterogeneous data</title>
		<author>
			<persName><forename type="first">M</forename><surname>Buron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Goasdoué</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Manolescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-L</forename><surname>Mugnier</surname></persName>
		</author>
		<author>
			<persName><surname>Obi-Wan</surname></persName>
		</author>
		<idno type="DOI">10.14778/3415478.3415512</idno>
	</analytic>
	<monogr>
		<title level="j">Proc. VLDB Endow</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="2933" to="2936" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Squerall: Virtual ontology-based access to heterogeneous and large data sources</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">N</forename><surname>Mami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Graux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Scerri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Jabeen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lehmann</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-30796-7_15</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 18th Int. Semantic Web Conf. (ISWC)</title>
				<meeting>of the 18th Int. Semantic Web Conf. (ISWC)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">11779</biblScope>
			<biblScope unit="page" from="229" to="245" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">One size does not fit all: Querying web polystores</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zimmermann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Gadepally</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>D'aquin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sahay</surname></persName>
		</author>
		<idno type="DOI">10.1109/ACCESS.2018.2888601</idno>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="9598" to="9617" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Querying web polystores</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zimmermann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Rebholz-Schuhmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sahay</surname></persName>
		</author>
		<idno type="DOI">10.1109/BIGDATA.2017.8258299</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. of IEEE Int. Conf. on Big Data (IEEE BigData)</title>
				<meeting>of IEEE Int. Conf. on Big Data (IEEE BigData)</meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="3190" to="3195" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">The Berlin SPARQL benchmark</title>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Schultz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Int. J. on Semantic Web and Information Systems</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="1" to="24" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Ontop: Answering SPARQL queries over relational databases</title>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Cogrel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Komla-Ebri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Rezk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Rodriguez-Muro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<idno type="DOI">10.3233/SW-160217</idno>
	</analytic>
	<monogr>
		<title level="j">Semantic Web J</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="471" to="487" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Efficient handling of SPARQL OPTIONAL for OBDA (extended version</title>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Cogrel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Botoeva</surname></persName>
		</author>
		<idno>CoRR abs/1806.05918</idno>
		<ptr target="http://arxiv.org/abs/1806.05918.arXiv:1806.05918" />
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<title level="m" type="main">Scaling knowledge graphs for automating AI of digital twins</title>
		<author>
			<persName><forename type="first">J</forename><surname>Ploennigs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Semertzidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Lorenzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Mihindukulasooriya</surname></persName>
		</author>
		<idno type="DOI">10.48550/ARXIV.2210.14596</idno>
		<idno type="arXiv">arXiv:2210.14596</idno>
		<ptr target="https://doi.org/10.48550/arXiv.2210.14596" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
