<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Doctoral Consortium, September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Towards Techniques for Updating Virtual Knowledge Graphs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Romuald Esdras Wandji</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mantas Šimkus</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Diego Calvanese</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computing Science, Umeå Universitet</institution>
          ,
          <addr-line>Umeå</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Research Centre for Knowledge and Data, Faculty of Engineering, Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Bolzano</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>1</volume>
      <fpage>8</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>The field of Virtual Knowledge Graphs (VKGs) continues to grow in both academic and applied contexts. Yet, the issue of updates in VKG systems has not yet received adequate attention, although it is crucial to manage data modifications at the data source level through the lens of an ontology. In this paper, we focus on VKGs whose ontology is specified in the lightweight ontology language DL-Lite, and we propose diverse settings and research directions we intend to explore to address the challenge of translating ontology-based updates into updates at the level of data sources. We also pay attention to the important problem of automated analysis of mappings, which plays a major role when it comes to reformulating ontology-based update requests into update requests over the data sources.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Knowledge Representation</kwd>
        <kwd>Virtual Knowledge Graph (VKG)</kwd>
        <kwd>Ontology-based Data Access</kwd>
        <kwd>View Updates</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        the ontology layer can be translated into an equivalent first-order query that can be executed
over the data source layer [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        The focus of research in VKGs has been on query answering, i.e., on using the ontology layer
and the corresponding knowledge graph to extract data specified through a query from the
underlying data sources. However, ontology-based updates in VKGs remains a challenging
task, which refers to modifying its extensional content with new knowledge that may or may
not be consistent w.r.t. the original one. This task comes with some important challenges.
The first is due to the complex logic-based inferences in ontologies that lead to the need for a
non-deterministic approach to updating knowledge bases [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ]. Another important challenge
comes from the presence of mappings that are essentially considered views that define ontology
elements in terms of queries over the data source. Thus, each update over an ontology element
can be translated into an update over the view that combines all queries that correspond to
mappings for that ontology element and thus faces the view-update problem [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
        ]. Several
methodologies have emerged within the realm of ontology-based updates, chiefly addressing
two primary classes of updates as delineated in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The first category is the instance-level
(or ABox) updates, where an update consists of changing assertions about individual entities
in the ontology. This is the most common form of updates studied in the literature [
        <xref ref-type="bibr" rid="ref13 ref14 ref15">13, 14,
15, 16</xref>
        ], originating from the work in [17]. Concurrently, an alternative update methodology
is characterized by a formula-based approach [18, 19], which encapsulates updates as logical
formulas, representing a semantic change that needs to be reflected in the ontology and is an
example of the Set-Of-Theories approach [20]. However these two update approaches are limited
to the ontology layer, i.e., they allow users to modify the extensional content of the ontology,
but mappings and thus updates of actual data sources are not covered by these previous works.
      </p>
      <p>The main focus of this PhD work is to address the fact that little attention has been paid so
far to the problem of updating VKGs through operations that are performed at the ontology
level, where changes applied over the ontology are translated into minimally suitable updates
over the underlying database through the mappings. A solution to this problem would be of
great practical relevance since it would allow the management of the key operations that are of
interest in an information system (i.e., queries and updates) through the lens of ontology.</p>
      <p>This article initially presents our motivation behind exploring the concept of ontology-based
updates within the context of VKGs. To this end, we recall the notion of knowledge base
updates as it is investigated in the literature, which provides us key definitions that will be
useful for further investigation in our specific research context. Subsequently, we introduce the
concept of updates in VKGs, where the goal is to manipulate data source instances by means
of operations performed over the ontology layer. We outline the current research challenges
for the study of VKG updates, which include defining an inconsistency-tolerant semantics for
updates in VKGs, investigating computational properties of update and query operations under
such semantics, and finding conditions under which VKG updates can bypass ontology axioms
(via query rewriting). Furthermore, we want to study how to enrich a given VKG system with
additional information to ensure a unique translation of updates at the ontological level to
updates of the data sources.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Preliminaries</title>
      <p>Here we briefly introduce standard DL Knowledge Bases (KBs). Subsequently, we provide the
definition of DL-Lite, the specific DL variant that forms the focus of this research study. Our
exposition concludes with an explanation of the concept of VKGs and its principal components.
In the following, we assume to have three distinct, countably infinite alphabets:  for ontology
predicates,  for source predicates, and  for constants. These alphabets are pairwise disjoint.</p>
      <sec id="sec-2-1">
        <title>2.1. Description Logic knowledge bases</title>
        <p>
          Description Logics (DLs) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] serve as the foundation of contemporary knowledge representation
systems, designed to depict the universe of discourse in a logically consistent and
computationally tractable manner. Acting as the building blocks of this domain, DLs employ two
fundamental entities: concepts and roles. Concepts denote unary relations, whereas roles denote
binary relations, and both are constructed by applying specific constructs starting from atomic
concepts and roles.
        </p>
        <p>DLs allow for modeling a domain of interest by employing concepts and roles within a
knowledge base (KB). Specifically, a KB  = ⟨ , ⟩ in a DL consists of two components: the
TBox  and the ABox . A TBox is the intensional level of the KB, capturing the terminological
knowledge (or schema) of the domain. An ABox comprises information at the instance level,
providing concrete instances of the concepts and roles defined in the TBox.</p>
        <p>
          In this paper, we center our attention on a specific family of DLs known as DL-Lite [
          <xref ref-type="bibr" rid="ref1 ref7">7, 1</xref>
          ],
which aligns with the tractable OWL 2 QL profile [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] of the Web Ontology Language OWL 2,
and, specifically, we consider DL-Lite, which is one of the most expressive variants in the
DL-Lite family for which a query-rewriting based approach to query answering can be adopted.
In DL-Lite, a TBox is defined as a finite set of intensional assertions of the form, 1 ⊑ 2
(concept inclusion), 1 ⊑ ¬2 (concept disjointness), 1 ⊑ 2 (role inclusion), 1 ⊑ ¬2 (role
disjointness),and (funct ) (functionality). Here,  (possibly sub-scripted) denotes an atomic
role  or its inverse  − . Instead,  (possibly sub-scripted) denotes a basic concept, which is
either an atomic concept , or a concept defined as ∃, representing the set of objects that
appear as the first argument of . Finally, (funct ) asserts the functionality of , i.e., that
its first argument operates as a key of the relation denoted by . An ABox in DL-Lite is
represented as a finite set of assertions of the form () or  (, ), with  and  belonging to
 .
        </p>
        <p>
          The semantics of a DL KB is given in terms of First-Order interpretations, referring to the
interpretation structures of First-Order Logic [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. An interpretation ℐ = ⟨∆ ℐ , · ℐ ⟩ consists of an
interpretation domain ∆ ℐ and an interpretation function · ℐ , which maps each atomic concept
 to a set ℐ ⊆ ∆ ℐ , each atomic role  to a set  ℐ ⊆ ∆ ℐ × ∆ ℐ , and each individual  to an
element ℐ ∈ ∆ ℐ . The role  − is interpreted as the inverse of the relation  ℐ , while (∃)ℐ =
{ | there exists ′ s.t. (, ′) ∈ ℐ }. Finally, (¬)ℐ = ∆ ℐ ∖ ℐ , and (¬)ℐ = ∆ ℐ × ∆ ℐ ∖ ℐ .
An interpretation ℐ is said to be a model of (or satisfies ) an ABox assertion () (resp.,  (, ))
if ℐ ∈ ℐ (resp., (ℐ , ℐ ) ∈  ℐ ), and it is a model of a TBox assertion 1 ⊑ 2 (where 1 and
2 are either concepts or roles) if 1ℐ ⊆ 2ℐ . An interpretation ℐ is a model of a KB  = ⟨ , ⟩
if it is a model of all assertions in  and . We denote with Mod () the set of all models of .
A KB  is consistent if it has at least one model, i.e., Mod () ̸= ∅. We say that  is consistent
with  , or  -consistent, if ⟨ , ⟩ is consistent, and  -inconsistent otherwise. Given an ABox 
and a TBox  , we denote by cl  () the closure of  w.r.t.  that is, the set of ABox assertions
over individuals in  that are logically implied by ⟨ , ⟩. Similarly, the deductive closure of a
TBox  , denoted cl ( ) is the set of inclusions and functionality assertions that follow from
 . Finally, given two ABoxes  and ′, we say that  is logically equivalent to ′ w.r.t.  ,
denoted  ≡  ′ if cl  () = cl  (′).
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Virtual Knowledge Graphs</title>
        <p>
          We recall here the main elements of the VKG framework, which is also known as Ontology-based
Data Access (OBDA) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. A VKG specification is a tuple  = ⟨ , ℳ, ⟩, where (i)  is a TBox
(expressed in OWL 2 QL), (ii)  is a (possibly federated) data source schema, and (iii) ℳ is a set
of global-as-view (GAV) mapping assertions [21], each of which specifies how to populate a
concept or a role of the ontology in terms of a query over the data source. Specifically, each
mapping assertion has the form  (⃗) ⇝  (⃗) where  (⃗) is a first-order query (typically
written in SQL syntax) over the predicates in  of the data source , and  (⃗) is a conjunction
of atoms over  .
        </p>
        <p>A VKG instance is a pair  = ⟨, ⟩ where  = ⟨ , ℳ, ⟩ is a VKG specification , and
 is a relational database instance compliant with . Its semantics is defined in terms of FO
interpretations over  . An interpretation ℐ is a model of ⟨, ⟩ if (i) ℐ |=  and (ii) ℐ satisfies
the set ℳ() of facts retrieved through ℳ from , where ℳ() is formally defined as follows:
ℳ() = { (⃗) | ⃗ ∈ eval ( (⃗), ) and  (⃗) ⇝  (⃗) ∈ ℳ},
where eval ( (⃗), ) is the evaluation of  (⃗) over . In other words, an interpretation ℐ is
a model of ⟨, ⟩ if it is a model of ⟨ , ℳ()⟩. It is worth noting that the retrieved ABox
ℳ() is kept virtual, which means that for tasks such as query answering, we do not need to
compute it.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Motivation</title>
      <p>Let us consider a VKG instance  = ⟨, ⟩ with the specification  = ⟨ , ℳ, ⟩. We are
interested in ABox updates that consist of basic operations of two types, namely insertions and
deletions of ABox facts. Formally, an ABox update is a pair  = ⟨ ,  − ⟩, where + denotes
+
a finite set of ABox insertions and  − denotes a finite set of ABox deletions, where each insertion
and deletion is represented as an ABox assertion. We would like to understand how these ABox
updates should be reflected in updates at the level of the data sources (DSs). Similarly to ABox
updates, we also represent a data source update as a pair  = ⟨ ,  − ⟩, where + denotes a
+
ifnite set of data source insertions and  − denotes a finite set of data source deletions, each of
them represented by a data source fact.</p>
      <p>Let us consider for now the setting of a plain KB  = ⟨ , ⟩, for instance, a university’s
KB, which contains comprehensive information about students and their ongoing programs.
Suppose that several students have changed their academic majors, and the update , is a list
of these students with their new majors. This new data may conflict with some assertions in
the ABox . For example, if in  no student can be registered in two diferent majors at the
same time, and  asserts that John is a Computer Science (CS) major, but should be registered
in Physics due to , then John can no longer be a CS major. To reconcile the conflict between
the existing information in the KB and the new data in , we can update  to reflect that John
actually has switched majors, which involves removing the outdated major in CS assignment.
In most scenarios, to resolve inconsistencies that originate from changes at the instance level, it
seems to be more intuitive to update the instance level rather than the intensional level. This
means we would want the updated KB ′ to be of the form ′ = ⟨ , ′⟩, where computing an
ABox update results in a new ABox ′, that along with the original TBox  , represents the
outcome of the update process.</p>
      <p>However, in a VKG setting, a vital additional component is present – the underlying data
source, , which is mapped to the ABox through a set ℳ of declarative mappings. When
ABox updates are triggered by real-world changes (like John changing his major), it is not
only critical to maintain consistency in the ABox, but it is also crucial to ensure that these
updates propagate to , preserving a harmonized state across the VKG system. In an ideal VKG
framework featuring update propagation, when an update is performed at the ABox level, the
changes should be automatically reflected in the underlying data source via the mappings. This
notion implies a change to the structure of the VKG system from  = ⟨, ⟩ to  ′ = ⟨, ′⟩,
highlighting the necessity of an update mechanism capable of propagating changes from ABox
updates to the actual database. This kind of mechanism would considerably improve data
consistency, reliability, and the practical utility of VKG systems in dynamic environments.</p>
      <p>In this respect, we discuss in the next section three main goals to be achieved: First, we intend
to formalize the framework of updates in the context of VKGs. Second, we delve into the task
of computing updates at the ABox level of a VKG by propagating them to the underlying data
source. And last, we propose an analysis of how schema mappings and ontology axioms afect
updates in VKGs.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Updates in VKG systems</title>
      <p>
        4.1. Updates in DL-Lite KBs
The problem of instance-level updates in DL KBs has been studied extensively in the past
[
        <xref ref-type="bibr" rid="ref13 ref14 ref15">13, 14, 16, 15, 22</xref>
        ], and since we are in a setting of incomplete information, insertion operations
and their combination may generate an inconsistency in the ABox with respect to the axioms
specified in the TBox 1.
      </p>
      <p>
        Hence, the work on KB update is closely related to the work on inconsistency management
under incomplete information. Three main strategies have been identified as ways to deal with
inconsistency in such a context: (i) the first one consists of adjusting the constraints in the
TBox  so that the updated ABox remains consistent; (ii) the second one, called consistent query
answering [23], is concerned with answering queries while keeping the inconsistency in the KB;
1Note that, under the open world assumption typically holding in DL KBs, deletion operations instead do per se not
generate inconsistencies.
(iii) the third one, which is more appropriate in our context (and in which we are interested),
consists of removing facts from the original ABox to restore consistency [16, 17]. As shown
in the literature [
        <xref ref-type="bibr" rid="ref9">9, 24</xref>
        ], whenever an update is applied over a KB, it is convenient to comply
with the minimal change principle, which states that the resulting KB ′ = ⟨ , ′⟩ should be
as close as possible to the original one. In order to characterize the semantics of such a system,
it becomes therefore necessary to rely on a suitably-defined consistency-tolerant semantics,
e.g., based on the notion of repair.
      </p>
      <p>
        In order to properly formalize the notion of KB updates, we need to define what it means for
an update  to be coherent w.r.t. to a TBox  . These notions are inspired by [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
+
Definition 1 (Update coherence with  ). Given a KB  = ⟨ , ⟩, an update  = ⟨ ,  − ⟩
over  is coherent with the TBox  if (i) Mod (⟨ , + ⟩) ̸= ∅ and (ii) cl (+ ) ∩  − = ∅.
      </p>
      <p>The above definition exhibits the conditions required to be satisfied in order to validate the
coherence of an update w.r.t the ontology. The first condition states that all the facts to be added
should be consistent with the TBox  and the second condition is added to ensure that there
is no insertion and deletion of the same fact. This definition leads us to the characterization
of accomplishment of updates over a KB which is based on the notion of the minimal change
principle
+
Definition 2 (Accomplishing an update). Let  = ⟨ , ⟩ be a KB,  = ⟨ ,  − ⟩ an update
coherent with  , and ′ an ABox. Then ′ accomplishes the update of  with  if (i) ′ is
 -consistent, (ii) ⟨ , ′⟩ ̸|=  , for each  ∈  − , and (iii) ′ = ′′ ∪ + for some ′′ ⊆  .</p>
      <p>As we want to ensure that the resulting KB remains as close as possible to the original one,
a semantic for the closeness measure has to be taken into account when comparing diferent
resulting ′ accomplishing  [25, 19, 26]. That’s why the term close does not have a general
definition that will fit with all circumstances when it comes to KB evolution. However, the
notion of maximal accomplishment of an update has to be formalized, regardless of the chosen
closeness measure.
+
Definition 3 (Maximally accomplishing an update). Let  = ⟨ , ⟩ be a KB,  = ⟨ ,  − ⟩
an update that is coherent with the TBox  , and ′ an ABox that accomplishes the update of 
with . Then ′ maximally accomplishes the update of  with  if there is no ′1 ⊋ ′ that
accomplishes the update of  with .</p>
      <p>In the case of DL-Lite, it was shown in [26] that for any KB update  there is a unique
ABox ′ that maximally accomplishes it. For what follows, we denote by  ∙  the KB ⟨ , ′⟩,
where ′ is the ABox that maximally accomplishes the update of  with .</p>
      <p>
        The definitions introduced so far in this section are useful to formally define a KB update
within the context of VKG systems  = ⟨, ⟩, where  = ⟨ , ℳ, ⟩, as an update over
⟨ , ℳ()⟩. Computing such update essentially consists in computing the set of insertions
and deletions of facts over ℳ(). In [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], a Datalog-based non-recursive algorithm has been
proposed to derive the set of insertions (and deletions) of facts over ℳ() from an update .
      </p>
      <sec id="sec-4-1">
        <title>4.2. Updates over mapped data source</title>
        <p>In order to maintain consistency between the knowledge graph and the data source, whenever
a change is applied over the knowledge base we need to accurately propagate it down to the
data source. This ensures that the source mirrors the requested change through the mappings.
In this context, we have to extend the notion of KB updates to the VKG setting: we consider
that the ABox facts in  are provided through the mapping ℳ from a data source .</p>
        <p>Since the knowledge graph is kept virtual, the only way to realize a KB update  is to
ifnd an appropriate database state ′ from which it is possible to reach the updated ABox ′.
The translation of KB updates into suitable database (DB) updates is one of the key problems
we are interested in this research, and to formalize it we first need to introduce the notion of
realizability for KB updates.</p>
        <p>Definition 4 (Realizability). Let  = ⟨⟨ , ℳ, ⟩, ⟩ be a VKG instance forming a KB  =
⟨ , ⟩ with  = ℳ(), and let  be a KB update coherent with  . Then,  is realizable over
 through ℳ if there exists a DB instance ′ such that ℳ(′) =  ∙ .</p>
        <p>
          In the context of VKGs, since mappings are seen as view functions over the underlying data
source, the existence of a DB instance as specified in the above definition, is related to the
notion of translatability of view updates described in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>However, due to the ambiguities provided by the presence mappings, there might exist
multiple DB instances that realize a given KB update. Such ambiguity, which reflects the
non-injectivity of the mappings, is defined in the following:
Definition 5 (Ambiguous Mapping). A mapping ℳ is said to be ambiguous if there exist two
DB instances 1 and 2 s.t. (i) 1 ̸= 2 and (ii) ℳ(1) = ℳ(2).</p>
        <p>The above definition shows how the injectivity of the mapping is tightly connected to
updates in VKG instances. Specifically, this tells us that there might exist multiple DB instances
that accomplish a given KB update when the mapping component is not injective. However,
whenever a mapping ℳ is injective, every KB updates are uniquely translatable and is illustrated
in the following proposition:
Definition 6 (Unique realizability). Let  = ⟨⟨ , ℳ, ⟩, ⟩ be a VKG instance forming a KB
 = ⟨ , ⟩ with  = ℳ(), and let  be a KB update coherent with  . Then,  is uniquely
realizable over  through ℳ if there exists a unique DB instance ′ such that ℳ(′) =  ∙ .
Proposition 1. Let ℳ be a non-ambiguous mapping of VKG instance  = ⟨⟨ , ℳ, ⟩, ⟩, and
let  be an update coherent with  . If  is realizable, then  is also uniquely realizable.
Proof. It is clear that if two DB instances 1 and 2 realize a KB update , then we get
ℳ(1) = ℳ(2), which, by non-ambiguity of ℳ, entails 1 = 2.</p>
        <p>As stated in [27], the converse does not hold, i.e., even if ℳ is ambiguous, a KB update
 that is realizable might still be uniquely realizable. Therefore, in order to deal with the
ambiguity provided by the presence of mappings in practical VKG systems (which represents a
major concern in our research), we pose in the following sections the main research challenges
we are facing, and any answer to them would benefit us in continuing our research.
In a typical VKG instance, where the mapping component ℳ is ambiguous, some KB updates
will result in diferent DB instance candidates realizing them. As an example, if a mapping
maps both the Student and Professor relations from the DB schema to the Person atom in the
VKG, a KB update that adds a new person will result in two possible DB instances, one with an
update to the Student relation and the other with an update to the Professor relation. To achieve
automated propagation of updates to the data sources, one approach is to develop methods to
automatically select the most suitable DB instance that realizes the desired VKG update based
on some preferences initially provided by the users. We should note that all DB instances that
realize a KB update are equivalent w.r.t. the mapping and the TBox, i.e., they create ABoxes
that are equivalent with respect to the TBox. This notion of equivalence is defined as follows:
Definition 7 (ℳ-equivalence of DB instances). Given a VKG specification  = ⟨ , ℳ, ⟩ and
two DB instances 1, 2 for , we say that 1 and 2 are ℳ-equivalent w.r.t.  if ℳ(1) ≡ 
ℳ(2).</p>
        <p>In order to reduce the space of possible DB instances that realize a given KB update, we take
advantage of the notion of repair by complying with the minimal change principle w.r.t. the
original instance. Moreover, we want to base our selection of repair on a set of preferences
provided by the user, as explained in [28]. This brings us to the following research questions:
Q1: How can we formalize a preference-based selection of DB instances that realize updates over
VKGs?
Q2: Given a VKG instance  = ⟨⟨ , ℳ, ⟩, ⟩, how can we formalize a suitable notion of
interference of predicates over  that may occur when propagating any given KB update, and
how can we actually compute such interferences?
For instance, in our previous example, the relations Student and Professor are interfering since
they map to the same atom in the KB.</p>
        <p>The above research question is helpful to check whether a priority operator based on
preferences given by the user is total, in the sense that it resolves all interferences. This is because a
preference that cannot be further extended should specify how to resolve every interference
[28]. This leads to a further research question:
Q3: What language is suitable to express user preferences in the context of updates in VKGs?</p>
        <p>Finally, we also want to consider the case where KB updates are not realizable, i.e., given a
KB  updated with , there is no DB instance  such that ℳ() =  ∙ . The absence of
realizability entails the presence of DB instances that are not ℳ-equivalent and that create
diferent approximations of  ∙ . This adds a further challenge, which consists of choosing
the DB instance ′ such that the KB ′ = ⟨ , ℳ(′)⟩ is minimally distant from  ∙ ,
for a suitably defined notion of distance. Therefore, we need to provide a relaxed version of
realizability, which consists of finding a DB instance that realizes at best a given KB update .
This brings us to the following research question:
Q4: Given a non-realizable KB update  over a KB , how can we select the DB instance that
through ℳ approximates at best  ∙ ?</p>
        <sec id="sec-4-1-1">
          <title>4.2.2. KB update rewriting in VKGs</title>
          <p>
            In this section, we are interested in statically analyzing updates in the context of VKGs, i.e.,
given a VKG specification  = ⟨ , ℳ, ⟩ and a KB update , we want to compute a suitable
DB update  that is its translation and that updates any data source in such a way that via the
mapping it generates the updated KB. Following [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ], which deals with view-updates, such a
translation can be formalized as follows.
          </p>
          <p>Definition 8 (Translation). Given a VKG specification  = ⟨ , ℳ, ⟩ and a KB update , a
DB update  is a translation of  if for every DB instance  satisfying  from which we have
 = ⟨ , ℳ()⟩, we have that: (1)  ∙  = ⟨ , ℳ( ∙ )⟩ and (2)  ∙  =  whenever
 ∙  = .</p>
          <p>Based on the above definition, we want to be able to statically compute the translation (or set
of translations) of a given KB update, which is composed of a set of insertions and deletions of
relations over the data source.</p>
          <p>
            Unlike the method proposed in [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ], which is based on a Datalog Program that considers
a DB instance, we want to propose a static method that supports the query rewriting
algorithm PerfectRef [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ] in order to entail the set of insertions/deletions over the KB. Finally, we
want to unfold each KB insertion/deletion w.r.t. to a given mapping in order to compute the
corresponding update over the data source.
          </p>
          <p>In this setting, we want to formalize the notion of static realizability, which is defined as
follows:
Definition 9 (Static realizability). Let  = ⟨ , ℳ, ⟩ be a VKG specification and  a KB
update. Then, we say that  is statically realizable if there exists a DB update  over the
signature  such that ⟨ , ℳ( ∙ )⟩ = ⟨ , ℳ()⟩ ∙ , for every DB instance  compliant
with .</p>
          <p>Given this notion of static realizability, we can now formulate a further research question:
Q5: How can KB updates be statically and efectively realized in state-of-the-art VKGs that support
query rewriting?</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.2.3. Mapping analysis in VKGs</title>
          <p>Finally, we want to study mapping analysis in the scope of VKGs, especially its inherent
properties that are intrinsically connected to VKG updates. The task of mapping specification
within the context of VKGs can be complex, necessitating a deep understanding of both the
ontology and the associated data source. This nuanced "cognitive distance" between the data
sources and the ontology often prompts the formulation of a complex mapping, expressed via
assertions that correlate queries over the ontology with those over the data sources. A particular
interest to us is the concept of injectivity of schema mapping—a facet that bears significant
implications for the translation of KB updates into DB updates within the framework of VKG.</p>
          <p>The injectivity property of the mapping is particularly crucial for updates, as it ensures that
each change in the ontology corresponds to a unique change in the DB. If a mapping is not
injective, a single update at the ontology level could potentially correspond to multiple updates
at the DB level, creating ambiguity in the translation process. This ambiguity could lead to
inconsistent states between the ontology and the DB, undermining the overall integrity of the
system. However, the absence of injectivity in the schema mapping does not entirely prevent the
translation of updates. There may be situations where updates can be uniquely translated even
if the schema mapping is not injective. Still, such scenarios are more challenging to manage
and typically require additional mechanisms or constraints to ensure the correct translation of
updates.</p>
          <p>As a first step towards analyzing mapping in VKG, we want to determine whether a given
mapping specification is injective (or ambiguous) and in order to achieve it we are planning to
use an automated model prover.</p>
          <p>Q6: Given a VKG specification  = ⟨ , ℳ, ⟩, how can we check whether the mapping component
ℳ is injective?</p>
          <p>To address this research question, our planned approach is to harness the capabilities of
automated tools, such as the Z3 model prover-a highly proficient theorem prover developed
under the auspices of Microsoft Research [29]. The ultimate aim is to integrate this tool into
the Ontop system infrastructure for mapping analysis.</p>
          <p>Also, we express interest in non-injective mappings, particularly in deriving the necessary
information from the system to facilitate a deterministic translation of KB updates. This is
tightly connected to work on unique solutions in data exchange, as studied in [30] where we
want to ensure that for any subjective mapping, there is always a unique data source state from
which it’s possible to compute a given KB. This is formulated in the following research question:
Q7: Given a VKG specification  = ⟨ , ℳ, ⟩ where ℳ is not injective, what additional
information is necessary in order to maintain consistency between the knowledge base and the DB
via the mapping?</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>This paper aims to address foundational and applied issues in the realm of Virtual Knowledge
Graphs (VKGs), focusing on the introduction and exploration of Ontology-based updates and
evolution within the context. We explored the multifaceted issue of translating knowledge base
updates into database updates in the setting of VKGs. The crux of our research challenges stems
from the dual nature of the problem: maintaining consistency in an incomplete information
context and the inherent ambiguity introduced by mappings between ontology and data sources.</p>
      <p>Regarding the research questions mentioned in this paper, we have already obtained some
preliminary results. However, these methods require extensive discussion and need to be further
analyzed to ascertain their validity and the potential for being extended.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was partially supported by the Wallenberg AI, Autonomous Systems, and Software
Program (WASP), funded by the Knut and Alice Wallenberg Foundation, and by the Province
of Bolzano and DFG through project D2G2 (DFG grant no. 500249124). Šimkus was partially
supported by the Austrian Science Fund (FWF) project P30873.
46–56.
[16] G. De Giacomo, M. Lenzerini, A. Poggi, R. Rosati, On instance-level update and erasure in
description logic ontologies, J. of Logic and Computation 19 (2009) 745–770.
[17] M. Winslett, A model-based approach to updating databases with incomplete information,</p>
      <p>ACM Trans. on Database Systems 13 (1988) 167–196.
[18] M. Lenzerini, D. F. Savo, Updating inconsistent description logic knowledge bases, in:</p>
      <p>Proc. of the 20th Eur. Conf. on Artificial Intelligence (ECAI), 2012, pp. 516–521.
[19] M. L. Ginsberg, D. E. Smith, Reasoning about action I: A possible worlds approach, Artificial</p>
      <p>Intelligence 35 (1988) 165–195.
[20] R. Fagin, J. D. Ullman, M. Y. Vardi, On the semantics of updates in databases, in: Proc. of
the 2nd ACM Symp. on Principles of Database Systems (PODS), 1983, pp. 352–365.
[21] M. Lenzerini, Data integration: A theoretical perspective., in: Proc. of the 21st ACM Symp.
on Principles of Database Systems (PODS), 2002, pp. 233–246. doi:10.1145/543613.
543644.
[22] D. Zheleznyakov, E. Kharlamov, W. Nutt, D. Calvanese, On expansion and contraction of
DL-Lite knowledge bases, J. of Web Semantics 57 (2019) 1–19. doi:10.1016/j.websem.
2018.12.002.
[23] L. Bertossi, Database Repairs and Consistent Query Answering, Morgan &amp; Claypool</p>
      <p>Publishers, 2011.
[24] G. Flouris, D. Plexousakis, Handling Ontology Change: Survey and Proposal for a Future
Research Direction, Technical Report TR-362 FORTH-ICS, Institute of Computer Science,
Forth, Greece, 2005.
[25] M. L. Ginsberg, Counterfactuals, Artificial Intelligence 30 (1986) 35–79.
[26] D. Calvanese, E. Kharlamov, W. Nutt, D. Zheleznyakov, Evolution of DL-Lite knowledge
bases, in: Proc. of the 9th Int. Semantic Web Conf. (ISWC), volume 6496 of Lecture Notes in
Computer Science, Springer, 2010, pp. 112–128. doi:10.1007/978-3-642-17746-0_8.
[27] E. Franconi, P. Guagliardo, On the translatability of view updates, in: Proc. of the 6th
Alberto Mendelzon Int. Workshop on Foundations of Data Management (AMW), volume
866 of CEUR Workshop Proceedings, CEUR-WS.org, 2012, pp. 154–167.
[28] S. Staworko, J. Chomicki, J. Marcinkowski, Prioritized repairing and consistent query
answering in relational databases, Ann. of Mathematics and Artificial Intelligence 64 (2012)
209–246. doi:10.1007/s10472-012-9288-8.
[29] L. De Moura, N. Bjørner, Z3: An eficient SMT solver, in: Proc. of the Int. Conf. on Tools
and Algorithms for the Construction and Analysis of Systems, Springer, 2008, pp. 337–340.
[30] N. Ngo, E. Franconi, Unique solutions in data exchange, in: Proc. of the 25th Int. Conf.
on Database and Expert Systems Applications (DEXA), volume 8645 of Lecture Notes in
Computer Science, Springer, 2014, pp. 281–294.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <article-title>Linking data to ontologies</article-title>
          ,
          <source>J. on Data Semantics</source>
          <volume>10</volume>
          (
          <year>2008</year>
          )
          <fpage>133</fpage>
          -
          <lpage>173</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>540</fpage>
          -77688-
          <issue>8</issue>
          _
          <fpage>5</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodriguez-Muro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <article-title>Ontologies and databases: The DL-Lite approach</article-title>
          , in: S.
          <string-name>
            <surname>Tessaris</surname>
          </string-name>
          , E. Franconi (Eds.),
          <source>Reasoning Web: Semantic Technologies for Informations Systems - 5th Int. Summer School Tutorial Lectures (RW)</source>
          , volume
          <volume>5689</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2009</year>
          , pp.
          <fpage>255</fpage>
          -
          <lpage>356</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kontchakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zakharyaschev</surname>
          </string-name>
          ,
          <article-title>Ontology-based data access: A survey</article-title>
          ,
          <source>in: Proc. of the 27th Int. Joint Conf. on Artificial Intelligence (IJCAI)</source>
          ,
          <source>IJCAI Org.</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>5511</fpage>
          -
          <lpage>5519</lpage>
          . doi:
          <volume>10</volume>
          .24963/ijcai.
          <year>2018</year>
          /777.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cogrel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <article-title>Virtual Knowledge Graphs: An overview of systems and use cases</article-title>
          ,
          <source>Data Intelligence</source>
          <volume>1</volume>
          (
          <year>2019</year>
          )
          <fpage>201</fpage>
          -
          <lpage>223</lpage>
          . doi:
          <volume>10</volume>
          .1162/dint_a_
          <fpage>00011</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <article-title>OWL 2 profiles: An introduction to lightweight ontology languages, in: Reasoning Web: Semantic Technologies for Advanced Query Answering -</article-title>
          8th
          <source>Int. Summer School Tutorial Lectures (RW)</source>
          , volume
          <volume>7487</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2012</year>
          , pp.
          <fpage>112</fpage>
          -
          <lpage>183</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -33158-
          <issue>9</issue>
          _
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          (Eds.),
          <source>The Description Logic Handbook: Theory, Implementation and Applications</source>
          , Cambridge University Press,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <article-title>Tractable reasoning and eficient query answering in description logics: The DL-Lite family</article-title>
          ,
          <source>J. of Automated Reasoning</source>
          <volume>39</volume>
          (
          <year>2007</year>
          )
          <fpage>385</fpage>
          -
          <lpage>429</lpage>
          . doi:
          <volume>10</volume>
          .1007/s10817-007-9078-x.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Katsuno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mendelzon</surname>
          </string-name>
          ,
          <article-title>On the diference between updating a knowledge base and revising it</article-title>
          ,
          <source>in: Proc. of the 2nd Int. Conf. on Principles of Knowledge Representation and Reasoning (KR)</source>
          ,
          <year>1991</year>
          , pp.
          <fpage>387</fpage>
          -
          <lpage>394</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          , G. Gottlob,
          <article-title>On the complexity of propositional knowledge base revision, updates and counterfactuals</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>57</volume>
          (
          <year>1992</year>
          )
          <fpage>227</fpage>
          -
          <lpage>270</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bancilhon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Spyratos</surname>
          </string-name>
          ,
          <article-title>Update semantics of relational views</article-title>
          ,
          <source>ACM Trans. on Database Systems</source>
          <volume>6</volume>
          (
          <year>1981</year>
          )
          <fpage>557</fpage>
          -
          <lpage>575</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Cosmadakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Papadimitriou</surname>
          </string-name>
          , Updates of relational views,
          <source>J. of the ACM</source>
          <volume>31</volume>
          (
          <year>1984</year>
          )
          <fpage>742</fpage>
          -
          <lpage>760</lpage>
          . doi:
          <volume>10</volume>
          .1145/1634.
          <year>1887</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Kakas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mancarella</surname>
          </string-name>
          ,
          <article-title>Database updates through abduction</article-title>
          ,
          <source>in: Proc. of the 16th Int. Conf. on Very Large Data Bases (VLDB)</source>
          , volume
          <volume>90</volume>
          ,
          <year>1990</year>
          , pp.
          <fpage>650</fpage>
          -
          <lpage>661</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kharlamov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Nutt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zheleznyakov</surname>
          </string-name>
          ,
          <article-title>Updating ABoxes in DL-Lite</article-title>
          ,
          <source>in: Proc. of the 4th Alberto Mendelzon Int. Workshop on Foundations of Data Management (AMW)</source>
          , volume
          <volume>619</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2010</year>
          , pp.
          <fpage>3</fpage>
          .
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          .
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>G. De Giacomo</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Oriol</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>D. F.</given-names>
          </string-name>
          <string-name>
            <surname>Savo</surname>
          </string-name>
          ,
          <article-title>Instance-level update in DL-Lite ontologies through first-order rewriting</article-title>
          ,
          <source>J. of Artificial Intelligence Research</source>
          <volume>70</volume>
          (
          <year>2021</year>
          )
          <fpage>1335</fpage>
          -
          <lpage>1371</lpage>
          . doi:
          <volume>10</volume>
          .1613/jair.1.12414.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>H.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lutz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Milicic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wolter</surname>
          </string-name>
          ,
          <article-title>Updating description logic ABoxes</article-title>
          ,
          <source>in: Proc. of the 10th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR)</source>
          ,
          <year>2006</year>
          , pp.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>