=Paper=
{{Paper
|id=Vol-3263/abstract-14
|storemode=property
|title=A New Dimension to Generalization: Computing Temporal EL Concepts from Positive Examples (Extended Abstract)
|pdfUrl=https://ceur-ws.org/Vol-3263/abstract-14.pdf
|volume=Vol-3263
|authors=Satyadharma Tirtarasa,Anni-Yasmin Turhan
|dblpUrl=https://dblp.org/rec/conf/dlog/TirtarasaT22
}}
==A New Dimension to Generalization: Computing Temporal EL Concepts from Positive Examples (Extended Abstract)==
<pdf width="1500px">https://ceur-ws.org/Vol-3263/abstract-14.pdf</pdf>
<pre>
A New Dimension to Generalization: Computing
Temporal E L Concepts from Positive Examples
Extended Abstract

Satyadharma Tirtarasa1 , Anni-Yasmin Turhan1
1 Technische Universität Dresden, Nöthnitzer Str. 46, 01219, Germany


                                      Abstract
                                      The authoring of complex concepts or (instance) queries written in a description logic (DL) can be
                                      a difficult task. An established approach is to generate such concepts from positive examples is to
                                      employ the most specific concept (msc), which generalizes an ABox individual into a concept and the
                                      least common subsumer (lcs), which generalizes a collection of concepts into a single concept. These
                                      inferences are investigated for the EL, but so far there are no methods for the 2-dimensional case such
                                      as temporalized DLs. We report in this abstract on our computation algorithms for the lcs and the msc
                                      in a temporalized DL: EL extended by the LTL operators next and global. We sketch the computation
                                      algorithms for both inferences—with and without the use of rigid symbols.

                                      Keywords
                                      generalization, temporal reasoning, learning from examples


   Concepts written in a particular DL that occur in description logic knowledge bases or in
queries usually model a notion from the application domain. When DL KBs or complex query
concepts are built, it is necessary to formalize the intended meaning by authoring complex
concepts. These can then be used in TBox axioms or can serve as instance queries. It can be
difficult for users to come up with such a formalization. This task is even more difficult, if there
are temporal aspects to be formalized in the intended concept. Often it is easier for a user to
select example instances of a (temporalized) concept available in the data in the knowledge base.
An approach to automatically generate complex concepts from a set of selected examples is the
bottom-up approach [1] where the resulting concept is computed by applying two generalization
inferences. The first is the most specific concept (MSC), which computes from an ABox individual
a concept (in a given DL) that is the least w.r.t. subsumption. The second inference is the least
common subsumer (LCS), which computes from a set of input concepts a concept that subsumes
all input concepts and is the least concept w.r.t. subsumption. Applying the MSC to each example
selected by the user and then applying the LCS to all of the resulting concepts, yields a concept
description of the selected examples and is a first version of the intended (query) concept.
   The inferences LCS and MSC are typically studied for DLs that do not admit disjunction,
since in the presence of this concept constructor the LCS is simply the disjunction of the

   DL 2022: 35th International Workshop on Description Logics, August 7–10, 2022, Haifa, Israel
" satyadharma.tirtarasa@tu-dresden.de (S. Tirtarasa); anni-yasmin.turhan@tu-dresden.de (A. Turhan)
~ https://tu-dresden.de/ing/informatik/thi/lat/die-professur/beschaeftigte (S. Tirtarasa);
https://lat.inf.tu-dresden.de/~turhan/ (A. Turhan)
 0000-0003-1741-1458 (S. Tirtarasa); 0000-0001-6336-335X (A. Turhan)
                                    © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
 CEUR
 Workshop
 Proceedings
               http://ceur-ws.org
               ISSN 1613-0073
                                    CEUR Workshop Proceedings (CEUR-WS.org)
input concepts and thus not informative. Both inferences, the LCS and the MSC, have been
investigated for unfoldable TBoxes. If a cyclic or even general TBox is used, the LCS need not
exist, since cyclic concepts cannot be expressed by finite concepts. For cyclic ABoxes the MSC
need also not exist, due to the same reason [2]. Conditions for the existence of the LCS (and the
MSC) in EL w.r.t. general TBoxes have been given in [3] and approximations that limit the role
depth in concepts have been studied in [2, 4, 5].
   Here we report on our paper recently presented at the SAC conference [6], where we provide
a first study on the computation of the LCS and the MSC for temporalized DLs. Temporalized
DLs have been intensively investigated in the last decade mainly motivated by ontology-based
situation recognition, where the situation to be recognized is formalized as a temporal query.
The task of situation recognition then boils down to answer a temporal query over a classical
DL TBox and a sequence of ABoxes. For recent surveys on reasoning in this kind of settings see
[7, 8]. Temporalized DLs are a kind of two dimensional DL, where two logic formalisms are
combined [9, 10]. Two dimensional DLs usually have an object dimension, which corresponds
to classical DLs and a context dimension [11] or even temporal evolution of context-sensitive
domain [12]. Temporal DLs based on LTL have a linear context dimension.
   Example-based learning for two dimensional DLs or temporalized DLs has so far not been
studied in the literature. In case of temporalized DLs automated support for authoring temporal
query concepts (or even temporal conjunctive queries) would be very helpful for situation
recognition applications.
   In our paper, we lift the LCS and MSC to the temporal setting. More precisely, we consider the
logic LTLX,G
           EL , which uses the temporal operators next (X) and global (G) from LTL “on top of”
EL constructors. Our choice of temporal operators is motivated by the fact, that the operators
until (U) and finally (F) imply a disjunction and would, if being used in the generalization
inferences, simply enumerate the variants found in the temporal data. LTLX,G     EL concepts are
defined by the following grammar:

                          C, D ::= A | C ⊓ D | ∃r.C | X C | G C | ⊤

LTLX,G
   EL can express concepts that capture temporal aspects over the domain of interest. For
example, consider LTLX,G
                     EL concept:

     Cex = Person ⊓ ∃isVaccinated.BioNTech ⊓ X(G(Vaccinated)) ⊓ G( RiskGroup),

which describes persons that are vaccinated using BioNTech and that count from the next time
point on as always vaccinated. The last conjunct expresses that these persons are always in the
risk group from the initial time point on.
   The semantics of LTLX,GEL concepts is given using two dimensions: (1) an infinite temporal
sequence of (2) classical DL interpretations. The LTL operators express the evolution of objects
in the temporal dimension, while the EL constructors express relations to other elements within
the classical interpretation (at the same time point).
   We consider in our paper the generalization inferences only for concepts (w.r.t. unfoldable
TBoxes and in case of the MSC also an ABox) as the LCS or MSC need not exist w.r.t. general
TBoxes due to cyclic axioms. This means we can rewrite the ABoxes to incorporate the
information from the TBox and then ignore the latter. Therefore, we can assume w.l.o.g.
                                        v0 : { Person, RiskGroup}
                         isVaccinated                         G
                                                  X
                  v1 : { BioNTech} v2 : {Vaccinated, RiskGroup} v4 : { RiskGroup}

                                                     G
                                    v3 : {Vaccinated, RiskGroup}

Figure 1: The LTLX,G
                 EL description tree that corresponds to normalized form of Cex .


that the input knowledge base consists of only a sequence of ABoxes. As usual for temporal
reasoning, we assume rigid domain and rigid individuals.
   The bottom-up approach in our temporal setting means that the user selects a set of example
individuals from the sequence of ABoxes (possibly from different time points). The MSC is
applied to each of them and the resulting LTLX,G
                                             EL concepts are generalized by the LCS into a
single LTLEL concept. Our computation algorithms for the LCS and the MSC in LTLX,G
           X,G
                                                                                    EL extend
the approach from [1] to the temporal case. For ease of presentation we start with the LCS
inference. The LCS computation algorithm from [1] performs three steps:
   1. transform each input concept into normal form, that makes the interaction among the
      temporal operators explicit
   2. represent each normalized concept as a tree-structure, called description tree,
   3. build cross-product of all description trees and read-off the LCS concept from it.
The normalization of LTLX,G EL concepts handles the interaction of X and G. More precisely,
it treats the temporal operators X and G as functional DL roles and therefore yields a one-
dimensional DL concept. This DL concept can then be transformed into a tree. Furthermore, in
order to obtain trees of equal height, the normalization performs tree padding. More precisely,
it extends the shorter tree by adding X successors and propagates the information from the
sub-tree rooted in the existential restriction for the G role onto the sub-tree (or rather the chain)
rooted in existential restrictions for the X role. Consider again Cex . This concept which is
not normalized, since the effect of G( RiskGroup) is not propagated to the next timepoint, i.e.
into the X-concept. Figure 1 depicts a description tree for Cex after normalization. Notice that
the G information such as RiskGroup and Vaccinated are propagated and made explicit. As it
turns out, the normalization can cause the concept to grow exponentially which is due to the
interaction of the temporal operators.
   Once the description trees are computed, their cross-product is constructed as usual and the
LTLX,G
     EL concept can be read-off from it. To be able to show correctness of the LCS computation
algorithm, we need to give a characterization of subsumption for LTLX,G      EL concepts which is
one of our main results of the paper. We show that a concept is subsumed by another concept
if there exists a homomorphism between their description trees, provided that the trees are
aligned. This alignment is crucial to make sure that the interaction between next and global are
captured. By using this characterization we show the soundness, completeness and termination
of our LCS computation algorithm for LTLX,G   EL .
  To compute the MSC in EL, the procedure from [1] performs the following steps:
   1. construct a description graph representing the relational structure in the ABox
      (incorporating the TBox information),
   2. tree-unravel the graph starting from the input individual to generate the description tree
      and read-off the MSC concept from it.
The extension of this method to the temporal setting needs to construct a description graph
from a sequence of ABoxes. The re-occurrence of an individual in the next ABox of the temporal
sequence is modeled by connecting the individual and its re-occurrence by X reflecting the
temporal information. For cyclic ABoxes, an exact solution need not exist already for EL, but the
construction can be approximated by a bound on the role-depth. In that case, the tree-unraveling
the description graphs (up to the bound) can be done as in the case of EL.
   In order to show the correctness of our MSC algorithm, we develop a characterization of the
instance relationship in LTLX,G EL . In particular, we show that an individual is an instance of a
concept (at certain time point) if there exists a homomorphism between the description tree of
the concept and the graph by mapping the root of the tree to the individual in the graph. With
our graph construction, we can treat the two temporal operators as standard roles in the graph
representation when traversing. Using this characterization, we show that our MSC algorithm is
sound, complete, and terminating, if the MSC exists w.r.t. the given ABox. While the traversing
is linear in the size of the graph, it is possible that the size of the graph is exponentially larger
than the initial one due to normalization.
The algorithms for generalization inferences in LTLX,G  EL that we have described so far handle
non-rigid symbols only. In our paper [6], we also investigate the case of rigid concepts and roles.
While rigid names often cause an exponential blow-up for or even undecidability of reasoning
in two-dimensional DLs and temporal DLs, this is not the case in our lightweight temporal
DL. In fact, it is well-known that rigid concepts can be readily expressed by using G at the
initial time point. Although incorporating rigid roles is more complex, we show that they also
can be simulated. Consider there exists r ( a, b) in the sequence of ABoxes and r is rigid. Then,
we have to introduce G ∃r.C where C is the G conjunct in the MSC of b. Since the process of
normalizing a concept might cause an exponential blow-up already, the computational cost of
simulating rigid concepts and roles is not adding to the overall complexity.


Acknowledgments
This work was partially supported by the AI competence center ScaDS.AI Dresden/Leipzig and
by the DFG through the Collaborative Research Center TRR 2481 and through the Research
Training Group RoSI (GRK 1907).


1 https://perspicuous-computing.science, project ID 389792660
References
 [1] F. Baader, R. Küsters, R. Molitor, Computing least common subsumers in description logics
     with existential restrictions, in: Proc. of the 16th Int. Joint Conf. on Artificial Intelligence
     (IJCAI’99), 1999.
 [2] R. Küsters, R. Molitor, Approximating most specific concepts in description logics with
     existential restrictions, AI Commun. 15 (2002).
 [3] B. Zarrieß, A.-Y. Turhan, Most specific generalizations w.r.t. general E L-TBoxes, in:
     Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI’13),
     AAAI Press, 2013.
 [4] R. Peñaloza, A.-Y. Turhan, A practical approach for computing generalization inferences
     in E L, in: Proceedings of the 8th European Semantic Web Conference (ESWC’11), LNCS,
     Springer-Verlag, 2011.
 [5] A. Ecke, R. Peñaloza, A.-Y. Turhan, Computing role-depth bounded generalizations in the
     description logic E LOR, in: Proceedings of the 36th German Conference on Artificial
     Intelligence (KI 2013), LNAI, Springer, 2013, pp. 49–60.
 [6] S. Tirtarasa, A.-Y. Turhan, Computing Generalizations of Temporal EL Concepts with Next
     and Global, in: Proceedings of the 37th Annual ACM Symposium on Applied Computing
     (SAC ’22), Association for Computing Machinery, 2022.
 [7] A. Artale, R. Kontchakov, A. Kovtunova, V. Ryzhikov, F. Wolter, M. Zakharyaschev,
     Ontology-mediated query answering over temporal data: A survey, in: 24th Int. Sympos.
     on Temp. Representation and Reasoning, TIME, 2017.
 [8] F. Baader, S. Borgwardt, P. Koopmann, V. Thost, A.-Y. Turhan, Semantic technologies for
     situation awareness, KI – Künstliche Intelligenz (2020).
 [9] F. Baader, S. Ghilardi, C. Lutz, LTL over description logic axioms, ACM Trans. Comput.
     Log. 13 (2012).
[10] K. Schild, Combining terminological logics with tense logic, in: Progress in Artificial
     Intelligence, 6th Portuguese Conference on Artificial Intelligence, EPIA ’93, volume 727 of
     Lecture Notes in Computer Science, Springer, 1993.
[11] S. Böhme, M. Lippmann, Decidable description logics of context with rigid roles, in:
     Frontiers of Combining Systems - 10th International Symposium, FroCoS„ volume 9322 of
     Lecture Notes in Computer Science, Springer, 2015.
[12] S. Tirtarasa, Temporal properties over contextualized description logics, in: Proceedings
     of the 33rd International Workshop on Description Logics (DL 2020), volume 2663 of CEUR
     Workshop Proceedings, CEUR-WS.org, 2020.

</pre>