Multi-Frame Modal Symbolic Learning

Multi-Frame Modal Symbolic Learning GiovanniPagliarini giovanni.pagliarini@unife.it University of Ferrara

Italy

University of Parma

Italy

GuidoSciavicco guido.sciavicco@unife.it University of Ferrara

Italy

IonelEduardStan ioneleduard.stan@unife.it University of Ferrara

Italy

University of Parma

Italy

Multi-Frame Modal Symbolic Learning 1613-0073 A1FB21C74EDD0C401F4FC78BBAEBF557 GROBID - A machine learning software for extracting information from scholarly documents Dimensional Data Modal Logics Modal Symbolic Learning

Symbolic learning is the sub-field of machine learning that deals with symbolic algorithms and models, which have been known for decades and successfully applied to a variety of contexts. The main limitation of symbolic models is the fact that they are essentially based on classical propositional logic, which implies that data with an implicit dimensional component, such as temporal (e.g., time series) or spatial data (e.g., images), cannot be properly dealt with within the standard symbolic framework. Recently, modal symbolic learning models have been proposed as a natural extension of classical ones to naturally deal with dimensional data, and successfully applied to temporal and spatial data. In this paper, we discuss the possibility of further extending such learning models to deal with multi-frame dimensional data, to be able to natively learn from instances represented by more than one dimensional description.

Introduction

The most iconic and fundamental separation between sub-fields of machine learning is the one between functional and symbolic learning. Functional learning is the process of learning a function that represents the theory underlying a certain phenomenon. Symbolic learning, on the other hand, is the process of learning a logical description that represents that phenomenon. Whether one or the other approach should be preferred raised a long-standing debate among experts, rooted in the fact that functional methods tend to be more versatile and statistically accurate than symbolic ones, while symbolic methods are able to extract models that can be interpreted, explained, and enhanced using human knowledge. From a logical standpoint, classical symbolic learning schemata are all characterized by the use of propositional logic (they are, in fact, sometimes called propositional methods), and can be classified along three main directions: the structure of the models (from strongly structured ones, such as decision trees [1,2], to strongly unstructured ones, such as sets of independent rules [3]), the type of logic (crisp versus fuzzy [4,5]), and the type of learning method (from purely deterministic to purely randomized). Dimensional data, such as temporal or spatial data, cannot be dealt with in a native way using propositional methods. Examples of naturally dimensional data include temporal histories of patients, or objects described by spatial images; in all such cases, the dimensional component is usually implicit. The general to-go strategy to treat dimensional data with propositional models such as decision-trees is to flatten the dimensional component, effectively hiding it. This consists in massaging the data set in such a way that new variables are created for each dimensional variable 𝐴, which contain the values of 𝐴 at different times, spatial locations, and so on; so for example, an instance that consists of a single-variate time series 𝐴 with 𝑁 (ordered) points ends up being represented as the (unordered) collection 𝐴(1), 𝐴(2), . . . , 𝐴(𝑁 ). Such a representation is called lagged in the temporal case, and flattened in the spatial one, and it allows one to use off-the-shelf propositional methods for the phase. Recently a new line in symbolic learning has emerged, in which classical symbolic algorithms are enhanced so that dimensional variables can be dealt by leveraging more expressive reasoning capabilities. To this end, propositional logic is replaced with propositional modal logic [6] in the learning schema, allowing one to natively express the relationships that emerge among the different worlds that describe each instance (e.g., time points, time intervals, extended areas, etc.). Modal logic can be declined into more practical languages, such as temporal or spatial logics, without loosing its basic principles, and the definition of modal symbolic learning schema immediately becomes a definition of a temporal, spatial, or spatial-temporal one. Modal decision trees, for example, have been studied in the temporal case in [7], and applied to real data in [8]. Dimensional data, however, is more complex than what can be captured by a single description. The temporal histories of patients, for instance, should be paired with patients' static data; the spatial descriptions of objects, as a different example, may require images from different angles; more in general, the instances of real-world data sets are often very complex, and require learning methods that can deal with such complexity.

In this paper we pave the way to a generalization of modal symbolic methods to the multiframe case, and present some practical cases in which such a complex schema can be useful.

Multi-Frame Modal Symbolic Learning

A multi-frame dimensional data set ℐ is a finite collection of instances 𝐼 1 , . . . 𝐼 𝑚 , each of which is associated with (i.e., described by) 𝑟 Kripke models 𝑀 1 , . . . , 𝑀 𝑟 . In turn, in each model 𝑀 𝑗 = (𝑊 𝑗 , 𝑅 𝑗 1 , . . . , 𝑅 𝑗 𝑠 𝑗 , 𝑉 𝑗 ), each world is characterized by the value of 𝑛 𝑗 distinct attributes 𝐴 𝑗 1 , . . . , 𝐴 𝑗 𝑛 𝑗 ; as in the classical propositional case, the attributes define the propositional alphabet. We say that ℐ is labeled if it is partitioned in a finite number of classes 𝒞 = {𝐶 1 , . . . , 𝐶 𝑘 }. In other words, a single instance is described by more-than-one dimensional information. We assume that the number and the type of descriptions are consistent among instances; so, for example, a multi-frame dimensional data set may contain instances described by three frames each. To a multi-frame dimensional data set with 𝑟 frames we associate 𝑟 (in general, distinct) modal languages. The 𝑗-th language is a unary modal logic with 𝑠 𝑗 existential modalities ♢ 1 , . . . , ♢ 𝑠 𝑗 , and their corresponding universal versions □ 1 , . . . , □ 𝑠 𝑗 . Modalities are interpreted by the relations 𝑅 𝑗 1 , . . . , 𝑅 𝑗 𝑠 𝑗 , so that:

𝑀 𝑗 , 𝑤 ⊩ 𝑝 iff 𝑝 ∈ 𝑉 𝑗 (𝑤); 𝑀 𝑗 , 𝑤 ⊩ ¬𝜙 iff 𝑀 𝑗 , 𝑤 ̸ ⊩ 𝜙; 𝑀 𝑗 , 𝑤 ⊩ 𝜙 ∨ 𝜓 iff 𝑀 𝑗 , 𝑤 ⊩ 𝜙 or 𝑀 𝑗 , 𝑤 ⊩ 𝜓; 𝑀 𝑗 , 𝑤 ⊩ ♢ 𝑗 𝑡 𝜙 iff ∃𝑣 s.t. 𝑤𝑅 𝑗 𝑡 𝑣 and 𝑀 𝑗 , 𝑣 ⊩ 𝜙,

where ♢ 𝑗 𝑡 is the 𝑡-th modality of the 𝑗-th language. Multi-frame modal symbolic learning consists of enhancing modal symbolic learning methods with the possibility of learning from more-than-one dimensional description at the same time, and describing the learned knowledge using the correct logic. Multi-frame dimensional data sets capture dimensional situations quite naturally; but just as it happens in modal symbolic learning, in the multi-frame case too we need to concretize the learning models and the associated languages to specific modal logics to adapt them to the real-world cases. Dimensional data is generally represented in an implicit form. This means that, for example, time series and images are usually linearized, and expressed as sets of numbers; different, but equivalent, representations can be used to capture dimensional situations (by rows and by columns are just two very popular examples). A key observation is that the information in real data, in general, is not point-based (think, for example, to time series or images: the values of single time points or single pixels are not really informative). One around this problem is to employ modal logics in which worlds do not correspond to single points, but to sets of points. Halpern and Shoham's interval temporal logic (𝐻𝑆) [9] allows one to express properties of intervals in the temporal case. By generalizing 𝐻𝑆 to any number of dimensions, we obtain a family of logics that we can denote by 𝐻𝑆 𝑑 , where 𝑑 ∈ N; 𝐻𝑆 0 is just propositional logic, 𝐻𝑆 1 is the original Halpern and Shoham logic of intervals, and 𝐻𝑆 2 is the natural logical generalization of Rectangle Algebra [10]. In 𝐻𝑆 𝑑 , worlds are hyperrectagles with edges parallel to the axis, and connected by the 𝑑-dimensional generalizations of Allen's interval relations. In this way, a set of propositional letters emerges naturally. Fixed a frame, a world, and an attribute 𝐴, these are of the type:

𝑓 (𝐴) ◁▷ ∼𝛾 𝑎

where 𝑎 is a value of the domain of 𝐴, 𝑓 is a function, 𝛾 ∈ (0, 1] ⊂ R, and ◁▷ ∈ {<, ≤, =, ≥, >}, and ∼ ∈ {<, ≤, ≥, >}. Propositional letters of this type are interpreted over hyperrectangles; for example, if 𝑑 = 1 (that is, data is one-dimensional, e.g., temporal), 𝑓 is the identity function, ◁▷ is <, ∼ is >, 𝛾 is 0.8, 𝐴 is the temperature, and 𝑎 = 37.5, then 𝐴 < >0.8 𝑎 represents the proposition more than the 80% of the values of the temperature during the current interval are below 37.5. By varying the function 𝑓 , one can produce more complex assertions on hyperrectangles.

For a better understanding of how multi-frame modal symbolic learning can be useful in practical case, consider, again, the medical example, as in Fig. 1 (top). In this situation, we have a three-frame data set. The first frame is static, and it is associated to propositional logic (𝐻𝑆 0 ); the second frame is one-dimensional, and, specifically, temporal: we associate it with 𝐻𝑆 1 , so that the information concerning the temporal attributes is learned by intervals; the third frame is two-dimensional, that is, spatial: by associating it to 𝐻𝑆 2 , we can learn patterns of rectangles and colors in them. Instances are labeled, making this data set suitable for knowledge extraction by classification. One can observe how data are represented in concrete form (Fig. 1, bottom left), and how we can extract knowledge in symbolic form from them (Fig. 1, bottom, right), with modal decision trees or sets of modal rules in 𝐻𝑆 𝑑 .

Conclusions

In this paper we paved the way towards multi-frame modal symbolic learning. We defined multi-frame dimensional data sets, the learning framework, and described how to concretize it to deal with implicit dimensional data in a intuitive way.

c o n c r e t e r e p r e s e n t a t i⟨ 8 ⟨⟨𝐵⟩8𝐴 ⟩ 𝑇 < 3 8 [ 𝐴 ] 𝑇 ≥ 3 𝐵 , 𝐸 ⟩ 𝑅 ≤ 2 0 0 [ 𝐵 , 𝐸 ] 𝑅 > 2 𝑃 > 110 ∧ [𝐴,𝑂] 𝐺 < 200 ⇒ . . . ⟨𝐴, 𝐴⟩ 𝐵 < 215 ∧ ⟨𝐿⟩ 𝑇 < 39 ⇒ . . . . . . decisio n on a tempor al interva l deci sion on a spat ial rect ang le

Figure 1 :1Figure 1: Example of multi-frame dimensional data set.

Acknowledgments

We thank the INdAM GNCS 2020 project Strategic Reasoning and Automated Synthesis of Multi-Agent Systems for partial support.

Induction of decision trees JRQuinlan Machine Learning 1 1986 Simplifying decision trees JRQuinlan International Journal of Human-Computer Studies 51 1999 Rule-based Classification AK HTung 2009 Springer Similarity relations, fuzzy linear orders, and fuzzy partial orders SKundu Fuzzy Sets and Systems 109 2000 Similarity relations, fuzzy partitions, and fuzzy orderings SOvchinnikov Fuzzy Sets and Systems 40 1991 Modal Logic PM DRijke YVenema 2001 Cambridge University Press Knowledge Extraction with Interval Temporal Logic Decision Trees GSciavicco IEStan Proc. of the 27th International Symposium on Temporal Representation and Reasoning of the 27th International Symposium on Temporal Representation and Reasoning LIPIcs 2020 178 16 Interval Temporal Random Forests with an Application to COVID-19 Diagnosis FManzella GPagliarini GSciavicco IEStan Proc. of the 28th International Symposium on Temporal Representation and Reasoning of the 28th International Symposium on Temporal Representation and Reasoning 2021 206 17 A propositional modal logic of time intervals JYHalpern YShoham Journal of the ACM 38 1991 Fariñas del Cerro, A model for reasoning about bidimensional temporal relations PBalbiani JCondotta L Proc. of the 6th International Conference on Principles of Knowledge Representation and Reasoning of the 6th International Conference on Principles of Knowledge Representation and Reasoning Morgan Kaufmann 1998