<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Multi-Frame Modal Symbolic Learning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Giovanni</forename><surname>Pagliarini</surname></persName>
							<email>giovanni.pagliarini@unife.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Ferrara</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Parma</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Guido</forename><surname>Sciavicco</surname></persName>
							<email>guido.sciavicco@unife.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Ferrara</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ionel</forename><forename type="middle">Eduard</forename><surname>Stan</surname></persName>
							<email>ioneleduard.stan@unife.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Ferrara</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Parma</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Multi-Frame Modal Symbolic Learning</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">A1FB21C74EDD0C401F4FC78BBAEBF557</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T06:56+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Dimensional Data</term>
					<term>Modal Logics</term>
					<term>Modal Symbolic Learning</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Symbolic learning is the sub-field of machine learning that deals with symbolic algorithms and models, which have been known for decades and successfully applied to a variety of contexts. The main limitation of symbolic models is the fact that they are essentially based on classical propositional logic, which implies that data with an implicit dimensional component, such as temporal (e.g., time series) or spatial data (e.g., images), cannot be properly dealt with within the standard symbolic framework. Recently, modal symbolic learning models have been proposed as a natural extension of classical ones to naturally deal with dimensional data, and successfully applied to temporal and spatial data. In this paper, we discuss the possibility of further extending such learning models to deal with multi-frame dimensional data, to be able to natively learn from instances represented by more than one dimensional description.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The most iconic and fundamental separation between sub-fields of machine learning is the one between functional and symbolic learning. Functional learning is the process of learning a function that represents the theory underlying a certain phenomenon. Symbolic learning, on the other hand, is the process of learning a logical description that represents that phenomenon. Whether one or the other approach should be preferred raised a long-standing debate among experts, rooted in the fact that functional methods tend to be more versatile and statistically accurate than symbolic ones, while symbolic methods are able to extract models that can be interpreted, explained, and enhanced using human knowledge. From a logical standpoint, classical symbolic learning schemata are all characterized by the use of propositional logic (they are, in fact, sometimes called propositional methods), and can be classified along three main directions: the structure of the models (from strongly structured ones, such as decision trees <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>, to strongly unstructured ones, such as sets of independent rules <ref type="bibr" target="#b2">[3]</ref>), the type of logic (crisp versus fuzzy <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>), and the type of learning method (from purely deterministic to purely randomized). Dimensional data, such as temporal or spatial data, cannot be dealt with in a native way using propositional methods. Examples of naturally dimensional data include temporal histories of patients, or objects described by spatial images; in all such cases, the dimensional component is usually implicit. The general to-go strategy to treat dimensional data with propositional models such as decision-trees is to flatten the dimensional component, effectively hiding it. This consists in massaging the data set in such a way that new variables are created for each dimensional variable 𝐴, which contain the values of 𝐴 at different times, spatial locations, and so on; so for example, an instance that consists of a single-variate time series 𝐴 with 𝑁 (ordered) points ends up being represented as the (unordered) collection 𝐴(1), 𝐴(2), . . . , 𝐴(𝑁 ). Such a representation is called lagged in the temporal case, and flattened in the spatial one, and it allows one to use off-the-shelf propositional methods for the phase. Recently a new line in symbolic learning has emerged, in which classical symbolic algorithms are enhanced so that dimensional variables can be dealt by leveraging more expressive reasoning capabilities. To this end, propositional logic is replaced with propositional modal logic <ref type="bibr" target="#b5">[6]</ref> in the learning schema, allowing one to natively express the relationships that emerge among the different worlds that describe each instance (e.g., time points, time intervals, extended areas, etc.). Modal logic can be declined into more practical languages, such as temporal or spatial logics, without loosing its basic principles, and the definition of modal symbolic learning schema immediately becomes a definition of a temporal, spatial, or spatial-temporal one. Modal decision trees, for example, have been studied in the temporal case in <ref type="bibr" target="#b6">[7]</ref>, and applied to real data in <ref type="bibr" target="#b7">[8]</ref>. Dimensional data, however, is more complex than what can be captured by a single description. The temporal histories of patients, for instance, should be paired with patients' static data; the spatial descriptions of objects, as a different example, may require images from different angles; more in general, the instances of real-world data sets are often very complex, and require learning methods that can deal with such complexity.</p><p>In this paper we pave the way to a generalization of modal symbolic methods to the multiframe case, and present some practical cases in which such a complex schema can be useful.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Multi-Frame Modal Symbolic Learning</head><p>A multi-frame dimensional data set ℐ is a finite collection of instances 𝐼 1 , . . . 𝐼 𝑚 , each of which is associated with (i.e., described by) 𝑟 Kripke models 𝑀 1 , . . . , 𝑀 𝑟 . In turn, in each model 𝑀 𝑗 = (𝑊 𝑗 , 𝑅 𝑗 1 , . . . , 𝑅 𝑗 𝑠 𝑗 , 𝑉 𝑗 ), each world is characterized by the value of 𝑛 𝑗 distinct attributes 𝐴 𝑗 1 , . . . , 𝐴 𝑗 𝑛 𝑗 ; as in the classical propositional case, the attributes define the propositional alphabet. We say that ℐ is labeled if it is partitioned in a finite number of classes 𝒞 = {𝐶 1 , . . . , 𝐶 𝑘 }. In other words, a single instance is described by more-than-one dimensional information. We assume that the number and the type of descriptions are consistent among instances; so, for example, a multi-frame dimensional data set may contain instances described by three frames each. To a multi-frame dimensional data set with 𝑟 frames we associate 𝑟 (in general, distinct) modal languages. The 𝑗-th language is a unary modal logic with 𝑠 𝑗 existential modalities ♢ 1 , . . . , ♢ 𝑠 𝑗 , and their corresponding universal versions □ 1 , . . . , □ 𝑠 𝑗 . Modalities are interpreted by the relations 𝑅 𝑗 1 , . . . , 𝑅 𝑗 𝑠 𝑗 , so that:</p><formula xml:id="formula_0">𝑀 𝑗 , 𝑤 ⊩ 𝑝 iff 𝑝 ∈ 𝑉 𝑗 (𝑤); 𝑀 𝑗 , 𝑤 ⊩ ¬𝜙 iff 𝑀 𝑗 , 𝑤 ̸ ⊩ 𝜙; 𝑀 𝑗 , 𝑤 ⊩ 𝜙 ∨ 𝜓 iff 𝑀 𝑗 , 𝑤 ⊩ 𝜙 or 𝑀 𝑗 , 𝑤 ⊩ 𝜓; 𝑀 𝑗 , 𝑤 ⊩ ♢ 𝑗 𝑡 𝜙 iff ∃𝑣 s.t. 𝑤𝑅 𝑗 𝑡 𝑣 and 𝑀 𝑗 , 𝑣 ⊩ 𝜙,</formula><p>where ♢ 𝑗 𝑡 is the 𝑡-th modality of the 𝑗-th language. Multi-frame modal symbolic learning consists of enhancing modal symbolic learning methods with the possibility of learning from more-than-one dimensional description at the same time, and describing the learned knowledge using the correct logic. Multi-frame dimensional data sets capture dimensional situations quite naturally; but just as it happens in modal symbolic learning, in the multi-frame case too we need to concretize the learning models and the associated languages to specific modal logics to adapt them to the real-world cases. Dimensional data is generally represented in an implicit form. This means that, for example, time series and images are usually linearized, and expressed as sets of numbers; different, but equivalent, representations can be used to capture dimensional situations (by rows and by columns are just two very popular examples). A key observation is that the information in real data, in general, is not point-based (think, for example, to time series or images: the values of single time points or single pixels are not really informative). One around this problem is to employ modal logics in which worlds do not correspond to single points, but to sets of points. Halpern and Shoham's interval temporal logic (𝐻𝑆) <ref type="bibr" target="#b8">[9]</ref> allows one to express properties of intervals in the temporal case. By generalizing 𝐻𝑆 to any number of dimensions, we obtain a family of logics that we can denote by 𝐻𝑆 𝑑 , where 𝑑 ∈ N; 𝐻𝑆 0 is just propositional logic, 𝐻𝑆 1 is the original Halpern and Shoham logic of intervals, and 𝐻𝑆 2 is the natural logical generalization of Rectangle Algebra <ref type="bibr" target="#b9">[10]</ref>. In 𝐻𝑆 𝑑 , worlds are hyperrectagles with edges parallel to the axis, and connected by the 𝑑-dimensional generalizations of Allen's interval relations. In this way, a set of propositional letters emerges naturally. Fixed a frame, a world, and an attribute 𝐴, these are of the type:</p><formula xml:id="formula_1">𝑓 (𝐴) ◁▷ ∼𝛾 𝑎</formula><p>where 𝑎 is a value of the domain of 𝐴, 𝑓 is a function, 𝛾 ∈ (0, 1] ⊂ R, and ◁▷ ∈ {&lt;, ≤, =, ≥, &gt;}, and ∼ ∈ {&lt;, ≤, ≥, &gt;}. Propositional letters of this type are interpreted over hyperrectangles; for example, if 𝑑 = 1 (that is, data is one-dimensional, e.g., temporal), 𝑓 is the identity function, ◁▷ is &lt;, ∼ is &gt;, 𝛾 is 0.8, 𝐴 is the temperature, and 𝑎 = 37.5, then 𝐴 &lt; &gt;0.8 𝑎 represents the proposition more than the 80% of the values of the temperature during the current interval are below 37.5. By varying the function 𝑓 , one can produce more complex assertions on hyperrectangles.</p><p>For a better understanding of how multi-frame modal symbolic learning can be useful in practical case, consider, again, the medical example, as in Fig. <ref type="bibr">1 (top)</ref>. In this situation, we have a three-frame data set. The first frame is static, and it is associated to propositional logic (𝐻𝑆 0 ); the second frame is one-dimensional, and, specifically, temporal: we associate it with 𝐻𝑆 1 , so that the information concerning the temporal attributes is learned by intervals; the third frame is two-dimensional, that is, spatial: by associating it to 𝐻𝑆 2 , we can learn patterns of rectangles and colors in them. Instances are labeled, making this data set suitable for knowledge extraction by classification. One can observe how data are represented in concrete form (Fig. <ref type="figure" target="#fig_2">1</ref>, bottom left), and how we can extract knowledge in symbolic form from them (Fig. <ref type="figure" target="#fig_2">1</ref>, bottom, right), with modal decision trees or sets of modal rules in 𝐻𝑆 𝑑 .    </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Conclusions</head><p>In this paper we paved the way towards multi-frame modal symbolic learning. We defined multi-frame dimensional data sets, the learning framework, and described how to concretize it to deal with implicit dimensional data in a intuitive way.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>c o n c r e t e r e p r e s e n t a t i⟨ 8 ⟨⟨𝐵⟩</head><label>8</label><figDesc>𝐴 ⟩ 𝑇 &lt; 3 8 [ 𝐴 ] 𝑇 ≥ 3 𝐵 , 𝐸 ⟩ 𝑅 ≤ 2 0 0 [ 𝐵 , 𝐸 ] 𝑅 &gt; 2 𝑃 &gt; 110 ∧ [𝐴,𝑂] 𝐺 &lt; 200 ⇒ . . . ⟨𝐴, 𝐴⟩ 𝐵 &lt; 215 ∧ ⟨𝐿⟩ 𝑇 &lt; 39 ⇒ . . . . . . decisio n on a tempor al interva l deci sion on a spat ial rect ang le</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Example of multi-frame dimensional data set.</figDesc></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>We thank the INdAM GNCS 2020 project Strategic Reasoning and Automated Synthesis of Multi-Agent Systems for partial support.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Induction of decision trees</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Quinlan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Machine Learning</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="81" to="106" />
			<date type="published" when="1986">1986</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Simplifying decision trees</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Quinlan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Human-Computer Studies</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="page" from="497" to="510" />
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Rule-based Classification</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K H</forename><surname>Tung</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Springer</publisher>
			<biblScope unit="page" from="2459" to="2462" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Similarity relations, fuzzy linear orders, and fuzzy partial orders</title>
		<author>
			<persName><forename type="first">S</forename><surname>Kundu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Fuzzy Sets and Systems</title>
		<imprint>
			<biblScope unit="volume">109</biblScope>
			<biblScope unit="page" from="419" to="428" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Similarity relations, fuzzy partitions, and fuzzy orderings</title>
		<author>
			<persName><forename type="first">S</forename><surname>Ovchinnikov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Fuzzy Sets and Systems</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="page" from="107" to="126" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Modal Logic</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M D</forename><surname>Rijke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Venema</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Knowledge Extraction with Interval Temporal Logic Decision Trees</title>
		<author>
			<persName><forename type="first">G</forename><surname>Sciavicco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">E</forename><surname>Stan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 27th International Symposium on Temporal Representation and Reasoning</title>
				<meeting>of the 27th International Symposium on Temporal Representation and Reasoning</meeting>
		<imprint>
			<publisher>LIPIcs</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">178</biblScope>
			<biblScope unit="page">16</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Interval Temporal Random Forests with an Application to COVID-19 Diagnosis</title>
		<author>
			<persName><forename type="first">F</forename><surname>Manzella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pagliarini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sciavicco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">E</forename><surname>Stan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 28th International Symposium on Temporal Representation and Reasoning</title>
				<meeting>of the 28th International Symposium on Temporal Representation and Reasoning</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">206</biblScope>
			<biblScope unit="page">17</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">A propositional modal logic of time intervals</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Halpern</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shoham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the ACM</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="935" to="962" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Fariñas del Cerro, A model for reasoning about bidimensional temporal relations</title>
		<author>
			<persName><forename type="first">P</forename><surname>Balbiani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Condotta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 6th International Conference on Principles of Knowledge Representation and Reasoning</title>
				<meeting>of the 6th International Conference on Principles of Knowledge Representation and Reasoning</meeting>
		<imprint>
			<publisher>Morgan Kaufmann</publisher>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="124" to="130" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
