Mathematical Patterns and Cognitive Architectures Agnese Augello1 , Salvatore Gaglio1,2 , Gianluigi Oliveri2,3 , and Giovanni Pilato1 1 ICAR - Italian National Research Council Viale delle Scienze - Edificio 11 - 90128 Palermo, Italy 2 DICGIM- Università di Palermo Viale delle Scienze, Edificio 6 - 90128, Palermo - ITALY 3 Dipartimento di Scienze Umanistiche - Università di Palermo Viale delle Scienze, Edificio 12 - 90128, Palermo - ITALY {agnese.augello,giovanni.pilato}@cnr.it {salvatore.gaglio,gianluigi.oliveri}@unipa.it Abstract. Mathematical patterns are an important subclass of the class of patterns. The main task of this paper is examining a particular pro- posal concerning the nature of mathematical patterns and some elements of the cognitive structure an agent should have to recognize them. 1 Introduction As is well known, the main aim of pattern recognition is to determine whether, and to what extent, what we call ‘pattern recognition’ can be accounted for in terms of automatic processes. From this it follows that two of its central prob- lems are how to: (i) describe and explain the way humans, and other biological systems, produce/discover and characterize patterns; and how to (ii) develop automatic systems capable of performing pattern recognition behaviour. Having stated these important facts, we need to point out that at the foun- dations of pattern recognition there are two more basic questions which we can formulate in the following way: (a) what is a pattern? (b) how do we come to know patterns? And it is clear that, if we intend to develop a science of pattern recognition able to provide a rigorous way of achieving its main aim, and of pursuing its central objects of study, it is very important to address questions (a) and (b). What we intend to do in this paper is tackling questions (a) and (b) not in their full generality, but in the privileged context provided by mathematics, where there exists a consolidated tradition which regards it as a science of pat- terns,4 connecting the results of our enquiries to the appropriate levels of the cognitive architecture we propose for a cognitive agent. 4 See on this [Oliveri, 1997], [Shapiro, 2000], [Resnik, 2001], [Oliveri, 2007], [Oliveri, 2012], [Bombieri, 2013]. Page 134 of 171 2 A case study If we are presented with the two following objects a and b, it is very difficult to see what interesting mathematical feature they might have in common, if any, let alone that they exemplify the same mathematical pattern: Fig. 1. Di↵erent Information Processing for two di↵erent Cognitive Agents Indeed, whereas object a is a 3 ⇥ 7 matrix whose elements are the first seven letters of the Italian alphabet, object b is an equilateral triangle in which we have inscribed a circle, drawn three bisecting segments, and singled out the points of intersection of three curves. However, the situation radically changes if we introduce the following formal system T with the appropriate interpretations. Let T be a formal system such that the language of T contains a primitive binary relation ‘x belongs to a set X’ (x 2 X), and its inverse ‘X contains an element x’ (X 3 x). Furthermore, let us assume that D, the domain, is a set of countably many undefined elements a1 , a2 , . . .; call ‘m-set’ a subset X of D; and consider the following as the axioms of T: Axiom 1 If x and y are distinct elements of D there is at least one m-set containing x and y; Axiom 2 If x and y are distinct elements of D there is not more than one m-set containing x and y; Axiom 3 Any two m-sets have at least one element of D in common; Axiom 4 There exists at least one m-set. Axiom 5 Every m-set contains at least three elements of D; Axiom 6 All the elements of D do not belong to the same m-set; Axiom 7 No m-set contains more than three elements of D.5 5 The case study discussed in this section has been taken from [Oliveri, 2012], §3, pp. 410-414. These axioms have been taken, with some minor alterations, from [Tuller, 1967], §2.10, p. 30. Page 135 of 171 At this point, if we put I1 (a1 ) = A, . . . , I1 (a7 ) = G, we find that, under this interpretation, what corresponds to the m-sets are the columns of the matrix in fig. 1, and that object a is a model of T. On the other hand, if we put I2 (a1 ) = P1 , . . . , I2 (a7 ) = P7 , we find that, under this interpretation, what corresponds to the m-sets are the curves in fig. 1, and that object b is a model of T. But the surprises do not end up here, because we can now prove that the two models of T mentioned above are isomorphic to one another (see on this [Oliveri, 2012], §3, p. 413, footnote 12). Several are the things that interest us in this example. First of all, the expres- sion ‘the pattern described by T’ appears to refer to the mathematical structure which is realized/instantiated in objects a and b. What this seems to suggest is that, in the mathematical case, the concept of pattern coincides with that of mathematical structure. Secondly, in the absence of our formal system T, we cannot see the pat- tern/structure instantiated by a and b because we are in no position for making the relevant observations concerning the salient features of the pattern/structure in question as is shown by the fact that, in particular, we are unable to make a number of fundamental distinctions such as that between part and whole, etc. etc. Thirdly, the mathematical structure which becomes salient when we observe objects a and b through T depends not only on T, but also on a and b. In fact, given that we can prove in T that there exist exactly seven elements in D and seven m-sets if, for instance, the number of letters of the Italian alphabet we considered as elements of our matrix were di↵erent from seven, the matrix could not be a model of T (the same applies mutatis mutandis to the number of points of intersection of three curves in b). Taking stock of some of the main points made in this section in our study of the mathematical case, we need to say that: (i) we must distinguish between object and structure; (ii) there are strong reasons for identifying mathematical patterns with structures; (iii) necessary conditions for pattern recognition in mathematics are the existence of (1) an observer O; (2) a domain of objects D; and (3) a system of representation ⌃, i.e. (O, D, ⌃).6 With regard to the problem of how we come to know mathematical patterns, given that mathematical patterns are neither sensible objects nor properties of sensible objects, e.g., what in our example we saw as a Euclidean equilat- eral triangle is not a perfect Euclidean equilateral triangle, because its sides do not have exactly the same length, do not contain an infinite number of points, are not breadthless, etc. (see on this [Oliveri, 2012], §§3 and 4, pp. 410-417), it follows that they are not given to us as a consequence of abstraction or induc- tion/generalization carried out on pure observations. But, on the other hand, if mathematical patterns are (also) dependent on objects, as in the case of a and b, they cannot simply be in the eyes of the beholder either. They are given to 6 Actually, the system of representation ⌃ is an ordered pair ⌃ = (T, I), where T is a set containing (as a subset) a recursive set of axioms A and all the logical consequences of A, and I is an interpretation of T on to D. Page 136 of 171 us as a consequence of our activity of representing entities like a and b within a given system of representation ⌃. 3 Patterns and conceptual spaces Conceptual spaces (CS) were originally introduced by Gärdenfors as a bridge between symbolic and associationist models of information representation. This was part of an attempt to describe what he calls the ‘geometry of thought’. In [Gärdenfors, 2004] and [Gärdenfors, 2004a] we find a description of a cog- nitive architecture for modelling representations. The cognitive architecture is composed by three levels of representation: a subconceptual level, in which data coming from the environment (sensory input) are processed by means of a neu- ral network based system; a conceptual level, where data are represented and conceptualized independently of language; and, finally, a symbolic level which makes it possible to manage the information produced at the conceptual level at a higher level through symbolic computations. Gärdenfors’ proposal of a way of representing information via his concep- tual spaces exploits geometrical structures rather than symbols or connections between neurons. This geometrical representation is based on the existence/con- struction of a space endowed with a number of what Gärdenfors calls ‘quality dimensions’ whose main function is to represent di↵erent qualities of objects such as brightness, temperature, height, width, depth. Moreover, for Gärdenfors, judgments of similarity play a crucial role in cog- nitive processes and, according to him, it is possible to associate the concept of distance to many kinds of quality dimensions. This idea naturally leads to the conjecture that the smaller is the distance between the representations of two given objects in a conceptual space the more similar to each other the objects represented are. According to Gärdenfors, objects can be represented as points in a conceptual space, points which we are going to call ‘knoxels’,7 and concepts as regions within a conceptual space. These regions may have various shapes, although to some concepts — those which refer to natural kinds or natural properties — correspond regions which are characterized by convexity.8 Of course, at this point a whole host of important questions come to the forefront, questions like how could a cognitive agent: (1) learn the appropriate conceptual spaces? (2) select between di↵erent spaces that could fill the data? (3) determine the possible dimensions for representing objects? etc. etc. And although all such questions are central to our attempt to use Gärdenfors concep- tual spaces as part of the cognitive architecture of a conceptual agent — we have addressed some of them in [Augello et al., 2013a] and [Augello et al., 2013b] — 7 The term ‘knoxel’ originates from [Gaglio, 1988] by the analogy with “pixel”. A knoxel k is a point in Conceptual Space and it represents the epistemologically primitive element at the considered level of analysis. 8 A set S is convex if and only if whenever a, b 2 S and c is between a and b then c 2 S. Page 137 of 171 what we aim to do in this paper is: (↵) showing the existence of at least three di↵erent pattern recognition procedures; and ( ) individuating which of the 3 corresponding levels of the cognitive architecture of our cognitive agent is in- volved in the processing of mathematical patterns. To do this consider the case study discussed in §2 (taken from [Oliveri, 2012], §3, pp. 410-414), and imagine we have before us a cognitive agent A endowed with level 1 information processing system. In this case A (its neural network) can be trained to recognize letters A, B, . . . , G and distinguish them from one another; and do the same thing for the coloured round objects P1 , P2 , . . . , P7 . Furthermore, suppose that the letters and the coloured round objects are presented to A exactly as they are in fig. 1. Once more A, exploiting its level 2 information processing system, i.e. the conceptual spaces of letters and of colours, is able to give a correct representation of a and b, for example, by representing a and b in an appropriate finite-dimensional vector space using rigid motions and some operations which act on the spaces. However, what A cannot do, if the formal system T (see §2) is absent from its symbolic level 3 information processing system, is recognizing that a and b exemplify/realize the same pattern. Therefore, if what we have argued so far is correct, it follows that in the dawning of a mathematical pattern all the three levels of information processing systems we mentioned above are involved 4 Conclusion In this work we have revisited a three levels cognitive architecture as a foun- dational approach to pattern recognition for an agent. We have illustrated this possibility by exploiting a mathematical domain. We have also highlighted the relevance of a linguistic, symbolic, level in order to produce abstractions and see deeper mathematical patterns. Acknowledgements This work has been partially supported by the PON01 01687 - SINTESYS (Se- curity and INTElligence SYSstem) Research Project. References [Augello et al., 2013a] Augello, A., Gaglio, S., Oliveri, G., Pilato, G., ‘An Algebra for the Manipulation of Conceptual Spaces in Cognitive Agents’. Biologically Inspired Cognitive Architectures, 6, 23-29, 2013. [Augello et al., 2013b] Augello, A., Gaglio, S., Oliveri, G., Pilato, G., ‘Acting on Con- ceptual Spaces in Cognitive Agents’. in [Lieto et al., 2013], pp. 25-32, 2013. [Bombieri, 2013] Bombieri, E.: 2013, ‘The shifting aspects of truth in mathematics’, Euresis, vol. 5, pp. 249–272. [Chella, 1997] A. Chella, M. Frixione, and S. Gaglio. A cognitive architecture for arti- ficial vision. Artif. Intell., 89:73111, 1997. Page 138 of 171 [Gaglio, 1988] S. Gaglio, P. P. Puliafito, M. Paolucci, and P. P. Perotto. 1988. Some problems on uncertain knowledge acquisition for rule based systems. Decis. Sup- port Syst. 4, 3 (September 1988), 307-312. DOI=10.1016/0167-9236(88)90018-8 http://dx.doi.org/10.1016/0167-9236(88)90018-8 [Gärdenfors, 2004] Gärdenfors, P.: 2004, Conceptual Spaces: The Geometry of Thought, MIT Press, Cambridge, Massachusetts. [Gärdenfors, 2004a] Gärdenfors, P.: 2004. ‘Conceptual spaces as a framework for knowledge representation’. Mind and Matter 2 (2):9-27. [Lieto et al., 2013] Lieto A. e Cruciani M. (ed.s): Proceedings of the First International Workshop on Artificial Intelligence and Cognition (AIC 2013). An official work- shop of the 13th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2013), Torino, Italy, December 3, 2013 on Conceptual spaces as a framework for knowledge representation. CEUR Workshop Proceedings, vol. 1100. [Oliveri, 1997] Oliveri, G.: 1997, ‘Mathematics. A Science of Patterns?’, Synthese, vol. 112, issue 3, pp. 379–402. [Oliveri, 2007] Oliveri, G.: 2007, A Realist Philosophy of Mathematics, College Publi- cations, London. [Oliveri, 2012] Oliveri, G.: 2012, ‘Object, Structure, and Form’, Logique & Analyse, vol. 219, pp. 401-442. [Resnik, 2001] Resnik, M.D.: 2001, Mathematics as a Science of Patterns, Clarendon Press, Oxford. [Scholkopf, 2001] Bernhard Scholkopf and Alexander J. Smola. 2001. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge, MA, USA. [Shapiro, 2000] Shapiro, S.: 2000, Philosophy of Mathematics. Structure and Ontology, Oxford University Press, Oxford. [Tuller, 1967] Tuller, A.: 1967, A Modern Introduction to Geometries, D. Van Nostrand Company, Inc., Princeton, New Jersey. Page 139 of 171