A Proposal for Primitive Decomposition of Spatial Orientation Relationships Jamie C. Macbeth1,* , Mackie Zhou1 and Zoie Zhao1 1 Department of Computer Science, Smith College, 10 Elm St, Northampton, Massachusetts, 01063, USA Abstract This short paper continues work on primitive decomposition systems for meaning representation which combine image schemas and conceptual dependency primitive systems. An important thread of this research seeks small abstract sets of conceptual primitives so that decompositions of imagery evoked by language give rise to rich sets of mappings between language and the language-free representations, reflecting the linguistic variation of human language behavior. In this brief paper, we present a proposal for novel primitive decompositions of positions, spatial relationships, and orientations of objects in space in a conceptual representation framework. As an abstract first approximation, we introduce a spatial primitive which represents that one object is positioned in between two other objects, and combine it with part-whole representations to decompose commonly referenced concepts and language expressions of the positions and orientations of objects in relation to their surroundings. Keywords Conceptual Dependency, Image Schemas, Conceptual Representations, Formal Modeling 1. Introduction Natural language understanding is a human intelligence task that continues to be a critical area for evaluating knowledge representation systems, commonsense reasoning, meaning represen- tation systems, and ontologies. This short paper continues work on primitive decomposition systems for meaning representation which combine image schemas [1, 2] and conceptual depen- dency primitive systems [3, 4, 5]. In this brief paper, we present an informal proposal for novel primitive decompositions of relationships between and among objects in space in a conceptual representation framework designed for in-depth natural language understanding systems. 2. Background Much of the focus of earlier work on conceptual dependency was on the representations of events and was dominated by decompositions of eleven primitive physical, mental, and social “acts” which changed the state of the world: PTRANS, MOVE, PROPEL, INGEST, EXPEL, CAOS VII: Cognition and Ontologies, 9th Joint Ontology Workshops (JOWO 2023), co-located with FOIS 2023, 19-20 July, 2023, Sherbrooke, Québec, Canada * Corresponding author. $ jmacbeth@smith.edu (J. C. Macbeth); mzhou@smith.edu (M. Zhou); zzhao39@smith.edu (Z. Zhao) € https://jamiemacbeth.com (J. C. Macbeth)  0000-0003-3474-203X (J. C. Macbeth) © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings GRASP, ATRANS, MTRANS, ATTEND, MBUILD, and SPEAK. This was largely due to the early focus of applying the representation system towards natural language understanding and story understanding tasks, for which representing dynamic and pivotal story events is key [6]. One relatively unexplored idea is to subject aspects of “static” scenes, spatial characteristics of objects, the spatial relationships between objects, and arrangements of objects in scenes to prim- itive decompositions. In the spirit of earlier work on CD and more recent work juxtaposing CD primitives with image schemas [7], we are seeking a small abstract set of conceptual primitives and decompositions that will have the same positive benefits as the primitive decompositions for events: it will allow for imagery evoked by language to be represented as unambiguously as possible through a language-free representation, allow rich sets of mappings between language and the representations, and allow for reasoning about scenes at both low levels and high levels of detail, depending on the “molecular” complexity of the primitive-decomposed structures. 3. Objects, Abstract Positions, and Orientations We build on prior work on the Mental Motion Pictures system [8], a novel CD-based conceptual analyzer of natural language which attempts an in-depth understanding of ProPara [9] para- graphs by creating sequences of frames to represent the evolution in time of the imagery evoked by the text. Here we focus specifically on how to represent positions, spatial relationships, and orientations of objects in the scenes, whether they are still or in motion. 3.1. Objects The conception of physical objects in the system builds on conceptual dependency (CD), image schemas (IS), and work to juxtapose, combine, and formalize the two [10, 11, 12, 7, 13, 14]. We treat objects in the system as Object image schemas, or as picture producers (known as PPs in conceptual dependency). The conceptual analyzer system performs primitive decomposition-based language under- standing without an underlying knowledge base. Within this context it is important to point out that the conceptual analyzer currently has no knowledge about objects other than their abstract spatial relationships. It knows nothing about the sizes and shapes of objects, or states of matter. For example it does not know that the earth is a much larger object than a drop of rain, that rain is a liquid, or even that the earth is spherical. As a result, the diagrams in this paper show objects simply as circles of equal size in two dimensions to convey the way in which the representations are agnostic of many of the commonly understood characteristics of the objects. 3.2. Part-Whole Relationships We extend previous work [8] by using part-whole relationships to specify the parts of objects that are relevant to their perceived orientations. This now allows us to, for example, identify that certain kinds of objects that are not perfectly symmetrical have tops and bottoms, left and right parts, fronts and backs, and dorsal and ventral parts. These correspond to the Part-Whole image schema. CD does retain a primitive called PART that is not one of the eleven primitive “acts” but is present in many CD conceptualizations to specify that one object (a picture producer or PP in CD) is a part or sub-object of another object (or PP). To operationalize this feature, PartOf(A,B) indicates that object A is a part of or is a sub-object of object B. In diagrams, we show parts of objects as regions of a circle representing the full object in two dimensions. 3.3. In-Between Relationships In prior work [8] the Mental Motion Pictures conceptual analyzer decomposed meanings of words such as “rise”, “fall”, “above”, and “below” in ProPara paragraphs in terms of a positional graph with edges representing the relative distance of objects with respect to the center of the earth. For example, if an object A is at a position that is greater in altitude than another object B, this relationship is represented as an edge between A and B in the position map indicating that A is further away from the center of the earth than B. Words for movement such as “rise” and “fall” are represented by instantiating a CD PTRANS (corresponding to the Source_Path_Goal image schema) and instantiating the center of the earth as an object in the “to” and “from” cases in a CD PTRANS conceptualization. The center of the earth was chosen as the reference point because many of the input texts involve events and object movements both above and under ground. In the current proposal, we expand the position map idea so that it incorporates relation- ships between any collection of objects, not only relationships between objects and the cen- ter of the earth. We introduce a new primitive, InBetween(), to represent how any three objects are posed with respect to each other in space. For three Objects or PPs, A, B, and C, InBetween(A,B,C) indicates a situation in which B is between A and C. This primi- tive could be used to represent altitude or changes in altitude relationships. For example InBetween(A,B,CenterOfEarth) could be used to indicate that Object B is closer to the center of the earth than Object A, and thus A is at a greater height or altitude, just as with the positional map in earlier work. In the spirit of a first approximation that maximizes the abstractness of the primitive definition, we assume for now that there exists a straight line which passes through all the objects in an InBetween().1 3.4. Decomposing Object Orientations Combining the more general InBetween() spatial relation primitive with Part-Whole rela- tionships allows us to propose representations not only of where objects are located in space with respect to one another, but also represent how they are oriented or how they are facing or not facing one another. Figure 1 illustrates how this works for the orientation expressions “right-side up” and “upside-down” in both pictorial form and using PartOf() and InBetween(). The representation identifies certain inherent parts of an object, its Top and Bottom, using the PartOf() primitive (for the Part-Whole image schema or PART CD primitive). PartOf(Top, Object) and PartOf(Bottom, Object) indicate that Top and Bottom are part of an Object called Object. Then, since Object parts are also Objects in their own right, InBetween(Top, 1 For ProPara paragraphs describing events such as rain falling from a cloud and landing on the ground, or rays of light traveling from the sun and being absorbed by a leaf, the generality lost due to this assumption appears to be not very consequential. Top “Right-side up” Bottom “Upside down” Bottom PartOf(Top, Object) Top PartOf(Top, Object) PartOf(Bottom, Object) PartOf(Bottom, Object) InBetween(Top, InBetween(Bottom, Bottom, Top, CenterOfEarth) CenterOfEarth) Center Center of Earth of Earth Figure 1: Left: an abstract representation of the expression “right-side up” for an object, its Top and Bottom parts, and their orientation in relation to the center of the earth in both pictorial form and using PartOf() and InBetween(). Right: an abstract representation of the expression “upside down” for an object, its Top and Bottom parts, and their orientation in relation to the center of the earth in both pictorial form and using PartOf() and InBetween(). Bottom, CenterOfEarth) is used to indicate that the Bottom part of the object is lower or closer to the center of the earth than the Top part, and thus it must be oriented right-side up (Figure 1, left). If the same object parts are identified and, instead, InBetween(Bottom, Top, CenterOfEarth) were the case, then the Top part of the object is lower or closer to the center of the earth than the Bottom part, and it must be oriented upside down (Figure 1, right). In a second pair of examples, Figure 2 illustrates how this works for the orientation expressions “facing” and “prone” in both pictorial form and using PartOf() and InBetween(). The representation identifies the Front and Back of an object as its inherent parts using the PartOf() primitive. PartOf(Front, Object) and PartOf(Back, Object) indicate that Front and Back are part of an Object called Object. Then InBetween(Back, Front, TheSun) is used to indicate that the Front part of the object is lower or closer to the sun than the Back part, and thus it must be oriented facing the sun (Figure 2, left). If the same object parts are identified and, instead, InBetween(Back, Front, CenterOfEarth) were the case, then the Front part of the object is lower or closer to the center of the earth than the Back part, and it must be oriented in what is commonly referred to as a “prone” position (Figure 2, right). 4. Related Work Related work has examined using image schemas and primitive decomposition systems for knowledge representation, commonsense reasoning systems, and knowledge bases [11, 15, 16], and using crowdsourcing to build knowledge bases of image schemas and conceptual dependency structures [13, 10, 12]. Other work has drawn comparisons between image schemas and conceptual dependency primitives [7], and has formalized CD by modeling conceptual dependency primitives with image schema logic [14]. Primitive decomposition systems have Back “Facing the sun” Back “Prone” Front PartOf(Front, Object) Front PartOf(Front, Object) PartOf(Back, Object) PartOf(Back, Object) InBetween(Back, InBetween(Back, Front, Front, TheSun) CenterOfEarth) Center The Sun of Earth Figure 2: Left: an abstract representation of the expression “facing” for an object, its Front and Back parts, and their orientation in relation to another object—in this case, the sun—in both pictorial form and using PartOf() and InBetween(). Right: an abstract representation of the expression “prone” for an object, its Front and Back parts, and their orientation in relation to the center of the earth in both pictorial form and using PartOf() and InBetween(). also been proposed for enhancing self-supervised learning [17]. Recently there have been applications of image schemas and image schema logic to autonomous and reactive robotics [18, 19]. 5. Conclusion In this brief paper, we present an informal proposal for novel primitive decompositions of positions, spatial relationships, and orientations of objects in space. We introduce a spatial primitive which represents that one object is positioned in between two other objects, and combine it with part-whole object relationships to decompose concepts corresponding to commonly used terms for orientations of objects in their environments. Having these kinds of abstract primitive decompositions will enable rich representations through the variety of ways in which they can be combined with themselves and still other primitives. We look forward to applying the idea to natural language understanding and generation systems based on componential analysis and primitive decomposition. References [1] J. M. Mandler, C. P. Cánovas, On defining image schemas, Language and Cognition 6 (2014) 510–532. [2] T. Oakley, Image schemas, in: D. Geeraerts, H. Cuyckens (Eds.), The Oxford Handbook of Cognitive Linguistics, Oxford University Press, New York, 2007. [3] R. C. Schank, Conceptual dependency: A theory of natural language understanding, Cognitive Psychology 3 (1972) 552–631. [4] R. C. Schank, Conceptual Information Processing, Elsevier, New York, NY, 1975. [5] S. L. Lytinen, Conceptual dependency and its descendants, Computers & Mathematics with Applications 23 (1992) 51–73. [6] R. C. Schank, C. K. Riesbeck, Inside Computer Understanding: Five Programs Plus Minia- tures, Lawrence Erlbaum Associates, Hillsdale, NJ, 1981. [7] J. C. Macbeth, D. Gromann, M. M. Hedblom, Image schemas and conceptual dependency primitives: A comparison, in: Proceedings of The Joint Ontology Workshops, Episode 3: The Tyrolean Autumn of Ontology, The International Association for Ontology and its Applications, Bolzano-Bozen, Italy, 2017. [8] M. Zhou, B. Duah, J. C. Macbeth, Novel primitive decompositions for real-world physical reasoning, in: K. R. Thórisson (Ed.), Proceedings of the Third International Workshop on Self-Supervised Learning, volume 192 of Proceedings of Machine Learning Research, PMLR, 2022, pp. 22–34. URL: https://proceedings.mlr.press/v192/zhou22a.html. [9] B. Dalvi, L. Huang, N. Tandon, W.-t. Yih, P. Clark, Tracking state changes in procedural text: a challenge dataset and models for process paragraph comprehension, in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Association for Computational Linguistics, New Orleans, Louisiana, 2018, pp. 1595–1604. [10] J. C. Macbeth, M. Barionnette, The coherence of conceptual primitives, in: Proceedings of the Fourth Annual Conference on Advances in Cognitive Systems, The Cognitive Systems Foundation, Evanston, Illinois, 2016. [11] J. C. Macbeth, Conceptual primitive decomposition for knowledge sharing via natural language, in: Proceedings of The Joint Ontology Workshops, Episode 3: The Tyrolean Autumn of Ontology, The International Association for Ontology and its Applications, Bolzano-Bozen, Italy, 2017. [12] J. C. Macbeth, S. Grandic, Crowdsourcing a parallel corpus for conceptual analysis of natural language, in: Proceedings of The Fifth AAAI Conference on Human Computation and Crowdsourcing, AAAI Press, Quebec City, Canada, 2017, pp. 128–136. [13] D. Gromann, J. C. Macbeth, Crowdsourcing image schemas, in: Proceedings of The Fourth Image Schema Day (ISD4), The International Association for Ontology and its Applications, Bolzano-Bozen, Italy, 2018. [14] J. C. Macbeth, D. Gromann, Towards modeling conceptual dependency primitives with image schema logic, in: The Fourth Workshop on Cognition And OntologieS (CAOS IV) at The Fifth Joint Ontology Workshop (JOWO’19), The International Association for Ontology and its Applications, Graz, Austria, 2019. [15] E. Cambria, Q. Liu, S. Decherchi, F. Xing, K. Kwok, SenticNet 7: A commonsense-based neurosymbolic AI framework for explainable sentiment analysis, in: Proceedings of the Thirteenth Language Resources and Evaluation Conference, European Language Resources Association, Marseille, France, 2022, pp. 3829–3839. URL: https://aclanthology.org/2022. lrec-1.408. [16] S. De Giorgis, A. Gangemi, D. Gromann, Imageschemanet: formalizing embodied common- sense knowledge providing an image-schematic layer to framester, Semant. Web J.(2022, forthcoming) (2022). [17] J. C. Macbeth, Enhancing learning with primitive-decomposed cognitive representations, in: Proceedings of The First Annual International Workshop on Self-Supervised Learning (IWSSL-2020), Cambridge, MA, 2020. [18] K. Dhanabalachandran, M. M. Hedblom, M. Beetz, A balancing act: Ordering algorithm and image-schematic action descriptors for stacking objects by household robots, in: The Eighth Joint Ontology Workshops (JOWO22), RobOntics: 3rd International Workshop on Ontologies for Autonomous Robotics Jönköping, Sweden, August 15-19, 2022, CEUR-WS, 2022. [19] M. Pomarlan, S. De Giorgis, M. M. Hedblom, M. Diab, N. Tsiogkas, Thinking in front of the box: Towards intelligent robotic action selection for navigation in complex environments using image-schematic reasoning, in: The Eighth Joint Ontology Workshops (JOWO22), RobOntics: 3rd International Workshop on Ontologies for Autonomous Robotics Jönköping, Sweden, August 15-19, 2022, CEUR-WS, 2022.