<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Some Connections between Qualitative Spatial Reasoning and Machine Learning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Anthony</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
							<email>a.g.cohn@leeds.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="department">School of Computer Science</orgName>
								<orgName type="institution">University of Leeds</orgName>
								<address>
									<postCode>LS2 9JT</postCode>
									<country key="GB">UK</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">The Alan Turing Institute</orgName>
								<address>
									<country key="GB">UK</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Some Connections between Qualitative Spatial Reasoning and Machine Learning</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">BABB16CFFB353BAC2430720913E37529</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:07+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>As has been remarked on before, Space is Special <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. Tobler's First Law of Geography <ref type="bibr" target="#b2">[3]</ref> captures the notion that all things are related, but close things are more related. Tversky <ref type="bibr" target="#b1">[2]</ref> eloquently argues for the special place for spatial representations, and in particular that (living) things must move and act in space to survive, that all thought begins as spatial thought and that spatial thinking comes from and is shaped by perceiving the world and acting in it, be it through learning or through evolution. Artificial Intelligence has thus naturally sought to endow artificial agents with spatial representations and ways of reasoning about space. Amongst these, I will focus on qualitative spatial representations and reasoning mechanisms (henceforth QSR, where the 'R' may stand for representation or reasoning or both, depending on the context). There have been many calculi developed for representing and reasoning about space in qualitative ways, covering aspects such as (mereo)topology, orientation/direction, size, distance and shape <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>. Whilst QSR has primarily been concerned with deductive reasoning, there have been and there are increasingly many connections between QSR and machine learning. In this talk I will discuss a number of such connections, ranging from the use of qualitative spatial representations in an inductive logic programming system to learn event classes occurring in video data, to the question of whether large language models (LLMs) are able to make inferences reliably about qualitative spatial relations, and whether they can be supported by symbolic reasoners. Learning rules for video interpretation: Dubba et al. <ref type="bibr" target="#b5">[6]</ref> show how Inductive Logic Programming can be used to learn a set of rules which can be used to recognise event class instances where videos have been abstracted to a set of qualitative spatio-temporal relations. The method is demonstrated in two domains including one which involves recognising the events which are necessary to service an aircraft whilst it is turning around at an airport. Whilst the resulting rules are relatively simple and it might be wondered whether a hand-written set of rules could not be easily written and just as effective, it turns out that in a comparison with such a set of manually written rules, the learned model is more effective, because the latter does not take account of noise in the video data, where as the learned model was already trained on noisy data and was thus more robust in the face of noisy data at classification time. The paper also shows how the inductive process can be interleaved with abduction, using an embedded spatial theory to improve the learned model in the face of noisy training data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Learning groundings for spatial representations:</head><p>A key question for QSR is how the relations in the calculus correspond to their use in language and their correspondence to the real world. Whilst relations are usually given plausible names in a relational calculus, there is no guarantee that these correspond to naturally occurring instances. Indeed, McDermott [7]  notes the dangers of "wishful naming". Alomari et al. [8]  present a system, named OLAV, which addresses the problem of bootstrapping knowledge in language and vision for autonomous robots. OLAV is able, for the first time, to (1) learn to form discrete concepts from sensory data; (2) ground language (n-grams) to these concepts (which include not only spatial relations, but also object attributes and actions); (3) induce a grammar for the language being used to describe the perceptual world; and moreover to do all this incrementally, without storing all previous data. The resulting grammar can then be used to parse novel commands for downstream action in a robotic system. Analysing polysemy in spatial prepositions: One challenge in assigning meanings to spatial prepositions is that they can frequently be polysemous, i.e. they can have multiple related senses (the polysemes). As the senses of polysemous terms are so closely intertwined, the theoretical and computational treatment of polysemy presents a difficult challenge for semantic models. To given an example: compare "book on a table", "balloon on the ceiling" and "picture on the wall".  discuss this problem and shows how a model can be built in which these senses can be distinguished using data from human subjects. Can Large Language Models perform qualitative spatial reasoning reliably? Many claims (e.g. <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12]</ref>) have been made since the emergence of Large Language Models (LLMs) as to their ability to reason. Spatial</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>reasoning is of particular interest since not only does it underlie a human's ability to operate in the physical world, but also because LLMs are not embodied; so the question arises, have they nonetheless acquired an ability to reason about situations which might occur in the real physical world? I will present the results of a number of experiments in which this ability is tested: for (cardinal)Michael Sioutis &lt;michael.sioutis@lirmm.fr&gt; directions <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref>, for relational composition and conceptual neighbourhood construction <ref type="bibr" target="#b14">[15]</ref> and other notions in spatial reasoning <ref type="bibr" target="#b15">[16]</ref>. One challenge for evaluating LLMs in the domain of spatial reasoning (and commonsense more generally <ref type="bibr" target="#b16">[17]</ref>) is the paucity of good benchmarks -I will discuss this issue and briefly present a new benchmark which is based on a synthetic generator, able to provide arbitrarily many examples of automatically labelled indoor virtual scenes <ref type="bibr" target="#b17">[18]</ref>. Using LLMs as a natural language interface to symbolic spatial reasoners: Given the deficiencies in the robustness of LLMs in performing qualitative spatial reasoning, it is worth asking the question whether an LLM and a more traditional symbolic reasoner in combination could be more effective than either on their own. An LLM has strengths in analysing language, but no so much in more complex reasoning, whilst an LLM on its own has no ability to comprehend natural language. The combination of the two can be particularly effective, for example as demonstrated in the StepGame benchmark <ref type="bibr" target="#b18">[19,</ref><ref type="bibr" target="#b13">14]</ref>.</p></div>		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements This work was supported by: the Fundamental Research priority area of The Alan Turing Institute; Microsoft Research -Accelerating Foundation Models Research program; the Economic and Social Research Council (ESRC) under grant ES/W003473/1. I also wish to give heartfelt thanks to all my co-authors in the papers <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b17">18]</ref> I will discuss in the talk, and with whom it has been such a pleasure to interact with.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Challenges in geographical information science</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">F</forename><surname>Goodchild</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences</title>
		<imprint>
			<biblScope unit="volume">467</biblScope>
			<biblScope unit="page" from="2431" to="2443" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Tversky</surname></persName>
		</author>
		<title level="m">Mind in Motion: How Action Shapes Thought</title>
				<imprint>
			<publisher>Basic Books</publisher>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Tobler&apos;s first law and spatial analysis</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">J</forename><surname>Miller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annals of the association of American geographers</title>
		<imprint>
			<biblScope unit="volume">94</biblScope>
			<biblScope unit="page" from="284" to="289" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Qualitative spatial representation and reasoning</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Renz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Handbook of knowledge representation</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Van Harmelen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Lifschitz</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">B</forename><surname>Porter</surname></persName>
		</editor>
		<imprint>
			<publisher>Elsevier</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="551" to="596" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A survey of qualitative spatial representations</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ouyang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Yu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Knowledge Engineering Review</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="page" from="106" to="136" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Learning relational event models from video</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">S</forename><surname>Dubba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Hogg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bhatt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Dylla</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Artificial Intelligence Research</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="page" from="41" to="90" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Artificial intelligence meets natural stupidity</title>
		<author>
			<persName><forename type="first">D</forename><surname>Mcdermott</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM SIGART Bulletin</title>
		<imprint>
			<biblScope unit="page" from="4" to="9" />
			<date type="published" when="1976">1976</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Online perceptual learning and natural language acquisition for autonomous robots</title>
		<author>
			<persName><forename type="first">M</forename><surname>Alomari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Hogg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">303</biblScope>
			<biblScope unit="page">103637</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Identifying and modelling polysemous senses of spatial prepositions in referring expressions</title>
		<author>
			<persName><forename type="first">A</forename><surname>Richard-Bollans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">G</forename><surname>Álvarez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cognitive Systems Research</title>
		<imprint>
			<biblScope unit="volume">77</biblScope>
			<biblScope unit="page" from="45" to="61" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Creswell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Shanahan</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2208.14271</idno>
		<title level="m">Faithful reasoning using large language models</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">C</forename></persName>
		</author>
		<author>
			<persName><forename type="first">.-C</forename><surname>Chang</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2212.10403</idno>
		<title level="m">Towards reasoning in large language models: A survey</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Large language models are zero-shot reasoners</title>
		<author>
			<persName><forename type="first">T</forename><surname>Kojima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Reid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Matsuo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Iwasawa</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2205.11916</idno>
	</analytic>
	<monogr>
		<title level="j">Advances in neural information processing systems</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="page" from="22199" to="22213" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Evaluating the ability of large language models to reason about cardinal directions</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Blackwell</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2406.16528</idno>
	</analytic>
	<monogr>
		<title level="m">Proc. COSIT-24 (to appear)</title>
				<meeting>COSIT-24 (to appear)</meeting>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Advancing spatial reasoning in large language models: An in-depth evaluation and enhancement using the StepGame benchmark</title>
		<author>
			<persName><forename type="first">F</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Hogg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. AAAI</title>
				<meeting>AAAI</meeting>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">An evaluation of ChatGPT-4&apos;s qualitative spatial reasoning capabilities in RCC-8</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2309.15577.arXiv:2309.15577" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note>appears in Working Notes of QR-23</note>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hernandez-Orallo</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2304.11164</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Benchmarks for automated commonsense reasoning: A survey</title>
		<author>
			<persName><forename type="first">E</forename><surname>Davis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="page" from="1" to="41" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Reframing spatial reasoning evaluation in language models: A real-world simulation benchmark for qualitative reasoning</title>
		<author>
			<persName><forename type="first">F</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Hogg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Cohn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. IJCAI</title>
				<meeting>IJCAI</meeting>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">StepGame: A new benchmark for robust multi-hop spatial reasoning in texts</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lipani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. AAAI</title>
				<meeting>AAAI</meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="11321" to="11329" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
