=Paper=
{{Paper
|id=Vol-3322/short2
|storemode=property
|title=Narrative Objects
|pdfUrl=https://ceur-ws.org/Vol-3322/short2.pdf
|volume=Vol-3322
|authors=Mihai Pomarlan,Robert Porzel
|dblpUrl=https://dblp.org/rec/conf/ijcai/PomarlanP22
}}
==Narrative Objects==
Narrative Objects
Mihai Pomarlan1 , Robert Porzel2
1
Applied Linguistics, University of Bremen, Bremen, Germany
2
Digital Media Lab, University of Bremen, Bremen, Germany
Abstract
In this work we operate on the view that the pragmatic employment of objects by an agent is manifested via their affordances
for that specific agent. What affordances can manifest themselves in a particular situation depends, in part, on the dispositions
offered by the particular objects involved. The contribution of this work is to construct and deploy a large scale formal model
of dispositions for physical objects that can be employed to constrain and describe the roles they can play in narratives.
1. Introduction will suffice: a disposition is a quality an object needs so
that it can play a particular role in a particular action.
A simple question such as “what is a particular object” Dispositions tend to operate in, at least, pairs. Where
turns out to have some complications if its answer is to the dispositions of knife and dough meet, there can be
be relevant for human beings. Hobbs remarks, by way of cutting. A block of wood can be cut too, though it would
illustration, that a road is a line when we plan a trip, a be hard to do so with a knife. There are more kinds of
surface when we drive on it, and a volume when we hit a dispositions to being a cutting instrument, and not all
pothole ([1], pg. 820). A road is always a road, of course, of them are good fits for all the dispositions to being
but in our common way to talk we confuse between what a cutting patient. Therefore, also needed is knowledge
something is, and what it is used as. This confusion sug- about what dispositions go together, to answer questions
gests that the use is more immediately relevant than the such as “what can I cut apples with?”, “what can I contain
ontological classification, at least where agents pursuing boiling water with?” and so on.
pragmatic goals are concerned. If equipped with knowledge of object dispositions, and
The relevance of “use as” requires that ontological how these dispositions “match” and allow various af-
modelling be able to describe both levels: of being, and fordances to manifest [4], an agent can tackle several
of use – and of other ways to interact with the pragmatic problems it may encounter in its coping with the envi-
goals of an agent. One approach to achieve this works by ronment. It can select appropriate tools for tasks it needs
also modelling the roles that (world) entities play in a nar- to perform, or seek passable substitutes. It can also look
rative an agent constructs about its interactions with the at a scene and form an idea of what can happen, and how
world [2]. The entities that appear in the agent’s narra- to work with, or against, such possibilities. If there is a
tive are classifications of world entities or ground objects puddle of oil lying around, people tend to be more careful
based on the roles they play, rather than the ground ob- with lit matches.
jects themselves. A ground object such as a hammer can In our everyday coping with the world, we don’t seem
play various roles, e.g., a murder weapon, a paper weight to usually tell ourselves stories about activities we master
or a door stopper. It can even be a tool to drive nails. The and render routine ([5], chapter 1; [6]). We just follow
potential classifications of a physical object – the ways our goals, mostly avoid dangerous configurations of the
in which it can be narrativized – depend on its physical world, without thinking about our decisions for too long.
dispositions. If however we had to narrativize – perhaps to teach some-
A disposition is a quality an object may have, relevant one else, perhaps to correct a perceived flaw in our action
for questions such as “what can this do?” and “what can or learn something new – we would use knowledge of
be done to it?” A knife can cut – it has a disposition dispositions, and dispositional matching. We would as-
allowing it to be a cutting instrument. A ball of dough sert that some tool is appropriate for a task, or explain
can be cut – it has a disposition to be a patient of cut- that we did something to prevent something else from
ting. While the concept of disposition is complicated to happening. Presumably, a similar capability is useful also
model [3], for our purposes here this basic understanding for robotic agents for doing household chores; if nothing
else, they should be able to explain themselves to human
IJCAI 2022: Workshop on semantic techniques for narrative-based users/observers.
understanding, July 24, 2022, Vienna, Austria Therefore, a large scale formal model of dispositions
$ pomarlan@uni-bremen.de (M. Pomarlan); of physical objects will be useful to the domestic service
porzel@uni-bremen.de (R. Porzel)
© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License robots of the future. The contribution of this work is to
Attribution 4.0 International (CC BY 4.0).
CEUR
Workshop
Proceedings
http://ceur-ws.org
ISSN 1613-0073
CEUR Workshop Proceedings (CEUR-WS.org)
6
construct and deploy such a model1 and show how it can 3. Limits of Open Knowledge
be employed to constrain and describe the roles objects
can play in an agent’s narratives.
Sources
Given the abundance of large knowledge graphs that
have some connection to commonsense, are more such
2. State of the Art graphs needed or useful? We argue here that they are.
We initially used OpenCyc as a foundation for repre-
2.1. Dispositions and Affordances
senting knowledge for a robot doing household activity.
Affordances were informally defined by Gibson as what DOLCE UltraLite [27] provided a more cognitively moti-
an environment offers to an agent [7]. Formal accounts vated foundation, and we then developed an ontological
have been proposed (affordances as qualities [8], as model on top of it to distinguish between object class and
events [9], as designs [10], as relations [11], [4] etc.; object use [28, 4], a distinction which OpenCyc does not
overview in [3]). Turvey proposed a theory [12] stat- make consistently. We will look again at the knowledge
ing the disposition of a “bearer” can realize an event items in Open-/ResearchCyc in the future. In this pa-
when it encounters a “trigger” disposition under some per we discuss the more recent, open-access knowledge
“background” conditions. An ontological model for dis- graphs such as CSKG.
positions based on Turvey’s insights was recently pro- Modern knowledge graph projects emphasize their
posed [13], [3]. Learning affordances from interaction broad coverage, manifested in millions of triples. Indeed,
has been investigated [14], [15], [16]. CSKG contains plenty of trivia knowledge. Knowledge
for everyday activities is sparser. E.g., there are no Used-
2.2. Commonsense Knowledge Graphs For triples for wipe/v/wn/contact – there is no knowledge
about what tools would be appropriate for an everyday
Several benchmarks are available to test an agent’s ability task such as wiping.
to answer commonsense queries, e.g. WinoGrande [17]. The large scale knowledge graphs of today would prob-
State of the art AI either sits below human performance, ably not exist without automatic techniques and cheap
or is not reliable in providing appropriate answers. crowd-sourcing. However, this is prone to introducing er-
Embodying commonsense reasoning in a computa- rors2 . We do not intend to be critical of the CSKG project,
tional model is complex for many reasons [18], such which formed the basis of our own. We however became
as the need for commonsense knowledge. To address very aware during our work of how problematic it is for
this, several commonsense knowledge graphs have inference.
been constructed: the CommonSense Knowledge Graph We position our work, relative to other knowledge
(CSKG; [19]), itself a merge of among others Concept- graphs, as specialized on everyday knowledge and rea-
Net [20], ATOMIC [21], ImageNet [22]. The most fa- soning. More human attention was spent on vetting the
mous commonsense knowledge graph was constructed items, with newer ontological modelling approaches [28]
by Cyc [23], but unfortunately its open- and research and new sources of knowledge about object use obtained
access versions have been discontinued. from games with a purpose [29].
Linguistic resources have also been used for common-
sense reasoning: WordNet [24], FrameNet [25], Verb-
Net [26]. These capture relations between word mean- 4. SOMA_DFL
ings, descriptions of scenarios, and thematic roles and
their selectional restrictions. We now describe what are the sources for our knowledge
Our work builds from existing knowledge graphs by se- graph and how it is structured and organized into an
lecting, correcting, and integrating knowledge items from ontology for the purposes of reasoning and answering
them with new sources. We obtained a rich knowledge queries.
store for answering questions about object capabilities
and uses. 4.1. Knowledge Sources
Our primary source is CSKG [19], which combines sev-
eral knowledge resources. We have focused on the part
of CSKG that is comprised of entities with an associated
2
In CSKG, we find that amputate/v/wn/medicine is a manner of audio-
1
The knowledge graph is available at https://github.com/ease-crc/ tape/n/wn/artifact, authorize/v/wn/communication, fructify/v/wn/-
ease_lexical_resources body, and several other such doubtful relations.
7
English WordNet synset3 . We selected the part of the assert that chewing is a kind of grinding and a disposition
object taxonomy that refers to tools, buildings, and food. to be chewable is a disposition to be grindable.
Some of these entities also have associated UsedFor and UsedFor triples are interpreted as asserting that an ob-
CapableOf triples, where the third member of a triple cor- ject can play an instrumental role in a particular task.
responds to an action. We have also selected MannerOf CapableOf triples are interpreted as asserting that an
triples between actions in UsedFor and CapableOf triples. object can play a passive role, usually Patient. Triples de-
We added triples linking actions to VerbNet 3.2 classes, scribing combinations of items are interpreted as general
and so to thematic roles and selectional restrictions on assertions about all items of the categories mentioned in
role fillers. We have done extensive manual corrections the triple.
on the collected triples. We added triples describing what
items can be used together during an action, Some of
this knowledge comes from WordNet synset definitions, 5. Queries
some from the work of our colleagues on games with a
Reasoning with SOMA_DFL consists in performing sub-
purpose [29].
sumption inference tasks between concepts in the ontol-
ogy and query concepts defined by the user. We query
4.2. Structure SOMA_DFL with the reasoner Konclude. The ontology is
For reasoning and query answering, the triples from somewhat large, with 22527 object classes coming from
the knowledge graph are organized into an ontology. CSKG and related resources, and 45538 subclassof/equiv-
We now use OWL-DL, but plan to add support for non- alentclasses axioms. Further, the ontology uses the full
monotonic inference as it resembles more the default- expressivity of OWL-2. Nonetheless, performing disposi-
with-exceptions pattern that human rules often follow. tion queries is fast – less than a milisecond – as long as a
All birds fly, except those that do not – and it is conve- cache is constructed first, which takes about 10 seconds
nient both to keep the default rule as well as an open- using Konclude 0.7.0 on an Intel®Core™i5-7500 CPU @
ended list of exceptions. 3.40GHz with 8GB RAM. Table 1 shows some example
The ontology we produce is built on top of DUL [27] queries.
and the SOcio-physical Model of Activity (SOMA [28]).
These provide higher-level knowledge, in particular that 6. Conclusion and Future Work
Tasks are classify events and define Roles which can be
filled by appropriate Objects. We have presented a knowledge graph that contains in-
formation about object dispositions and possible com-
4.3. Axiomatization binations of objects that can be used to achieve a task.
The graph focuses on knowledge useful for everyday ac-
Our dispositional theory asserts that to play a role, an ob- tivities, and can be employed by a computational agent
ject must have a disposition for it. From VerbNet knowl- to select appropriate tools or patients (objects to act on)
edge, we produce axioms asserting that if an object fills a for a task, or to understand a scene in terms of what is
particular role in a particular task then it must also obey possible for the objects in it to do together. Our graph
the appropriate semantic restrictions. E.g., the following differs both in its focus – everyday activity knowledge
axioms – and in its purpose – logical reasoning, as opposed to
𝑐𝑙𝑎𝑠𝑠𝑖𝑓 𝑖𝑒𝑠−1 (𝑃 𝑎𝑡𝑖𝑒𝑛𝑡 ⊓ ∃𝑑𝑒𝑓 𝑖𝑛𝑒𝑠−1 𝑐ℎ𝑒𝑤) ⊑ information retrieval – from previous projects.
∃ℎ𝑎𝑠𝑄𝑢𝑎𝑙𝑖𝑡𝑦 𝑐ℎ𝑒𝑤.𝑃 𝑎𝑡𝑖𝑒𝑛𝑡 We will add support for non-monotonic inference, be-
∃ℎ𝑎𝑠𝑄𝑢𝑎𝑙𝑖𝑡𝑦 𝑐ℎ𝑒𝑤.𝑃 𝑎𝑡𝑖𝑒𝑛𝑡 ⊑ 𝑐𝑜𝑚𝑒𝑠𝑡𝑖𝑏𝑙𝑒 ⊓ 𝑠𝑜𝑙𝑖𝑑 cause it captures better the default with open-ended ex-
assert that what plays the patient role for chewing must ception list way that human knowledge is often orga-
have the chewable disposition, and therefore comestible nized, and will integrate knowledge about causal rela-
and solid. tions between events.
MannerOf triples are interpreted as describing a tax-
onomy, and the roles a manner defines are connected to
the roles defined by the task it is a manner of. E.g., the Acknowledgments
following axioms
𝑐ℎ𝑒𝑤 ⊑ 𝑔𝑟𝑖𝑛𝑑 This work was funded by the by the FET-Open Project
𝑐ℎ𝑒𝑤.𝑃 𝑎𝑡𝑖𝑒𝑛𝑡 ⊑ 𝑔𝑟𝑖𝑛𝑑.𝑃 𝑎𝑡𝑖𝑒𝑛𝑡 #951846 “MUHAI – Meaning and Understanding for
Human-centric AI” by the EU Pathfinder and Horizon
3
2020 Program and by the German Research Foundation
In CSKG, the names of such entities can be identified by a “/c/en/”
prefix and a “/wn/” infix. An example of such an entity is /c/en/-
(DFG) as part of Collaborative Research Center (SFB)
cut/v/wn/contact. In this paper we will omit the “/c/en/” prefix.
8
Table 1 [4] D. Beßler, R. Porzel, M. Pomarlan, M. Beetz,
Disposition queries, informally stated; concept names short- R. Malaka, J. Bateman, A Formal Model of Affor-
ened for space, responses in alphabetical order dances for Flexible Robotic Task Execution, in: Eu-
What can you use (as instrument) to – ropean Conference on Artificial Intelligence (ECAI),
broiler, dutch oven, gas oven, 2020.
bake
microwave, oven, rotisserie, tandoor [5] P. Agre, Computation and Human Experience,
adz, auger, ax, axe, backsaw, Learning in doing, Cambridge University Press,
cut New York, 1997. 00550.
(100 more)
cleaning pad, handkerchief, [6] D. Kahneman, Thinking, fast and slow,
wipe paper towel, piece of cloth, sponge, Farrar, Straus and Giroux, New York,
swab, towel 2011. URL: https://www.amazon.de/
What can you use (as object acted on) to – Thinking-Fast-Slow-Daniel-Kahneman/dp/
album, anthracite, art paper, 0374275637/ref=wl_it_dp_o_pdT1_nS_nC?
burn ie=UTF8&colid=151193SNGKJT9&coliid=
autograph, card, (204 more)
acerola, ackee, acorn squash, I3OCESLZCVDFL7.
eat
adobo, aitchbone, (1491 more) [7] J. J. Gibson, The Ecological Approach to Visual Per-
spill
absynth, acetone, acidophilus milk, ception, Psychology Press Classic Editions, 1979.
aioli, alcohol, (709 more) [8] J. Ortmann, W. Kuhn, Affordances as qualities, in:
What can you do with – Proceedings of the 2010 Conference on Formal On-
break, break into, change, tology in Information Systems: Proceedings of the
bowl Sixth International Conference (FOIS 2010), IOS
contain, cover, (25 more)
carve, change, cube Press, Amsterdam, The Netherlands, The Nether-
knife
cut, dice, (14 more) lands, 2010, pp. 117–130.
paper
burn, change, cover [9] L. A. Moralez, Affordance ontology: towards a
cut, extinguish, (12 more) unified description of affordances as events, Res.
What can you use (as instrument) to – Cogitans 7 (2016) 35–45.
ax, axe, backsaw, [10] I. Awaad, G. Kraetzschmar, J. Hertzberg, Challenges
cut firewood in finding ways to get the job done, in: 2nd Plan-
battle-axe, broad hatchet, (25 more)
alembic, amphora, ampulla, ning and Robotics (PlanRob) Workshop at 24th In-
contain a liquid
aspersorium, autoclave, (162 more) ternational Conference on Automated Planning and
cover a bowl
adhesive tape, book, duct tape, Scheduling (ICAPS), 2014.
foil, newspaper, (3 more) [11] A. Chemero, An outline of a theory of affordances,
What can you – Ecological Psychology 15 (2003) 181–195.
open with scissors envelope, letter, packet [12] M. T. Turvey, Ecological foundations of cognition:
alphabet soup, applejack, aqua vitae, Invariants of perception and action., American
scoop with a ladle
aquavit, armagnac, (75 more) Psychological Association (1992).
[13] F. Toyoshima, Modeling affordances with disposi-
tions, in: Proceedings of the Joint Ontology Work-
1320 EASE – Everyday Activity Science and Engineering, shops, 2018.
University of Bremen in subprojects P01 and P05. [14] B. Moldovan, P. Moreno, M. Van Otterlo, J. Santos-
Victor, L. De Raedt, Learning relational affordance
models for robots in multi-object manipulation
References tasks, in: 2012 IEEE International Conference on
Robotics and Automation, IEEE, 2012, pp. 4373–
[1] J. R. Hobbs, Sketch of an ontology underlying the 4378.
way we talk about the world, Int. J. Hum. Com- [15] R. Detry, E. Baseski, M. Popovic, Y. Touati,
put. Stud. 43 (1995) 819–830. URL: https://doi.org/ N. Kruger, O. Kroemer, J. Peters, J. Piater, Learn-
10.1006/ijhc.1995.1076. doi:10.1006/ijhc.1995. ing object-specific grasp affordance densities, 2009.
1076.
doi:10.1109/DEVLRN.2009.5175520.
[2] R. Porzel, On formalizing narratives, in: Proceed- [16] E. Ugur, J. H. Piater, Emergent structuring of in-
ings of the JOWO – Ontology Workshops, Bolzano, terdependent affordance learning tasks using in-
Italy, 2021. trinsic motivation and empirical feature selection,
[3] F. Toyoshima, A. Barton, J.-F. Ethier, Affordances IEEE Trans. Cogn. Dev. Syst. 9 (2017) 328–340. URL:
and their ontological core, Journal of Applied On- https://doi.org/10.1109/TCDS.2016.2581307. doi:10.
tology 17 (2022) 285–320.
9
1109/TCDS.2016.2581307. tional Conference, FOIS 2021, Bozen-Bolzano, Italy,
[17] K. Sakaguchi, R. L. Bras, C. Bhagavatula, Y. Choi, September 13-16, 2021, Frontiers in Artificial Intel-
Winogrande: An adversarial winograd schema chal- ligence and Applications, IOS Press, 2021.
lenge at scale, ArXiv abs/1907.10641 (2020). [29] J. Pfau, R. Malaka, We asked 100 people: How would
[18] E. Davis, Logical formalizations of commonsense you train our robot?, in: Extended Abstracts of
reasoning: A survey, J. Artif. Intell. Res. 59 the 2020 Annual Symposium on Computer-Human
(2017) 651–723. URL: https://doi.org/10.1613/jair. Interaction in Play, 2020, pp. 335–339.
5339. doi:10.1613/jair.5339.
[19] F. Ilievski, P. A. Szekely, B. Zhang, CSKG:
the commonsense knowledge graph, CoRR
abs/2012.11490 (2020). URL: https://arxiv.org/abs/
2012.11490. arXiv:2012.11490.
[20] R. Speer, J. Chin, C. Havasi, Conceptnet 5.5: An
open multilingual graph of general knowledge, in:
AAAI Conference on Artificial Intelligence, 2016.
URL: http://arxiv.org/abs/1612.03975.
[21] M. Sap, R. L. Bras, E. Allaway, C. Bhagavatula,
N. Lourie, H. Rashkin, B. Roof, N. A. Smith, Y. Choi,
ATOMIC: an atlas of machine commonsense for
if-then reasoning, in: The Thirty-Third AAAI
Conference on Artificial Intelligence, AAAI 2019,
The Thirty-First Innovative Applications of Ar-
tificial Intelligence Conference, IAAI 2019, The
Ninth AAAI Symposium on Educational Advances
in Artificial Intelligence, EAAI 2019, Honolulu,
Hawaii, USA, January 27 - February 1, 2019, AAAI
Press, 2019, pp. 3027–3035. URL: https://doi.org/
10.1609/aaai.v33i01.33013027. doi:10.1609/aaai.
v33i01.33013027.
[22] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, L. Fei-
Fei, Imagenet: A large-scale hierarchical image
database, in: 2009 IEEE Conference on Computer
Vision and Pattern Recognition, 2009, pp. 248–255.
doi:10.1109/CVPR.2009.5206848.
[23] D. B. Lenat, Cyc: a large-scale investment in knowl-
edge infrastructure, Commun. ACM 38 (1995) 32–
38.
[24] C. Fellbaum (Ed.), WordNet: an electronic lexical
database, MIT Press, 1998.
[25] J. Ruppenhofer, M. Ellsworth, M. Petruck, C. John-
son, J. Scheffczyk, Framenet ii: Extended theory
and practice (2010).
[26] K. K. Schuler, VerbNet: A Broad-Coverage, Com-
prehensive Verb Lexicon, Ph.D. thesis, University
of Pennsylvania, 2006. URL: http://verbs.colorado.
edu/~kipper/Papers/dissertation.pdf.
[27] V. Mascardi, V. Cordì, P. Rosso, A comparison of
upper ontologies., 2007, pp. 55–64.
[28] D. Beßler, R. Porzel, M. Pomarlan, A. Vyas,
S. Höffner, M. Beetz, R. Malaka, J. Bateman, Foun-
dations of the socio-physical model of activities
(soma) for autonomous robotic agents, in: B. Bro-
daric, F. Neuhaus (Eds.), Formal Ontology in Infor-
mation Systems - Proceedings of the 12th Interna-
10