=Paper=
{{Paper
|id=Vol-1388/poster_paper6
|storemode=property
|title=Predicting Actions Using a Probabilistic Model of Human Decision Behaviours
|pdfUrl=https://ceur-ws.org/Vol-1388/poster_paper6.pdf
|volume=Vol-1388
|dblpUrl=https://dblp.org/rec/conf/um/CruickshankRS15
}}
==Predicting Actions Using a Probabilistic Model of Human Decision Behaviours==
Predicting actions using an adaptive probabilistic model of human decision behaviours A.H.Cruickshank, R.Shillcock, S.Ramamoorthy School of Informatics, University of Edinburgh, EH8 9AB, UK A.H.Cruickshank@sms.ed.ac.uk rcs@inf.ed.ac.uk S.Ramamoorthy@ed.ac.uk Abstract. Computer interfaces provide an environment that allows for multiple objectively optimal solutions but individuals will, over time, use a smaller number of subjectively optimal solutions, developed as habits that have been formed and tuned by repetition. Designing an interface agent to provide assistance in this environment thus requires not only knowledge of the objectively optimal solutions, but also recognition that users act from habit and that adaptation to an individual’s subjectively optimal solutions is required. We present a dynamic Bayesian network model for predicting a user’s actions by inferring whether a decision is being made by deliberation or through habit. The model adapts to indi- viduals in a principled manner by incorporating observed actions using Bayesian probabilistic techniques. We demonstrate the model’s effective- ness using specific implementations of deliberation and habitual decision making, that are simple enough to transparently expose the mechanisms of our estimation procedure. We show that this implementation achieves > 90% prediction accuracy in a task with a large number of optimal solutions and a high degree of freedom in selecting actions. Keywords: User Modeling, Bayesian Methods, Action Recognition. 1 Introduction Computer interfaces provide an environment that allows for multiple ob- jectively optimal solutions but individuals will, over time, use a smaller number of subjectively optimal solutions, developed as habits that have been formed and tuned by repetition. Thus an interface agent providing assistance in this environment requires not only knowledge of the objec- tively optimal solutions, but also the ability to adapt to an individual’s subjectively optimal solutions. Utilising findings in psychology and neu- roscience we propose a general model that adapts to individuals using Bayesian probability to infer the type of decision making behaviour that will be used. We demonstrate the effectiveness of our approach using simple implementations for two decision systems, the deliberative and habitual systems. The deliberative system uses an internal model of the environment for forward planning to reach a goal and selects actions based on the calculated plan. The habitual system learns the utility for actions in a situation based on previous experience and selects the one that has proven to be most useful in the past. The existence of both these systems has been recognised from early studies in psychology and neuro- science ([6], [5]) and are known to coexist ([3]). Our approach is related to others that are derived from human decision making, such as that used in [1], which proposes a Bayesian Theory of Mind, and [7], which extended the ACT-R cognitive architecture to create ACT-R/E. Other approaches for plan and action recognition are derived from applying au- tomated planning and machine learning techniques. These generally fall into one of two categories a planner approach ([4]) or a historic approach ([2]). Of note is that both of these approaches broadly replicate one of the two types of decision system used by people when selecting actions. Plan- ner predictors replicate the deliberative decision system whereas historic predictors replicate the habitual decision system. Thus our approach can be viewed as integrating these two types of predictors using a Bayesian model combination. 2 Predictive Model Action prediction is performed using a dynamic Bayesian network model B is a decision behaviour, which we define as any process that provides a distribution over actions given a state and previously observed ac- tions. That is, a decision behaviour can be represented as a distribution p(a′ |B, s, a