=Paper=
{{Paper
|id=Vol-528/paper-2
|storemode=property
|title=Using Gaussian Spatial Processes to Model and Predict Interests in Museum Exhibits
|pdfUrl=https://ceur-ws.org/Vol-528/paper2.pdf
|volume=Vol-528
|dblpUrl=https://dblp.org/rec/conf/ijcai/BohnertZS09
}}
==Using Gaussian Spatial Processes to Model and Predict Interests in Museum Exhibits==
Using Gaussian Spatial Processes to
Model and Predict Interests in Museum Exhibits
Fabian Bohnert, Ingrid Zukerman, and Daniel F. Schmidt
Faculty of Information Technology, Monash University
Clayton, VIC 3800, Australia
{fabianb,ingrid,dschmidt}@infotech.monash.edu.au
Abstract viewing times rather than explicit ratings, and (3) predictions
differing from recommendations (we do not want to recom-
This paper adapts models from the area of spatial mend exhibits that visitors are going to see anyway). We turn
statistics to the task of predicting a user’s inter- the first challenge into an advantage by exploiting the fact
ests (i. e., implicit item ratings) within a recom- that physical distances between exhibits are meaningful, en-
mender system in the museum domain. We de- abling the use of walking distance between exhibits to calcu-
velop a model based on Gaussian spatial processes, late (content) distance. This supports the direct, interpretable
and discuss two ways of computing item-to-item application of spatial processes by using a simple paramet-
distances in the museum setting. Our model was ric Gaussian spatial process model (with the ensuing low
evaluated with a real-world dataset collected by variance in parameter estimates), compared to more complex
tracking visitors in a museum. Overall, our model non-parametric approaches, e. g., [Schwaighofer et al., 2005].
attains a higher predictive accuracy than nearest- The second challenge, which stems from the variable seman-
neighbour collaborative filters. In addition, the tics of viewing times (time t for different exhibits could mean
model variant using physical distances outperforms interest or boredom), is naturally addressed by SPM’s struc-
that using distances computed from item-to-item ture. The third challenge can be addressed by (a) using SPM
similarities. to build a model of a visitor’s interests in unseen exhibits,
(b) inferring a predictive model of a visitor’s pathway through
1 Introduction the remainder of the museum [Bohnert et al., 2008], and
Spatial processes (random fields) are a subclass of stochastic (c) combining these models to recommend exhibits of interest
processes which are applied to domains that have a geospa- that may be overlooked if the predicted pathway is followed.
tial interpretation, e. g., [Diggle et al., 1998; Banerjee et al., SPM was evaluated with a real-world dataset of time spans
2004]. They are typically used in the field of spatial statistics spent by museum visitors at exhibits (viewed as implicit rat-
to model spatial associations between a set of observations ings). We compared our model’s performance to that of (1) a
made at certain locations, and to predict values at locations baseline model which delivers a non-personalised prediction,
where no observations have been made. This paper applies and (2) a nearest-neighbour collaborative filter incorporat-
such models to the prediction of a user’s interests or item rat- ing performance-enhancing modifications, e. g., [James and
ings in recommender systems (RS). We develop our Spatial Stein, 1961; Herlocker et al., 1999]. Our results show that
Process Model (SPM) by adapting a Gaussian spatial process SPM significantly outperforms both models.
model to the RS scenario, and demonstrate our model’s ap- The paper is organised as follows. In Section 2, we discuss
plicability to the task of predicting implicit ratings in the mu- related research. Section 3 describes our domain and dataset.
seum domain. The use of spatial processes requires a measure Our spatial processes approach for modelling and predicting
of distance between items in addition to users’ ratings. This exhibit interests is developed in Section 4. In Section 5, we
measure, which is non-specific (e. g., it may be a physical or present the results of our evaluation, followed by a discussion
a conceptual distance), can be readily obtained in most cases. in Section 6 and our conclusions in Section 7.
For example, distances could be computed from feature vec-
tors representing the items (similarly to content-based RS), 2 Related Research
from item-to-item similarities (similarly to item-to-item col- Recommender systems (RS) are designed to direct users to
laborative filtering [Sarwar et al., 2001]), or from physical personally interesting items in situations where the amount
distance. In this paper, we explore the latter two measures. of available information exceeds the users’ processing capa-
Our application scenario is motivated by the need to au- bility [Resnick and Varian, 1997; Burke, 2002]. Typically,
tomatically recommend exhibits to museum visitors, based such systems (1) use information about a user (i. e., a user
on non-intrusive observations of their actions in the physi- model) to predict ratings of items that the user has not yet
cal space. Employing RS in this scenario is challenging due considered, and (2) recommend suitable items based on these
to (1) the physical nature of the domain, (2) having exhibit predictions. Collaborative modelling techniques constitute
one of the main model classes applied in RS [Albrecht and dation techniques that do not require such an explicit domain
Zukerman, 2007]. They base their predictions upon the as- knowledge representation [Albrecht and Zukerman, 2007].
sumption that users who have agreed in their behaviour in the
past will agree in the future.
3 Domain and Dataset
The greatest strength of collaborative approaches is that
they are independent of any representation of the items being The G ECKO project endeavours to develop user modelling
recommended, and work well for complex objects, for which and personalisation techniques for information-rich physical
features are not readily apparent. The two main collaborative spaces, relying on non-intrusive observations of users’ be-
approaches are memory-based and model-based. Previous haviour [Bohnert et al., 2008]. Developing such non-intrusive
research has mainly focused on memory-based approaches, user modelling and personalisation techniques for museums
such as nearest-neighbour models (classic collaborative fil- requires datasets about visitor behaviour in the physical mu-
tering), e. g., [Herlocker et al., 1999]. The main drawback of seum space (i. e., visitor pathways). Datasets that are suitable
memory-based algorithms is that they operate over the entire for the development phase can be obtained by manually track-
user database to make predictions. In contrast, model-based ing museum visitors. Such a data collection methodology is
approaches use techniques such as Bayesian networks, latent- clearly inappropriate for model deployment, but it facilitates
factor models and artificial neural networks, e. g., [Breese et model development by eschewing issues related to technol-
al., 1998; Bell et al., 2007], to first learn a statistical model ogy selection and instrumentation accuracy.
in an offline fashion, and then use it to make predictions and Museums such as Melbourne Museum (Melbourne, Aus-
generate recommendations. This decomposition can signifi- tralia) display thousands of exhibits distributed over many
cantly speed up the recommendation generation process. separate galleries and exhibitions. Normally, visitors do not
Personalised guide systems in physical domains have of- require recommendations to travel between individual, logi-
ten employed adaptable user models, which require visitors cally related exhibits in close physical proximity. Rather, they
to explicitly state their interests in some form. For example, may prefer recommendations regarding physically separated
the GUIDE project [Cheverst et al., 2002] developed a hand- areas. In order to gather data for assessing predictive models
held tourist guide for visitors to the city of Lancaster, UK. that support appropriate recommendations, we grouped Mel-
It employed a user model obtained from explicit user input bourne Museum’s individual exhibits into semantically co-
to generate a dynamic and user-adapted city tour, where the herent and spatially confined exhibit areas. This task, which
order of the visited items could be varied. In the museum do- was performed with the assistance of museum staff, yielded
main, the CHIP project [Aroyo et al., 2007] investigates how 126 exhibit areas. Figure 1 depicts the site map of Melbourne
Semantic Web techniques can be used to provide personalised Museum showing these exhibit areas, together with one of the
access to digital museum collections both online and in the visitor pathways we collected.
physical museum, based on models that require an explicit To obtain our dataset, we manually tracked visitors to Mel-
initialisation. bourne Museum from April to June 2008, using a custom-
Less attention has been paid to predicting preferences made tracking tool running on laptop computers [Bohnert and
from non-intrusive observations, and to utilising adaptive user Zukerman, 2009]. In total, we recorded over 170 visitor path-
models that do not require explicit user input. In the museum ways. We only tracked first-time adult visitors travelling on
domain, adaptive user models are usually updated from a their own, to ensure that neither prior knowledge about the
user’s interactions with the system, the focus being on adapt- museum nor other visitors’ interests influenced a visitor’s de-
ing content presentation as opposed to predicting and rec- cisions about which exhibits to view. Prior to the data col-
ommending exhibits to be viewed. For example, HyperAu- lection, we briefed our trackers on the usage of our tracking
dio [Petrelli and Not, 2005] dynamically adapted the pre- software, the layout of the museum, and its digital represen-
sented content and hyperlinks to stereotypical assumptions tation on the site map. Additionally, we clarified what should
about a user, and to what a user has already accessed and be considered a viewing event. After the data collection, the
seems interested in. The augmented audio reality system for visitor pathways were post-processed using a post-processing
museums ec(h)o [Hatala and Wakkary, 2005] treated user in- tool we developed. For instance, we removed tracking events
terests in a dynamic manner, and adapted its user model on that could not have possibly occurred, e. g., visitor transitions
the basis of a user’s interactions with the system. The col- from one end of the museum to the other and back within
lected user modelling data were used to deliver personalised a few seconds, or transitions outside the museum walls and
information associated with exhibits via audio display. The back. We also removed incomplete visitor pathways, e. g.,
PEACH project [Stock et al., 2007] developed a multimedia due to a laptop computer running out of battery, or a vis-
handheld guide which adapts its user model on the basis of itor leaving unexpectedly. The resulting dataset comprises
both explicit visitor feedback and implicit observations of a 158 complete visitor pathways in the form of time-annotated
visitor’s interactions with the device. This user model was sequences of visited exhibit areas, with a total visit length
then used to generate personalised multimedia presentations. of 291:22:37 hours, and a total viewing time of 240:00:28
These systems, like most systems in the museum domain, hours. The dataset also contains demographic information
rely on knowledge-based user models in some way, and about the visitors, which was obtained by means of post-visit
hence, require an explicit, a-priori engineered representation interviews conducted by our trackers. In total, we obtained
of the domain knowledge. In contrast, our research investi- 8327 viewing durations at the 126 exhibit areas, yielding an
gates non-intrusive statistical user modelling and recommen- average of 52.7 exhibit areas per visitor (41.8% of the exhibit
(a) Melbourne Museum – Ground level (b) Melbourne Museum – Upper level
Figure 1: Visitor pathway visualised on a site map of Melbourne Museum
Table 1: Dataset statistics as viewing time correlates positively with preference and in-
terest [Parsons et al., 2004]. Hence, viewing time can be used
Mean Stddev Min Max as an indirect measure of interest. We propose to use log
viewing time (instead of raw viewing time), due to the fol-
Visit length (hrs) 1:50:39 0:47:54 0:28:23 4:42:12
lowing reasons. When examining our dataset (Section 3), we
Viewing time (hrs) 1:31:09 0:42:05 0:14:09 4:08:27
found the distributions of viewing times at exhibits to be pos-
Exhibit areas / visitor 52.70 20.69 16 103 itively skewed (we use the terms ‘exhibit’ and ‘exhibit area’
Visitors / exhibit area 66.09 25.36 6 117 synonymously in the remainder of this paper). Thus, the usual
assumption of a Gaussian model did not seem appropriate. To
select a more appropriate family of probability distributions,
areas). Hence, on average 58.2% of the exhibit areas were we used the Bayesian Information Criterion (BIC) [Schwarz,
not viewed by a visitor. This indicates that there is potential 1978]. We tested exponential, gamma, normal, log-normal
for pointing a visitor to relevant but unvisited exhibit areas. and Weibull distributions. The log-normal family fitted best,
Table 1 summarises further statistics of the dataset. with respect to both number of best fits and average BIC score
Clearly, the deployment of an automated RS in a museum (averaged over all exhibits). Hence, we transformed all view-
requires suitable positioning technologies to non-intrusively ing times to their log-equivalent to obtain approximately nor-
track visitors, and models to infer which exhibits are being mally distributed data. This transformation fits well with the
viewed. Although our dataset was obtained manually, it pro- idea that for high viewing times, an increase in viewing time
vides information of the type that may be inferred from sens- indicates a smaller increase in the modelled interest than a
ing data (the work described in [Schmidt et al., 2009] links similar increase in the context of low viewing times.
sensory and manually obtained information). Additionally,
the results obtained from experiments with this dataset are 4.2 Spatial Statistics in the Context of Our
essential for model development, as they provide an upper Application Scenario
bound for the predictive performance of our model. Spatial statistics is concerned with the analysis and predic-
tion of geographic data [Banerjee et al., 2004]. Utilising spa-
tial processes, the field deals with tasks such as modelling the
4 Using Gaussian Spatial Processes to Model associations between observations made at certain locations,
and Predict Visitors’ Exhibit Interests and predicting values at locations where no observations have
In this section, we first describe how we use viewing time to been made. The assumption made for spatial processes, that
quantify interest in exhibits (Section 4.1), and discuss the ap- correlation between observations increases with decreasing
plicability of spatial process models [Banerjee et al., 2004] site distance, fits well with our RS scenario, where viewing
to the prediction of a visitor’s interest in exhibits in our RS times are usually more correlated the more related exhibits
scenario (Section 4.2). We then propose a model-based col- are. Hence, by introducing a notion of spatial distance be-
laborative approach based on the theory of Gaussian spa- tween exhibits to functionally specify this correlation struc-
tial processes for predicting a visitor’s (log) viewing times ture, we can use spatial process models for predicting view-
(viewed as exhibit interests) from non-intrusive observations ing times (i. e., exhibit interests). We use s1 , . . . , sn to de-
of his/her (log) viewing times at visited exhibits (Section 4.3). note the locations of exhibits i, j ∈ I = {1, . . . , n} in a
space providing such a distance measure, i. e., ksi − sj k. For
example, ksi − sj k can be computed from feature vectors
4.1 From Viewing Time to Exhibit Interest
representing the items (similarly to content-based RS), from
In an information-seeking context, people usually spend more item-to-item similarities (similarly to item-to-item collabora-
time on relevant information than on irrelevant information, tive filtering [Sarwar et al., 2001]), or from physical distance.
In this paper, we explore the two latter options: Item-to-Item viewed exhibits 2, 3, 7 and 9, then µ1 = (µ2 , µ3 , µ7 , µ9 ) and
Distance and Physical Distance. σ1 = (σ2 , σ3 , σ7 , σ9 ).
• Item-to-Item Distance (I2I). Item-to-item collaborative Similarly to spatial processes, SPM assumes a special cor-
filtering [Sarwar et al., 2001] utilises a database of rat- relation structure between the viewing times of different ex-
ings to compute item-to-item similarities, and predicts hibits. In our experiments, we use a powered exponen-
a current user’s rating of an unseen item from his/her tial [Banerjee et al., 2004]:
ratings of those items that are most similar to the item ν
ρ(ksi − sj k; φ, ν) = exp (− (φksi − sj k) ) ,
in question. Inspired by how item-to-item similarities
are computed in this process, we use the observed log where φ > 0 and 0 < ν < 2. That is, ρ(ksi − sj k; φ, ν)
viewing times to derive the I2I distance measure as fol- models the correlation between the log viewing times of ex-
lows. We first transform the log viewing times into z- hibits i and j (ρ(ksi − sj k; φ, ν) depends on the sites si
scores by normalising the values for each visitor sepa- and sj of exhibits i and j only through the distance
rately. This ensures that varying viewing behaviour does ksi − sj k). Let H(φ, ν) be a correlation matrix with com-
not affect the similarity computation.1 Secondly, we cal- ponents (H(φ, ν))ij = ρ(ksi − sj k; φ, ν) collecting all these
culate item-to-item similarities using Pearson’s correla- correlations, and let Hu (φ, ν) denote a visitor u’s correlation
tion coefficient on the normalised log viewing times of matrix (dimension nu × nu ). That is, Hu (φ, ν) corresponds
exhibits i and j (using only the normalised log viewing to H(φ, ν) without the rows and columns
for unvisited ex-
times of those visitors that have viewed both exhibits i hibits. Also, let θ = µ, σ, τ 2 , φ, ν be a vector representing
and j). The resulting similarity value from within the in- the 2n + 3 model parameters, where τ 2 denotes the variance
terval [−1, 1] is finally transformed into a distance mea- of non-spatial error terms necessary to fully specify the model
sure by mapping it onto a value in [0, 1] (a similarity (these terms model non-spatial variation in the data). Then,
value of −1 yields a distance of 1, and a similarity of 1 modelling the data using Gaussian spatial processes (a de-
yields a distance of 0). tailed derivation appears in [Bohnert P et al., 2009]), r given θ
m
• Physical Distance (PD). Museums are carefully themed is multivariate normal of dimension u=1 nu . As the view-
by curatorial staff, such that closely-related exhibits are ing times of different visitors u = 1, . . . , m are independent,
in physical proximity. Based on this observation, we the model simplifies to
hypothesise that physical walking distance between ex-
ru | θ ∼ N (µu , Σu ) for all u = 1, . . . , m, (1)
hibits is inversely proportional to their (content) simi-
larity. Thus, we use physical walking distance PD as 2
where Σu = σu 1nu Hu (φ, ν)σu 1nu + τ 1nu is a visitor u’s
a measure of distance between exhibits. Specifically, covariance matrix, and 1nu is the identity matrix of dimen-
a SVG file-based representation of Melbourne Museum sion nu × nu .
was used to calculate the walking distances by mapping We employ Bayesian inference using SPM’s likelihood
the site map (Figure 1) onto a graph structure which pre- function derived from Equation 1 to estimate θ from r (in
serves the physical layout of the museum (i. e., prevent- particular, we use slice Gibbs sampling [Neal, 2003]). This
ing paths from passing through walls or ceilings). We solution offers attractive advantages over the classic frequen-
normalised the resulting distances to the interval [0, 1]. tist approach, such as the opportunity of incorporating prior
knowledge into parameter estimation via the prior distribu-
4.3 Our Gaussian Spatial Process Model tion, and capturing the uncertainty about the parameters via
In this section, we utilise theory from the area of spatial the posterior distribution.
statistics (Section 4.2) to formulate a Gaussian spatial process Given the model parameters θ = µ, σ, τ 2 , φ, ν , our
model, called Spatial Process Model (SPM), for predicting a model is fully specified, and we can use standard multivari-
museum visitor’s interests in unseen exhibits (i. e., log view- ate normal theory to predict a current visitor a’s log viewing
ing times) from his/her viewing behaviour at visited exhibits. times of unseen exhibits, say ra,1 , from a vector of observed
Let U = {1, . . . , m} be the set of all visitors, and log viewing times ra,2 . This is because (ra,1 , ra,2 ) | θ is nor-
I = {1, . . . , n} be the set of all items. Typically, for a vis- mally distributed (similarly to Equation 1). If we use the fol-
itor u ∈ U , we have viewing times for only a subset of I, say lowing notation
for nu exhibits. Denoting a visitor’s log viewing time vec-
Σa,11 Σa,12
ra,1 µa,1
tor with ru , we collect all observed logPviewing times into a |θ ∼ N , T ,
m ra,2 µa,2 Σa,12 Σa,22
vector r = (r1 , . . . , rm ) of dimension u=1 nu . Associated
with each exhibit i ∈ I is a log viewing time mean µi and then the conditional distribution p (ra,1 |ra,2 , θ) is normal
a standard deviation σi . Let µ = (µ1 , . . . , µn ) be the vector with mean vector and covariance matrix
of mean log viewing times, and σ = (σ1 , . . . , σn ) the vector
of standard deviations. Furthermore, µu and σu are the vec- E (ra,1 |ra,2 , θ) = µa,1 + Σa,12 Σ−1
a,22 (ra,2 − µa,2 ) ,
tors of means and standard deviations respectively for only Cov (ra,1 |ra,2 , θ) = Σa,11 − Σa,12 Σ−1 T
a,22 Σa,12 ,
those exhibits viewed by a visitor u. For example, if visitor 1
where E (ra,1 |ra,2 , θ) represents a personalised prediction
1
We also tested a variant of the I2I measure without visitor-wise of the log viewing times ra,1 . Additionally, a measure
normalisation. However, this variant yielded inferior results. of confidence in this prediction can be easily derived from
Cov (ra,1 |ra,2 , θ), i. e., by using the variances on the diago- was followed to obtain samples of θ for both SPM variants,
nal of this matrix. i. e., for both distance measures I2I and PD (Section 4.2). We
Being a model-based approach, SPM offers advantages then used the posterior means estimated from these samples
over memory-based collaborative filters. For instance, the to compute predictions by conditioning a multivariate normal
model parameters θ = µ, σ, τ 2 , φ, ν have a clear inter- distribution (Section 4.3). We improved SPM-I2I’s predic-
pretation, and the confidence measure provided by the model tive performance by using the (non-personalised) mean log
supports an informed interpretation of the model’s predic- viewing time µi as a prediction whenever the conditioning
tions. Additionally, recommendation generation is sped up by would have been based on fewer than K log viewing times
decoupling the model-fitting phase from the prediction phase. (in our case, K = 19). This modification was not applied to
SPM-PD. For CF, predictions were computed from the rat-
5 Evaluation ings of the nearest neighbours; and for MM, we used µi , esti-
mated from the appropriate reduced dataset, as a prediction.
This section reports on the results of an evaluation performed We performed two types of experiments: Individual Ex-
with our dataset (Section 3), including comparison with a hibit and Progressive Visit.
nearest-neighbour collaborative filter.2
• Individual Exhibit (IE). IE evaluates predictive perfor-
5.1 Experimental Setup mance for a single exhibit. For each observed visitor-
exhibit area pair (u, i), we removed the observation rui
To evaluate the predictive performance of our Spatial Process
from the vector of visitor u’s log viewing durations, and
Model (SPM), we implemented two additional models: Mean
computed a prediction r̂ui from the other observations.
Model (MM) and Collaborative Filter (CF). MM, which we
This experiment is lenient in the sense that all available
use as a baseline, predicts the log viewing time of an ex-
observations except the observation for exhibit area i are
hibit area i to be its (non-personalised) mean log viewing
kept in a visitor’s viewing duration vector.
time µi . For CF, we implemented a nearest-neighbour col-
laborative filtering algorithm, and added modifications from • Progressive Visit (PV). PV evaluates performance as a
the literature that improve its performance, such as shrink- museum visit progresses, i. e., as the number of viewed
age to the mean [James and Stein, 1961] and significance exhibit areas increases. For each visitor, we started with
weighting [Herlocker et al., 1999]. Additionally, to ensure an empty visit, and iteratively added each viewed exhibit
that varying exhibit area complexity does not affect the simi- area to the visit history, together with its log viewing
larity computation for selecting the nearest neighbours (view- time. We then predicted the log viewing times of all yet
ing time increases with exhibit complexity), we transformed unvisited exhibit areas.
the log viewing times into z-scores by normalising the values For both experiments, we used the mean absolute error
for each of the exhibit areas separately. Visitor-to-visitor dif- (MAE) to measure predictive accuracy as follows:
ferences with respect to their mean viewing durations were
removed by transforming predictions to the current visitor’s 1 XX
MAE = P |rui − r̂ui |,
viewing-time scale [Herlocker et al., 1999]. Refer to [Bohn- u∈U |Iu | u∈U i∈Iu
ert and Zukerman, 2009] for a detailed description of CF. We
tested several thousand different parameterisations, but in this where Iu denotes a visitor u’s set of exhibit areas for which
paper, we report only on the performance of the best one. predictions were computed. For IE, we calculated the total
Due to the relatively small dataset, we used leave-one-out MAE for all valid visitor-exhibit area pairs; and for PV, we
cross validation to evaluate the performance of the different computed the MAE for the yet unvisited exhibit areas for all
models. That is, for each visitor, we trained the models with visitors at each time fraction of a visit (to account for different
a reduced dataset containing the data of 157 of the 158 visit visit lengths, we normalised all visits to a length of 1).
trajectories, and used the withheld visitor pathway for test-
ing. To train and instantiate the SPM variants (i. e., SPM-I2I 5.2 Results
and SPM-PD), we obtained a sample of θ = µ, σ, τ 2 , φ, ν Table 2 shows the results for the IE experiment, where both
from p(θ|r) by performing slice Gibbs sampling [Neal, 2003] spatial models (SPM-I2I and SPM-PD) outperform both MM
on the training data. For each of the 129 free model parame- and CF. Specifically, SPM-I2I achieves an MAE of 0.7756
ters,3 we used (uninformative) independent uniform prior dis- (stderr 0.0068), and SPM-PD attains an MAE of 0.7548
tributions. We used every 20-th sample after a burn-in phase (stderr 0.0066), outperforming SPM-I2I as well. The pair-
of 1000 iterations as a sample of θ from p(θ|r), and stopped wise performance differences are statistically significant with
the sampling procedure after 8000 iterations. Thus, in total, p 0.01 for all model pairings.
we obtained 350 samples of θ from p(θ|r). This procedure The performance of SPM-PD, SPM-I2I, CF and the base-
2
line MM for the PV experiment is depicted in Figure 2. CF
For our experiments, we ignore travel between exhibit areas, outperforms MM slightly (statistically significantly for visit
and collapse multiple
q viewing events of one area into one event. fractions 0.191 to 0.374 and for several shorter intervals later
3 2
We set σi = σr,i − τ 2 to speed up the sampling process, on, p < 0.05). More importantly, both SPM-I2I and SPM-PD
2
where σr,i denotes the sample variance of the log viewing times at perform significantly better than MM and CF. For SPM-I2I,
exhibit i, calculated from the observed log viewing times rui . This this performance increase is statistically significant for visit
reduces the number of free parameters from 255 (126×2+3) to 129. fractions 0.189 to 0.960 when comparing to MM, and except
Table 2: Model performance for the IE experiment (MAE) 6 Discussion
MAE Stderr SPM offers advantages over other model-based approaches in
that, unlike neural networks (and memory-based techniques),
Mean Model (MM) 0.8618 0.0071
it returns the confidence in a prediction, and its parameters
Collaborative Filter (CF) 0.7868 0.0068
have a clear interpretation; unlike Bayesian networks, our
Spatial Process Model using I2I
model does not require a domain-specific adaptation, such
(SPM-I2I) 0.7756 0.0068
as designing the network topology. In addition, the dis-
Spatial Process Model using PD
tance measure endows our model with capabilities of hybrid
(SPM-PD) 0.7548 0.0066
RS [Burke, 2002; Albrecht and Zukerman, 2007] by seam-
lessly supporting the incorporation of other types of models
0.92
MM
(e. g., content-based). The distance measure also alleviates
CF the cold-start problem. The new-item problem is addressed
0.90 SPM−I2I
SPM−PD by utilising the (distance-based) correlation between this item
0.88 and the other items. The new-user problem is similarly han-
MAE
dled through the correlation between items rated by a user
0.86 and the other items (when utilising Physical Distance as the
distance measure, our model can make useful personalised
0.84
predictions after only one item has been rated).
0.82 Our dataset is relatively small compared to other real-world
0.0 0.2 0.4 0.6 0.8 1.0
visit fraction RS applications. Although a high number of ratings per user
slows down the slice Gibbs sampler due to repeated inversion
Figure 2: Model performance for the PV experiment (MAE) of matrices of high dimension, employing our model with
larger datasets should not represent a problem in practice.
for a few short intervals, for visit fractions 0.375 to 0.902 This is because the number of ratings per user is usually small
when comparing to CF. In comparison, SPM-PD performs compared to the number of users and items, and the compu-
significantly better than both MM and CF for visit fractions tational complexity of evaluating the likelihood function de-
0.019 to 0.922 (statistically significantly, p < 0.05). Ad- pends only linearly on the number of users in the database.
ditionally, SPM-PD outperforms SPM-I2I until visit fraction
0.660 (statistically significantly, p < 0.05). Drawing atten- 7 Conclusions and Future Work
tion to the initial portion of the visits, SPM-PD’s MAE de-
creases rapidly, whereas the MAE for MM and CF remains In this paper, we utilised the theory of spatial processes to de-
at a higher level. Generally, the faster a model adapts to velop a model-based approach for predicting users’ interests
a visitor’s interests, the more likely it is to quickly deliver in exhibits (i. e., implicit item ratings) within a RS in the mu-
(personally) useful recommendations. Such behaviour in the seum domain. We applied our model to a real-world dataset
early stages of a museum visit is essential in order to build collected by tracking visitors in a museum, using two mea-
trust in the RS, and to guide a visitor in a phase of the visit sures of item-to-item (content) distance: (1) distances com-
where such guidance is most likely needed. A similar im- puted from item-to-item similarities (as in item-to-item col-
provement in performance cannot be observed for SPM-I2I, laborative filtering), and (2) physical walking distance. For
which suggests that a visitor’s exhibit interests observed in both distance measures, our model attains a higher predictive
close physical proximity are better predictors of interests in accuracy than nearest-neighbour collaborative filters. Addi-
unseen exhibits than interests in exhibits with positively cor- tionally, the model variant using physical distances outper-
related viewing times. As expected, MM performs at a rela- forms that using distances computed from item-to-item sim-
tively constant MAE level. For CF, SPM-I2I and SPM-PD ilarities. Under the realistic Progressive Visit setting, our
we expected to see an improvement in performance (rela- model using physical distance to measure item-to-item dis-
tive to MM) as the number of visited exhibit areas increases. tance rapidly adapts to a user’s ratings (starting from as little
However, this trend is rather subtle (it can be observed when as one rating), thus alleviating the new-user problem common
plotting the models’ performance relative to MM). Addition- to collaborative filtering. This is not the case for the model
ally, for all four models, there is a performance drop towards variant based on distance computed from item-to-item simi-
the end of a visit. We postulate that these phenomena may be larities, which suggests that a visitor’s interests observed for
explained, at least partially, by the increased influence of out- exhibits in close physical proximity are better predictors of
liers on the MAE as the number of exhibit areas remaining to interests in unseen exhibits than those interests for exhibits
be viewed is reduced with the progression of a visit. This in- with positively correlated viewing times.
fluence in turn offsets potential gains in performance obtained In the future, we intend to hybridise our model by incor-
from additional observations. Our hypothesis is supported by porating content-based item features into our distance mea-
a widening in the standard error bands for all models as a sure, and to explore hybrids of models utilising a variety of
visit progresses, in particular towards the end (not shown in item-to-item distances. We also plan to extend our model
Figure 2 for clarity of presentation). However, this behaviour to fit non-Gaussian item ratings, e. g., [Diggle et al., 1998;
requires further, more rigorous investigation. Yu et al., 2006].
Acknowledgments [Hatala and Wakkary, 2005] M. Hatala and R. Wakkary.
Ontology-based user modeling in an augmented audio re-
This research was supported in part by grant DP0770931 ality system for museums. User Modeling and User-
from the Australian Research Council. The authors thank Adapted Interaction, 15(3-4):339–380, 2005.
Carolyn Meehan and her team from Museum Victoria for
their assistance; and David Abramson, Jeff Tan and Blair [Herlocker et al., 1999] J.L. Herlocker, J.A. Konstan, A.
Bethwaite for their help with the computer cluster. Borchers, and J.T. Riedl. An algorithmic framework for
performing collaborative filtering. In Proceedings of the
22th Annual International ACM SIGIR Conference on Re-
References search and Development in Information Retrieval (SIGIR-
[Albrecht and Zukerman, 2007] D.W. Albrecht and I. Zuker- 99), pages 230–237, 1999.
man. Introduction to the special issue on statistical and [James and Stein, 1961] W. James and C.M. Stein. Estima-
probabilistic methods for user modeling. User Modeling tion with quadratic loss. In Proceedings of the Fourth
and User-Adapted Interaction, 17(1-2):1–4, 2007. Berkeley Symposium on Mathematical Statistics and Prob-
[Aroyo et al., 2007] L. Aroyo, N. Stash, Y. Wang, P. Gorgels, ability, vol. 1, pages 361–379, 1961.
and L. Rutledge. CHIP demonstrator: Semantics-driven [Neal, 2003] R.M. Neal. Slice sampling. The Annals of
recommendations and museum tour generation. In Pro- Statistics, 31(3):705–767, 2003.
ceedings of the Sixth International Conference on the Se- [Parsons et al., 2004] J. Parsons, P. Ralph, and K. Gallager.
mantic Web (ISWC-07), pages 879–886, 2007. Using viewing time to infer user preference in recom-
[Banerjee et al., 2004] S. Banerjee, B.P. Carlin, and A.E. mender systems. In Proceedings of the AAAI Workshop
Gelfand. Hierarchical Modeling and Analysis for Spatial on Semantic Web Personalization (SWP-04), pages 52–64,
Data. Chapman & Hall/CRC, 2004. 2004.
[Petrelli and Not, 2005] D. Petrelli and E. Not. User-centred
[Bell et al., 2007] R. Bell, Y. Koren, and C. Volinsky. Chas-
design of flexible hypermedia for a mobile guide: Reflec-
ing $1,000,000: How we won the Netflix progress
tions on the HyperAudio experience. User Modeling and
prize. ASA Statistical and Computing Graphics Newslet-
User-Adapted Interaction, 15(3-4):303–338, 2005.
ter, 18(2):4–12, 2007.
[Resnick and Varian, 1997] P. Resnick and H.R. Varian.
[Bohnert and Zukerman, 2009] F. Bohnert and I. Zukerman. Recommender systems. Communications of the ACM,
Non-intrusive personalisation of the museum experience. 40(3):56–58, 1997.
In Proceedings of the 1st and 17th International Confer-
ence on User Modeling, Adaptation, and Personalization [Sarwar et al., 2001] B. Sarwar, G. Karypis, J.A. Konstan,
(UMAP-09), pages 197–209, 2009. and J.T. Riedl. Item-based collaborative filtering recom-
mendation algorithms. In Proceedings of the 10th Inter-
[Bohnert et al., 2008] F. Bohnert, I. Zukerman, S. Berkov- national Conference on the World Wide Web (WWW-01),
sky, T. Baldwin, and L. Sonenberg. Using interest and pages 285–295, 2001.
transition models to predict visitor locations in museums. [Schmidt et al., 2009] D.F. Schmidt, I. Zukerman, and D.W.
AI Communications, 21(2-3):195–202, 2008.
Albrecht. Assessing the impact of measurement uncer-
[Bohnert et al., 2009] F. Bohnert, D.F. Schmidt, and I. Zuk- tainty on user models in spatial domains. In Proceedings
erman. Spatial processes for recommender systems. In of the 1st and 17th International Conference on User Mod-
Proceedings of the 21st International Joint Conference on eling, Adaptation, and Personalization (UMAP-09), pages
Artificial Intelligence (IJCAI-09), 2009. 210–222, 2009.
[Breese et al., 1998] J.S. Breese, D. Heckerman, and C. [Schwaighofer et al., 2005] A. Schwaighofer, V. Tresp, and
Kadie. Empirical analysis of predictive algorithms for col- K. Yu. Learning Gaussian process kernels via hierarchi-
laborative filtering. In Proceedings of the 14th Interna- cal Bayes. In Advances in Neural Information Processing
tional Conference on Uncertainty in Artificial Intelligence Systems 17 (NIPS-04), pages 1209–1216, 2005.
(UAI-98), pages 42–52, 1998. [Schwarz, 1978] G.E. Schwarz. Estimating the dimension of
[Burke, 2002] R. Burke. Hybrid recommender systems: Sur- a model. The Annals of Statistics, 6(2):461–464, 1978.
vey and experiments. User Modeling and User-Adapted [Stock et al., 2007] O. Stock, M. Zancanaro, P. Busetta,
Interaction, 12(4):331–370, 2002. C. Callaway, A. Krüger, M. Kruppa, T. Kuflik, E. Not, and
[Cheverst et al., 2002] K. Cheverst, K. Mitchell, and N. C. Rocchi. Adaptive, intelligent presentation of informa-
tion for the museum visitor in PEACH. User Modeling
Davies. The role of adaptive hypermedia in a context-
and User-Adapted Interaction, 18(3):257–304, 2007.
aware tourist guide. Communications of the ACM,
45(5):47–51, 2002. [Yu et al., 2006] S. Yu, K. Yu, V. Tresp, and H.-P. Kriegel.
Collaborative ordinal regression. In Proceedings of
[Diggle et al., 1998] P.J. Diggle, J.A. Tawn, and R.A. Moy- the 23rd International Conference on Machine Learning
eed. Model-based geostatistics. Applied Statistics, (ICML-06), pages 1089–1096, 2006.
47(3):299–350, 1998.