=Paper=
{{Paper
|id=None
|storemode=property
|title=Large scale evaluation of multi-mode recommender system using predicted contexts with mobile phone users
|pdfUrl=https://ceur-ws.org/Vol-791/paper6.pdf
|volume=Vol-791
}}
==Large scale evaluation of multi-mode recommender system using predicted contexts with mobile phone users==
Large Scale Evaluation of Multi-Mode Recommender
System Using Predicted Contexts with Mobile Phone Users
Takashi Shiraki Chihiro Ito Takeo Ohno
Information and Media Processing Information and Media Processing Service Platforms Research
Laboratories, NEC Corporation Laboratories, NEC Corporation Laboratories, NEC Corporation
t-shiraki@bu.jp.nec.com c-ito@az.jp.nec.com takeo@aj.jp.nec.com
1753 Shimonumabe, Nakahara-ku, Kawasaki, Kanagawa, 211-8666, Japan
ABSTRACT had a root mean square error (RMSE) 10% better than that of the
Context-aware recommender systems can improve user Netflix‟s legacy algorithm Cinematch.
satisfaction with recommended information when user preferences However, many challenges still remain in predicting various user
change depending on user contexts (e.g. location, time, and needs, which change depending on context. Cyberguide [1],
weather). However, the effect of each context on various user GUIDE [3], and COMPASS [11] are mobile tour guide systems
preferences has yet to be fully elucidated. In addition, few that provide information such as tourist resorts and exhibits using
examples address this challenge in large-scale real-world the time of day and user current locations as contexts. Magitti [2]
experiments. Therefore, relationships between mobile phone users‟ is a mobile leisure guide system that predicts a user‟s current and
contexts and their preferences have been evaluated. Our system future activity. It is based on extensive fieldwork in which an
calculates the intensity of various user preferences (e.g. favorite online survey and many interviews were conducted. Reference [9]
area and type of content) and recommends content accordingly. It also predicts a user‟s current activity from past / current / future
was applied to a restaurant recommendation service with 2,762 contexts of all users. Though all these systems take into account
mobile phone users over a two-month period. The evaluation contexts that influence users‟ interests, these contexts have rarely
results indicate quantitative evidence that user preferences depend been evaluated precisely in large-scale experiments, because of
on context information. In particular, location information the difficulty in obtaining both user contexts (e.g. location) and
strongly correlates with users‟ interest in recommending different user feedback in the real world.
types of restaurants. Finally, predicted context data is shown to be
more effective for recommendation than raw data. To overcome this, we have developed a Multi-Mode
Recommender System (MMRS), which can adapt to various
Categories and Subject Descriptors contextual conditions and evaluate relationships between contexts
and user interests automatically. We applied it to a trial
H.3.3 [Information Storage and Retrieval]: Information Search recommendation service of about 28,000 restaurants for 2,762
and Retrieval – Information filtering, Relevance feedback, mobile phone users with the largest Japanese mobile
Retrieval models, Selection process. communications operator NTT DOCOMO. As a result, we
clarified the following issues based on quantitative evaluations in
General Terms a large-scale experiment.
Algorithms, Experimentation, Human Factors
- Users‟ favorite items depend on user contexts.
Keywords - The MMRS can find effective profile/context.
Recommender System, Context-awareness, Mobile Systems
- Predicted contexts can be more effective for recommendation.
1. INTRODUCTION 2. MMRS
Due to the recent information overload, recommendation schemes
Figure 1 shows the overall architecture of the MMRS. The
that adapt information to changing user needs are becoming more
important. Recommender systems are increasingly being used for MMRS consists of four components: model learner, user-mode
services on the web and on devices (e.g. video recorders). For estimator, sub-recommender systems (SRSes), and item selector.
Model learner constructs user-mode learning models that are
example, Amazon.com, iTunes, and TiVo mainly recommend
calculated from past user behavioral logs. The user-mode
books, music, and TV programs, respectively. Recommender
systems score items using user profiles, content data, user estimator predicts the current user-mode from the user-mode
feedback such as purchase logs, and user contexts. learning models when a user requests recommendations. Each
SRS makes an item list by using each algorithm such as
Many personalized recommender systems using profiles, content content-based filtering and collaborative filtering. The item
data, and feedbacks have already been proposed. They are suitable selector retrieves results from item lists of SRSes. We describe
for services for providing users with items that match their those algorithms in detail in the subsections 2.1-2.4.
preferences. Amazon.com recommends products by using The MMRS has four types of input: profile data, context data,
item-based collaborative filtering, which finds similar items from feedback logs, and item data. Profile data are static-feature user
user feedback [6]. The Netflix Grand Prize solution [5, 10, 13] data, such as age, gender, and whether she/he smokes. Context
---- data are dynamic-feature user data, such as time, location, and
CARS-2011, October 23, 2011, Chicago, Illinois, USA. pulse rate. Feedback logs are user positive feedback data in
Copyright is held by the author(s)/owner(s). recommendation services, such as purchases, clicks, and
bookmarks. Item data are content data that should be retrieved in To eliminate zeros from the denominator in Equation (3), we use
recommendation services. Laplace smoothing [7], which simply adds one to each count
The MMRS has two types of output: an item list and user-modes. , as an
An item list is set of recommended items that the MMRS retrieves initial condition. The model learner calculates for all
as results. User-modes are defined as user interests in selecting and F for all on a regular basis.
information such as location of users‟ favorite contents, user
preference, and user‟s favorite method for retrieving items. 2.2 User-mode estimator
User-mode estimator is used for online-processing that predicts
current user-mode intensities from user-mode learning models,
profile data, and current context data. When the MMRS receives a
query from a user, the user-mode estimator obtains probabilities in
Equation (1) for a profile/context combination of the user.
Figure 2 shows a screenshot of the user-mode intensities. In the
case of user-mode “Preference,” the equalizer bars in the screen
indicate that user-mode estimator predicts “Gourmet meal,” which
here means a relatively expensive meal, as the most interesting for
the user. The user can also move the equalizer bars to control
recommendation results interactively.
Figure 1. Overall architecture of MMRS
2.1 Model learner
The model learner is used for offline-processing that produces
user-mode learning model from profile data, past context data,
feedback logs, and item data. User-mode learning model is the set
of intensities of user-modes for all profile/context combinations.
The intensity is based on the probability of category of
user-mode in each profile/context combination. It is given by
User-mode of user-mode category
Profile/context of user profile/context category
Figure 2. Screenshot showing user-modes
.
We use the Naïve Bayes algorithm to calculate Equation (1). 2.3 Sub-recommender system (SRS)
Under the Naïve Bayes Assumption between profile/context sets, Each SRS produces an item list from user data and item data.
we have Recommendation service providers can use any recommender
systems as SRSes. Recommended item lists of SRSes generally
differ from each other. The MMRS can evaluate each SRS‟s
effect in context-aware recommendation. In our previous study
[12], we evaluated the effect of five ranking methods
The Naïve Bayes algorithm has the limitation in handling the (content-based / profile-based / item-to-item collaborative filtering
dependency among profiles/contexts. However, we apply it to the / user-to-user collaborative filtering / profile-item matching) in
MMRS because it has the following merits for the conditional personalized recommendation that did not use context data. We
independence assumption between profile/context sets. found that users‟ favorite methods depended on the person. For
example, men in their twenties did not prefer ranking methods
- Space complexity can be reduced from using user profile data as much as females and older males.
to where 2.4 Item selector
- We can evaluate each profile/context set in a fair manner Item selector is used for online-processing that produces a result
- It is easy to use in distributed systems item list from the user-mode intensities predicted by user-mode
Additionally, in Equation (2) is calculated using estimator and item lists of SRSes. User-mode estimator sets the
probability in accordance with Equation (1) for each SRS. When a
frequency , which is defined as the amount of user
user requests recommendation, item selector randomly selects a
feedback in condition during the experiment period, as below, SRS in accordance with the SRS‟s probability and collects one
item at a time repeatedly from its SRSes until the item selector
obtains the requested number of items (Figure 3).
Table 1. User profile and context information (H=6)
Category Description
Age×Gender “Age”( -29 / 30s / 40s / 50- ): four patterns
“Gender” (male/female): two patterns
Drinker Whether user checks “like alcohol” in initial
input data or not. (drinker / non-drinker)
Weather (sunny/cloudy/rainy or snowy) is obtained by a
weather forecast service
Day-type× “Day-type” estimated from user behavioral logs
Time-of-day× (workday/day off): two patterns
Figure 3. Item selection method in item selector User-attribute “Time-of-day:” (0-4 / 4-8 / 8-12 / 12-16 / 16-20 /
3. EXPERIMENT 20-24): six patterns
“User-attribute” means user‟s current behavior
In this section, we explain how we applied the MMRS to a predicted from user‟s historical behavioral
restaurant recommendation service. Its conditions are as follows. pattern. (home / office / second office / staying in
- Users: 2762 mobile phone users who registered online private place/ commute to work / moving for
work / return home / moving to private place):
- Items: about 28000 restaurants in Tokyo and six prefectures
eight patterns
around Tokyo
- Recommendation requests: 65,445 times Current user The user area in Tokyo and six prefectures
area around Tokyo (e.g. Ginza, Roppongi, and Chiba).
- Experimental period: 64 days This is directly obtained using a GPS system or
Therefore, we use two types of positive feedback data. the base stations of the mobile phone network.
- Browse: users click to see detailed restaurant information
Next user Predicted area the user will go next, which is
- Bookmark: users bookmark to save restaurant information
area derived from the analysis of each user‟s
3.1 Profile and context data behavioral patterns.
In this experiment, we chose profile/context sets with different 3.2 User-mode
properties (Table 1). “Age,” “Gender,” and “Drinker” are profile The MMRS recommended restaurants after predicting
data (static-feature user data) obtained from service registration “User-modes” (Table 2) by using the profile/context information
information. The others are context data (dynamic-feature user as a result. In this experiment, we set three user-modes:
data). “Restaurant area,” “Preference,” and “Ranking method.”
“Day-type,” “User-attribute,” and “Next user area” in Table 1, are “Restaurant area” is important, especially for mobile phone users.
predicted contexts by user behavioral pattern analyses as follows. We also consider “Preference” (types of restaurants) as the main
The user behavioral patterns are composed of “stop places” and factor for selecting items. We are also interested in “Ranking
“trip routes.” Stop places are defined as the places where users method” because the MMRS blends results of SRSes.
had stayed within a 500-meters radius for more than 30 minutes.
Table 2. User-mode information (I=3)
Trip routes are defined as the paths between two stop places.
These are generated from location logs of a user‟s mobile phone. Category Description
Before predicting the contexts, we estimate each stop place Restaurant area The area the user prefers in Tokyo and six
between a user‟s home, office (office / second office), and private prefectures around Tokyo .
(except the home) by comparing hours stayed at all stop places, Preference The type of meal the user prefers.
after which we predict the contexts.
Casual meal With price under ¥4,999
“Day-type” tells us whether the day is a workday or day off for
the user. For example, Tuesday is a workday for a user if the user Gourmet meal With price over ¥5,000
has often been in the office on that day. “User-attribute” shows
Alcohol Bars and pubs
user‟s current behavior from eight patterns. The attributes of the
stop place or trip route determine where the user is. “Next user Café For soft drinks and
area” means the user‟s next destination. We obtained this from light meals
trip times, hour, and origin-destination history in the user Entertainment e.g. Karaoke, darts
behavioral patterns.
Ranking The method for ranking that provides
“Age×Gender” and “Day-type×Time-of-day×User-attribute” method recommended items.
denote multiple profile/context sets. We combined “age” and
“gender” because we considered that the difference in preferences Global ranking Determined by popularity
between women and men in their twenties is greater than that
between women and men of other generations. The predicted Personalized This scores items in
contexts “Day-type” and “User-attribute” depend on ranking accordance with how
“Time-of-day” because behavioral pattern analyses of them relate closely they match the
to time. user‟s stated preferences.
The MMRS had three steps in this experiment. First, it selected - Location data > Time data
one “Restaurant area” ( ) by the highest value of Equation (1). - Next user area (predicted context) > Current user area
Next, it estimated probability of “Preference” ( ) by We compared the value of each user-mode in Table 3. For all
context sets, the intensity of three user-modes is as follows.
- Restaurant area > Preference > Ranking method.
Finally, it estimated probability of “Ranking method” ( ) by We considered “Preference” to be the most important user-mode
because it is closest to user interests with regards to restaurants.
Figure 4 shows the drilled-down data of the relationship between
Equations (4) and (5) are easy extensions of Equation (1). “Preference” and “Next user area.” We chose this because “Next
user area” has the highest coefficient (0.354) in “Preference” in
Item selector stochastically selects items from SRSes. We can Table 3. As shown in Figure 4, the MMRS predicted that many
flexibly change user-modes and profile/context sets explicitly that users going to Shibuya/Roppongi (popular nightlife districts in
are important selection criteria based on services. Tokyo) would be interested in “Gourmet” restaurants. Figure 4
also indicates that users going to an urban area like to browse for
4. RESULTS AND DISCUSSIONS high-end restaurants (“Gourmet”) and cafés while those going to a
We elucidated the effect of each context on user-modes. As listed suburban/rural area browse for cheaper restaurants and alcohol.
in Table 3, we obtained the intensities of the relationships In the case of only profile data, we chose the relationship between
between user-modes and profile/context sets from the Cramer‟s “Preference” and “Age×Gender” because “Age×Gender” has a
coefficient association values [4]. Value 0 means no dependency higher coefficient (0.083) than “Drinker” in Table 3. Figure 5
between user-modes and profile/context data, and value 1 means shows the drilled-down data of it. As shown in Figure 5, women
that the user-mode is fully predictable from the profile/context. like to select “Cafés” and men like to select “Casual meals”
First, we compared the values of each profile/context set in (inexpensive restaurants). In addition, men become more
Table 3. We found that context data, except “Weather,” affected interested in “Casual meals” as they age, and women over 50 very
user-modes more than “Age×Gender” and “Drinker” did. are interested in “Gourmet meals” (expensive restaurants). This
Therefore, location context strongly correlates with user-modes. may be due to the fact that Japanese women in their 50‟s tend to
The results indicate that location information is more useful in have a lot of discretionary money.
predicting user-modes. The values of “Next user area” were In this way the model learner can clarify the relationship between
higher than those of “Current user area” in all user-modes. The any user-modes and any profile/context data. The user-mode
results indicate that context data computed by analyzing user‟s estimator predicts current user-mode intensities, and the item
behavioral logs can improve context-aware recommendations. selector can provide recommendation items that are automatically
- Context data (except “Weather”) > Profile data adapted to user interests using the intensities.
Table 3. Cramer's coefficient of association between user-modes and profile/context sets
Profile Context
Time Location
Age×Gender Drinker Weather Day-type × Time-of-day Current Next
×User-attribute user area user area
Day- Time- User-
Age Gender
type of-day attribute
Restaurant 0.142 0.144
0.185 0.064 0.643 0.716
area 0.158 0.188 0.077 0.053 0.144
0.083 0.129
Preference 0.069 0.048 0.209 0.354
0.062 0.103 0.031 0.026 0.086
Ranking 0.043 0.067
0.020 0.009 0.082 0.197
method 0.019 0.014 0.002 0.009 0.028
Figure 4. Relationship between user-mode “Preference” and context “Next user area” (0.354)
6. ACKNOWLEDGMENTS
The research presented in this paper is part of the Information
Grand Voyage Project funded by the Japanese Ministry of
Economy, Trade and Industry. NTT DOCOMO produced “My
Life Assist Service” [8] for the project, and we used their data.
We thank Isao Kobayashi of NTT DOCOMO, Kyota Kanno,
Kenshi Nishimura, Tsunehisa Kawamata, Nobuyuki Saji, and
Yasuhiro Miyao of NEC for their helpful comments. We also
thank all members that contributed to system implementation.
7. REFERENCES
[1] Abowd, G., Atkeson, C., Hong. J., Long, S., Kooper, R. and
Pinkerton, M. Cyberguide: a mobile context-aware tour
guide. Wirel. Netw. 3, 5 (1997), 421-433.
Figure 5. Relationship between user-mode [2] Bellotti, V., Begole, B., Chi, Ed H., Ducheneaut, N., Fang, J.,
“Preference” and profile “Age×Gender” (0.083) et al., Activity-based serendipitous recommendations with
the Magitti mobile leisure guide, In Proc. of the twenty-sixth
Finally, we discuss the relationship between the machine learning annual SIGCHI conference on Human factors in computing
mechanism of the MMRS system and user satisfaction by systems (CHI „08) , pp. 1157-1166, 2008.
observing the click rate, which is defined as the number of clicked [3] Cheverst, K., Davies, N., Mitchell, K., et al., Developing a
items divided by the number of retrieved items. Each plot in context-aware electronic tourist guide: some issues and
Figure 6 is the click rate on one day, and the line is the linear experiences. Proc. CHI‟00. ACM Press, NY, 2000, p. 17-24
regression line. It also shows that the click rate gradually
increased during the experiment. There are several possible [4] Cramer, E. M. et al., Some Symmetric, Invariant Measures of
factors for this increase. Multivariate Association, Psychometrika, 44, 43-54. (1979).
・ Machine learning makes MMRS more effective [5] Koren, Y., The BellKor Solution to the Netflix Grand Prize,
(2009).
・ The ratio of active users who give a lot of clicks increases http://www.netflixprize.com/assets/GrandPrize2009_BPC_B
because some inactive users quit using this services ellKor.pdf
[6] Linden, G., Smith, B. and York, J., Amazon.com
Recommendations: Item-to-Item Collaborative Filtering,
IEEE Internet Computing. Jan./Feb. 2003.
[7] Manning, C.D., Raghavan P. and Schütze H., Introduction to
Information Retrieval, Cambridge University Press, p. 240.
[8] METI of Japan, Information Grand Voyage Project: My Life
Assist Service, 2009-2010.
http://www.meti.go.jp/policy/it_policy/daikoukai/igvp/content
s_en/activity09/ms09/list/personal/ntt-docomo-inc-1.html
[9] Oku, K., et al., A Recommendation System Considering
Figure 6. Click Rates Users‟ Past / Current / Future Contexts, CARS2010.
5. CONCLUSIONS [10] Piotte, M. and Chabbert, M., The Pragmatic Theory Solution
We proposed a context-aware recommender system that retrieves to the Netflix Grand Prize, (2009).
items and user-modes. We elucidated the effect of each context on http://www.netflixprize.com/assets/GrandPrize2009_BPC_Pr
various user preferences. We applied it to a large-scale restaurant agmaticTheory.pdf
recommendation service with 2,762 mobile phone users over a
[11] van Setten, M., Pokraev, S., Koolwaaij J., Context-Aware
two-month period. From the results, we found that context
Recommendations in the Mobile Tourist Application
information is more effective for making recommendations than
COMPASS. In Adaptive Hypermedia 2004, vol. 3137 of
profile information. This indicates that recommender systems
LNCS, p.235-244, 2004.
using context input data can effectively improve user satisfaction.
Furthermore, location data is fairly useful for predicting user [12] Shiraki, T., Kanno, K., Nishimura, K. and Kawamata, T.,
intentions. User-modes depend on “Next user area” more than Evaluation Methods for a Multi-Mode Recommendation
“Current user area”. The results indicate that predicted context System, 71th IPSJ Conference 2009 (in Japanese).
data computed by analyzing user‟s behavioral patterns can http://www.ipsj.or.jp/01kyotsu/award/taikai_yushu/71award_
improve recommendations. Finally we discussed the relationship paper/2C_4.pdf
between the machine learning mechanism of the MMRS and user
satisfaction. We found that the click rate gradually increases. [13] Töscher, A., Jahrer M. and Bell, R., The BigChaos Solution
However, there may be several factors for this increase. We will to the Netflix Grand Prize, (2009).
clarify the relationship between the MMRS and satisfaction with http://www.netflixprize.com/assets/GrandPrize2009_BPC_Bi
comparative experiments in future work. gChaos.pdf