=Paper=
{{Paper
|id=Vol-1438/paper9
|storemode=property
|title=Interaction Design in a Mobile Food Recommender System
|pdfUrl=https://ceur-ws.org/Vol-1438/paper9.pdf
|volume=Vol-1438
|dblpUrl=https://dblp.org/rec/conf/recsys/ElahiGRFBM15
}}
==Interaction Design in a Mobile Food Recommender System==
<pdf width="1500px">https://ceur-ws.org/Vol-1438/paper9.pdf</pdf>
<pre>
Interaction Design in a Mobile Food Recommender System

                   Mehdi Elahi                           Mouzhi Ge                      Francesco Ricci
            Politecnico di Milano, Italy              Free University of                 Free University of
            mehdi.elahi@polimi.it                    Bozen-Bolzano, Italy               Bozen-Bolzano, Italy
                                                   mouzhi.ge@unibz.it                    fricci@unibz.it
                   Ignacio                         Shlomo Berkovsky                      Massimo David
              Fernández-Tobías                         CSIRO, Australia                  Free University of
           Universidad Autónoma de shlomo.berkovsky@csiro.au    Bozen-Bolzano, Italy
                Madrid, Spain                               david.massimo@stud-inf.unibz.it
         ignacio.fernandezt@uam.es

ABSTRACT                                                         18, 20, 4, 11]. There is a broad spectrum of available in-
One of the most important steps in building a recommender        formation about food, such as recipe data and cooking in-
system is the interaction design process, which defines how      structions. Thus, some applications and websites already
the recommender system interacts with a user. It also shapes     provide support functions allowing users to browse recipes
the experience the user gets, from the point she registers       and related information. However, most applications only
and provides her preferences to the system, to the point         offer generic and non-personalized recipe catalogue browsing
she receives recommendations generated by the system. A          support, without tailoring it to the tastes and preferences of
proper interaction design may improve user experience and        individual users.
hence may result in higher usability of the system, as well         User preference elicitation is a fundamental and necessary
as, in higher satisfaction.                                      step to go beyond this generic support and generate person-
   In this paper, we focus on the interaction design of a mo-    alized recipe recommendations. More importantly than in
bile food recommender system that, through a novel interac-      other application domains, such as movies or books, recipe
tion process, elicits users’ long-term and short-term prefer-    recommendations should not only be based on user’s long-
ences for recipes. User’s long-term preferences are captured     term tastes, but also fit their ephemeral preferences, such as
by asking the user to rate and tag familiar recipes, while for   the available ingredients or current cooking constraints.
collecting the short-term preferences, the user is asked to         In this paper we address this problem by proposing a pref-
select the ingredients she would like to include in the recipe   erence elicitation approach for food recommender systems
to be prepared. Based on the combined exploitation of both       that obtains user preferences through a novel and effective
types of preferences, a set of personalized recommendations      interaction design. First, it exploits an integrated Active
is generated. We conducted a user study measuring the us-        Learning algorithm [5, 6] for selecting the recipes to rate
ability of the proposed interaction. The results of the study    and tag that are estimated to be the most useful for the rec-
show that the majority of users rates the quality of the rec-    ommender. The active learning algorithm scores a recipe ac-
ommendations high and the system achieves usability scores       cording to its predicted its rating (using transformed matrix
above the standard benchmark.                                    of user-recipe) and then selects the highest scoring recipes.
                                                                 This reveals the users’ long-term preferences, i.e., what they
                                                                 usually like to eat or cook. Second, when requested to gener-
1.   INTRODUCTION                                                ate recommendations, the system acquires short-term pref-
   Recommender systems are decision support tools that proac-    erences referring to ingredients the user wants to cook or to
tively identify and suggest items, which are expected to be      include in the meal. The acquired preferences are used by a
interesting for the users. Recommendations are based on          Matrix Factorization (MF) rating prediction model designed
the users’ previous interactions with the system and the ex-     to take into account both tags and ratings [11, 13, 7].
plicitly provided users’ preferences [15]. One important and        In a real user study, we evaluated the proposed prefer-
new application domain for recommender systems is food.          ence elicitation interaction and observed that the users have
This application has recently drawn much attention in the        scored the usability of the system between “good” and “ex-
research community due to its potential to improve eating        cellent” and assessed the presented recommendations, which
behaviour of users and positively influencing their lives [8,    are generated on the basis of the elicited preferences, to be
                                                                 of high quality.
                                                                    Thus, the main contributions of our paper are: (a) a novel
                                                                 interaction design that is used to elicit long-term (general)
                                                                 and short-term (session-based) user preferences; and (b) an
                                                                 effective preference elicitation method that exploits active
                                                                 learning in the food recommendation domain.


.
     Figure 1: (a) user instructions, (b) browsing food categories, and (c) selecting eaten or cooked recipes.


2.    RELATED WORK                                                building a real-world recommender system. To address this
  Several recommender systems for the food domain have            limitation, this paper focuses on the interaction design, mainly
recently been developed [9, 18, 19, 20]. For example, Freyne      for the preference elicitation: long-term and session-based.
and Berkovsky [9] proposed a food recommender that, through
an easy-to-use interface, elicits user preferences and provides   3.   USER-RECOMMENDER INTERACTION
personalized recommendations Their system transferred the
recipe ratings collected by the system to ingredient ratings         We designed a complete human-computer interaction for
and then aggregated the ratings of the ingredients used in a      collecting user preferences, in the form of recipe ratings and
recipe to generate rating predictions.                            tags [4]. An Android-based prototype was developed, in or-
  Elahi et al. [4] proposed a food recommendation model           der to implement this interaction. The first step is a general
that combines the predicted value of a recipe along differ-       preference elicitation, aimed at collecting the long-term (sta-
ent dimensions (user food preferences, nutritional indicators,    ble) user preferences, i.e., what she generally likes to cook
and ingredients costs) to compute a single utility measure        (or eat). This step includes two stages: (1) the system asks
of a recipe. The goal is to consider factors influencing the      the user to specify the recipes she cooks at home and, (2) the
user’s food decisions in order to produce more useful and         user assigns ratings and tags to the recipes she experienced.
valuable recommendations. In a follow-up work [11], the au-          Upon logging in the system, the user can browse the full
thors conducted an offline evaluation of the rating prediction    catalogue of recipes and mark those that she has eaten be-
algorithm, which extends MF by using, in addition to rat-         fore (see Figure 1). Users can navigate through the recipe
ings, the users’ tags assigned to recipes. It was shown that      categories and sub-categories in order to find the desired
this additional source of information about the user pref-        recipe, e.g., ‘Beef’ → ‘Roasted Beef’ → ‘Roasted Beef with
erences allowed the proposed method to outperform other           Salad’. Inside each category there is a list of recipes mapped
state-of-the-art algorithms, e.g., those proposed in [10].        to this category. When the user finds one of them she can
  In general, the user’s preferences that are collected and       mark it as ‘Eaten’or ‘Cooked’ by clicking the check box.
used by a recommender can be either long-term (general               After that, a selection of the recipes that the user marked
preferences) or short-term (session-based and ephemeral).         as eaten or cooked, is presented to the user for rating and
While obtaining both preference types is crucial, many rec-       tagging. This allows the system to acquire knowledge about
ommender systems do not distinguish between the two. In           the general user preferences. However, the system also needs
fact, there are few studies that taken this consideration into    to deeper explore the user’s preferences and it presents addi-
account. Ricci and Nguyen proposed in [14] a mobile rec-          tional recipes for the user to rate and tag. These are found
ommender system in travel domain, which elicits both gen-         by predicting what the user might have eaten, but did not
eral long-term preferences (e.g., explicitly defined by users)    mark in the first step. In order to find such recipes, we use
and short-term preferences in the form of critiques express-      active learning. For this, the rating dataset is transformed
ing more detailed session-based preferences. More recently,       into a binary format indicating only whether the user rated
short term preferences were found to depend on the rec-           an item: null entries are mapped to 0, and not null entries
ommendation context and many context-aware approaches             to 1. Then, using a factor model, predictions are computed
have been proposed to better suit the needs of the users [1].     for all the values mapped to 0, and for each user the items
  It is worth noting that RSs research often focused on the       with the highest prediction are shown to the user [5, 6].
improvement of the prediction model, by assuming that the            Figure 2-a shows the rating and tagging interface. This
preference elicitation process is completed. Hence, they          interface uses the classical 5-star Likert scale. The users
ignore the complete user-system interaction, required for         are also requested to “explain” the core motivations for their
                                                                  ratings by assigning tags to recipes. Users can either tag a
Figure 2: (a) general preference elicitation, (b) session-based preference elicitation, and (c) recommendation.


recipe with the suggested tags or add their own tags. At the
recommendation time, session-specific preference are elicited
(see Figure 2-b). The user enters the core ingredient she
wants to include in the recipe. This is done by selecting
a keyword from the list of suggestions derived from food
ingredients and popular tags assigned by other users.
   Then, the recommendations leverage both types of the col-
lected user preferences, long-term and session-specific. The
long term preferences are exploited by a custom MF rating
prediction model [13], which uses the tagging information
                                                                                    Figure 3: SUS results.
[7]. Each user is associated with a vector that models her
latent features and each recipe is modeled by a vector that
contains its latent features. Then, the rating of a user for       mark value is 68, which is the average SUS score computed
an item is predicted by computing the inner product of the         over 500 usability studies [16].
user and item vectors. To exploit the short-term model,               In our experiment 20 subjects used the system and com-
the system post-filters the recommendations according to           pleted the questionnaire. They were either computer science
the current user preferences. The recipes with the high-           researchers or non-academic people. 60% of subjects were
est rating are presented to the user one by one. When the          male and 40% were female, the age range was 23 to 50, and
user selects a recommended recipe, the system presents the         the ethnical background varied across the subjects (Italy,
required ingredients and detailed cooking instructions (see        France, USA, Germany, China, and more).
Figure 2-c).                                                          We first present the perceived recommendation quality re-
                                                                   sults. The survey measures the recommendation quality us-
4.   USER STUDY                                                    ing 7 questions on a Likert scale from 0 to 4, where 4 is
                                                                   the highest score. Thus, the maximum overall quality score
   The main goal of the evaluation was to assess whether the
                                                                   is 28. The average perceived recommendation quality score
system can effectively assist users in finding recipes that suit
                                                                   across the 20 subjects was 19 and the median was 19 (see
their preferences. For the user study, we designed a usage
                                                                   [3] for more details on the calculation). We observed that
scenario a task that was formulated as follows: “You want to
                                                                   the maximal recommendation quality score was 26, and the
avoid everyday routine meals. You can use this application
                                                                   minimal was 12. Thus, we can conclude that, on average,
to discover new recipes that suit your taste”.
                                                                   the users agreed that the recommendations were well-chosen
   The users were asked to use the mobile application and
                                                                   and suited their preferences.
complete a questionnaire referring to two performance in-
                                                                      For the SUS usability score, we observed that for 75% of
dicators: perceived quality of recommendations quality and
                                                                   subjects the SUS score was higher than the 68 point bench-
system usability. The first part of the questionnaire mea-
                                                                   mark (see Figure 3). The system achieved overall average
sured the level of user satisfaction with the recommenda-
                                                                   SUS score of 75.50 and the median was 73.75, which is well
tions. We used a validated instrument based on a set of
                                                                   above the benchmark. We observed that the minimal us-
questions developed by Knijnenburg et al. [12]. The second
                                                                   ability score of 55, and the maximal was 95. According to
part of the questionnaire aimed at collecting the users’ im-
                                                                   these results we can conclude that the system usability was
pression of the usability of the system. Here, we exploited
                                                                   considered between “good” and “excellent” [2].
the System Usability Scale (SUS) questionnaire [17]. The
overall usability scores range from 0 to 100 and the bench-
   We have computed the average replies for all the SUS             [6] M. Elahi, F. Ricci, and N. Rubens. Active learning in
statements and observing the statements with the highest                collaborative filtering recommender systems. In
average values, we can report that the users have evaluated             E-Commerce and Web Technologies, pages 113–124.
the system easy to learn and easy to use. They also believe             Springer International Publishing, 2014.
that various components were well-integrated into the sys-          [7] I. Fernández-Tobı́as and I. Cantador. Exploiting social
tem. On the other hand, by observing the statements with                tags in matrix factorization models for cross-domain
the lowest values we can state that users think that they               collaborative filtering. In Proceedings of the 1st
have to learn a lot before they can use the system properly             Workshop on New Trends in Content-based
and they may need technical person for that. Our explana-               Recommender Systems, Foster City, California, USA,
tion for this result is that we need to improve further the             pages 34–41, 2014.
interface and provide more explanations, so that users can          [8] J. Freyne and S. Berkovsky. Intelligent food planning:
better learn and understand the usage of the components in              personalized recipe recommendation. In IUI, pages
the system.                                                             321–324. ACM, 2010.
                                                                    [9] J. Freyne and S. Berkovsky. Intelligent food planning:
5.   CONCLUSION AND FUTURE WORK                                         personalized recipe recommendation. In IUI, pages
   In this paper, we illustrated the preference elicitation pro-        321–324. ACM, 2010.
cess of a novel food recommender system [11]. Our system           [10] J. Freyne and S. Berkovsky. Evaluating recommender
generates recommendations by exploiting tags and ratings                systems for supportive technologies. In User Modeling
in a MF algorithm. In our study, we collected user evalu-               and Adaptation for Daily Routines, pages 195–217.
ations of the recommendation quality and system usability.              Springer, 2013.
Both measurements were found to be positive. This means            [11] M. Ge, M. Elahi, I. Fernaández-Tobı́as, F. Ricci, and
that the proposed preference elicitation process and system             D. Massimo. Using tags and latent factors in a food
interaction are liked by users.                                         recommender system. In Proceedings of the 5th
   Considering that this is a preliminary study, this paper             International Conference on Digital Health 2015,
has several limitations. First, the evaluation is performed             pages 105–112. ACM, 2015.
on the whole system rather than on preference elicitation.         [12] B. P. Knijnenburg, M. C. Willemsen, Z. Gantner,
Since the prediction model was already tested in another                H. Soncu, and C. Newell. Explaining the user
study [11], this work mostly focuses on preference elicita-             experience of recommender systems. User Modeling
tion as the main component of user interaction. Second,                 and User-Adapted Interaction, 22(4-5):441–504, 2012.
we have not compared our system with alternative prefer-           [13] Y. Koren and R. Bell. Advances in collaborative
ence elicitation processes. Our current result mostly reflects          filtering. In F. Ricci, L. Rokach, B. Shapira, and
the users’ direct perception of their interaction with the sys-         P. Kantor, editors, Recommender Systems Handbook,
tem. Third, we admit the limited number of subjects in the              pages 145–186. Springer Verlag, 2011.
user study. In the future, we plan to increase the number          [14] F. Ricci and Q. N. Nguyen. Acquiring and revising
of participants in the study. Also, we plan to extend the               preferences in a critique-based mobile recommender
recommendation model by considering nutritional factors,                system. Intelligent Systems, IEEE, 22(3):22–29, 2007.
e.g., the required calories and proteins, in order to build a      [15] F. Ricci, L. Rokach, and B. Shapira. Introduction to
health-aware recommender system.                                        recommender systems handbook. In F. Ricci,
                                                                        L. Rokach, B. Shapira, and P. Kantor, editors,
6.   REFERENCES                                                         Recommender Systems Handbook, pages 1–35.
 [1] G. Adomavicius and A. Tuzhilin. Context-aware                      Springer Verlag, 2011.
     recommender systems. In Recommender systems                   [16] J. Sauro. Measuring usability with the system
     handbook, pages 217–253. Springer, 2011.                           usability scale (sus).
 [2] A. Bangor, P. Kortum, and J. Miller. Determining                   http://www.measuringusability.com/sus.php.
     what individual sus scores mean: Adding an adjective               Accessed: 2013-01-15.
     rating scale. Journal of usability studies, 4(3), 2009.       [17] J. Swarbrooke and S. Horner. Consumer behaviour in
 [3] M. Braunhofer, M. Elahi, F. Ricci, and T. Schievenin.              tourism. Routledge, 2007.
     Context-aware points of interest suggestion with              [18] C.-Y. Teng, Y.-R. Lin, and L. A. Adamic. Recipe
     dynamic weather data management. In Information                    recommendation using ingredient networks. In
     and Communication Technologies in Tourism 2014,                    Proceedings of the 4th Annual ACM Web Science
     pages 87–100. Springer International Publishing, 2014.             Conference, pages 298–307. ACM, 2012.
 [4] M. Elahi, M. Ge, F. Ricci, D. Massimo, and                    [19] M. Trevisiol, L. Chiarandini, and R. Baeza-Yates.
     S. Berkovsky. Interactive food recommendation for                  Buon appetito: recommending personalized menus. In
     groups. In Poster Proceedings of the 8th ACM                       Proceedings of the 25th ACM conference on Hypertext
     Conference on Recommender Systems, RecSys 2014,                    and social media, pages 327–329. ACM, 2014.
     Foster City, Silicon Valley, CA, USA, October 6-10,           [20] R. West, R. W. White, and E. Horvitz. From cookies
     2014. 2014.                                                        to cooks: Insights on dietary patterns via analysis of
 [5] M. Elahi, F. Ricci, and N. Rubens. Active learning                 web usage logs. In Proceedings of the 22nd
     strategies for rating elicitation in collaborative                 international conference on World Wide Web, pages
     filtering: A system-wide perspective. ACM                          1399–1410. International World Wide Web
     Transactions on Intelligent Systems and Technology                 Conferences Steering Committee, 2013.
     (TIST), 5(1):13, 2013.

</pre>