=Paper=
{{Paper
|id=Vol-1438/paper7
|storemode=property
|title=An Adaptive Electronic Menu System for Restaurants
|pdfUrl=https://ceur-ws.org/Vol-1438/paper7.pdf
|volume=Vol-1438
|dblpUrl=https://dblp.org/rec/conf/recsys/FilhoW15
}}
==An Adaptive Electronic Menu System for Restaurants==
An Adaptive Electronic Menu System for Restaurants Paulo Henrique Azevedo Filho Wolfgang Wörndl Aberklar GbR Technische Universität München Helene-Mayer-Ring 7 Boltzmannstraße 3 80809, Munich, Germany 85748, Garching bei München, Germany paulo.azevedo@aberklar.com woerndl@informatik.tu-muenchen.de ABSTRACT restaurant to restaurant, the choice for a recommendation This work shows the early stages of the development of a technique has to be further tweaked to address these issues. collaborative-filtering-inspired adaptive system to stream- line the ordering process at restaurants that use electronic The present work is an attempt to embrace this opportu- menu systems. nity, building an adaptive recommender system for elec- tronic restaurant menus, to aid guests in their ordering pro- Among other results, the proposed system achieved a reduc- cess, based on MenuMate. tion of the session duration, while increasing feedback given by restaurant guests. MenuMate, developed by the German startup Aberklar1 , is an electronic menu that offers users a picture-centric ap- Categories and Subject Descriptors proach for use at restaurants. Through MenuMate users can H.4.m [Information Systems Applications]: Miscella- place orders, request the bill and provide feedback about neous their experience. This is the system that was extended with the adaptive system described in this paper. Figure 1 gives an idea of the general structure of the dish overview screen, General Terms and the arrows suggest the repositioning of dishes of the Human Factors, Economics proposed method, that will be described soon. Keywords The dish overview screen shows a title bar for each category, Recommender Systems, Adaptive, Collaborative-Filtering, followed by thumbnails of each dish in that category. From Electronic Menu, Target Variable that screen, the user can go to the dish details screen, with a full screen picture and detailed description, from where the 1. INTRODUCTION dish can be ordered. After orders have been placed and the There is an increasing trend in the number of tablet-based tab asked for, the user is invited to provide feedback about systems for ordering food, in order to streamline the ser- a few different variables, such as food, drinks, service and vice, increasing efficiency and profit. This is a great oppor- electronic menu system, using a star-based rating. tunity to make a change: instead of mere replacements for the paper menu and replacement for some of the roles of The rest of this document is organized as follows: Section 2 the waiters, such systems can do much more, such as be- will put this work in perspective in terms of the electronic coming feedback-gathering devices, or helping guests choose menu used for its implementation, as well as of the related their dishes better and increasing their satisfaction by offer- work. Section 3 will describe in general terms the proposed ing meaningful suggestions. algorithm and how it fits in. Section 4 describes some of the performed experiments and displays their results. Finally, Those meaningful suggestions may stem from collaborative section 5 offers a brief summary as well as points some pos- filtering. Well-established algorithms for collaborative filter- sibilities of further work to be explored. ing exist, as well as several hybridization techniques blend- ing this with other approaches. However, since users of 2. BACKGROUND AND RELATED WORK electronic menus are mostly anonymous (no log-in neces- To achieve the aforementioned goal, once the current state sary), and the offer of dishes and drinks varies greatly from of MenuMate is known, it is necessary to know what the current state of the art for this specific niche is. Wasinger et al. have proposed an electronic menu with an embedded recommender system, called Menu Mentor [9]. The authors come from a perspective of highly personalized explicit recommendations, processed on the user’s phone, and that must be scrutable, that is, allow the users to know why a given recommendation (be it positive or negative) was RecSys Vienna, Austria, 16th-20th September 2015 1 www.aberklar.com/en Four correlation coefficients were tested: Spearman’s [4], Pearson’s [5], Goodman and Kruskal’s [2] and Kendall’s [3]. In fact, the experiments tried all of them in different sessions and made a comparison between them. More coefficients could be used, the only requirement is that they must yield values between -1 and 1, which could imply a normalization step for coefficients that do not yield results in this range. In order to vary the position of dishes, there is what we called pre-optimization randomization, which will shuffle the dishes before sorting them, giving the chance to dishes that have the same coefficient (within a small delta) to change places, changing not only absolute but also relative order. A bias could, otherwise, arise from the fact that stable sort- ing algorithms were used, thus keeping items with a simi- Figure 1: Screen of MenuMate with the overview of lar correlation coefficient always in the same relative order, the dishes, as well as a suggestion of the dynamic preventing them from switching places and moving far away reordering of the menu items. from each other. Also, dishes for which there is no previous information always start with coefficient 0. given, and give them the chance to override it. It does, how- The basic principle is as follows: after each session, the ever, assume explicit user profiles, acquired through usage of tablets send the raw data gathered to the system that gen- the system on the user’s smartphone. In order to minimize erates the sequence (there is one such system per restau- user setup, restaurants should be able to offer the system rant). This system, based on the chosen coefficient and tar- and the hardware. get variable, calculates the value for that variable and the coefficients, generates the menu sequence, and at the be- There are psychological studies about how to organize a ginning of each session the tablets poll this system for the restaurant menu, with techniques such as placing items with latest sequence to be employed. This system, called Menu high prices first to ”smoothen” the effect of the lower-priced Optimizer, is configurable in respect to the target variable, items following, even if they are not actually cheap [6]. There action (maximize or minimize) and correlation coefficient, are also studies evaluating guests’ gaze and how it corre- together with other parameters relevant for the A/B testing lates to which dishes get ordered, placing dishes restaura- performed, that will be explained in the next section. teurs want to have ordered in those positions, such as shown at [1]. There are as well other possibilities that go in a sim- ilar direction, such as [8] and [10]. Going for this sort of 4. PRELIMINARY USER STUDIES psychological study, however, would require extensive trials Since the method proposed does not explicitly recommend after every change to the menu, as well as the usage of the a single item or try to predict ratings, some methods usu- same menu in a different place where cultural background ally employed to evaluate recommender systems cannot be changes. This type of approach is also sensitive to the direc- employed, such as cross-validation, recall and precision and tion of reading of the mother tongue of the restaurant guest, accuracy. such as right to left in case of Arabic speakers. There are, however, other methods that can be employed The goal of this work was to develop an electronic menu sys- directly: the test was a double-blind A/B test, in which nei- tem that adapts itself according to a certain target variable, ther the user nor the waiter knew which the target variable such as session duration, but while avoiding the need both at the time was, nor the correlation coefficient used. It was for the setup of a user profile and extensive trials after each also online (in the sense that real users were using the sys- change. This is what will be presented in the next section. tem, generally spending real money through it [7], as was the case in our tests), and measured a few variables. 3. ALGORITHM AND ARCHITECTURE Since there is no way to directly measure accuracy for this The proposed idea is rather simple. It consists in reordering system, there is employment of efficiency. In traditional rec- the menu items to optimize a quantifiable target variable, ommender systems, the goal is to try to predict what the that can be either maximized or minimized. For this work, user would like to find and show it to them, shortening the four target variables were studied: tab value, session dura- search. Assuming that users find more easily the informa- tion, revenue rate (cents per minute) and feedback rating. tion they are looking for, they will more promptly take deci- Once this definition is set, as different menu sequences are sions based on that information, namely order the dish they used the value of each target variable is recorded for each intend to. Assuming this to be true, it would derive that session, as well as the position of each dish in the menu. a reduction of session duration would imply increased ac- The optimal menu sequence is computed by calculating the curacy in a way. To assess whether this is true, it should correlation coefficient between each dish’s position and the be coupled with an increase of the feedback ratings, which performance, and sorting ascending or descending, depend- would show that the items found satisfied the user. This ing on whether the action is to maximize or minimize, re- metric, that as mentioned before, is called efficiency, and spectively. was used to assess the system. Figure 2: The evolution of feedback ratings and ses- Figure 3: How the values of the correlation coeffi- sion duration at one of the restaurants, that suggests cients evolved over time at the two restaurants. good efficiency. back at El Patio was too low to assume it was meaningful). Another adaptation made was with regards to serendipity. It suggests that indeed over time, independent of those be- Instead of measuring it, there was the measurement of cat- ing the target variables, feedback tends to rise and session alog coverage: the measurement of how much users tend duration to reduce, indicating good efficiency. to order varied dishes, compared to users of plain paper menu. For that, observations were made of tables whose Table 1 shows how the choice of a target variable influences guests did not use MenuMate, and which dishes they or- the performance of that variable, as well as of other vari- dered, and those were compared to the orders of MenuMate ables. In this table only sessions optimized with Spearman’s users. correlation coefficient are used, because it was the only co- efficient with a high enough number of observations from There were two classes of tests performed, the stress tests which to derive conclusions (21 sessions in total). The ”none” and the user tests. The stress tests, which will not be dis- line, represents results for a non-optimized menu. The sys- played in detail here, were used to assess the scalability of tem did well in optimizing for the target variable, with the the system, which was developed to be run on a Raspberry best results for both feedback and session duration, a close Pi computer, with an embedded 900MHz ARM processor. It second for tab value, while not delivering good results for suffices to say that the system could serve between 100 and revenue rate, because the higher tab value was not enough 160 tablets with unnoticeable performance losses, and that to compensate for the roughly halved session duration time the limits reached were due to the test rig employed, rather achieved when the target variable was the duration. It is than the system itself. Another interesting result on this also worth noticing that indeed, revenue rate is a direct con- front is that, on the employed hardware, menu optimization sequence of both tab value and session duration, but the av- time is increased on average by 3ms for each session stored, erage revenue rate is not necessarily the same as the revenue which allows prediction of the optimization time based on rate of the averaged tab values and durations. For clarity, the size of the history of observations. the best results for each variable are displayed in boldface. The preliminary user studies were performed at two differ- Figure 3 shows another facet of the inner workings of the ent restaurants in Munich, for about a week in each, time system: although the number of observed sessions was low, during which the system would gather session data and au- it is very fast to converge the internal coefficients, suggesting tomatically adapt itself, switching the target variable at reg- some stability after approximately 10 sessions. Another in- ular intervals. The number of observations was rather small teresting effect is that the absolute values of the coefficients due to the reduced number of days allowed for observations. tend to be higher for dishes that ended up being ordered, There were 13 sessions observed in one restaurant, called El which means the system can in fact predict which dishes Patio, and 35 in the other, called Wendlinger. The restau- will be ordered, by checking the dishes with highest abso- rants had, respectively, 201 and 290 menu items, split in 24 lute value of the coefficients. and 34 categories. The original menu sequence was devised by their respective owners. Due to operational constraints, catalog coverage was only measured at El Patio. There, 12 of the 13 sessions were Figure 2 shows the evolution over time of the efficiency at with menu optimization enabled, that will be considered for Wendlinger (the number of sessions to which users gave feed- this measurement. 50 sessions with paper menus were also Target Variable Feedback Value (e) Duration Rev. Rate None 2,0 25,17 12.865,6 20,9 Feedback 5,0 13,25 15.779,9 33,8 Tab Value 4,4 38,00 7.499,5 59,1 Session Duration 4,3 36,85 2.406,9 154,9 Revenue Rate 5,0 40,10 4.400,6 56,6 Table 1: Cross-references between target variables and the results for all interest variables. observed. In total, 97 different dishes and drinks were or- • Feature extraction, to allow dishes of different restau- dered, in different quantities. From those 97, 19 were or- rants to be matched and correlation information ex- dered both with and without MenuMate, 63 only by users change, possibly improving results; of the paper menu and 15 only by users of MenuMate with menu optimization. • Extra variable isolation, that would allow control over influential external variables such as weather, time of It is not easy to extrapolate how those proportions would be the day or season; in case an equal number of observations was available, and • If individual user profiling is done, further personaliza- discarding paper menu-based sessions could arbitrarily lead tion, such as filtering out dishes based on allergies or to any results. Assuming, however, that for both systems at taste preferences, could be done; each session there is an equal probability of adding not pre- viously ordered items, and that this probability is inherent • Stochastic exploration of dishes, which could allow for to either MenuMate or the paper menu, a proportion rule recommendation of the next course based on previ- may be followed. ously ordered dishes in a session. In the case of the paper menu, 19 + 63 = 82 different items 6. REFERENCES were ordered in 50 sessions, which yields an average of 1,64 [1] J.-G. Choi, B.-W. Lee, and J.-w. Mok. An experiment new items per session. With MenuMate in use, there were on psychological gaze motion: a re-examination of 19 + 15 = 34 different items ordered in 12 sessions, resulting item selection behavior of restaurant customers. in 2,83 new items per session. That may indicate that thus, Journal of Global Business and Technology, 6(1):68, the menu optimizations led to guests ordering a bigger vari- 2010. ety of items. Alternatively, it could be that as more sessions [2] L. A. Goodman and W. H. Kruskal. Measures of would be measured, these sessions would progressively get association for cross classifications iii: Approximate less ”innovative”, in terms of the ordered dishes, which could sampling theory. Journal of the American Statistical revert this balance. An extended evaluation, with a similar Association, 58(302):310–364, 1963. number of sessions in both conditions would help settle this [3] M. G. Kendall. A new measure of rank correlation. matter. Biometrika, pages 81–93, 1938. [4] J. L. Myers, A. Well, and R. F. Lorch. Research design 5. CONCLUSIONS AND FUTURE WORK and statistical analysis. Routledge, 2010. The proposed menu optimizer harnesses principles of rec- [5] K. Pearson. Notes on regression and inheritance in the ommender systems to improve restaurant menus according case of two parents. In Proceedings of the Royal to an arbitrary interest variable. In fact, it can be used Society of London, volume 58, pages 240–242, 1895. to facilitate user interaction for any system to which access [6] W. Poundstone. Priceless: The myth of fair value is anonymous and the number of items not overwhelming. (and how to take advantage of it). Macmillan, 2010. Another use that comes to mind is the choice of which pre- [7] G. Shani and A. Gunawardana. Evaluating sentations to attend at a conference, or main sights to visit recommendation systems. In Recommender systems in a city. handbook, pages 257–297. Springer, 2011. One of the main contributions is the strong suggestion that [8] B. Wansink, J. Painter, and K. Van Ittersum. this purely statistical method may improve the menu even if Descriptive menu labels’ effect on sales. The Cornell the underlying mechanism that drives the change is not un- Hotel and Restaurant Administration Quarterly, derstood by the system (i.e. the psychological implications). 42(6):68–72, 2001. [9] R. Wasinger, J. Wallbank, L. Pizzato, J. Kay, The developed system comprises the algorithm for menu op- B. Kummerfeld, M. Böhmer, and A. Krüger. Scrutable timization, coupled with a robust and scalable implemen- user models and personalised item recommendation in tation of it. It was followed by qualitative and quantita- mobile lifestyle applications. In User Modeling, tive tests, with real paying users, of which only very few Adaptation, and Personalization, pages 77–88. results were presented this time due to page number con- Springer, 2013. straints, but that nevertheless suggest efficacy of the pro- [10] S. S. Yang, M. M. Sessarego, et al. $ or dollars: Effects posed method. of menu-price formats on restaurant checks. Cornell Hospitality Reports, pages 6–11, 2009. There are, however, some promising improvements to the method. Among which, the highlights are: