=Paper= {{Paper |id=Vol-3294/long1 |storemode=property |title=Nudging Towards Health in a Conversational Food Recommender System Using Multi-Modal Interactions and Nutrition Labels |pdfUrl=https://ceur-ws.org/Vol-3294/long1.pdf |volume=Vol-3294 |authors=Giovanni Castiglia,Ayoub El Majjodi,Federica Calò,Yashar Deldjoo,Fedelucio Narducci,Alain Starke,Christoph Trattner |dblpUrl=https://dblp.org/rec/conf/recsys/CastigliaMCDNST22 }} ==Nudging Towards Health in a Conversational Food Recommender System Using Multi-Modal Interactions and Nutrition Labels== https://ceur-ws.org/Vol-3294/long1.pdf

Nudging Towards Health in a Conversational Food
Recommender System Using Multi-Modal Interactions and
Nutrition Labels
Giovanni Castiglia1 , Ayoub El Majjodi2 , Federica Calò1 , Yashar Deldjoo1 , Fedelucio Narducci1 ,
Alain Starke2,3 and Christoph Trattner2
1
Polytechnic University of Bari, Bari, Italy
2
Department of information science and media studies, University of Bergen, Bergen, Norway
3
Marketing and Consumer Behaviour Group, Wageningen University & Research, Wageningen, The Netherlands

Abstract
Humans engage with other humans and their surroundings through various modalities, most notably speech, sight, and
touch. In a conversation, all these inputs provide an overview of how another person is feeling. When translating these
modalities to a digital context, most of them are unfortunately lost. The majority of existing conversational recommender
systems (CRSs) rely solely on natural language or basic click-based interactions.
This work is one of the first studies to examine the influence of multi-modal interactions in a conversational food
recommender system. In particular, we examined the effect of three distinct interaction modalities: pure textual, multi-modal
(text plus visuals), and multi-modal supplemented with nutritional labeling. We conducted a user study (𝑁=195) to evaluate
the three interaction modalities in terms of how effectively they supported users in selecting healthier foods. Structural
equation modelling revealed that users engaged more extensively with the multi-modal system that was annotated with
labels, compared to the system with a single modality, and in turn evaluated it as more effective.

Keywords
Personalization, Health, Food recommendation, Digital Nudges, Nutrition labels

1. Introduction and Context tackling risk factors, such as attaining a healthy food
intake [8]. While our food decisions are driven by
Conversational recommender systems (CRSs) represent our overall preferences, the food selection process is
a hotly debated area of study in the field of information extremely contextual and influenced by a variety of
seeking [1, 2]. They combine the power of recommen- factors, such as the user’s mood and dietary constraints.
dation algorithms with conversational strategies. Using Moreover, many of the decisions are made spontaneously
multi-turn conversations, CRSs are able to collect users’ and consumers’ judgments are influenced by factors
nuanced and dynamic preferences in more depth, which unrelated to the food content, such as their perception
can enhance recommendation outcomes and user experi- of the food’s visual characteristics [9]. For instance,
ence. CRSs are utilized in a variety of domains, including the packaging of items with nutritional labels can
medical diagnosis [3], e-commerce [4], and entertain- serve to highlight the nutritious nature of the food
ment [5, 6]. Only a few studies have investigated their (cf. [10]). Moreover, people generally prefer food that
merit for food recommendation [7], and in particular for has a more visually appealing presentation, such as
encouraging users to make healthier food decisions. food that is presented in an attractive way [11]. People
Over 60% of all deaths are caused by non- are willing to pay extra for food whose ingredients
communicable diseases, which are preventable by are tastefully/attractively organized, and restaurants
strive to generate Instagram-friendly photographs by
4th Edition of Knowledge-aware and Conversational Recommender Sys-
enhancing the color composition of their plates.
tems (KaRS) Workshop @ RecSys 2022, September 18–23 2023, Seattle,
WA, USA. To surface effective and healthy food recommenda-
Envelope-Open g.castiglia@studenti.poliba.it (G. Castiglia); tions it is crucial to understand these underlying decision
ayoub.majjodiu@uib.no (A. E. Majjodi); f.calo8@studenti.poliba.it factors. Regrettably, the large majority of existing con-
(F. Calò); yashar.deldjoo@poliba.it (Y. Deldjoo); versational recommender systems [12, 13] only consider
fedelucio.narducci@poliba.it (F. Narducci); alain.starke@uib.no
a single type of interaction, such as natural language or
(A. Starke); christoph.trattner@uib.no (C. Trattner)
GLOBE https://www.christophtrattner.info/ (C. Trattner) click-based interaction, thereby neglecting a wealth of
Orcid 0000-0002-7478-5811 (A. E. Majjodi); 0000-0002-6767-358X information in the actual imaging of meals [14]. The goal
(Y. Deldjoo); 0000-0002-9255-3256 (F. Narducci); of the present work at hand is to employ a new conver-
0000-0002-9873-8016 (A. Starke); 0000-0002-1193-0508 (C. Trattner) sational model for food recommendation that permits
© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License
Attribution 4.0 International (CC BY 4.0). more natural, multi-modal user-system interaction.
CEUR
Workshop
Proceedings
http://ceur-ws.org
ISSN 1613-0073
CEUR Workshop Proceedings (CEUR-WS.org)
To attain this goal, this paper introduces a multi- presentation time, healthiness of recipes chosen and a
modal conversational food recommender system (MM- user’s level of choice satisfaction and experienced system
CFRS). It implements different user-system interaction effectiveness.
modes, along with nutrition labelling in order to assist the
user in making dietary decisions. Our objective is to ex-
amine the effects of three distinct interaction modes: pure 2. System Design
textual, multi-modal (text plus visuals), and multi-modal
In this section we describe the features of our conversa-
supplemented with nutritional labeling. While multi-
tional food recommender system, which supports users
modal conversational information seeking (MMCIS) is
in making healthier choices.1
gaining attention by the research in the RecSys/IR/HCI
We designed a system-driven conversation in which the
communities [15, 1, 16], only a few practical studies have
system requires user feedback (response/input) to con-
been published that focus on topics other than food and
tinue. The main steps of the conversational flow are
health, such as conservational systems on tourism [17]
shown in Figure 1. Users can interact with the system us-
and fashion [18, 19]. In the field of food recommendation,
ing both buttons and textual messages2 . The main steps
Elsweiler et al. [20] provide a good frame of reference for
of the interaction are reported below:
recent advances in the field of food recommender sys-
tems in general. Specifically for conversational systems, • Food category acquisition: The user was presented
Barko-Sherif et al. [21] investigate the possibility for con- with a choice of four different food categories
versational preference elicitation in a food recommender that were considered in this work: Pasta, Salad,
environment, using a Wizard of Oz study design (see also Dessert, and Snack.
[22]). Using a between-groups approach, they compare
• User constraints acquisition: The user was then
spoken and text-input chat interfaces and reported that
prompted to indicate any potential dietary con-
such interfaces are useful for users to describe their needs
straints. Initially, the system used an interface
and preferences. In other studies, Samagaio et al. [23]
with a single checkbox for each of the most preva-
present a RASA-based chatbot that can recognize and cat-
lent intolerances and allergies: Lactose, Meat, Al-
egorize user intentions in the conversation aimed to elicit
cohol, Seafood, Reflux, Cholesterol, Diabetes. Af-
food preferences for recommendation purposes. Another
terwards, the system asked the user to disclose a
study of Samagaio et al. [24] applies more knowledge-
list of ingredients she could not consume.
based elements based on word embedding to optimize
• Preference elicitation: According to the con-
conversational ingredient retrieval. These studies, how-
straints specified by the user, the user was
ever, focus less on aspects pertaining to health, health la-
prompted to submit preferences for five of the
belling, or elicitation modalities. In a non-conversational
dishes proposed by the system. Each dish was
recommender context, El Majjodi et al. [25] recently in-
accompanied with two buttons: “Like” and “Skip”.
dicated that nutritional labels can reduce user’s choice
The skip option was provided to encourage users
difficulty in non-conversational context. The primary
to inspect an addition dish, which was retrieved
distinction between our work and previous studies is the
from the randomly sorted menu. The retrieval
lack of multiple modalities (typically only text is used),
was based on a random active learning strategy.
as well as that only a few studies (e.g., [25]) have used
This way, users were encouraged to like five
nutrition labelling.
dishes they were interested in, after which the
To summarize, the goal of this study is to compare the
user profile was built by the system.
impact of three user-system interaction and explanation
modalities (textual, multi-modal, and multi-modal with • Processing: The system constructed the user pro-
nutritional labels) on both behavioral aspects (what type file by analyzing the user’s five preferences from
of recipe is chosen? How healthy is that recipe?) and the previous stage. The cosine similarity was
evaluation aspects (how does the user evaluate the sys- computed between the user profile and each of
tem or their chosen recipe?). Using a mediation analysis the available foods in the catalog, to provide a list
(structural equation modelling), we answer the following of dishes from which recommendations would
research question: be selected. The algorithm also provided a list
of dishes ranked according to their healthiness
• RQ: To what extent do different interaction modal- (based on their FSA score; see Section 3).
ities affect a user’s recipe choices and evaluation
in a conversational food recommendation sce-
nario? 1
Code and recipe data used for implementing the chatbot are avail-
able at https://github.com/giocast/MMCFRS
To address this question, we consider different dimen- 2
A video demo of the three versions of our system is available at
sions of analysis. This includes system interaction length, https://tinyurl.com/mtzxr2sw
Figure 1: Our conversational recommender system flow.

For each food category we built a matrix con- Table 1
taining the TF-IDF representation (dish vs. ingre- Differences between three implementations of the system.
dient) of dishes in the catalog. The higher the
Interaction Mode I E
TF-IDF score, the greater the ingredient’s signifi-
Pure text (T) T T
cance to this dish (as opposed to other dishes).
Multi-modal (MM) MM T
• Recommendation and explanation: The system Multi-modal with labels (MM-Label) MM MM
provided two personalized recommendations,
based on the user’s preferences. The system con-
strained the retrieval to ensure that the two op- manner by displaying the name and image of each dish
tions differed in terms of healthiness, so that one throughout the dialogue. However, the supplied explana-
option was healthier than the other. Thus, the tion remains textual. For the first dish, the explanation
algorithm provided a description of the suggested can be like ”I recommend these dish because I know
dishes. Specifically, it explained why the second that you have diet constraints due to: meat, zucchini.
dish was healthier than the first and why the ad- The first dish I proposed contains ingredients that you
vice was made. The user would then be prompted might like: carrot, lemon, tuna, olive oil”. For the sec-
to select one or request a new recommendation. ond recommendation, the explanation further provides
The two recommended dishes were chosen using information about macro nutrients quantities of the two
the following strategy: The first dish would be recommended dishes and can be in the form of ”The
the most similar to the user profile, while the sec- second dish I proposed has less calories (54 Kcal) than
ond dish (the healthier alternative) was selected the first one (123 Kcal) and has less fats than the first
from a list of most similar dishes ranked on their one. The third version MM-Label (MM + MM) likewise
FSA scores, selecting the healthiest one (i.e. with employs a multi-modal interaction approach, but it also
the lowest FSA score). makes use of nutritional explanations in the form of a
front-of-package nutrition label with FSA’s Multiple Traf-
Three different interaction modes were implemented
fic Lights (MTL) [25]. MTL nutrition labels depicted the
by modifying the values associated with the two manipu-
intake adequacy of a dish in terms of energy and nutri-
lated variables: interaction 𝐼 and explanation 𝐸, according
tional content, along five dimensions: energy (kcal), fat,
to Table 1.
saturates, sugars, and salt. This adequacy, per serving
In the Pure text version (T + T), the system communi-
and per 100g, was depicted using the colors green, yellow
cates with the user solely through text, displaying sim-
and red, where green indicated a dish to adhere to the
ply the dish titles and offering textual explanations of
nutritional intake guideline, while red indicated that the
the food recommendations. In the Multi-modal version
content was unacceptable. These labels were generated
(MM + T), the system engages the user in a multi-modal
Figure 2: The three implementations of the system. Some details displayed on the interface, such as the chatbot’s and authors’
names are anonymized and will be added after peer review.

for each dish by following the directives of Food Standard the recommendations, we provide the user with an expla-
Agency and UK department of health [26]. nation that helps her comprehend the health benefits of
Figure 2 depicts a snapshot of the chatbot prototype, the second alternative above the first, which is the dish
visualizing the different interaction phases. that best matches her preferences. This is accomplished
In the Textual (T) version, the user received recom- either by text (T and MM variants) or a multiple traffic
mendations identified by only the names of the dishes light nutritional label (MM-Label).
(e.g., Cupcake Princess’ Vanilla Cupcakes, Floating Island The user can accept one of the two dishes proposed or
II). The recommendations were followed by textual ex- can ask for another recommendation.
planations, based on the ingredients in the dish that the
user likes. A comparative analysis of the nutritional facts
(e.g., ‘less sugars’) would also be provided. In the Multi- 3. Experimental Evaluation
modal (MM) version, the system additionally provided
To evaluate the extent to which different versions of the
images of the recommended dishes. The explanation was
chatbot affected users’ evaluations and decisions, we re-
similar to the one presented in the T version. Finally,
cruited 195 participants from Amazon MTurk to use our
the Multi-modal with labels (MM-Label) version provided
system. Participants had to have a hit rate of 95% at least
nutritional labels that were annotated to the depicted
and were compensated with 2 dollars. On average, user
images (e.g., Sugar 2.3g, Fat 10.7g, etc.) presented with
required around 15 minutes to complete the study.3 Users
red, yellow, and/or green colors according to the FSA
3
score. As stated previously, following the presentation of The research conformed to the ethical standards of the Norwegian
Centre for Research Data (NSD). The collected data is available in
Table 2
Questionnaire items used in the confirmatory factor analysis. Alpha denotes Cronbach’s Alpha, AVE denotes the Average
Variance Explained, indicating construct validity if AVE > 0.5. Items in gray and without loading were omitted from analysis.
Choice Satisfaction did not form a sensible aspect, because of a lack of construct validity.
Aspect Item Loading
I think, I would enjoy eating the dish I have chosen in the end
Choice Satisfaction I would recommend the dish I’ve chosen in the end to others
My chosen dish could become my favorite
It was easy to make my final choice on the dish 0.737
I interacted a lot with the system before getting the dish of my choice
System Effectiveness The explanation influenced my final choice of dish
I think, that I would use this system frequently
Alpha = 0.740 I found the system easy to use and understand 0.724
AVE = 0.534 I felt very confident using the system 0.661
I would imagine that most people would learn to use this system very quickly 0.722

performed the processes outlined in Section 2, interact- tion duration was significantly longer (𝑝 < 0.05) than in
ing with our chatbot for preference elicitation, evaluating the text-based condition . This indicated that the usage
recipe recommendations, selecting one recipe, and evalu- of nutrition labels affected conversation time, on top of
ating the experience. A user’s experience was evaluated the other modalities.
through choice satisfaction and system effectiveness, us- The duration of the conservation affected, in turn, the
ing questionnaire items that were evaluated on 5-point evaluation of the user. Inferred from our confirmatory
Likert scales. factor analysis (cf. Table 2), users who interacted with
Chosen recipes were evaluated according to their the chatbot for longer periods of time indicated greater
healthiness. This was evaluated using the FSA score [27]. levels of system effectiveness (𝑝 < 0.01). This indicated
Each recipe was scored between 4 and 12, where 4 indi- that an extended engagement did not frustrate users. In-
cated that all four nutrients (sugar, fat, saturated fat, salt) stead, it indicated that they were enthusiastic about using
adhered to nutritional guidelines per 100g [9, 28], while the system. Figure 3 also shows that the healthiness of
12 would indicate that a recipe was unhealthy because chosen recipes was not significantly related to any of the
of all nutritional contents being too high. other aspects or factors. Note that the MM-Label condi-
The responses to the evaluation questionnaire item tion led the healthiest recipe choices, but the differences
were submitted to a confirmatory factor analysis (CFA; with the other conditions were not significant.
see Table 2). Unfortunately, we could not infer a reli-
able construct for choice satisfaction, as the variance
explained by the questionnaire items was too low, while 4. Conclusion and Future Work
Cronbach’s Alpha was only acceptable (0.60). Other items
We have presented a novel chatbot-like recommender
were dropped from the system effectiveness aspect be-
system that introduces multi-modality in interaction with
cause of low factor loadings.
user, presentation of results and explanation of the rec-
We organized the different factors (e.g., conversation
ommendations with nutrition labels in a conversational
time, condition factors) and aspects (i.e., system effective-
scenario. We have designed and analyzed the impact
ness) into a path model using Structural Equation Mod-
of three distinct version of our chatbot: pure textual,
elling. Figure 3 depicts the resulting model, which had de-
multi-modal (use of text and images), and multi-modal
cent fit statistics: 𝜒 2 (17) = 28.064, 𝑝 < 0.05, 𝐶𝐹 𝐼 = 0.969,
supplemented with nutritional labels.
𝑇 𝐿𝐼 = 0.954, 𝑅𝑀𝑆𝐸𝐴 = 0.058, 90% − 𝐶𝐼: [0.009, 0.095].
Our experimental evaluation reveals that our chatbot
The relevant AVEs of the aspects was sufficiently high to
is the most effective when accompanied by explanatory
form a path model [29].
labels. This is indicated by the length of conversation, as
Our analysis revealed that the MM-Label condition
well as by the user’s evaluation of the system effective-
with nutrition labels (MM-label) stood out in terms of
ness.
how long users interacted with our chatbot. Figure 3
Limitations to this study could be viewed from dif-
illustrates this, while the use of multi-modal approaches
ferent viewpoints. In terms of analysis, we have been
alone had no effect on the interaction or evaluation fac-
unable to infer the choice satisfaction evaluation aspect.
tors considered. For MM-Label, our mediation analysis
Other research have demonstrated that decision satisfac-
suggested that in the MM-Label condition, the conversa-
tion is a good predictor of post-interaction engagement
the project’s GitHub repository. with selected item, such as for household energy con-
Figure 3: Structural Equation Model (SEM). Numbers on the arrows represent the 𝛽-coefficients, standard errors are denoted
between brackets. Effects between the subjective constructs are standardized and can be considered as correlations, other
effects show regression coefficients. Aspects are grouped by color: Objective system aspects are purple, behavioral indicators
are blue (note: the FSA score represents recipe unhealthiness) and experience aspects are orange. The thinner arrows are
non-significant relations, in addition: ∗∗∗ 𝑝 < 0.001, ∗∗ 𝑝 < 0.01, ∗ 𝑝 < 0.05.

servation [30]. Moreover, rather than relying solely on Adapt. Interact. 30 (2020) 251–284. URL: https://
system-driven interaction, it might be intriguing and nat- doi.org/10.1007/s11257-019-09250-7. doi:1 0 . 1 0 0 7 /
ural to investigate user-driven scenarios in which users s11257- 019- 09250- 7.
might query the system with an image and textual query. [6] A. Iovine, F. Narducci, G. Semeraro, Conversa-
The food categories considered in this work (pasta, salad, tional recommender systems and natural language:
dessert, snack) could additionally be expanded to include : A study through the converse framework, De-
more meal categories and their combinations, such as cis. Support Syst. 131 (2020) 113250. URL: https:
to create a complete meat (first dish, second dish and //doi.org/10.1016/j.dss.2020.113250. doi:1 0 . 1 0 1 6 / j .
vegetables). On top of that, the distinctions between var- dss.2020.113250.
ious label modalities are an additional intriguing topic [7] C. Trattner, D. Elsweiler, Food recommendations,
we wish to investigate more in-depth [31]. in: Collaborative recommendations: Algorithms,
practical challenges and applications, World Scien-
tific, 2019, pp. 653–685.
References [8] R. Y. Toledo, A. A. Alzahrani, L. Martinez, A food
recommender system considering nutritional infor-
[1] H. Zamani, J. R. Trippas, J. Dalton, F. Radlinski,
mation and user preferences, IEEE Access 7 (2019)
Conversational information seeking, arXiv preprint
96695–96711.
arXiv:2201.08808 (2022).
[9] A. D. Starke, M. C. Willemsen, C. Trattner, Nudg-
[2] D. Jannach, A. Manzoor, W. Cai, L. Chen, A survey
ing healthy choices in food search through visual
on conversational recommender systems, ACM
attractiveness, Frontiers in Artificial Intelligence 4
Computing Surveys 54 (2022) 1–36. doi:1 0 . 1 1 4 5 /
(2021) 621743.
3453154.
[10] E. J. Van Loo, C. Grebitus, J. Roosen, Explaining
[3] P. Cordero, M. Enciso, D. López, A. Mora, A conver-
attention and choice for origin labeled cheese by
sational recommender system for diagnosis using
means of consumer ethnocentrism, Food Quality
fuzzy rules, Expert Systems with Applications 154
and Preference 78 (2019) 103716.
(2020) 113449. doi:1 0 . 1 0 1 6 / j . e s w a . 2 0 2 0 . 1 1 3 4 4 9 .
[11] Y. Peng, J. B. Jemott III, Feast for the eyes: Effects
[4] D. Griol, J. Milina, From voicexml to multimodal
of food perceptions and computer vision features
mobile apps: development of practical conversa-
on food photo popularity., International Journal of
tional interfaces, ADCAIJ Adv. Distrib. Comput.
Communication (19328036) 12 (2018).
Artif. Intell. J. 5 (2016) 43.
[12] C. Zhou, Y. Jin, K. Zhang, J. Yuan, S. Li, X. Wang,
[5] F. Narducci, P. Basile, M. de Gemmis, P. Lops, G. Se-
Musicrobot: Towards conversational context-
meraro, An investigation on the user interac-
aware music recommender system, in: Interna-
tion modes of conversational recommender sys-
tional Conference on Database Systems for Ad-
tems for the music domain, User Model. User
vanced Applications, Springer, 2018, pp. 817–820. ingredient retrieval, in: 3rd Conference on Lan-
[13] J. Schaffer, T. Hollerer, J. O’Donovan, Hypothetical guage, Data and Knowledge (LDK 2021), Schloss
recommendation: A study of interactive profile ma- Dagstuhl-Leibniz-Zentrum für Informatik, 2021.
nipulation behavior for recommender systems, in: [25] A. El Majjodi, A. D. Starke, C. Trattner, Nudging
The Twenty-Eighth International Flairs Conference, towards health? examining the merits of nutrition
2015, pp. 507–512. labels and personalization in a recipe recommender
[14] Y. Deldjoo, M. Schedl, P. Cremonesi, G. Pasi, Rec- system, in: Proceedings of the 30th ACM Confer-
ommender systems leveraging multimedia content, ence on User Modeling, Adaptation and Personal-
ACM Computing Surveys (CSUR) 53 (2020) 1–38. ization, 2022, pp. 48–56.
[15] Y. Deldjoo, J. R. Trippas, H. Zamani, Towards multi- [26] Department of Health and Social Care UK, Front
modal conversational information seeking, in: Pro- of Pack nutrition labelling guidance, 2016. URL:
ceedings of the 44th International ACM SIGIR con- https://www.gov.uk/government/publications/
ference on research and development in Informa- front-of-pack-nutrition-labelling-guidance.
tion Retrieval, 2021, pp. 1577–1587. [27] D. of Health UK, F. S. Agency, Guide
[16] R. G. Sousa, P. M. Ferreira, P. M. Costa, P. Azevedo, to creating a front of pack (fop) nutri-
J. P. Costeira, C. Santiago, J. Magalhaes, D. Semedo, tion label for pre-packed products sold
R. Ferreira, A. I. Rudnicky, et al., ifetch: Multimodal through retail outlets (2016). URL: https:
conversational agents for the online fashion market- //assets.publishing.service.gov.uk/government/
place, in: Proceedings of the 2nd ACM Multimedia uploads/system/uploads/attachment_data/file/
Workshop on Multimodal Conversational AI, 2021, 566251/FoP_Nutrition_labelling_UK_guidance.pdf.
pp. 25–26. [28] C. Trattner, D. Elsweiler, Investigating the healthi-
[17] L. Liao, L. H. Long, Z. Zhang, M. Huang, T.-S. Chua, ness of internet-sourced recipes: implications for
Mmconv: an environment for multimodal conversa- meal planning and recommender systems, in: Pro-
tional search across multiple domains, in: Proceed- ceedings of the 26th international conference on
ings of the 44th International ACM SIGIR Confer- world wide web, ACM, New York, NY, USA, 2017,
ence on Research and Development in Information pp. 489–498.
Retrieval, 2021, pp. 675–684. [29] B. P. Knijnenburg, M. C. Willemsen, Evaluating
[18] S. Moon, S. Kottur, P. A. Crook, A. De, S. Pod- recommender systems with user experiments, in:
dar, T. Levin, D. Whitney, D. Difranco, A. Beirami, Recommender systems handbook, Springer, 2015,
E. Cho, et al., Situated and interactive multimodal pp. 309–352.
conversations, arXiv preprint arXiv:2006.01460 [30] A. Starke, M. Willemsen, C. Snijders, Effective user
(2020). interface designs to increase energy-efficient behav-
[19] Y. Yuan, W. Lam, Conversational fashion image ior in a rasch-based energy recommender system,
retrieval via multiturn natural language feedback, in: Proceedings of the eleventh ACM conference
in: Proceedings of the 44th International ACM SI- on recommender systems, 2017, pp. 65–73.
GIR Conference on Research and Development in [31] Y. Deldjoo, M. Schedl, B. Hidasi, Y. Wei, X. He, Mul-
Information Retrieval, 2021, pp. 839–848. timedia recommender systems: Algorithms and
[20] D. Elsweiler, H. Hauptmann, C. Trattner, Food challenges, in: Recommender systems handbook,
recommender systems, in: Recommender Systems Springer, 2022, pp. 973–1014.
Handbook, Springer, 2022, pp. 871–925.
[21] S. Barko-Sherif, D. Elsweiler, M. Harvey, Conversa-
tional agents for recipe recommendation, in: Pro-
ceedings of the 2020 Conference on Human Infor-
mation Interaction and Retrieval, 2020, pp. 73–82.
[22] A. Steinfeld, O. C. Jenkins, B. Scassellati, The oz
of wizard: simulating the human for interaction
research, in: Proceedings of the 4th ACM/IEEE in-
ternational conference on Human robot interaction,
2009, pp. 101–108.
[23] Á. Mendes Samagaio, H. Lopes Cardoso, D. Ribeiro,
A chatbot for recipe recommendation and prefer-
ence modeling, in: EPIA Conference on Artificial
Intelligence, Springer, 2021, pp. 389–402.
[24] Á. M. Samagaio, H. Lopes Cardoso, D. Ribeiro, En-
riching word embeddings with food knowledge for