<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>E. Karahodža);</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Usability and Efectiveness of Bot-Assisted Group Decision Making</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Esma Karahodža</string-name>
          <email>esma.karahodza@etf.unsa.ba</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amra Delić</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesco Ricci</string-name>
          <email>fricci@unibz.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Bolzano-Bozen</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Group Recommender Systems, System Usability Scale, Decision-making Support</institution>
          ,
          <addr-line>Chatbot</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Sarajevo</institution>
          ,
          <addr-line>Sarajevo</addr-line>
          ,
          <country country="BA">Bosnia and Herzegovina</country>
        </aff>
      </contrib-group>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Group recommender systems (GRSs) have been designed to support collective decision-making. Conversational GRSs are claimed to provide important advantages, related to their flexibility in replying to users' questions, and their adaptability to the dynamic evolutions of users' preferences.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        process [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Group Recommender Systems (GRSs) are designed to assist groups in reaching decisions, by delivering
recommendations and supporting the diferent stages of the decision-making process [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. The
standard approach to generating recommendations resides on balancing the group members’ individual
preferences by using a preference aggregation strategy. Individual preferences are either explicitly
stated by group members or derived from the user actions during the interactive group decision-making
      </p>
      <p>
        In fact, a classical research line in this area has focused on preference aggregation, employing various
strategies, either inspired by Social Choice Theory [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], their extensions [
        <xref ref-type="bibr" rid="ref6 ref7 ref8 ref9">6, 7, 8, 9</xref>
        ], or heuristic-based
methods [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13">10, 11, 12, 13</xref>
        ], and more recently even neural network-based approaches [
        <xref ref-type="bibr" rid="ref14 ref15 ref16 ref17 ref18 ref19">14, 15, 16, 17, 18, 19</xref>
        ].
A smaller body of research has proposed tools that de-emphasize the preference aggregation task and
try to support the full group decision-making process, including tasks such as eliciting preference,
negotiation and discussion facilitation, and guiding the group towards a final decision [ 20, 3, 21, 22, 23,
      </p>
      <p>Regardless of the research focus, it is important to highlight that, diferently from the widespread
difusion of single-user Recommender Systems, GRSs have failed to find traction in real-world platforms.</p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <p>To the best of our knowledge, there is no GRS in operation today that serves a large users’ base, in a
real-world setting, despite decades of active research in the field [ 26, 27]. We believe that this negative
outcome could be motivated by the wrong assumption that groups will be eager to use domain-specific
applications dedicated to choose either a movie, a restaurant, or an accommodation. This overlooks the
fact that, diferently from the individual recommendation scenario, when people make group decisions
online, they need to interact not only with the RS, but also with the other group members. However,
this intergroup communication functionality is ofered already by general purpose communication
platforms, such as, chat applications (e.g. WhatsApp). Hence, a GRS has a larger chance to enter
in the common practice of a group if it does not require the group to change their common group
communication and discussion tool, i.e., their chat app.</p>
      <p>To this end, we introduce and evaluate a new kind of GRS, which is not built as a stand-alone
application but is instead designed as a lightweight chatbot, embedded into an existing instant messaging
chat app (Telegram), where usually decision-making processes naturally occur. Here, we assess the
usability of this chatbot-based GRS and its impact on the group decision-making process. Our Telegram
bot ofers fundamental and necessary group decision-making functionalities, including the generation
of group recommendations, but the recommended items were not proposed by the RS (as it usually
happens), they were suggested by the group members as proposals that they independently identified.
We analyse the GRS efect on group decision-making processes across two tasks of varying complexity,
each performed with and without the assistance of the GRS chatbot, resulting in a between-subjects
experimental design with four conditions.</p>
      <p>Our experimental results indicate that the proposed GRS chatbot achieves a usability score well
above average, highlighting its sound design. However, explicit users’ comments clearly indicate some
system limitations and worth to be implemented improvements. In general, the main positive outcome
is that, by using the CHARM chatbot, groups perceived the decision-making process as significantly
less dificult (compared with groups that did not use CHARM). This efect was especially pronounced
among the participants who rated the bot easy to use, hence revealing a positive correlation between
usability and perceived decision-making complexity. Furthermore, discussions supported by CHARM
produced significantly fewer messages, suggesting that CHARM provides a meaningful level of cognitive
ofload for participants. Moreover, even among those who did not find the bot particularly useful in the
experimental setting (two rather easy decision tasks), many noted its potential value in more complex
decision-making scenarios, and called for a wider intelligent and proactive system behaviour in future
versions. Overall, more than 93% of participants either directly mentioned or clearly implied potential
use cases, relevant domains, and openness to using the chatbot in real-world decision-making contexts.</p>
      <p>The remainder of the paper is organized as follows. Section 2 reviews existing tools supporting the
decision-making process, as well as the literature on chat-based support. In section 3, we present the
functionalities of CHARM and its commands. This tool is evaluated in the user study which is described
in section 4. We present the results of this study and corresponding analysis in Section 5, and in Section
6 we outline possible future directions for the chatbot.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        While most GRSs focus on generating single-shot recommendations by aggregating individual
preferences [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], a smaller body of research has focused on the proper design of a broader set of functionalities,
supporting the full group decision-making process. Early examples include TDF [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], where agents
negotiate on behalf of unavailable users, and CATS [22], a GRS where users can critique
recommendations or peer suggestions, prompting updated recommendations. Similarly, Where2Eat [23] supports
asynchronous interaction and proposal revision during group discussion. Later, systems added more
dynamic interaction. HOOTLE [24], for instance, facilitates group discussions around desirable item
features, allowing members to articulate and revise their preferences. Another system, developed by
Nguyen et al. [
        <xref ref-type="bibr" rid="ref4">4, 28</xref>
        ], combines long-term preference modelling with short-term feedback collected
during chat-based interaction. The system supports group conversations and allows users to make
suggestions, react to others’ proposals, and receive aggregated recommendations within the discussion
interface.
      </p>
      <p>Beyond GRS-specific tools, chatbot-based support for group decision-making has also been explored.
In [29, 30], the authors showed that chatbot interventions can enhance information exchange and
decision quality, especially when introduced early in discussions. Similarly, Tilda [31] improves
chatbased collaboration by tagging message types and generating summaries to reduce conversational
overload. Chatbots have also been used as discussion facilitators. Kim et al. [32, 33] developed bots
that encourage participation, manage time, and structure group deliberation leading to greater opinion
diversity and higher perceived decision quality. SolutionChat [34] adds visualization and moderation
support to help participants track discussion progress and highlight key arguments.</p>
      <p>Together, these works show a growing trend of integrating intelligent and personalised decision
support into natural group conversations. Our work contributes to this line of research.</p>
      <p>In [35] a focus study was conducted to examine how groups make decisions across a variety of issues
and to identify areas where the process can be supported by a tool or an agent. Than, in [36] and [37],
the CHARM framework is introduced as a tool designed to support group decision-making on items
proposed by group members. A more detailed explanation on CHARM is given in the next section.</p>
    </sec>
    <sec id="sec-3">
      <title>3. CHARM: A Chat-bot based GRS</title>
      <p>In this section, we describe the functionalities of the group decision-making assistive tool evaluated in
this study. CHARM (CHAt-bot group RecoMmender) is a TelegramBot [36] designed to support groups
in collaborative decision-making tasks. In general, a TelegramBot is simply a Telegram account operated
by software, that monitors conversations and interacts with users. CHARM is domain independent,
which means that the bot is not capable to autonomously suggest relevant items. In practice CHARM can
only reason on the data collected while interacting with the group members and exploiting information
and knowledge exchanged in the group. It is up to the group members to propose options for the group
to choose.</p>
      <p>The version of CHARM, which we used in our study, provides the following six functionalities with
the goal of structuring and directing the decision-making process:
1. Start - initiating a decision-making task, invoked by “\start” command;
2. Suggestion - enables users to make annotated item suggestions to their fellow group members,
invoked by “\suggest”;
3. Feedback - enables group members’ to provide explicit feedback on the suggested items, in the
form of “Love it”, “Like”, ”Dislike”, and ”Hate it” reactions;
4. Summary - generates the list of the suggested items together with the feedback that participants
gave to them, it is invoked by “\summarize”;
5. Recommendation - invoked by “\recommend”, shows aggregated scores of the votes, where
”Dislike”, “Like“ and “Love it“ are mapped to -1, 1 and 2, respectively, and generates recommendations
based on the users’ feedback;
6. Decision - allows group members to conclude the decision-making process by using the “\decide”,
which prompts the bot to present a list of all suggestions for the group to choose from.</p>
      <p>
        The bot’s functionalities are designed to ensure that the process of making item suggestions is easily
traceable, to reduce participants’ cognitive load from remembering all the suggestions and others’
opinions, and to assist in the preselection of potentially interesting options for the group through a
simple group recommendation approach. The approach uses the average without misery [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] aggregation
strategy, which averages members’ feedback, but eliminates options with at least one “Hate it“ reaction.
      </p>
      <p>Figure 1 shows a simplified real-world scenario, where Bob, Alice, and Carol are deciding on a movie
to watch. After Bob starts the session by using the “\start“, Carol and Bob make item suggestions with
“\suggestion“. The final decision is made after CHARM displays the summary and aggregated scores of
the votes (“\summarize“ and “\recommend“). The decision-making process is concluded when Bob uses
the “\decided“, to communicate the final choice to CHARM.</p>
    </sec>
    <sec id="sec-4">
      <title>4. User Study Methodology</title>
      <p>In this section, we describe the user study procedure and the analyses of the collected data, in order to
understand the efects of CHARM on the group decision-making process.</p>
      <p>Data Collection Procedure. The study was conducted at the University of Sarajevo, obtaining the
ethical approval from the dean of the Faculty. The participants were first-year undergraduate students in
the Computer Science and Informatics program, and the study was carried out as part of the Probability
and Statistics course, as an optional activity. Prior to the study, participants were asked to create groups
consisting of 2 to 6 members and to register their group with an appointed group leader. The data
collection process consisted of two phases: 1) Group-chat discussion phase and 2) Post-questionnaire
phase.</p>
      <p>In the group discussion phase, participants were asked to invite their group members to a Telegram
group chat and to discuss and make a joint decision (choice) that would satisfy all group members as
much as possible. Each group was assigned to one of the four experimental conditions. The experimental
conditions defined the decision-making task, as well as whether or not a group will be using CHARM
assistance during the decision-making process.</p>
      <p>The first task was to select a movie theatre (two options were available), a movie (from those played
at that moment in the two theatres), and a time-slot when all group members are available. Hence, the
groups were required to make three decisions.</p>
      <p>For the second task, participants were first presented with the following scenario: “Your group
achieved the best result on Homework 1 in the course Probability and Statistics among all groups. As a
recognition of your efort and outstanding results, the Faculty has decided to reward you with a fully funded
trip to an international conference on applied statistics, which will be held in Vienna from July 14 to 17 at
TU Wien, address: Karlsplatz 13, 1040 Vienna, Austria.”. Then, the groups were instructed as follows:
“Your task is to find and propose suitable accommodation for the entire group for the period from July 13 to
18. The total accommodation budget is €250 per person for the five nights.” .</p>
      <p>Hence, in the second task the group was required to make a single-decision, in contrast with the first
task, a multiple-decision task, which is considered more complex. To this end, the four experimental
conditions are v1: multiple-decisions task without CHARM, v2: single-decision task without CHARM,
v3: multiple-decision task with CHARM, and v4: single-decision task with CHARM.</p>
      <p>After reaching a decision, the participants were instructed to export their group chat, including
multimedia content, and submit their data to the research team. To ensure anonymity and ethical research
standards, an anonymisation script was provided. The script automatically hashed all usernames and
mentions in the chat transcripts, preserving the participants’ privacy.</p>
      <p>The final dataset comprised 44 group chats involving a total of 164 participants, the distribution of
participants and groups over the four experimental conditions is shown in table 1. All participants were
aged between 19 and 20 years. The gender distribution was 61% male and 39% female.</p>
      <p>Experimental Condition
v1: Multiple-decision without CHARM
v2: Single-decision without CHARM
v3: Multiple-decision with CHARM
v4: Single-decision with CHARM
Total</p>
      <p>In the post-questionnaire phase, all participants were asked to state their level of agreement, on a
ifve-point Likert scale, from 1 - “Strongly disagree”, to 5 - “Strongly agree”, with two sets of statements
on choice satisfaction and decision making process dificulty. The assessment of the group choice
satisfaction included three statements: “I like the choice that we, as a group, have made”; “The choice we
made satisfies my preferences” ; “The choice we made was fair”. The dificulty of the group decision-making
process was evaluated with six statements: “I Eventually I was in doubt between some alternatives”; “The
task of making this decision was overwhelming”; “The decision process was frustrating”; “I changed my
mind several times before making the final decision” ; “I think we have chosen the best option from the
available options”; “To make the decision was easy”.</p>
      <p>Finally, the participants in the bot-assisted conditions were asked to complete the System Usability
Scale (SUS) questionnaire[38], and to provide feedback on their experience with CHARM. The System
Usability Scale (SUS) is a standardized questionnaire used to evaluate the perceived usability of a system
or product. It consists of ten items, which are as follows: “I think that I would like to use this system
frequently”; “I found the system unnecessarily complex”; “I thought the system was easy to use”; “I think
that I would need a support of a technical person to use this system”; “I found the various functions in the
system were well integrated”; “I thought there was too much inconsistency in this system”; “I imagine that
most people would learn to use the system quickly”; “I found the system very cumbersome to use”; “I felt
very confident using the system” ; “I needed to learn a lot of things before I could get going with this system”.</p>
      <p>The SUS score is computed by summing the participants’ responses to these items on a 5-point Likert
scale, adjusting odd- and even-numbered items separately, and then multiplying the total by 2.5 to
obtain a score ranging from 0 to 100. Higher scores indicate better usability. Scores above 85 indicate
excellent usability, 70–84 suggest good usability, 50–69 indicate marginal usability, and scores below 50
reflect poor usability.</p>
      <p>The experience with CHARM was also evaluated with the two additional questions: ‘How helpful
was the Telegram bot in making a decision?” with options: “It was not helpful at all, it just made
the decision-making process more complicated.”, “The feedback option for suggestions was helpful.”, “The
summary functionality was helpful.”, “The recommend functionality was helpful.” “The bot was in general
helpful.”, and a free-text entry; “List improvements / corrections (in bullet points), that you think would
be beneficial for bot.”, a free-text question.</p>
      <p>Conducted Analyses. We investigated the usefulness and impact of the CHARM-bot in group
decision-making by comparing conditions with and without its support. We first assessed the usability
of CHARM by using the SUS score. Then, with Brunner–Munzel test, we investigated diferences in
choice satisfaction, perceived dificulty, and users-to-users interaction patterns (number of messages
exchanged), between the four experimental conditions. The Brunner–Munzel test is a non-parametric
statistical test that examines whether there are diferences between two independent populations, that
is, whether one population tends to yield larger values than the other. Spearman correlations were
used to explore the relationships between usability, satisfaction, and dificulty. Lastly, we analysed how
participants used the bot, the perceived usefulness, and their suggestions for improvement.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experimental Results and Analysis</title>
      <p>System Usability Score. The system usability score of the CHARM-bot was 72.02 which exceeds
the benchmark of 69 and is considered as acceptable usability with minor improvements needed.</p>
      <p>Moreover, CHARM received a slightly higher score (73.70) when used for the single-decision task,
compared to the multiple-decision task (71.03) - although this diference is not statistically significant.
Comparisons of decision-making process with and without CHARM assistance. To compare
the diferences between two independent populations, specifically, the group decision-making processes
conducted with and without CHARM, while considering choice satisfaction, perceived decision-making
dificulty, and the number of messages exchanged, as we previously noted, we used the non-parametric
Brunner–Munzel test. The results are shown in Table 2, and in sake of conciseness, we present only
statistically significant diferences at a threshold of  &lt; 0.05. The results indicate that perceived dificulty
is significantly lower for groups using CHARM, particularly in the more complex task. However,
CHARM does not lead to increased choice satisfaction; in fact, satisfaction was slightly higher in groups
that did not use CHARM for the simpler task. Conversely, the total number of messages exchanged
is significantly lower in groups that used CHARM. We justify these results by guessing that CHARM
is reducing the cognitive load of group members by helping them to keep track of the various item
suggestions and the others’ opinions.
Correlation Analysis. In order to better understand these results, we performed a Spearman
correlation analysis, measuring the relationship between the system usability on the one side and perceived
dificulty and choice satisfaction on the other side. Significant correlations are shown in Table 3. These
results strongly indicate that the decision-making process is significantly less dificult and choice
satisfaction significantly higher for participants who scored CHARM higher in usability.
CHARM Functionalities Usage Analysis. Next, to illustrate how participants used CHARM, in
Figure 2 we show the frequency, and box-plots of CHARM and mistyped commands usage. The most
frequently used functionality is “\suggest“ (a group member suggests an item), and the frequency of
the other functionalities is quite uniform. The figure shows a relatively high number of mistyped
commands, but the majority of these were actually made by 3 groups.</p>
      <p>Participants Feedback Analysis. When asked how helpful the bot was in the decision-making process,
30 subjects indicated that the bot was in general helpful, 31 pointed out the summary functionality
as helpful, 29 indicated recommendations functionality, 20 the feedback option, while 11 out of 74
(14.86%) indicated that the bot was not helpful at all. Participants who did not find the CHARM helpful
emphasized that this was either because the task was too simple to require such support, or because
of broken and unreliable core commands, like summary and recommendations. Notably, 14 out of 74
participants explicitly reported these command failures, which directly contributed to their frustration
and reduced perceived usefulness.</p>
      <p>Despite these issues, the overall sentiment leans toward optimism about the bot’s potential:
participants consistently acknowledge that in more complex decision-making scenarios, such a tool could be
highly valuable. Suggestions for future improvements were abundant and constructive. Participants
called for smarter, context-aware behaviour, for instance, generating suggestions and recommendations
based on the ongoing conversation; more intuitive interaction, such as, buttons or menus instead of
typed commands, improved naming conventions, and an easier way to modify or remove suggestions;
further support for visualising preferences and feedback; and finally clearer onboarding and better
feedback when something goes wrong. The absence of helpful error messages or step-by-step guidance
made troubleshooting dificult and added to confusion. Many envisioned the bot evolving into an
intelligent assistant that simplifies group coordination, increases fairness, and enables more eficient
decision-making in real-world chat environments.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>In this paper, we have presented the system usability analysis of the CHARM-bot and an initial
comparison of group decision-making processes with and without its assistance. Our findings suggest
that even in its current, simple form, a domain independent and purely reactive bot like CHARM
can ofer some support to group decision-making by reducing perceived dificulty and by lowering
the total number of exchanged messages. We attribute this result to CHARM’s design goal of easing
cognitive load, especially by tracking suggestions and opinions. However, no diferences in choice
satisfaction were found, which we believe may be due to the bot ensuring complete transparency of the
decision-making process: when all suggestions and feedback remain visible and accessible at any time,
there is less room for manipulation or persuasion, and participants are also less likely to forget their
initially preferred options. Although in both cases (with and without CHARM assistance), the average
choice satisfaction score was well above 4 (out of 5).</p>
      <p>The usage of bot functionalities was generally balanced, except for the higher use of the suggestion
function. The summarization command was seen as most helpful, while 11 out of 74 participants
did not find CHARM useful. Most of these users felt the task was too simple to require assistance
and encountered issues with core features. Importantly, participants ofered valuable suggestions for
improving the bot, many of which align with current directions in GRS research [26, 27, 39], such as
generating summaries of the discussion, automatically extracting suggestions from chat, and providing
additional recommendations through web search.</p>
      <p>We acknowledge the limitations of our study, including the relatively small sample size, as well as
the convenient sample, i.e., all participants were students, and the preliminary nature of our analysis.</p>
      <p>In our upcoming work, our plan is to first improve the robustness of the bot by eliminating failures
in its core functionalities. With the new CHARM version, we then intend to conduct a more in-depth,
content-focused comparison of decision-making processes with and without CHARM. Specifically, we
aim to investigate the following hypotheses:</p>
      <p>H1: CHARM fosters a more task-focused and structured decision-making process — as suggested by the
observed reduction in exchanged messages, we assume that the bot helps groups stay oriented on their
goal by providing simple mechanisms to track options and preferences.</p>
      <p>H2: CHARM increases transparency in group decisions — by keeping all suggestions and feedback
constantly visible, the bot makes it harder to disregard opinions or dominate the discussion, ensuring
more transparent and balanced participation.</p>
      <p>H3: Decisions reached with CHARM are perceived as fairer — we hypothesize that improved
transparency and equal access to information lead participants to feel more represented and treated equitably.</p>
      <p>H4: CHARM promotes exploration of a broader set of alternatives — since suggesting and recalling
options is simplified, we expect participants to contribute more diverse proposals, ultimately enriching
the decision space.</p>
      <p>Furthermore, by adding simple functionalities to CHARM-bot, such as calling less active (or even
passive) users to take part in providing their feedback, we aim to investigate how such interventions
would afect various group decision-making aspects, including evenness of participation, fairness,
satisfaction, and perceived complexity. Finally, in situations where conflicts are detected (e.g., opposing
views or polarization), we envision extending the bot’s functionality to find alternatives outside the
user-made suggestions that can better satisfy multiple perspectives.</p>
      <p>In the longer term, we also plan to analyse the influence of social relationships, participant roles,
and emotional dynamics during group discussions, tailoring the bot’s proactiveness accordingly. This
includes enabling the bot to engage in one-on-one conversations with individual users when appropriate,
with the goal of easing conflict resolution and guiding groups toward reaching more balanced and
satisfactory decisions overall.</p>
    </sec>
    <sec id="sec-7">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used Chat-GPT-4 for sentence polishing. After using
this tool, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the
publication’s content.
(2021) 1–27.
[20] L. Ardissono, A. Goy, G. Petrone, M. Segnan, P. Torasso, Intrigue: personalized recommendation
of tourist attractions for desktop and hand held devices, Applied artificial intelligence 17 (2003)
687–714.
[21] A. Venturini, F. Ricci, Applying trip@ dvice recommendation technology to www. visiteurope.</p>
      <p>com, Frontiers in Artificial Intelligence and Applications 141 (2006) 607.
[22] K. McCarthy, M. Salamó, L. Coyle, L. McGinty, B. Smyth, P. Nixon, The needs of the many: a
case-based group recommender system, Advances in Case-Based Reasoning (2006) 196–210.
[23] F. Guzzi, F. Ricci, R. D. Burke, Interactive multi-party critiquing for group recommendation, in:</p>
      <p>RecSys, ACM, 2011, pp. 265–268.
[24] J. O. Á. Márquez, J. Ziegler, Preference elicitation and negotiation in a group recommender system,
in: Human-Computer Interaction, Springer, 2015, pp. 20–37.
[25] D. Contreras, M. Salamó, L. Boratto, Integrating collaboration and leadership in conversational
group recommender systems, ACM Transactions on Information Systems (TOIS) 39 (2021) 1–32.
[26] F. Ricci, A. Delić, Widening the role of group recommender systems with cajo, 2025. URL: https:
//arxiv.org/abs/2504.05934. arXiv:2504.05934.
[27] D. Jannach, A. Delić, F. Ricci, M. Zanker, Rethinking group recommender systems in the era of
generative ai: From one-shot recommendations to agentic group decision support, arXiv preprint
arXiv:2507.00535 (2025).
[28] T. N. Nguyen, F. Ricci, A chat-based group recommender system for tourism, Information</p>
      <p>Technology &amp; Tourism 18 (2018) 5–28.
[29] N. Gurkan, B. Yan, Chatbot catalysts: Improving team decision-making through cognitive diversity
and information elaboration, in: ICIS 2023 Proceedings, ???? URL: https://par.nsf.gov/biblio/
10545067.
[30] B. Yan, N. Gurkan, It depends on the timing: The ripple efect of ai on team decision-making, in:</p>
      <p>Proceedings of the Hawaii International Conference on System Sciences, 2023.
[31] A. X. Zhang, J. Cranshaw, Making sense of group chat through collaborative tagging and
summarization, Proc. ACM Hum.-Comput. Interact. 2 (2018). doi:10.1145/3274465.
[32] S. Kim, J. Eun, C. Oh, B. Suh, J. Lee, Bot in the bunch: Facilitating group chat discussion by
improving eficiency and participation with a chatbot, in: Proceedings of the 2020 CHI Conference
on Human Factors in Computing Systems, 2020, pp. 1–13.
[33] S. Kim, J. Eun, J. Seering, J. Lee, Moderator chatbot for deliberative discussion: Efects of discussion
structure and discussant facilitation, Proc. ACM Hum. Comput. Interact. 5 (2021) 87:1–87:26.
doi:10.1145/3449161.
[34] S.-C. Lee, J. Song, E.-Y. Ko, S. Park, J. Kim, J. Kim, Solutionchat: Real-time moderator support for
chat-based structured discussion, in: CHI ’20, 2020, pp. 1–12. doi:10.1145/3313831.3376609.
[35] A. Delic, H. Emamgholizadeh, F. Ricci, J. Masthof, Supporting group decision-making: Insights
from a focus group study, in: UMAP, ACM, 2024, pp. 301–306.
[36] A. Delic, H. Emamgholizadeh, T. N. Nguyen, F. Ricci, Charm: a group decision making support
chatbot, in: Companion Proceedings of the 29th International Conference on Intelligent User
Interfaces, 2024, pp. 7–10.
[37] H. Emamgholizadeh, A. Delic, F. Ricci, Supporting a group member to make a group choice, in:
Adjunct Proceedings of the 31st ACM conference on user modeling, adaptation and personalization,
2023, pp. 96–99.
[38] J. Brooke, et al., Sus-a quick and dirty usability scale, Usability evaluation in industry 189 (1996)
4–7.
[39] E. Karahodža, A. Delić, F. Ricci, Conceptual framework for group dynamics modeling from
group chat interactions, in: Adjunct Proceedings of the 33rd ACM Conference on User Modeling,
Adaptation and Personalization, 2025, pp. 23–27.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Masthof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Delic</surname>
          </string-name>
          ,
          <article-title>Group recommender systems: Beyond preference aggregation</article-title>
          ,
          <source>in: Recommender Systems Handbook</source>
          , Springer US,
          <year>2022</year>
          , pp.
          <fpage>381</fpage>
          -
          <lpage>420</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Felfernig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Boratto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stettinger</surname>
          </string-name>
          , M. Tkalčič (Eds.),
          <source>Group recommender systems, Signals and communication technology</source>
          , Springer Nature Switzerland,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Jameson</surname>
          </string-name>
          ,
          <article-title>More than the sum of its members: challenges for group recommender systems</article-title>
          ,
          <source>in: Proceedings of the working conference on Advanced visual interfaces</source>
          ,
          <year>2004</year>
          , pp.
          <fpage>48</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T. N.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Delic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. G.</given-names>
            <surname>Bridge</surname>
          </string-name>
          ,
          <article-title>Conflict resolution in group decision making: insights from a simulation study, User Model</article-title>
          .
          <source>User Adapt. Interact</source>
          .
          <volume>29</volume>
          (
          <year>2019</year>
          )
          <fpage>895</fpage>
          -
          <lpage>941</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Masthof</surname>
          </string-name>
          , Group recommender systems: aggregation, satisfaction and group attributes, in: F.
          <string-name>
            <surname>Ricci</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Rokach</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          Shapira (Eds.),
          <source>Recommender Systems Handbook</source>
          , Springer,
          <year>2015</year>
          , pp.
          <fpage>743</fpage>
          -
          <lpage>776</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>I.</given-names>
            <surname>Ali</surname>
          </string-name>
          , S.-W. Kim, Group recommendations:
          <article-title>approaches and evaluation</article-title>
          ,
          <source>in: Proceedings of the 9th International Conference on Ubiquitous Information Management and Communication</source>
          , ACM,
          <year>2015</year>
          , p.
          <fpage>105</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Berkovsky</surname>
          </string-name>
          , J. Freyne,
          <article-title>Group-based recipe recommendations: analysis of data aggregation strategies</article-title>
          ,
          <source>in: Proceedings of the fourth ACM conference on Recommender systems, ACM</source>
          ,
          <year>2010</year>
          , pp.
          <fpage>111</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Quijano-Sanchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Recio-Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Diaz-Agudo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Jimenez-Diaz</surname>
          </string-name>
          ,
          <article-title>Social factors in group recommender systems</article-title>
          ,
          <source>ACM Transactions on Intelligent Systems and Technology (TIST) 4</source>
          (
          <issue>2013</issue>
          )
          <article-title>8</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E.</given-names>
            <surname>Quintarelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Rabosio</surname>
          </string-name>
          , L. Tanca,
          <article-title>Recommending new items to ephemeral groups using contextual user influence</article-title>
          ,
          <source>in: Proceedings of the 10th ACM Conference on Recommender Systems</source>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>285</fpage>
          -
          <lpage>292</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Salamó</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>McCarthy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          ,
          <article-title>Generating recommendations for consensus negotiation in group personalization services</article-title>
          ,
          <source>Personal and Ubiquitous Computing</source>
          <volume>16</volume>
          (
          <year>2012</year>
          )
          <fpage>597</fpage>
          -
          <lpage>610</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>L. A. M. C.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          , H. T. Macedo,
          <article-title>Users' satisfaction in recommendation systems for groups: an approach based on noncooperative games</article-title>
          ,
          <source>in: Proceedings of the 22nd International Conference on World Wide Web</source>
          ,
          <year>2013</year>
          , pp.
          <fpage>951</fpage>
          -
          <lpage>958</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Baltrunas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Makcinskas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <article-title>Group recommendations with rank aggregation and collaborative filtering</article-title>
          ,
          <source>in: Proceedings of the fourth ACM conference on Recommender systems</source>
          ,
          <year>2010</year>
          , pp.
          <fpage>119</fpage>
          -
          <lpage>126</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.-N.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bloess</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>El</surname>
          </string-name>
          <string-name>
            <surname>Saddik</surname>
          </string-name>
          ,
          <article-title>Folkommender: a group recommender system based on a graph-based ranking algorithm</article-title>
          ,
          <source>Multimedia systems 19</source>
          (
          <year>2013</year>
          )
          <fpage>509</fpage>
          -
          <lpage>525</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>D.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Miao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>An</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hong</surname>
          </string-name>
          , Attentive group recommendation,
          <source>in: The 41st International ACM SIGIR conference on research &amp; development in information retrieval</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>645</fpage>
          -
          <lpage>654</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <article-title>Double-scale self-supervised hypergraph learning for group recommendation</article-title>
          ,
          <source>in: Proceedings of the 30th ACM international conference on information &amp; knowledge management</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>2557</fpage>
          -
          <lpage>2567</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>A novel group recommendation model with two-stage deep learning</article-title>
          ,
          <source>IEEE Transactions on Systems, Man, and Cybernetics: Systems</source>
          <volume>52</volume>
          (
          <year>2021</year>
          )
          <fpage>5853</fpage>
          -
          <lpage>5864</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>Social influence-based group representation learning for group recommendation</article-title>
          ,
          <source>in: 2019 IEEE 35th International Conference on Data Engineering (ICDE)</source>
          , IEEE,
          <year>2019</year>
          , pp.
          <fpage>566</fpage>
          -
          <lpage>577</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Sankar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sundaram</surname>
          </string-name>
          ,
          <article-title>Groupim: A mutual information maximization framework for neural group recommendation</article-title>
          ,
          <source>in: Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>1279</fpage>
          -
          <lpage>1288</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>L.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , K. Zheng,
          <article-title>Hierarchical hyperedge embedding-based representation learning for group recommendation</article-title>
          ,
          <source>ACM Transactions on Information Systems (TOIS) 40</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>