<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>M. Schedl, H. Zamani, C.-W. Chen, Y. Deldjoo, M. Elahi, Current challenges and visions in music
recommender systems research, International Journal of Multimedia Information Retrieval</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1145/3406522.3446033</article-id>
      <title-group>
        <article-title>Efects on Experience</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Shah Noor Khan</string-name>
          <email>s.n.khan1@uu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jesse Nieuwkoop</string-name>
          <email>j.j.m.nieuwkoop@students.uu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Judith Masthof</string-name>
          <email>j.f.m.masthoff@uu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Music Recommender Systems, Fairness, User driven</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Utrecht University</institution>
          ,
          <addr-line>Princetonplein 5, 3584 CC Utrecht</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>7</volume>
      <issue>2018</issue>
      <fpage>0000</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>This study investigates how user-driven customization of fairness and diversity afects satisfaction in music recommender systems. We developed a prototype allowing listeners to adjust four fairness dimensions: popularity, artist gender, nationality, and genre diversity. In 42 sessions with Dutch participants, interactive controls substantially improved perceived fairness, control, and added value. Genre diversity was most influential, while nationality was least engaged, with gender and popularity falling in between. Findings highlight that parametric design-not algorithmic complexity-drives improved user experience. We show that transparent, customizable fairness levers can make recommendation systems both fairer and more engaging.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Related</title>
    </sec>
    <sec id="sec-3">
      <title>Work</title>
      <p>
        Fairness and diversity in RS are crucial, especially in cultural domains like music [
        <xref ref-type="bibr" rid="ref2">10, 2</xref>
        ]. While research
has focused on algorithmic fairness, user perceptions of fairness and their impact on satisfaction remain
underexplored. This study explores users’ perceptions of user-driven fairness customization in MRS.
      </p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <p>
        Fairness Challenges in Music Recommendation: MRS algorithms often exhibit structural biases,
particularly popularity bias [11], which favors mainstream content and marginalizes lesser-known
artists, reducing diversity for niche audiences [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Female artists are also consistently underrepresented
in generated playlists, underscoring the need for demographic equity and systemic representation
rather than one-size-fits-all corrections [ 7, 12, 13]. Most existing fairness interventions operate from a
system-driven perspective with little user input, and algorithmic accuracy does not guarantee a fair
user experience. This discrepancy between metric-based and perceived fairness [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] highlights the
importance of involving users in collaboratively defining fairness [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Multi-Dimensional Fairness and Personalization: Fairness in recommendation is increasingly
recognized as multi-dimensional, with growing evidence that allowing users to personalize fairness
goals across dimensions can enhance satisfaction [
        <xref ref-type="bibr" rid="ref5">5, 14</xref>
        ]. Adaptive, participatory systems that respond
to user-defined fairness criteria tend to feel more equitable. Re-ranking approaches that incorporate
user preferences have been shown to improve the user experience, especially when individuals control
how fairness is applied to their lists [15]. Preferences for diversity and popularity vary widely across
cultures [16], while heavy reliance on historical genre labels can lead to filter bubbles and repetitive
recommendations [
        <xref ref-type="bibr" rid="ref6">6, 17</xref>
        ]. Diversity-focused approaches mitigate these issues by incorporating
crossgenre exposure and balancing relevance with novelty through multi-objective optimization [18, 19],
enriching listening experiences by introducing fresh yet contextually fitting tracks.
      </p>
      <p>User Perception and Fairness Awareness: Perceived fairness is shaped by how familiar and
relevant recommendations feel; both excessive familiarity and excessive novelty reduce satisfaction,
emphasizing the need for calibrated diversity that balances personal taste with exploration [19]. Subtle
interface design interventions—such as making genre visibility adjustable or guiding user
interactions—can improve perceived fairness and exploration without reducing comfort [20]. Interactive
controls also boost perceived control and trust by making recommendation processes more transparent
and interpretable [21].</p>
      <p>
        Algorithmic Decisions and User Trust: Fairness constraints applied without user understanding
or input can undermine trust, making transparency and user agency central, particularly in systems
afecting demographic exposure [ 7]. In line with HCI principles of interpretability and participatory
design [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], personalized fairness settings enhance user control and reduce dissatisfaction arising from
rigid, opaque interventions [9]. While personalization and fairness-aware re-ranking approaches exist,
few studies examine their efects on perceived recommendation quality, and the psychological impacts
of user-controlled fairness remain underexplored [20, 15]. Ofline studies using static fairness settings
fail to capture real user experience. Collecting live feedback from users interacting with customizable
fairness dimensions ofers more actionable insights for system design [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        This study introduces a system that lets users configure fairness preferences and examines how these
choices afect satisfaction and perceived fairness. By emphasizing user agency, it advances a view of
fairness as a dynamic user experience rather than a fixed system metric [
        <xref ref-type="bibr" rid="ref4">4, 14, 9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Methodology</title>
      <p>We conducted a user study with 42 participants residing in the Netherlands, recruited through
convenience sampling, to explore how giving users control over fairness and diversity settings in an
MRS afects their satisfaction and listening experience. The study focused on user perception rather
than system accuracy. Participants used a simple, locally run web app to try the music recommender.
The Ethics and Privacy Quick Scan by the Utrecht University Research Institute of Information and
Computing Sciences deemed the research as low-risk, requiring no further assessment.</p>
      <sec id="sec-4-1">
        <title>3.1. Study Setup</title>
        <p>The study was conducted in person. The procedure was as follows:</p>
        <p>Step 0: Pre-survey. After providing informed consent, participants completed a short survey with
their demographics (age, gender, nationality). Next, they were directed to the custom app.</p>
        <p>Step 1: Seed track selection. Participants selected 10 seed tracks based on their preferences, as
illustrated in Figure 1. The selection is facilitated through a search bar connected to the Spotify API,
allowing users to find and add songs by title and artist. Once added, the songs appear in a dynamic
list with a counter on the right that displays the number of selected tracks. Participants could listen to
an audio preview of maximum 30 seconds per song, similar to a study by Fatahi et al. [22]. Each track
can be individually removed, and the ”Confirm Selection” button remains disabled until 10 tracks have
been added. This number of tracks was chosen to prevent any single track from disproportionately
influencing the recommendations. An open-ended search was used instead of a predefined list to
promote user autonomy and reduce bias.</p>
        <p>Step 2: Setting fairness sliders. After confirming their seed track selection, participants provided
their fairness preferences (see Figure 2), using four sliders, each representing a fairness dimension
(see selection rationale in Section3.2). The dimensions used were: track popularity (ranging from
niche to popular), artist gender (from the same gender to diverse genders), artist nationality (from the
same nationality to diverse), and genre diversity (from the same genre to diverse). Each slider was
set to a neutral midpoint by default, allowing participants to express preferences in either direction.
Sliders were chosen for their clarity and ease of use. Positioning this customization step after seed track
selection reinforces that users actively shape their recommendation experience, setting the expectation
that their input will influence the content.</p>
        <p>Step 3: Recommendations. After submitting their fairness preferences, the participants were
provided with a loading screen informing them that their recommendations were being generated.
Once ready, participants were shown ten recommended tracks (Figure 3). Each recommendation was
displayed as a card containing the song’s title, the artist’s name, and a preview button linked to the
Spotify embedded audio player. This layout supports both evaluation and passive discovery.</p>
        <p>Step 4: Post Survey. After listening, participants returned to the survey, where they assessed
perceived added value, fairness, and sense of control of our system, and their current experience with
MRS in general using 7-point Likert scales, as research emphasizes the importance of subjective RS user
evaluations. For more details see Section 3.3. Additionally, they indicated for each fairness dimension
whether it was important to them (ticking important ones). This survey completes the ”feedback loop”
central to the study’s design: users express their preferences, receive tailored recommendations, and
evaluate whether the system respected their input. The study’s pacing and clarity were refined through
pilot tests to ensure that participants clearly understood each step and could complete the study easily.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Selection of fairness dimensions</title>
        <p>
          These dimensions were chosen because prior research has identified them as common sources of bias
in music recommendation systems as mentioned in Section 2. Popularity was chosen as mainstream
hits overshadow niche works [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], Artist-gender diversity as female and non-binary artists remain
under-represented [7], Nationality diversity as local artists’ visibility varies by region [16], and Genre
variety as limited cross-genre exposure widens filter bubbles [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>3.3. Survey evaluation categories</title>
        <p>
          The survey evaluation categories were chosen based on prior literature on subjective system evaluation.
Perceived Added value is widely recognized as essential for assessing user experience, capturing not
only relevance and enjoyment but also perceived personalization quality [23, 24]. Perceived fairness
is central to our research, as prior work shows that algorithmic fairness metrics often diverge from
user perceptions, making direct user assessment critical [
          <xref ref-type="bibr" rid="ref2 ref4">4, 2</xref>
          ]. Sense of control aligns with our core
hypothesis: increasing user agency in shaping recommendations enhances both perceived fairness
and satisfaction, leading to a better overall listening experience [21, 24]. Finally, Current experience
with MRS (in terms of feeling treated fairly and in control) helps determine how our system performs
relative to existing systems and highlights whether user-driven fairness customization ofers a perceived
improvement over mainstream platforms.
        </p>
        <p>
          Survey items were adapted from validated constructs in Human-Computer Interaction (HCI) and
recommender systems research to ensure reliability. Items measuring sense of control draw on established
scales addressing user agency and transparency [24], while fairness perception items build on recent
studies examining how users interpret and evaluate algorithmic fairness in personalized contexts [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
The survey can be seen in the supplementary materials1.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>3.4. System Architecture</title>
        <p>The MRS back end stores user data, processes seed tracks and fairness preferences, generates
recommendations via OpenAI, enriches them with Spotify metadata, and delivers results to the front-end.</p>
        <p>Data Storage and Recommendation Generation. User inputs (seed tracks and fairness preferences)
are stored locally using JSON, for API compatibility, ease of debugging, and easy integration across
system components. After submission, the back-end triggers an asynchronous recommendation process,
while the front-end polls the server every few seconds. Results typically appear within 15–30 seconds.</p>
        <p>API Integration and Validation Logic. Spotify’s API (1) supports the seed track search bar and
(2) validates GPT-generated recommendations and locates playable tracks. OpenAI’s GPT API was
selected as it supports structured JSON outputs, can be steered with detailed prompts, integrates well
with Python, and allows incorporation of user-specified fairness dimensions.</p>
        <p>Prompt Engineering with Fairness Dimensions. Prompts sent to GPT include seed tracks and
normalized (0–100) values for popularity, gender diversity, nationality diversity, and genre diversity (e.g
0 to 10 = Only very niche/obscure/underground tracks, so likely diferent from seed tracks). Detailed
instructions guide GPT to respect these preferences while avoiding overfitting, hallucination, or invalid
outputs. GPT is instructed to return exactly ten tracks in a fixed JSON format, including title, artist, and
genre. Prompt design required extensive iteration. Early prompts led to extremes—recommending only
frequent seed artists or generating entirely unfamiliar, overly diverse results. Adjustments were made
to balance diversity with musical identity across genres and configurations. The prompts are available
in the supplementary materials2.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Results</title>
      <p>In this section, first we report participant characteristics and the reliability of the measurement scales.
Second, we analyze subjective evaluations, including perceived fairness, control, added value, and
overall experience. Third, we examine behavioral preference measures derived from the fairness sliders
and consider the importance ratings and the alignment between preferences and importance.</p>
      <sec id="sec-5-1">
        <title>4.1. Demographics</title>
        <p>Among the 42 participants, 25 (59.5%) identified as male and 17 (40.5%) as female, providing a modest
male majority. Ages ranged from 19 to 25 years ( = 20.8 ,  = 1.6 ), reflecting a predominantly
early-twenties, university-age sample, making the findings most applicable to young adult listeners.</p>
      </sec>
      <sec id="sec-5-2">
        <title>4.2. Reliability of Scales</title>
        <p>To ensure the questionnaire reliably measured each category, we used Cronbach’s alpha ( ) with
higher values (closer to 1) indicating stronger internal consistency. The Perceived Fairness scale
showed excellent reliability ( = 0.95 ), while Perceived Control was acceptable ( = 0.79 ). The Added
Value/Impact scale had marginal reliability ( = 0.69 ), and the Current Experience scale showed low
consistency ( = 0.57 ), suggesting its items may capture varied aspects or be interpreted inconsistently.</p>
      </sec>
      <sec id="sec-5-3">
        <title>4.3. Subjective user evaluations</title>
        <p>Category Comparison &amp; Correlation. Figure 4 shows the average scores across the four categories.
Added Value scored highest (mid-5s to almost 6), suggesting users found the customization options
beneficial. Perceived Control followed with (around 5.5), indicating participants felt confident adjusting
the system. Perceived Fairness received moderate ratings (around 5), while Current Experience scored
about 3.5, suggesting traditional MRS are perceived worse than our system in terms of fairness and
control.</p>
        <p>Correlations (Table 1) reveal a strong positive relationship between Perceived Fairness and Perceived
Control ( = .84 ), suggesting fairness and agency are closely linked in shaping the user experience.
Fairness also correlated strongly with Added Value ( = .72 ), and Control with Added Value ( = .69 ),
indicating that both fairness and control enhance perception of the system’s usefulness. In contrast,
Current Experience was not significant correlated with any other construct, suggesting that satisfaction
with MRS in general was independent of perceptions of our system’s fairness, control, and added value.</p>
      </sec>
      <sec id="sec-5-4">
        <title>4.4. Behavioral Preference Measures</title>
        <p>Correlations Between Fairness Dimensions. Figure 5 summarizes mean slider positions (0–100)
across participants: Popularity ≈ 45, Genre ≈ 22, Gender ≈ 43, and Nationality ≈ 49. All averages fall
below the neutral midpoint (50), with Genre notably lowest and Nationality nearest to neutrality.</p>
        <p>Correlations between slider values (Table 2) show the strongest link between Gender and Nationality
( = .39 ), indicating that participants favoring gender diversity also diversified by nationality. Moderate
correlations emerged for Genre-Gender ( = .34 ), Popularity-Gender ( = .33 ), and Genre-Popularity
( = .31 ). Popularity-Nationality was not significant, suggesting mainstream versus niche choices were
independent of nationality preferences. These patterns reveal “global fairness” tendencies in some
participants, while others adjusted only one dimension, supporting the value of modular controls.</p>
        <p>Importance of Dimensions. Figure 6 shows shows participants’ post-survey responses on whether
they considered each dimension important, based on “Yes” (important) vs “No” (not important) answers.
Nearly all judged track genre as important, with only one dissenting response. Track popularity
followed, valued by roughly two-thirds of participants. Views on artist gender were more divided,
though a slight majority marked it as important. Artist nationality ranked lowest, with more deeming
it unimportant than important. These preferences contextualize slider behaviors by clarifying which
fairness levers users consciously prioritized.</p>
        <p>Preference Behavior vs. Ratings. Figure 7 compares the slider deviations from the midpoint (50)
between participants who judged a dimension “important” versus “not important”. Greater deviation
reflects stronger engagement, regardless of direction. Across all dimensions, ”important” participants
consistently moved sliders farther from 50, while others stayed near default. Patterns varied: for
Popularity, importance was associated with widely ranging adjustments, whereas non-importance
yielded near-zero deviations. For Genre, nearly all “important” participants moved the slider
substantially, but the low overall average (22; Figure 5) indicates only a limited desire for genre exploration.
Gender showed the strongest polarization: with extreme shifts for importance and minimal change for
non-importance. Nationality followed a similar but less pronounced pattern.</p>
        <p>Mann–Whitney U tests confirmed that absolute slider deviations were significantly greater among
participants who rated a dimension as important for Popularity ( = 83.0,  = .002 ), Gender ( =
66.5,  &lt; .001 ), and Nationality ( = 98.5,  = .003 ). For Genre, the diference was not statistically
significant (  = 0.5,  = .099 ) despite a large efect size, due to the very low number of “unimportant”
participants. Rank-biserial correlations ( ) indicate moderate-to-large efects for all dimensions.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Discussion</title>
      <p>Our findings indicate that transparent, parametric controls substantially enhance user perceptions of
fairness and control, while also revealing that fairness priorities vary widely across individuals.</p>
      <p>Main Findings. Participants reported high levels of perceived fairness ( ≈ 5.0 ,  = .95 ) and
control ( &gt; 5.5 ,  = .79 ), with the two constructs strongly correlated ( = .84 ). This suggests that
in interactive recommender settings, fairness is closely tied to agency: users who felt empowered to
shape recommendations were also those most likely to perceive the system as fair. Both categories
were also positively associated with the sense that the system added value to the listening experience,
with fairness correlating at  = .72 and control at  = .69 . These efects occurred despite the underlying
recommendation system being relatively simple, reinforcing that the parametric design—not algorithmic
complexity—drove perceptions of value. The current experience category, which reflects satisfaction
with traditional MRS was not significantly correlated with any of these categories, and its lower average
indicates participants preferred our system in terms of fairness and control.</p>
      <p>Not all fairness dimensions were weighted equally. Genre diversity emerged as the most universally
important: nearly all participants considered it relevant and adjusted its slider. Nationality diversity, by
contrast, was less important, with many participants leaving this setting unchanged; however, those
who did prioritize it moved the slider significantly more (  = 98.5 ,  = .003 ,  = −0.54 ). Gender and
popularity fell between these extremes. For both, participants who rated these dimensions as important
made deliberate adjustments, with significant diferences for gender (  = 66.5 ,  &lt; .001 ,  = −0.69 ) and
popularity ( = 83.0 ,  = .002 ,  = −0.59 ). These patterns suggest that while some fairness concerns
are widely shared (e.g., genre), others reflect more individual preferences.</p>
      <p>Correlations between fairness dimensions revealed modest associations, such as gender with genre
( = .34 ) and popularity ( = .33 ), and a somewhat stronger relationship between gender and nationality
( = .39 ). This points to a general fairness orientation among some users but also indicates suficient
independence between dimensions to justify treating them as conceptually distinct.</p>
      <p>Overall, these findings indicate that while fairness preferences are diverse and personal, enabling
users to express them explicitly improves their experience and perception of the recommender.</p>
      <p>
        Connection with Existing Literature.The present findings resonate strongly with prior research
on user-centered fairness and transparency in RS. Much of the literature has focused on algorithmic
strategies to improve fairness, but recent work increasingly emphasizes user agency as a determinant
of fairness perceptions [
        <xref ref-type="bibr" rid="ref5">5, 8</xref>
        ]. The strong correlation between perceived fairness and perceived control
( = .84 ) supports claims by Dinnissen and Bauer [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Burke [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] that fairness is not simply a statistical
outcome but a lived, interpretable experience tied to user influence. This aligns with broader principles
in fairness-aware recommendation, where procedural justice—the ability to influence the process—often
matters as much as distributive outcomes [9, 25].
      </p>
      <p>
        Unlike many prior studies that rely on passive feedback or post hoc surveys, this work used interactive
sliders, enabling users to modulate dimensions such as popularity, gender, nationality, and genre in
real time. This parametric design approach responds to recent calls for more transparent fairness
mechanisms [14, 21], particularly in music domains where taste and identity are tightly coupled [
        <xref ref-type="bibr" rid="ref6">6, 7</xref>
        ].
Whereas traditional re-ranking methods operate invisibly, the interface foregrounded user choices,
aligning with Liang and Willemsen’s [20] concept of “fairness nudges”. The finding that fairness and
control correlated strongly with added value ( = .72 ,  = .69 ) emphasizes that users derive satisfaction
not only from outcomes but also from their ability to shape them.
      </p>
      <p>
        The dimension-specific patterns observed here also connect with earlier findings. Genre diversity’s
near-universal importance echoes evidence that stylistic variety, when aligned with personal taste,
enhances satisfaction and engagement [17, 26]. Nationality, conversely, received the least engagement,
consistent with studies showing mixed user sensitivity to cultural or geographic diversity in
recommendations [16]. The diminished significance of nationality can be partially attributed to the phenomenon of
glocalisation[27]. Gender and popularity, while less broadly endorsed, elicited substantial adjustments
from those who prioritized them, reinforcing the significance of individual fairness preferences in
interactive RS [
        <xref ref-type="bibr" rid="ref3">28, 3</xref>
        ]. These results complement population-level work on bias and diversity in MRS
[
        <xref ref-type="bibr" rid="ref6">10, 7, 6</xref>
        ] by ofering a more granular perspective on the dimensions users themselves are most inclined
to control.
      </p>
      <p>
        Finally, these findings raise questions about the long-term efects of user-controlled fairness. This
study focused on a single interaction; future work could examine whether such control mechanisms
sustain trust, satisfaction, and diversity over repeated sessions [
        <xref ref-type="bibr" rid="ref1">1, 8</xref>
        ]. Expanding the controllable
dimensions (e.g., lyrical content, mood, or tempo) or tailoring sliders to specific user segments (e.g.,
minority or non-mainstream listeners [29]) could further enhance perceived relevance and fairness.
Collectively, these results contribute to the growing body of research that views fairness not as a fixed
property of recommendations, but as a dynamic, user-driven process.
      </p>
      <p>Limitations. The Fairness ( = .95 ) and Control ( = .79 ) scales showed strong reliability, but Added
Value ( = .69 ) and Current Experience ( = .57 ) were less reliable. The study used a homogeneous
sample of young Dutch adults, limiting generalizability. The design lacked a control condition without
fairness sliders, making it unclear how much customization drove the observed benefits. The study
tested only four fairness dimensions. Finally, only a single, brief interaction was studied. Longitudinal
research is needed to examine whether benefits of fairness slider persist or change over time.</p>
    </sec>
    <sec id="sec-7">
      <title>6. Conclusion</title>
      <p>This study explored how user-driven customization of fairness and diversity shapes satisfaction in
MRS. Allowing listeners to adjust four fairness dimensions improved their experience, showing that
parametric design - rather than algorithmic complexity or default settings - drives perceived value.</p>
      <p>Not all fairness dimensions held equal weight. Genre diversity was most frequently prioritized,
while nationality diversity drew the least engagement, though valued by a subset of users. Gender
and popularity fell between, with notable slider adjustments among those rating them as important.
Correlations among fairness dimensions (e.g., gender and nationality) suggest some users adopt a broad
fairness orientation, while others focus on specific axes.</p>
      <p>
        These findings show that modular, user-centric fairness controls bridge the gap between algorithmic
metrics and lived experience. By emphasizing user agency, such systems enhance perceived fairness,
control, and added value. This parametric approach supports viewing fairness as a dynamic negotiation
rather than a fixed rule [
        <xref ref-type="bibr" rid="ref4">14, 4</xref>
        ]. Platforms should consider ofering similar controls, tailoring available
dimensions to individual preferences. Future research should refine measures, diversify samples, add
control conditions, expand fairness axes, and assess efects longitudinally.
      </p>
      <p>In sum, transparent, user-driven fairness controls ofer a promising path to make MRS both equitable
and engaging. By combining live customization with fairness-aware algorithms, MRS can meet fairness
goals while accommodating diverse, subjective listening preferences.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used Grammarly in order to: Grammar and spelling
check. After using this tool, the authors reviewed and edited the content as needed and take full
responsibility for the publication’s content.
[23] P. Pu, L. Chen, R. Hu, A user-centric evaluation framework for recommender systems, in:
Proceedings of the Fifth ACM Conference on Recommender Systems, RecSys ’11, Association
for Computing Machinery, New York, NY, USA, 2011, p. 157–164. URL: https://doi.org/10.1145/
2043932.2043962. doi:10.1145/2043932.2043962.
[24] B. P. Knijnenburg, M. C. Willemsen, Z. Gantner, H. Soncu, C. Newell, Explaining the user experience
of recommender systems, User Modeling and User-Adapted Interaction 22 (2012) 441–504. URL:
https://doi.org/10.1007/s11257-011-9118-4. doi:10.1007/s11257-011-9118-4.
[25] A. Singh, T. Joachims, Policy learning for fairness in ranking, in: Advances in Neural
Information Processing Systems 31 (NeurIPS 2018), 2019. URL: https://arxiv.org/abs/1902.04056.
arXiv:1902.04056.
[26] L. Porcaro, E. Gómez, C. Castillo, Assessing the impact of music recommendation diversity on
listeners: A longitudinal study, 2022. URL: https://arxiv.org/abs/2212.00592. arXiv:2212.00592.
[27] W. Page, C. Dalla Riva, ’Glocalisation’of Music Streaming within and across Europe, Technical</p>
      <p>Report, EIQ Paper, 2023.
[28] G. Farnadi, P. Kouki, S. K. Thompson, S. Srinivasan, L. Getoor, A fairness-aware hybrid
recommender system, 2018. URL: https://arxiv.org/abs/1809.09030. arXiv:1809.09030.
[29] S. Khan, E. Herder, D. Kaya, Experiences of non-mainstream and minority users with music
recommendation systems, 2024. doi:10.18420/muc2024-mci-ws11-141.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Nie</surname>
          </string-name>
          , T.-S. Chua,
          <article-title>User-controllable recommendation against filter bubbles</article-title>
          ,
          <source>in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '22</source>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          ,
          <year>2022</year>
          , p.
          <fpage>1251</fpage>
          -
          <lpage>1261</lpage>
          . URL: http://dx.doi.org/10.1145/3477495. 3532075. doi:
          <volume>10</volume>
          .1145/3477495.3532075.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ekstrand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Azpiazu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ekstrand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Anuyah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>McNeill</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Pera, All the cool kids, how do they fit in?: Popularity and demographic biases in recommender evaluation and efectiveness</article-title>
          , in: S. Friedler, C. Wilson (Eds.),
          <source>Proceedings of the 1st Conference on Fairness, Accountability and Transparency</source>
          , volume
          <volume>81</volume>
          ,
          <string-name>
            <surname>PMLR</surname>
          </string-name>
          ,
          <year>2018</year>
          , pp.
          <fpage>172</fpage>
          -
          <lpage>186</lpage>
          . URL: https://proceedings.mlr.press/v81/ ekstrand18b.html.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H.</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          ,
          <article-title>Popularity bias in ranking and recommendation</article-title>
          ,
          <source>in: Proceedings of the 2019 AAAI/ACM Conference on AI</source>
          ,
          <string-name>
            <surname>Ethics</surname>
          </string-name>
          , and Society, AIES '19,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2019</year>
          , p.
          <fpage>529</fpage>
          -
          <lpage>530</lpage>
          . URL: https://doi.org/10.1145/3306618.3314309. doi:
          <volume>10</volume>
          .1145/3306618.3314309.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.</given-names>
            <surname>Dinnissen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bauer</surname>
          </string-name>
          ,
          <article-title>Fairness in music recommender systems: a stakeholder-centered mini review</article-title>
          ,
          <source>Frontiers in Big Data</source>
          <volume>5</volume>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . URL: https://doi.org/10.3389/fdata.
          <year>2022</year>
          .
          <volume>913608</volume>
          . doi:
          <volume>10</volume>
          . 3389/fdata.
          <year>2022</year>
          .
          <volume>913608</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          ,
          <article-title>Multisided fairness for recommendation</article-title>
          ,
          <source>FATML'17</source>
          (
          <year>2017</year>
          ). URL: https://arxiv.org/abs/ 1707.00093. doi:
          <volume>10</volume>
          .48550/arXiv.1707.00093. arXiv:
          <volume>1707</volume>
          .
          <fpage>00093</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Kowald</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schedl</surname>
          </string-name>
          ,
          <string-name>
            <surname>E. Lex,</surname>
          </string-name>
          <article-title>The unfairness of popularity bias in music recommendation: A reproducibility study</article-title>
          ,
          <source>in: Advances in Information Retrieval</source>
          , Springer International Publishing, Cham,
          <year>2020</year>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>42</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -45442-
          <issue>5</issue>
          _5. doi:
          <volume>10</volume>
          .1007/ 978- 3-
          <fpage>030</fpage>
          - 45442-
          <issue>5</issue>
          _
          <fpage>5</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>