<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Exploring Cross-group Discrepancies in Calibrated Popularity for Accuracy/Fairness Trade-of Optimization ∗</article-title>
      </title-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>1</volume>
      <fpage>8</fpage>
      <lpage>23</lpage>
      <abstract>
        <p>Popularity bias is an important issue in recommender systems, as it afects end-users, content creators, and content provider platforms alike. It can cause users to miss out on less popular items that would fit their preference, prevent new content creators from finding their audience, and force providers to pay higher royalties for serving expensive popular content. Over the past years, various approaches to mitigate popularity bias in recommender systems have been proposed. Among them, post-processing methods are widely accepted due to their versatility and ease of implementation. While previous studies have investigated the efects of diferent post-processing techniques on accuracy and fairness of recommendations, the influence of diferent algorithms on diferent user groups have not received much attention in this context. Addressing this research gap, we study the efect of a recent mitigation strategy, Calibrated Popularity, in conjunction with a selection of state-of-the-art recommender algorithms including BPR, ItemKNN, LightGCN, MultiVAE, and NeuMF. We show that these algorithms demonstrate diferent characteristics in terms of the trade-of between accuracy and fairness, both within and between various user groups defined by gender and inclination towards consumption of mainstream items. Finally, we demonstrate how these discrepancies can be exploited to achieve a more efective trade-of between utility and fairness of recommender systems.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 INTRODUCTION</title>
      <sec id="sec-1-1">
        <title>Recommender systems (RSs) are ubiquitous decision support tools, assisting all kinds of users in their personal and business tasks. They help connect content creators and consumers on streaming platforms, suggest products in online stores, and even influence whether a person finds a fitting job. Considering the important role of RSs, it is crucial to monitor societal and statistical biases they often sufer from.</title>
        <p>
          While not all biases are harmful—recommendation results need to be biased in the sense of personalization to match
the end user’s preferences—data, algorithmic, and presentation biases may lead to unfair behavior of RSs, i. e., the
recommender “systematically and unfairly discriminates against certain individuals or groups of individuals in favor of
others” [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. Popularity bias corresponds to the tendency of some RSs to favor popular items over lesser popular items
and is considered to be a harmful phenomenon [
          <xref ref-type="bibr" rid="ref1 ref13 ref16 ref7">1, 7, 13, 16</xref>
          ]. Popularity bias has been a long-studied problem in the RSs
community (e. g., [
          <xref ref-type="bibr" rid="ref17 ref20 ref28 ref3 ref4">3, 4, 17, 20, 28</xref>
          ]). A RS with popularity bias creates recommendation lists with highly popular items
ranked on top, suppressing the exposure of long-tail items. This often leads to low satisfaction of end users (especially
those interested in niche items), unfairly limited exposure of new and niche item producers, and higher expenses for
content providers, as serving popular items on online platforms in most cases spells higher royalties. To measure and
∗Copyright 2022 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Presented at the MORS workshop held in conjunction with the 16th ACM Conference on Recommender Systems (RecSys), 2022, in Seattle, USA.
†This is the corresponding author.
capture diferent aspects of popularity bias, various metrics have been introduced; for an overview, see Abdollahpouri
et al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. In this paper we concentrate on the user side of popularity bias.
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Over the past years, researchers proposed a multitude of bias mitigation strategies, working on diferent stages</title>
        <p>of the recommendation pipeline. One can distinguish pre-, in- and post-processing methods. The first act before the
main RS, often applying transformations to the data the RS is trained on, in an attempt to make its output less biased.</p>
      </sec>
      <sec id="sec-1-3">
        <title>In-processing methods usually include additional debiasing training objectives, e. g., through adversarial training [9, 21]</title>
        <p>
          or regularization [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. Post-processing methods act on the output of the RS, usually by re-ranking recommended items
to satisfy a certain fairness goal. Post-processing bias mitigation techniques have the advantage of versatility, being
independent of the main RS and thus able to work in conjunction with almost any algorithm. In addition, a number
of calibration-based post-processing techniques have been shown efective in popularity bias mitigation for matrix
factorization algorithms.
        </p>
      </sec>
      <sec id="sec-1-4">
        <title>Previous studies have shown that not only diferent RSs vary in the degree they are susceptible to popularity bias, but</title>
        <p>also diferent user groups sufer from it to various extents. These findings lead us to the following research questions
we tackle in this work: RQ1: Is the mitigation technique of post-processing equally efective for all algorithms? Do all
algorithms show the same character of trade-of between utility and calibration? RQ2: Are all user groups equally afected
by the mitigation procedure? Are the optimal mitigation parameters the same for all user groups? RQ3: To which extent can
the trade-of between utility and calibration be softened through using specific mitigation parameters for each user group?</p>
      </sec>
      <sec id="sec-1-5">
        <title>To answer these questions, we pair a recent mitigation strategy, Calibrated Popularity, with an array of recommender</title>
        <p>algorithms, analyzing the mitigation efectiveness and utility-fairness trade-of for each of them. We also consider the
efect of mitigation on diferent user groups, and through this characterize the scoring strategy of each recommendation
algorithm investigated. Finally, we conduct an experiment to evaluate potential gains of mitigation approaches tailored
specifically to each user group.
2</p>
        <p>
          RELATED WORK
Many studies formalize fairness of a RS on the user level through calibration of a certain item attribute (such as genre)
[
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. Meaning that recommendation is considered fair only when the distribution of the attribute over the recommended
list matches its distribution over some reference list (e. g., each user’s consumption history or the whole list of items in
the collection). A number of studies follow this approach to investigate and enforce fairness of the recommendations
[
          <xref ref-type="bibr" rid="ref14 ref5">5, 14</xref>
          ]. Lesota et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] study diferences between popularity distributions of consumed and recommended items for
each user, tackling the problem of measuring popularity bias as miscalibration between the two. They express it in
terms of the median as well as several statistical moments and similarity measures. In addition, they combine research
strands on popularity bias and gender bias by analyzing how female and male listeners are afected by popularity bias.
        </p>
      </sec>
      <sec id="sec-1-6">
        <title>Abdollahpouri et al. [3] show that state-of-the-art movie recommendation algorithms sufer from popularity bias, and introduce the delta-GAP metric to quantify the level of underrepresentation. Kowald et al. [16] reproduce these results for music domain.</title>
      </sec>
      <sec id="sec-1-7">
        <title>Works on bias mitigation often adopt post-processing strategies. Post-processing is a widely used family of bias</title>
        <p>mitigation techniques. They operate on the output of a RS, re-ranking the items, striving to create a list that satisfies
both utility and fairness objectives. A big advantage of post-processing is its flexibility to be used with almost any</p>
      </sec>
      <sec id="sec-1-8">
        <title>RS algorithm. A multitude of post-processing techniques for popularity bias mitigation have been proposed over the</title>
        <p>
          years. Abdollahpouri et al. [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] propose an algorithm calibrating proportions of head and tail items from the overall
popularity distribution. Zehlike et al. [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] propose FA*IR, a method for boosting exposure of items of some protected
2
category (of low popularity). Abdollahpouri et al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] take a more user-oriented approach, called Calibrated Popularity
(CP), calibrating three-bin item popularity distributions between user consumption history and their recommendations.
        </p>
      </sec>
      <sec id="sec-1-9">
        <title>Klimashevskaia et al. [15] take a wide perspective on post-processing popularity bias mitigation techniques and analyze</title>
        <p>them on both platform-wide and user-preference levels. They show that CP is preferable for providing fairness on
per-user level.</p>
        <p>
          Usually bias mitigation algorithms allow to adjust the weight distribution between the utility and fairness objectives.
da Silva et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] take this idea further, experimenting with learning personal weighting for every user to ensure proper
genre diversity in the recommendation lists. To the best of our knowledge, this approach has not been adopted for
popularity bias mitigation. In addition, most studies presenting mitigation techniques limit their demonstration to a
narrow scope of algorithms and mainly consider the population of users as a whole. We address these limitations in
our research, by (1) conducting a set of bias mitigation experiments on state-of-the-art RSs of diferent architectures,
(2) considering two ways of user grouping as well as the whole populations, and (3) carrying out an experiment with
learning bias mitigation weights for every group separately. We investigate two datasets: MovieLens-1M (ML-1M) [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]
from the movie domain and LFM-2b [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] from the music domain.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>METHODOLOGY</title>
      <sec id="sec-2-1">
        <title>We base our study on the common assumption that consumers prefer calibrated recommendations [24], i. e., the distribution of item popularity in a user’s recommendation list should match that of their interaction history. We investigate the trade-of between popularity calibration and utility of recommendations considering diferent recommender algorithms, user groups, and settings of the mitigation technique.</title>
      </sec>
      <sec id="sec-2-2">
        <title>Item Popularity. Following common practice [4, 17], we define popularity of each item through the number of</title>
        <p>interactions with it. We distinguish Popular, Niche, and Mid categories of items. Popular items are represented by
items most interacted with and jointly receiving 20% of all user-item interactions. Similarly, Niche items are the least
interacted with, receiving 20% of aggregated user-item interactions. The rest of items falls into the category Mid.</p>
      </sec>
      <sec id="sec-2-3">
        <title>User Groups. This work concerns both overall user population as well as specific user groups. We investigate two</title>
        <p>
          ways of user grouping: by users’ gender1 and by their inclination towards consumption of popular items. With the latter,
similar to [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], we define three user groups: HighPop, MidPop, and LowPop, based on the proportion of popular items in
their consumption histories. The groups are defined by sorting users in descending order with respect to the proportion
of popular items they consume, and then selecting the top 20%, mid 60%, and bottom 20% of users, respectively.
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Metrics. In this work, we consider the trade-of between recommenders’ utility expressed through NDCG @10 metric</title>
        <p>and their proneness to popularity bias. Following previous work, we define the latter on per-user level as Jensen-Shannon</p>
      </sec>
      <sec id="sec-2-5">
        <title>Divergence between the popularity distribution of a user’s already consumed items and the top 10 recommended items.</title>
        <p>If  and  are item popularity (probability) distributions of consumption history and recommendation for user ,
respectively, we calculate Jensen-Shannon Divergence as:</p>
        <p>
          (,  ) = 21 ∑︁  ()2  (2)+() () + ∑︁  ()2  (2)+() () (1)
where  () is the proportion of items of popularity category  in the consumption history of user .   can be seen
as symmetrical version of Kullback–Leibler divergence. Note that using 2 we ensure that the value of   is bound
!
between 0 and 1 [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. We express the degree of exposure to popularity bias of a user group  as  , defined as the
average   over all users in .
        </p>
      </sec>
      <sec id="sec-2-6">
        <title>Bias Mitigation Technique. Calibrated Popularity is a recent post-processing technique for popularity bias mitiga</title>
        <p>
          tion [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. It re-ranks a recommendation list ′ of  items initially recommended to each user, to create a personalized
popularity-aware recommendation list ∗ of  items ( &lt;&lt; ):
∗ = arg max (1 − ) · ( ) −  ·   (,  ( )) (2)
        </p>
        <p>, | |=
where  ( ) is the sum of relevance scores and  ( ) the item popularity distribution of the  candidate item list.</p>
      </sec>
      <sec id="sec-2-7">
        <title>To ensure consistency of the mitigation procedure across diferent recommenders, we re-scale the relevance scores</title>
        <p>
          constituting  ( ) to the interval [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ] where needed. The parameter  allows to prioritize between the utility (first
term) and bias mitigation (second term) objectives. Similar to [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] the final recommendation list ∗ is created through
the process of greedy optimization.
        </p>
      </sec>
      <sec id="sec-2-8">
        <title>Choosing Optimal Mitigation Parameters. To illustrate potential gains of choosing group-specific mitigation parameters</title>
        <p>for diferent user groups we introduce a way of selecting an optimal value of parameter  taking into account both
utility and fairness of the recommendation. For ease of notation, we denote   (,  ) = 1 −   (,  ) a fairness
measure; similar to NDCG, higher values are better. We define the optimal value of  for a user group  as follows:
 = arg max NDCG ·   (3)</p>
        <p>
          ∈ [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] NDCG +
        </p>
      </sec>
      <sec id="sec-2-9">
        <title>In other words, for a given group  we select  to maximize the harmonic mean between utility (NDCG) and fairness</title>
        <p>
          (JSF) for the group. The selection is done by conducting a grid search on the interval [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ].
4
        </p>
        <p>EXPERIMENT SETUP</p>
        <p>Datasets. We investigate two datasets, MovieLens-1M (ML-1M) 2 in the movie domain and LFM-2b 3 in the music
domain. The former provides ratings for 6K users and almost 4K movies. The latter is a Last.fm music listening dataset,
which we modify to fit our experimental setup. Firstly, we consider only listening events in the year 2019 of users with
meta-information regarding age, gender and country. Secondly, all user–item interactions with a playcount () of
&lt; 2 are removed to reduce the number of spurious interactions and noise. Thirdly, we only consider tracks that were
listened to by at least 5 diferent users (constraint 1) and we only consider users who have listened to at least 5 diferent
tracks (constraint 2). Lastly, we treat each user–item interaction in a binary way – 1 if the user has listened to a track, 0
otherwise. Furthermore, we sample 100K tracks uniformly-at-random to ensure items of diferent characteristics are
equally likely to be included in the final subset. We then reinforce constraints 1 and 2. This ultimately results in almost
10K users retained with a total of 10.7M listening events. See Table 1 for details.</p>
        <p>
          Algorithms and Baselines. We study popular collaborative filtering algorithms (i. e., neighborhood-based, neural
matrix factorization, autoencoders, and graph convolution networks), briefly described in the following. For consistency,
we use the algorithm implementations from the Recbole framework [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ].4 These are: Bayesian Personalized Ranking
(BPR) [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] adopts an optimization function that ranks the items consumed by the users according to their preferences,
by defining an implicit order between pairs of items. Item k-Nearest Neighbors (KNN) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] recommends items based on
item-to-item similarity. Specifically, an item is recommended to a user if it is similar (in terms of ratings or interactions)
        </p>
      </sec>
      <sec id="sec-2-10">
        <title>2https://grouplens.org/datasets/movielens/1m</title>
      </sec>
      <sec id="sec-2-11">
        <title>3http://www.cp.jku.at/datasets/LFM-2b</title>
      </sec>
      <sec id="sec-2-12">
        <title>4https://recbole.io/</title>
        <sec id="sec-2-12-1">
          <title>Dataset</title>
          <p>LFM-2b</p>
        </sec>
        <sec id="sec-2-12-2">
          <title>MovieLens-1M</title>
        </sec>
        <sec id="sec-2-12-3">
          <title>Demographic All</title>
        </sec>
        <sec id="sec-2-12-4">
          <title>Female</title>
        </sec>
        <sec id="sec-2-12-5">
          <title>Male All</title>
        </sec>
        <sec id="sec-2-12-6">
          <title>Female</title>
        </sec>
        <sec id="sec-2-12-7">
          <title>Male</title>
          <p>ItemKNN</p>
          <p>LightGCN</p>
          <p>MultiVAE</p>
          <p>NeuMF
Exploring Cross-group Discrepancies in Calibrated Popularity for
Accuracy/Fairness Trade-of Optimization
LightGCN</p>
          <p>LightGCN</p>
          <p>.14
BPR</p>
          <p>.10
BPR
.10
BPR
.15
.10
.05
.2
.1
0
.1
.25
.20
D
JS.15
.10
.2
D
S
J
.1
.2
D
S
J
.1
.20
.15
.10
.05
.2
.1
.2
.1
.24
.25
.13
.14
.15
.09
.10
.11
.04
ItemKNN</p>
          <p>MultiVAE</p>
          <p>
            NeuMF
.10
.24
.10
.04
on the basic matrix factorization approach but replaces the inner product with a neural architecture that can learn an
arbitrary function from the interaction data. Variational Autoencoders (MultiVAE) [
            <xref ref-type="bibr" rid="ref18">18</xref>
            ] estimates a probability
distribution over all items, given the user’s interaction vector.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-13">
        <title>Training and Evaluation. To evaluate the aforementioned algorithms, we partition the interactions of each user in</title>
        <p>train/validation/test groups with a 60-20-20 ratio split. Therefore, 60% of all users’ interactions are used to train the
algorithms. We maximize the NDCG @10 metric over the validation set. All results are reported for the test set.</p>
        <p>Popularity Bias Mitigation. We conduct a series of tests comparing utility and fairness of recommendation lists
produced by the above mentioned algorithms and re-ranked by the CP post-processing mitigation technique with</p>
        <p>LowPop
MidPop</p>
        <p>HighPop
.04</p>
        <p>NeuMF
Male
Female
.04
D
S
J
.02
.04
D
S
J.02</p>
        <p>BPR
.316
BPR
.04
.02
.04
.02
.04
.02
.04
.02
.04
.02
.04
.02
ItemKNN</p>
        <p>LightGCN</p>
        <p>MultiVAE</p>
        <p>NeuMF
.344
.348</p>
        <p>.352
ItemKNN
.352 .356
nDCG@10
LightGCN
diferent settings. For every algorithm 5, we re-rank the list of top 100 recommendations to create the final list of 10
items for each user. We test it for ten values of the weighting parameter  from 0 (no mitigation) to 0.9 (weight 0.9 to
the fairness objective and 0.1 to utility) with step size 0.1. We aim to exploit potential diferences in the way various
user groups respond to the popularity bias mitigation to achieve a better trade-of between utility and fairness. To this
end, we split users uniformly-at-random into train and test sets of the same size striving to ensure all user groups are
represented in both. We search for optimal values of  for each user group and the whole population using the criterion
in Equation 3 on the train set. We then compute the new recommendation lists for the test users, applying mitigation
with the weights learned for their corresponding user groups.
5 RESULTS</p>
      </sec>
      <sec id="sec-2-14">
        <title>We approach RQ1 by analyzing popularity calibration over various factors, shown in Figures 1 and 2. The figures report</title>
        <p>NDCG (utility measure) against   (popularity bias measure) for every recommendation algorithm and the ten values
of , respectively, for LFM-2b and ML-1M dataset. On the plots, the opacity of the points corresponds to the value of
, such that the palest show the results of  = 0. Let us first look at the the top row of the plots in each figure which
shows the results achieved on the whole population of the dataset. On LFM-2b we notice diferences in behavior of
recommended lists produced by diferent recommendation algorithms. In particular, KNN shows an increase in NDCG
combined with an improvement in fairness at  = 0.1. BPR and LightGCN show steady progress towards debiased
results of lower utility with growing . At the same time, MultiVAE and NeuMF demonstrate a notably larger drop in
utility over the first step (from  = 0 to 0.1). Considering that every point on each plot corresponds to a new set of
items recommended to most of the users, we can examine the quality of the top 100 recommendation lists produced by
diferent algorithms. In this regard, a smooth decay of NDCG and   signify a better overall quality of the top 100 list
as it allows to debias the recommendation gradually without a sudden drop in utility. A sudden drop in both metrics
however would mean that the achieved utility to a large degree comes from concentration of the popular items at the</p>
      </sec>
      <sec id="sec-2-15">
        <title>5We re-scale relevance scores provided by the algorithms to the interval [0, 1] in order to ensure comparability of mitigation results, see Equation 2.</title>
        <p>6
the third row of Figure 1 show the results of the users grouped by genders6. Analyzing the results according to the
mainstreamness groups for LFM-2b, we observe initially all algorithms provide the best utility to the HighPop group,
the group most exposed to popularity bias varies from model to model. BPR, LightGCN and KNN show that LowPop
user group benefit from the mitigation method in terms of utility. KNN also shows the same for the MidPop group. In
most cases, the HighPop group experiences the largest drop in utility through popularity bias mitigation. These findings
support our hypothesis that selecting mitigation parameters separately for each user group potentially improves the
trade-of between utility and fairness. On the ML-1M dataset we see that all three groups maintain a certain level
of utility, while steadily decreasing the bias metric, showing that all five algorithms on this dataset can successfully
achieve good results in terms of bias mitigation while maintaining utility. Considering the results on genders, we do
not observe a notable diference in the overall patterns between the genders.</p>
        <p>Finally addressing RQ3, Table 2 shows the results of the experiment with group-specific values of . For every
algorithm and dataset, we report utility and bias metric under four conditions:  = 0 no mitigation,  where one
optimal parameter value is selected for the whole population,  where specific optimal parameter value selected for
each popularity inclination user group, and finally  indicating the selection of specific parameter value for each
gender. We observe that, except for NeuMF on ML-1M, group-specific s provide the best result on bias metric and
trade-of between utility and fairness, namely a lower value of   together with utility staying on the same level or
slightly decreasing. Among all algorithms LightGCN has shown the lowest sensitivity to the mitigation, as its values
do not notably drop in either utility or bias metrics. All algorithms show low bias metrics without any mitigation on</p>
      </sec>
      <sec id="sec-2-16">
        <title>ML-1M, nevertheless leveraging group specific  allows to improve trade-of between utility and fairness for BPR and</title>
      </sec>
      <sec id="sec-2-17">
        <title>NeuMF.</title>
      </sec>
      <sec id="sec-2-18">
        <title>6We do not report this grouping for ML-1M as all five algorithms display the same behavior for both genders: steady decrease of bias metric with only</title>
        <p>slight decrease in utility.
6</p>
        <p>CONCLUSION AND FUTURE WORK
We explore the efectiveness of a post-processing popularity bias mitigation technique, Calibrated Popularity, applied to
an array of state-of-the-art recommendation algorithms. We conduct experiments on two datasets from the music and
movie domain, of diferent size and sparsity, considering how various user groups (defined by gender and
mainstreaminess) are afected by bias mitigation. The larger music dataset LFM-2b exposes discrepancies in behavior of diferent
algorithms. NeuMF and MultiVAE show a notable drop in utility and bias metrics even with light bias mitigation
settings. KNN shows an increase in utility with moderate bias mitigation applied. We also show that for BPR, KNN,
and LightGCN users least interested in popular items can benefit in terms of utility from popularity bias mitigation
as opposed to other users. Our experiments show that diferent user groups respond to the mitigation diferently
depending on their inclination towards consumption of popular content. We also found responses of diferent gender
groups relatively similar. Finally, we conduct an experiment showing that selecting mitigation parameters individually
for every user group (by interest towards popular items) leads to a better trade-of between utility and fairness overall.</p>
      </sec>
      <sec id="sec-2-19">
        <title>In future work, other mitigation strategies as well as criteria for selecting optimal mitigation parameters could be tested.</title>
      </sec>
      <sec id="sec-2-20">
        <title>Also, additional user groups could be addressed, including choosing individual settings for each user.</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>ACKNOWLEDGMENTS</title>
      <p>LIT-2021-YOU-215.</p>
      <sec id="sec-3-1">
        <title>This work received financial support by the Austrian Science Fund (FWF): P33526 and DFH-23; and by the State of</title>
      </sec>
      <sec id="sec-3-2">
        <title>Upper Austria and the Federal Ministry of Education, Science, and Research, through grants LIT-2020-9-SEE-113 and</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Himan</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          , Robin Burke, and
          <string-name>
            <given-names>Bamshad</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Controlling popularity bias in learning-to-rank recommendation</article-title>
          .
          <source>In Proceedings of the eleventh ACM conference on recommender systems</source>
          .
          <volume>42</volume>
          -
          <fpage>46</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Himan</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          , Robin Burke, and
          <string-name>
            <given-names>Bamshad</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Managing popularity bias in recommender systems with personalized re-ranking. In The thirty-second international flairs conference</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Himan</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          , Masoud Mansoury, Robin Burke, and
          <string-name>
            <given-names>Bamshad</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>The Unfairness of Popularity Bias in Recommendation</article-title>
          .
          <source>In Proceedings of the Workshop on Recommendation in Multi-stakeholder Environments co-located with the 13th ACM Conference on Recommender Systems (RecSys</source>
          <year>2019</year>
          ), Copenhagen, Denmark,
          <year>September 20</year>
          ,
          <source>2019 (CEUR Workshop Proceedings</source>
          , Vol.
          <volume>2440</volume>
          ).
          <article-title>CEUR-WS.org</article-title>
          . http://ceur-ws.org/Vol2440/paper4.pdf
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Himan</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          , Masoud Mansoury, Robin Burke, Bamshad Mobasher, and
          <string-name>
            <surname>Edward</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Malthouse</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>User-centered Evaluation of Popularity Bias in Recommender Systems</article-title>
          .
          <source>In Proceedings of the 29th ACM Conference on User Modeling, Adaptation and Personalization</source>
          ,
          <string-name>
            <surname>UMAP</surname>
          </string-name>
          <year>2021</year>
          ,
          <article-title>Utrecht, The Netherlands</article-title>
          , June,
          <fpage>21</fpage>
          -
          <lpage>25</lpage>
          ,
          <year>2021</year>
          ,
          <string-name>
            <given-names>Judith</given-names>
            <surname>Masthof</surname>
          </string-name>
          , Eelco Herder, Nava Tintarev, and Marko Tkalcic (Eds.). ACM,
          <volume>119</volume>
          -
          <fpage>129</fpage>
          . https: //doi.org/10.1145/3450613.3456821
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5] Diego Corrêa da Silva,
          <source>Marcelo Garcia Manzato, and Frederico Araújo Durão</source>
          .
          <year>2021</year>
          .
          <article-title>Exploiting personalized calibration and metrics for fairness recommendation</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>181</volume>
          (
          <year>2021</year>
          ),
          <volume>115112</volume>
          . https://doi.org/10.1016/j.eswa.
          <year>2021</year>
          .115112
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Mukund</given-names>
            <surname>Deshpande</surname>
          </string-name>
          and
          <string-name>
            <given-names>George</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Item-Based Top-N Recommendation Algorithms</article-title>
          .
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>22</volume>
          ,
          <issue>1</issue>
          (jan
          <year>2004</year>
          ),
          <fpage>143</fpage>
          -
          <lpage>177</lpage>
          . https://doi.org/10.1145/963770.963776
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Michael</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Ekstrand</surname>
          </string-name>
          , Mucun Tian, Ion Madrazo Azpiazu,
          <string-name>
            <surname>Jennifer D Ekstrand</surname>
          </string-name>
          , Oghenemaro Anuyah,
          <string-name>
            <surname>David McNeill</surname>
            ,
            <given-names>and Maria Soledad</given-names>
          </string-name>
          <string-name>
            <surname>Pera</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>All the cool kids, how do they fit in?: Popularity and demographic biases in recommender evaluation and efectiveness</article-title>
          . In Conference on Fairness,
          <source>Accountability and Transparency</source>
          .
          <volume>172</volume>
          -
          <fpage>186</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Batya</given-names>
            <surname>Friedman</surname>
          </string-name>
          and
          <string-name>
            <given-names>Helen</given-names>
            <surname>Nissenbaum</surname>
          </string-name>
          .
          <year>1996</year>
          .
          <article-title>Bias in Computer Systems</article-title>
          .
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>14</volume>
          ,
          <issue>3</issue>
          (
          <year>1996</year>
          ),
          <fpage>330</fpage>
          -
          <lpage>347</lpage>
          . https://doi.org/10.1145/230538. 230561
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Christian</given-names>
            <surname>Ganhör</surname>
          </string-name>
          , David Penz,
          <string-name>
            <given-names>Navid</given-names>
            <surname>Rekabsaz</surname>
          </string-name>
          , Oleg Lesota, and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Schedl</surname>
          </string-name>
          .
          <year>2022</year>
          .
          <article-title>Unlearning Protected User Attributes in Recommendations with Adversarial Training</article-title>
          .
          <source>In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          (Madrid, Spain) (
          <source>SIGIR '22)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <fpage>2142</fpage>
          -
          <lpage>2147</lpage>
          . https://doi.org/10.1145/3477495.3531820
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F Maxwell</given-names>
            <surname>Harper and Joseph A Konstan</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>The movielens datasets: History and context</article-title>
          .
          <source>Acm transactions on interactive intelligent systems (tiis) 5</source>
          ,
          <issue>4</issue>
          (
          <year>2015</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Xiangnan</surname>
            <given-names>He</given-names>
          </string-name>
          , Kuan Deng, Xiang Wang,
          <string-name>
            <given-names>Yan</given-names>
            <surname>Li</surname>
          </string-name>
          , YongDong Zhang, and
          <string-name>
            <given-names>Meng</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <fpage>639</fpage>
          -
          <lpage>648</lpage>
          . https://doi.org/10.1145/3397271.3401063
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Xiangnan</surname>
            <given-names>He</given-names>
          </string-name>
          , Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Neural Collaborative Filtering</article-title>
          .
          <source>In Proceedings of the 26th International Conference on World Wide Web (Perth, Australia) (WWW '17)</source>
          .
          <source>International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE</source>
          ,
          <fpage>173</fpage>
          -
          <lpage>182</lpage>
          . https://doi.org/10.1145/3038912.3052569
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Dietmar</surname>
            <given-names>Jannach</given-names>
          </string-name>
          , Lukas Lerche, Iman Kamehkhosh, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Jugovac</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>What recommenders recommend: an analysis of recommendation biases and possible countermeasures</article-title>
          .
          <source>User Modeling and User-Adapted Interaction 25</source>
          ,
          <issue>5</issue>
          (
          <year>2015</year>
          ),
          <fpage>427</fpage>
          -
          <lpage>491</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Jugovac</surname>
          </string-name>
          , Dietmar Jannach, and
          <string-name>
            <given-names>Lukas</given-names>
            <surname>Lerche</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Eficient optimization of multiple recommendation quality factors according to individual user tendencies</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>81</volume>
          (
          <year>2017</year>
          ),
          <fpage>321</fpage>
          -
          <lpage>331</lpage>
          . https://doi.org/10.1016/j.eswa.
          <year>2017</year>
          .
          <volume>03</volume>
          .055
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Anastasiia</surname>
            <given-names>Klimashevskaia</given-names>
          </string-name>
          , Mehdi Elahi, Dietmar Jannach, Christoph Trattner, and
          <string-name>
            <given-names>Lars</given-names>
            <surname>Skjaerven</surname>
          </string-name>
          .
          <year>2022</year>
          .
          <article-title>Mitigating Popularity Bias in Recommendation: Potential and Limits of Calibration Approaches</article-title>
          .
          <fpage>82</fpage>
          -
          <lpage>90</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -09316-
          <issue>6</issue>
          _
          <fpage>8</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Dominik</surname>
            <given-names>Kowald</given-names>
          </string-name>
          , Markus Schedl, and
          <string-name>
            <given-names>Elisabeth</given-names>
            <surname>Lex</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>The Unfairness of Popularity Bias in Music Recommendation: A Reproducibility Study</article-title>
          .
          <source>In Advances in Information Retrieval - 42nd European Conference on IR Research</source>
          , ECIR
          <year>2020</year>
          , Lisbon, Portugal,
          <source>April 14-17</source>
          ,
          <year>2020</year>
          , Proceedings,
          <source>Part II (Lecture Notes in Computer Science</source>
          , Vol.
          <volume>12036</volume>
          ). Springer,
          <fpage>35</fpage>
          -
          <lpage>42</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -45442-
          <issue>5</issue>
          _
          <fpage>5</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Oleg</surname>
            <given-names>Lesota</given-names>
          </string-name>
          , Alessandro B.
          <string-name>
            <surname>Melchiorre</surname>
            , Navid Rekabsaz, Stefan Brandl, Dominik Kowald, Elisabeth Lex, and
            <given-names>Markus</given-names>
          </string-name>
          <string-name>
            <surname>Schedl</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Analyzing Item Popularity Bias of Music Recommender Systems: Are Diferent Genders Equally Afected?</article-title>
          .
          <source>In RecSys '21: Fifteenth ACM Conference on Recommender Systems, Amsterdam, The Netherlands, 27 September 2021 - 1 October</source>
          <year>2021</year>
          ,
          <article-title>Humberto Jesús Corona Pampín, Martha A</article-title>
          .
          <string-name>
            <surname>Larson</surname>
          </string-name>
          , Martijn C.
          <article-title>Willemsen, Joseph A</article-title>
          .
          <string-name>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <surname>Julian J. McAuley</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jean</surname>
            Garcia-Gathright,
            <given-names>Bouke</given-names>
          </string-name>
          <string-name>
            <surname>Huurnink</surname>
          </string-name>
          , and Even Oldridge (Eds.). ACM,
          <volume>601</volume>
          -
          <fpage>606</fpage>
          . https://doi.org/10.1145/3460231.3478843
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Dawen</surname>
            <given-names>Liang</given-names>
          </string-name>
          , Rahul G. Krishnan,
          <string-name>
            <given-names>Matthew D.</given-names>
            <surname>Hofman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Tony</given-names>
            <surname>Jebara</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Variational Autoencoders for Collaborative Filtering</article-title>
          .
          <source>In Proceedings of the 2018 World Wide Web Conference (Lyon, France) (WWW '18)</source>
          .
          <source>International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE</source>
          ,
          <fpage>689</fpage>
          -
          <lpage>698</lpage>
          . https://doi.org/10.1145/3178876.3186150
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <year>1991</year>
          .
          <article-title>Divergence measures based on the Shannon entropy</article-title>
          .
          <source>IEEE Transactions on Information Theory 37</source>
          ,
          <issue>1</issue>
          (
          <year>1991</year>
          ),
          <fpage>145</fpage>
          -
          <lpage>151</lpage>
          . https: //doi.org/10.1109/18.61115
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Masoud</surname>
            <given-names>Mansoury</given-names>
          </string-name>
          , Himan Abdollahpouri, Mykola Pechenizkiy, Bamshad Mobasher, and
          <string-name>
            <given-names>Robin</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Feedback Loop and Bias Amplification in Recommender Systems</article-title>
          .
          <source>In CIKM '20: The 29th ACM International Conference on Information and Knowledge Management</source>
          , Virtual Event, Ireland,
          <source>October 19-23</source>
          ,
          <year>2020</year>
          , Mathieu d'Aquin, Stefan Dietze, Claudia Hauf, Edward Curry, and
          <string-name>
            <surname>Philippe</surname>
          </string-name>
          Cudré-Mauroux (Eds.). ACM,
          <volume>2145</volume>
          -
          <fpage>2148</fpage>
          . https: //doi.org/10.1145/3340531.3412152
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Navid</surname>
            <given-names>Rekabsaz</given-names>
          </string-name>
          , Simone Kopeinik, and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Schedl</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation of BERT Rankers</article-title>
          .
          <source>In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada)</source>
          .
          <fpage>306</fpage>
          -
          <lpage>316</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Stefen</surname>
            <given-names>Rendle</given-names>
          </string-name>
          , Christoph Freudenthaler, Zeno Gantner, and
          <string-name>
            <surname>Lars</surname>
          </string-name>
          Schmidt-Thieme.
          <year>2009</year>
          .
          <article-title>BPR: Bayesian Personalized Ranking from Implicit Feedback</article-title>
          .
          <source>In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence</source>
          (Montreal, Quebec, Canada) (
          <source>UAI '09)</source>
          . AUAI Press, Arlington, Virginia, USA,
          <fpage>452</fpage>
          -
          <lpage>461</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Markus</surname>
            <given-names>Schedl</given-names>
          </string-name>
          , Stefan Brandl, Oleg Lesota, Emilia Parada-Cabaleiro, David Penz,
          <string-name>
            <given-names>and Navid</given-names>
            <surname>Rekabsaz</surname>
          </string-name>
          .
          <year>2022</year>
          .
          <article-title>LFM-2b: A Dataset of Enriched Music Listening Events for Recommender Systems Research and Fairness Analysis</article-title>
          .
          <source>In ACM SIGIR Conference on Human Information Interaction and Retrieval</source>
          (Regensburg, Germany) (
          <source>CHIIR '22)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <fpage>337</fpage>
          -
          <lpage>341</lpage>
          . https://doi.org/10.1145/3498366.3505791
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Harald</given-names>
            <surname>Steck</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Calibrated recommendations</article-title>
          .
          <source>In Proceedings of the 12th ACM Conference on Recommender Systems, RecSys</source>
          <year>2018</year>
          , Vancouver, BC, Canada, October 2-
          <issue>7</issue>
          ,
          <year>2018</year>
          ,
          <string-name>
            <given-names>Sole</given-names>
            <surname>Pera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Michael D.</given-names>
            <surname>Ekstrand</surname>
          </string-name>
          , Xavier Amatriain, and
          <string-name>
            <surname>John O'Donovan</surname>
          </string-name>
          (Eds.). ACM,
          <volume>154</volume>
          -
          <fpage>162</fpage>
          . https://doi.org/10.1145/ 3240323.3240372
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Meike</surname>
            <given-names>Zehlike</given-names>
          </string-name>
          , Francesco Bonchi, Carlos Castillo, Sara Hajian, Mohamed Megahed, and
          <string-name>
            <surname>Ricardo</surname>
          </string-name>
          Baeza-Yates.
          <year>2017</year>
          .
          <article-title>FA*IR: A Fair Top-k Ranking Algorithm</article-title>
          .
          <source>In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (Singapore, Singapore) (CIKM '17)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <fpage>1569</fpage>
          -
          <lpage>1578</lpage>
          . https://doi.org/10.1145/3132847.3132938
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>George</surname>
            <given-names>Zerveas</given-names>
          </string-name>
          , Navid Rekabsaz, Daniel Cohen, and
          <string-name>
            <given-names>Carsten</given-names>
            <surname>Eickhof</surname>
          </string-name>
          .
          <year>2022</year>
          .
          <article-title>Mitigating Bias in Search Results Through Contextual Document Reranking and Neutrality Regularization</article-title>
          .
          <source>In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          (Madrid, Spain) (
          <source>SIGIR '22)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <fpage>2532</fpage>
          -
          <lpage>2538</lpage>
          . https://doi.org/10.1145/3477495.3531891
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>Wayne</given-names>
            <surname>Xin</surname>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Shanlei</given-names>
            <surname>Mu</surname>
          </string-name>
          , Yupeng Hou, Zihan Lin,
          <string-name>
            <given-names>Kaiyuan</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Yushuo</given-names>
            <surname>Chen</surname>
          </string-name>
          , Yujie Lu, Hui Wang, Changxin Tian, Xingyu Pan, Yingqian Min, Zhichao Feng, Xinyan Fan, Xu Chen,
          <string-name>
            <surname>Pengfei</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Wendi Ji,
          <string-name>
            <given-names>Yaliang</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Xiaoling</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <surname>Ji-Rong Wen</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>RecBole: Towards a Unified, Comprehensive and Eficient Framework for Recommendation Algorithms</article-title>
          . CoRR abs/
          <year>2011</year>
          .01731 (
          <year>2020</year>
          ). arXiv:
          <year>2011</year>
          .01731 https: //arxiv.org/abs/
          <year>2011</year>
          .01731
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Ziwei</surname>
            <given-names>Zhu</given-names>
          </string-name>
          , Yun He,
          <string-name>
            <surname>Xing Zhao</surname>
            ,
            <given-names>and James</given-names>
          </string-name>
          <string-name>
            <surname>Caverlee</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>Popularity Bias in Dynamic Recommendation</article-title>
          .
          <source>In KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining</source>
          , Virtual Event, Singapore,
          <source>August 14-18</source>
          ,
          <year>2021</year>
          ,
          <string-name>
            <given-names>Feida</given-names>
            <surname>Zhu</surname>
          </string-name>
          , Beng Chin Ooi, and Chunyan Miao (Eds.). ACM,
          <volume>2439</volume>
          -
          <fpage>2449</fpage>
          . https://doi.org/10.1145/3447548.3467376
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>