<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>IIR</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Choice Models for Simulating the Consumption of Recommendations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Discussion Paper</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Naieme Hazrati</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesco Ricci</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Bolzano</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>12</volume>
      <abstract>
        <p>Recent Recommender Systems (RSs) research has focused on identifying and understanding factors that determine the choice behaviour of their users. By simulating users' choices, influenced by RSs, it was shown that algorithmic biases, such as the tendency to recommend popular items, are transferred to the users' choices. In this position paper, we briefly summarise previous results showing that the efect of an RS on the quality and distribution of the users' choices can be influenced by the users' tendency to prefer certain types of items, i.e., popular, recent, or highly-rated items. To quantify this impact, we have defined alternative Choice Models (CMs) and simulated their efect when users are exposed to recommendations. We found that a bias determined by an RS, e.g., the tendency to concentrate the choices on a restricted number of items, can also be enforced by the CM. Moreover, we have discovered that the quality of the choices can be jeopardised by a CM. We also found that for some RSs, the impact of the CM is less prominent, and their biases are not modified by the CM. This research line shows the importance of assessing algorithmic biases in conjunction with a proper model of users' behaviour.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Simulation</kwd>
        <kwd>Recommender systems</kwd>
        <kwd>Choice model</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The current analysis of the biases of Recommender Systems (RSs), e,g. the popularity bias, has
so far focused on the impact of the RS algorithm and training data on the distribution of the
produced recommendations [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. However, in practice, users are never passively picking the
recommended items; they compare them with benchmarks (decision goal), and finally they
make a choice. Hence, real users’ choices are surely determined by the RS, but also by the users’
choice behaviour. Hence, the users’ choices overall distribution and quality can be determined
by users’ tendency to choose items with specific properties, such as, those more popular or
recent [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. For instance, this is clearly observed in how readers purchase books [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Therefore, we are interested in understanding the quantitative efects of alternative and
“plausible” users’ choice behaviours on the distribution and quality of their choices [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ].
Aiming at that goal, we operationalise alternative choice models (CMs) that, by mining real
purchases data sets, appear to be adopted by real users (e.g., in the Amazon Apps and Games
ratings data sets). Then, we use these CMs to simulate users repeatedly choosing items during a
long time span, among those recommended by an RS [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The CMs that we consider,
PopularityCM, Rating-CM, and Age-CM, are influenced by three item properties that users may consider
as criteria for making a choice: item popularity, item rating and item age, which is the time
diference between the choice and when the chosen item was first available in the system. In
fact, these properties have already been studied in the literature [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
        ] as they often influence
users in their decision making process. We then use these CMs in a simulation process where
users are exposed to recommendations and are simulated to make choices on the base of one
of these CMs. We also define a benchmark CM, Base-CM, where the simulated users always
select the top recommended item. Base-CM is used to measure the sole efect of the RS and to
diferentiate the efect of the RS from that of the user’s CM. In our empirical analysis we have
found interesting properties of the distribution and quality of the users’ choices, hence showing
the importance of studying the combined efect of a CM and an RS:
1. The CM can have a significant impact on the distribution and quality of the users’ choices. For
instance, when users tend to choose more popular items (Popularity-CM) the choices become
even more concentrated over a small set of items. While choosing newer items (i.e., adopting
Age-CM) can lead to more diverse choices but with lower quality.
2. Some important properties and biases of the RS, how they afect the distribution and quality of
the choices, are independent of the CM. In these cases the RS may have unavoidable efects
that are not changed by any CM. For instance, the strong choice concentration efect of
non-personalised RSs (they recommend the same items to all the users), is not reduced by
any of the considered CMs.
      </p>
      <p>Our research line has the potential to enlighten the not yet analysed efect of the users’ CM.
We aim at understanding the practical implications of the users’ population adopting a particular
CM. This is important to anticipate the long term efect of an RS on users’ decision making.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Simulation of Users’ Choices</title>
      <p>Our research method is based on simulating repeated choices in monthly time intervals, when
these choices are influenced by recommendations. In a timestamped data set of users’ choices
for items, we observe the choices up to a time point 0, and use them as initial training input for
the RS. We then simulate users’ choices among the recommendations, in the successive months,
and at the end of each month, we retrain the RS with the simulated choices of that month.</p>
      <p>
        Six alternative RSs are studied in our simulations. 1) Popularity-based Collaborative Filtering
(  ) is a nearest neighbourhood collaborative filtering (CF) RS that suggests the most popular
items among the choices of nearest neighbour users [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. 2) Low Popularity-based Collaborative
Filtering (  ) is similar to   , but it penalises the ranking score of popular items by
multiplying it with the inverse of their popularity. 3) Factor Model (  ) is a CF RS based on
matrix factorization [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. 4) Neural network-based Collaborative Filtering (  ) leverages a
multi-layer perceptron to learn the user-item interaction function that is used to recommend
top-k items to the target user [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. 5) Popularity-based (  ) is a non-personalised RS that
recommends the same most popular items to all the users. 6) Average Rating () is another
non-personalised RS that recommends items with the highest average ratings.
      </p>
      <p>
        We assume that the simulated user , when receives a set of recommendations , uses a
multinomial-logit CM to make one choice among the recommended items [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The probability
of the user  to choose the item  is computed as follows:

( ℎ ) = ∑︀∈ 
where  is the utility of the item  for the user . || is set to 50 in our experiments. We
note that items with a larger utility are more likely to be chosen, but users do not necessarily
maximise utility. Based on that multinomial-logit model, we consider four alternative CMs that
difer in how the utility of a recommended item is assessed by the simulated user.
• Rating-CM: the utility of item  for the user  is equal to their rating prediction, ˆ . We use
Inverse Propensity Score Matrix Factorization model (IPS-MF) for such a prediction [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. We
note that Rating-CM is motivated by the assumption that RS users prefer items with larger
ratings [
        <xref ref-type="bibr" rid="ref13 ref2 ref5">13, 2, 5</xref>
        ].
• Popularity-CM: the utility of the item  is equal to:  =  *  , where  is the item 
popularity (at the time of the user choice), i.e., the number of times  has been chosen in 
days prior to the simulated choice divided by  (=90 in our study). This choice behaviour is
often observed and has been extensively studied [
        <xref ref-type="bibr" rid="ref14 ref6">6, 14</xref>
        ]. To have a fair comparison between
the considered CMs,  is a constant adjusted so that  ranges between 1 and 5, which is
the default range of utility values for the Rating-CM (five stars rating).
• Age-CM: the utility of item  is equal to:  =  * ( −  ), where  is the age of item 
(at the simulated choice time). Age is the time diference between the choice time and the
release date of the item  and  is the maximum item age in the entire data set. , as before,
adjusts the impact of the item age on the utility. In Age-CM, more recent items have a larger
utility, hence they tend to be preferred by the simulated users. Such a choice behaviour has
been observed in some domains [
        <xref ref-type="bibr" rid="ref15 ref16 ref17 ref8">8, 15, 16, 17</xref>
        ].
• Base-CM: the user always selects the top recommended item. To impose this choice, we set
the value of  to 1 if  is the first recommended item and 0 otherwise. The analysis of the
choices simulated with Base-CM will show the sole efect of the RS.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental Analysis</title>
      <p>
        We have used some Amazon data sets to conduct simulation experiments, namely, Apps and
Games data sets [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. They contain timestamped ratings of users for purchased items. The
ratings are provided after the purchase and hence, they signal actual choices. We simulate
the final ten months of choice data, while previous months’ data were used to bootstrap the
simulation (RSs initial training data). We have analysed the full set of the simulated choices
using two metrics: (a) the Gini index of the chosen items [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], where a higher value of Gini
represents a lower diversity of these choices; and (b) Choice’s Rating which is the average
predicted rating (IPS-MF predictions) of the choices, which signals the quality of the choices.
      </p>
      <p>Gini index</p>
      <p>Choice’s Rating
Data set CM\RS</p>
      <p>PCF LPCF FM</p>
      <p>NCF POP AR</p>
      <p>PCF LPCF FM</p>
      <p>NCF POP AR
Base
Age
Popularity
Rating
Base
Age
Popularity
Rating</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>In this position paper we have illustrated a research line that the authors are now conducting:
by using a properly defined simulation approach, we measure the efect of alternative users’
choice behaviours in the presence of an RS. We are interested in analysing the combined efect
of the CM and the RS on the diversity and quality of the choices. We believe that our study can
contribute to the start of a new line of research where alternative decision making approaches,
potentially followed by the users, are considered in assessing the impact of RS technologies.
one-class collaborative filtering, in: proceedings of the 25th international conference on
world wide web, 2016, pp. 507–517.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gunawardana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Shani</surname>
          </string-name>
          ,
          <article-title>A survey of accuracy evaluation metrics of recommendation tasks</article-title>
          ,
          <source>Journal of Machine Learning Research</source>
          <volume>10</volume>
          (
          <year>2009</year>
          )
          <fpage>2935</fpage>
          -
          <lpage>2962</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Szlávik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Kowalczyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schut</surname>
          </string-name>
          ,
          <article-title>Diversity measurement of recommender systems under diferent user choice models</article-title>
          , in: Fifth
          <source>International AAAI Conference on Weblogs and Social Media</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hazrati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <article-title>The impact of recommender systems and choice models on users' choices</article-title>
          , in: International Workshop on Algorithmic Bias in Search and Recommendation, Springer,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Heck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Seiling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bröder</surname>
          </string-name>
          ,
          <article-title>The love of large numbers revisited: A coherence model of the popularity bias</article-title>
          ,
          <source>Cognition</source>
          <volume>195</volume>
          (
          <year>2020</year>
          )
          <fpage>104069</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hazrati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <article-title>Recommender systems efect on the evolution of users' choices distribution</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>59</volume>
          (
          <year>2022</year>
          )
          <fpage>102766</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H.</given-names>
            <surname>Abdollahpouri</surname>
          </string-name>
          , G. Adomavicius,
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Guy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Kamishima</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Krasnodebski</surname>
          </string-name>
          , L. Pizzato, Beyond personalization: Research directions in multistakeholder recommendation, arXiv preprint arXiv:
          <year>1905</year>
          .
          <year>01986</year>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Kowald</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schedl</surname>
          </string-name>
          ,
          <string-name>
            <surname>E. Lex,</surname>
          </string-name>
          <article-title>The unfairness of popularity bias in music recommendation: A reproducibility study</article-title>
          ,
          <source>in: European conference on information retrieval</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bartels</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Reinders</surname>
          </string-name>
          ,
          <article-title>Consumer innovativeness and its correlates: A propositional inventory for future research</article-title>
          ,
          <source>Journal of Business Research</source>
          <volume>64</volume>
          (
          <year>2011</year>
          )
          <fpage>601</fpage>
          -
          <lpage>609</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D.</given-names>
            <surname>Fleder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hosanagar</surname>
          </string-name>
          ,
          <article-title>Blockbuster culture's next rise or fall: The impact of recommender systems on sales diversity</article-title>
          ,
          <source>Management science 55</source>
          (
          <year>2009</year>
          )
          <fpage>697</fpage>
          -
          <lpage>712</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Volinsky</surname>
          </string-name>
          ,
          <article-title>Collaborative filtering for implicit feedback datasets</article-title>
          , in: 2008 Eighth IEEE International Conference on Data Mining, Ieee,
          <year>2008</year>
          , pp.
          <fpage>263</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Liao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Nie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Hu</surname>
          </string-name>
          , T.-S. Chua,
          <article-title>Neural collaborative filtering</article-title>
          ,
          <source>in: Proceedings of the 26th international conference on world wide web</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>173</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Schnabel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Swaminathan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chandak</surname>
          </string-name>
          , T. Joachims,
          <article-title>Recommendations as treatments: Debiasing learning and evaluation</article-title>
          ,
          <source>arXiv preprint arXiv:1602.05352</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hazrati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Elahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <article-title>Simulating the impact of recommender systems on the evolution of collective users' choices</article-title>
          ,
          <source>in: Proceedings of the 31st ACM conference on hypertext and social media</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>207</fpage>
          -
          <lpage>212</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Caverlee</surname>
          </string-name>
          ,
          <article-title>Measuring and mitigating item under-recommendation bias in personalized ranking systems</article-title>
          ,
          <source>in: Proceedings of the 43rd international ACM SIGIR conference on research and development in information retrieval</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>449</fpage>
          -
          <lpage>458</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. J.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <article-title>Mining novelty-seeking trait across heterogeneous domains</article-title>
          ,
          <source>in: Proceedings of the 23rd international conference on World wide web</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>373</fpage>
          -
          <lpage>384</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Gravino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Monechi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Loreto</surname>
          </string-name>
          ,
          <article-title>Towards novelty-driven recommender systems</article-title>
          ,
          <source>Comptes Rendus Physique</source>
          <volume>20</volume>
          (
          <year>2019</year>
          )
          <fpage>371</fpage>
          -
          <lpage>379</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Eelen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Verlegh</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. Van den Bergh</surname>
          </string-name>
          ,
          <article-title>Exploring the efectiveness of the label “new” in product packaging and advertising</article-title>
          , ACR North American Advances (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>R.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. McAuley</surname>
          </string-name>
          ,
          <article-title>Ups and downs: Modeling the visual evolution of fashion trends with</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>