<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Italian Information Retrieval Workshop, September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1145/3308558.3313710</article-id>
      <title-group>
        <article-title>Measuring the Ranking Quality of Recommendations in a Two-Dimensional Carousel Setting</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nicolò Felicioni</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maurizio Ferrari Dacrema</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernando B. Pérez Maurera</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Cremonesi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ContentWise</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Politecnico di Milano</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>1</volume>
      <fpage>3</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>Movie-on-demand and music streaming services usually provide the user with multiple recommendation lists, i.e., carousels, in a two-dimensional user interface, each generated according to diferent criteria (e.g., TV series, popular artists, etc.). In this two-dimensional setting it is not appropriate to use traditional ranking metrics designed for a single ranking list. It is well known that users do not explore a two-dimensional interface one row at a time, but rather focus their attention in a triangular area at the top-left corner. Furthermore, it is frequent for user interfaces to hide some items or lists due to space constraints, which can be shown by performing certain actions (i.e., click, swipe). In this paper we extend the widely used NDCG to a two-dimensional recommendation setting with a formulation that allows to account both the two-dimensional user exploration behaviour and interface-specific design. We also compare the proposed extension against single-list NDCG highlighting that they can lead to a diferent choice of the optimal algorithm in ofline evaluation.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Recommender Systems</kwd>
        <kwd>User Interface</kwd>
        <kwd>Evaluation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Traditionally, in the Information Retrieval and Recommender Systems domains, the objective
has been to provide the user with the best possible ranked list of results [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. For this
reason, many metrics were developed to evaluate the quality of a one-dimensional ranked list.
A common assumption is that users will navigate the list according to its order, therefore it is
better for a correct recommendation to be at the beginning of the list.
      </p>
      <p>
        There are however several scenarios that do not fit into these assumptions, mainly when
the results are presented in a two-dimensional grid rather than a single list. This is true both
in information retrieval [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and in recommendation systems applications, in particular for
video-on-demand streaming services [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ] and music streaming platforms [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ]. Those
services usually provide users with multiple rows of thematically coherent recommendations
(e.g., the most popular movies, a specific genre, new releases, and so on, see Figure 1). These
rows are referred to as widgets, shelves or as carousels.
      </p>
      <p>
        A simple way to adapt one-dimensional ranking metrics to a two-dimensional interface is to
concatenate all recommendation lists into a single one. This strategy does not make realistic
assumptions and, we argue, is not appropriate. First, it is known that users do not explore each
carousel sequentially from the first to the last, as concatenating them assumes. Rather, users
start from the top-left corner of the screen and proceed to explore the items both to the right
and to the bottom [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ]. This efect is also known as "golden triangle" or "F-pattern". A visual
example from an information retrieval application [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is shown in Figure 2. Another example
from a video streaming service [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] is shown in Figure 3. In addition to this user behaviour,
many websites and mobile applications present carousels that are swipeable [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], i.e., the user
can swipe horizontally or vertically to reveal more items as well as lists that were not previously
visible. This is a common way to overcome the limited space available in the user interface
allowing to fit more recommendations and carousels that the user can easily browse. However
this puts additional overhead on the user that has to actively interact with the system to access
the recommendations. Hence, it is preferable for a correct recommendation to be visible with
the least possible number of user actions, as also noticed in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>In order to take those factors into account, in this paper we propose to extend the
onedimensional NDCG metric to consider both the two-dimensional user exploration behaviour
and the user interface characteristics. We show that the two metrics can lead to diferent results
when used to select which recommenders to use in the carousel interface.</p>
      <p>The rest of the paper is organized as follows, in Section 2 we summarize the characteristics
of a carousel setting, in Section 3 we formulate an extended version of NDCG, in Section 4 we
perform an ofline comparison of the results in a single list and carousel interface. Finally in
Section 5 we draw the conclusions.
ion 2
t
i
s
o
P
ow 3
R
0
1
4
5
104
103
102
FFigiguurree 23::HVeiastumaalipzaotfiothneonfutmhbeenruomfibneterroafctuiosnesr per
posiintitoenraocntitohnesscorneeena. cMhopstoisnitteioranctoend aiteumsesrarinetleorcfaatecde i[n7]th.e
A drsetmroawrcsaatniodnobnetthweeernstthpeosfiitriostnsanodf tsheecolinstd. Vhaallufeosfare
log-scaled.</p>
      <p>the columns is visible.</p>
      <p>Episodes of TV series 123, 831 85.36%
2. MCovihesaracteristics o1f3, a733Caro9.u47s%el Settsiinnggle item are 23, 939 and 1 (6, 260 and 1 if counting interactions
TV Movies and shows 5, 722 3.94% from impressions), respectively.</p>
      <p>Movies and clips in series 1, 788 1.23% For impressions with direct links to interactions, the average
The carousel interface layo1u4t5, a07n4d the w10a0y% it is usunaullmybegreonf einrtaetraecdtiobnys rveciediveeod-openr-idmepmresasniodn ias n2,dwhere the</p>
      <p>
        Total
music streaming platforms has important characterimstaixci msutmhaatnddtihsetimnigniumiusmh niutmfrboermof aintseirnacgtiloen-slirsecteived by a
setup [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]: single item are 213 and 1, respectively.
5.1 Analysis of the dataset In Figure 2, we show a heatmap that indicates the most interacted
positions of the recommendation lists based on the row position on
CIonntteentrWf aisce eIm: prAesstiwonoscdonitmainesn1s0,io45n7,a8l10uisneterraicntitoenrs;f3a0c7e,45w3 ith mthuelstcirpeleen.cSapreociucsaelllys,.wSeosmeeethcatamrooustsientlesraocrtiornesch-appen
beimpressions with direct links to interactions; and 23, 342, 617
impressions owithout direct link to interactions. The dataset also
con
      </p>
      <p>mmendations may be hidden due to limited tpwaegenetshiezersatnthdreebreowacpcoseitsisonibs,laendotnhley rvstiatenuisteemr positions.
tains 42, 1a53ctuisoernss;1(4i5.,e0.7,4ciltiecmks, asnwd2ip8,e8)8.1 series.</p>
      <p>
        In Table 4, we highlight the distribution of the interactions when 5.2 Comparison with other datasets
grRouepceodbmy imnteeranctdioanttiyopen, ws:herTeh97e.8l%isotfsthsehdoatwasent its ocotmhperisuedsers aAres pgreevnioeursalytemdenwtiiotnheddiinferSeecntitonal2g, ocurrirtehnmtlys,noorimpressions
of view and access interactions. Similarly, in Table 5, we present datasets are publicly available to the community. As such, we
gaththe distribution of interpacrtoiovnisdbeyristeamntdypne,owshienrgel9e6.p23o%sto-fpthreocessienregd and reporatepdptlhieeidrs.tatistics useinagchtheinondeisvdiedsucraibled on works
by diferent step is While
interactiornesccoormresmpoennddtoaetpioisnodelsisotf TdVoseesriens oantdcmoonvtiaesi.nLadstulyp,licattehsa,t tuhseed sthaomseedaittaesmets.may appear in multiple
in Table 6c,wareoshuosweltsh.e1d2istribution of item types, where the same To the best of our knowledge, ContentWise Impressions is the
episodes of TV series and movies item types represent 94.83% of the rst dataset with impressions to be open-sourced. In previous
total items. years, other articles have used private datasets[
        <xref ref-type="bibr" rid="ref14 ref7">7, 14</xref>
        ], which were
UWseeorbsBerevhedathvaitouuserrs:, itTemhse, aunsdesrerwiesi,llprfeoscenutsloonng-ttahiledtisotrpi--left tnroitarneglelaeseodftoththeesccormemenunriatyt.hOethretrhs awnereexdpislcolorsiendgunder
nonbutions. Ftohr eusecras,r2o7u.9s6e%lmsossetpqoupeunlatriuaslelrys.areFausrsotchiaetremdwoirthe, theryediwstriilbluteioxnpclloaurseestohnechraellcenogmes[m1, e2,n1d3,a2t0i]o, nwsherienonly a few
80% of thedinifeterreancttions. For items, 12.06% most popular items corret-hey nreeseedarchers have access to them. Furrethveeramlotrhe,eCmon.te3ntWise
Imways according to which actions to perform in order to
spond with 80% of the interactions. For series, 4.05% most popular pressions provides both impressions present in the interactions
series appear in 80% of the interactions. and without any associated interaction. Both LinkedIn PYMK
Im
      </p>
      <p>
        ThWeahveirlaegeanucmabreoruofsienltelraacytioonustpemruasyer isse2e48m(22siif mcouilnatirngto a tprraedssiitoinosnaandl LminekregdIen-Sliksiltl Eenmdobrseedmdeinntg[
        <xref ref-type="bibr" rid="ref1 ref4">1,4</xref>
        ]walhsoeprreesent both
dimrecutlitnitperleactrieoncsofmrommiemnpdreastsiioonns)l,iwshtearreethceommaxbi minuemdainndttoheone, tihmipsreisssinonost. Othnethceaosthee.rIhnanad,roetahelrsdcaetansaetrsi[o1,, 1t3h]eornely provided
minimum number of interactions made by a single user are 13, 517 impressions present in the interactions.
and 2 (2, 886 and 1 if counting direct interactions from impressions), Another advantage of ContentWise Impressions is that it is
subrespec1tiAvelsyi.gnificant example are content aggregators, which comsabminpeledcainroauwsealys tforobemeadsiifelyreunsatbplerofovridreesresa:rcNh eptuflixrp, oses without
YoFourt uitbeem,sP,rthimeaevVeriadgeeon,uetmcb.er of interactions received per item requiring signi cant computation resources. While researchers can
is 72 (22F5oirf ecxouamntpinlge, iinntetrhacetiNonestflixfrohmomimeparegsesisohnos)w,wnhienreFitghuere 1 thinedTeeVd spereripersoceSspsaacnedFsourbcseamapppleebairgsgberotdhatianseths,eifTnVeeded, this
mCaoxmimeudmiesanadntdheNmewinRimeluemasnesu mcabreoruosfeilnst.eractions received by a may result in di erent articles relying on di erent subsampling,
3Usually users tend to navigate more easily with simple swipes rather than repeated mouse clicks, hence their
behaviour, as it is known, will change according to the device.
are multiple constraints. First, the carousels may be generated by diferent content providers,
each of them unaware of how the other lists are generated or by whom. This means that the
composition of the layout as well as the recommendations of the other providers are, in general,
not known. It is for this reason that diferent carousels may contain similar recommendations.
Furthermore, a content provider that wishes to select the optimal carousels to display has limited
degrees of freedom and can only alter the content and relative ordering of those it is tasked to
provide. Finding strategies to select the optimal carousel layout is a complex problem [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>3. Extending one-dimensional NDCG</title>
      <p>
        One of the most used metrics for ranked list evaluation is the Discounted Cumulated Gain (DCG),
as well as its Normalized version (NDCG) [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ]. This metric comes from the information
retrieval domain and is widely used to evaluate recommendation systems. The DCG metric
relies on two assumptions:
      </p>
      <sec id="sec-2-1">
        <title>1. highly relevant results are more valuable for a user;</title>
        <p>2. within a list of results, it is preferable to have relevant results in the first positions
Let  be the recommendation list length, i.e., cutof, and () the relevance of the item in
position . The DCG is defined as the following discounted sum of gains:</p>
        <p>= ∑︁ () · ()</p>
        <p>=1
The  function is responsible for rewarding highly relevant results, while the 
function introduces a penalization that should increase the further the item is from the beginning
of the list.</p>
        <p>
          One of the most used formulations for the DCG is the following [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]:
        </p>
        <p>= ∑︁ 2() − 1</p>
        <p>=1 log2( + 1)</p>
        <p>
          Hence, () = 2() − 1 and () = log2(1+1) . Notice that this formulation is
only one of many possible formulations for the DCG. Several other ways of rewarding and
discounting results have been proposed in previous research [
          <xref ref-type="bibr" rid="ref18 ref19">18, 19</xref>
          ]. In the following, we will
start from this formulation and extend it since it is one of the most used. Other types of gain
and discount functions can be extended in an analogous way. We leave the analysis of diferent
gains and discounts as future work.
        </p>
        <p>In a two-dimensional scenario, the standard DCG definition could be naively adapted in
the following way. Let ℎ be the horizontal dimension of the interface (i.e., the length of each
carousel) and  the vertical dimension of the interface (i.e., the number of carousels). The
carousels can be concatenated in a single list of length  =  · ℎ items on which the standard
DCG formulation can be applied. This strategy assumes that the users will explore all carousels
sequentially, from the first to the last, which, as previously discussed, is not consistent to the
user behaviour and does not account for the interface navigation constraints. Therefore, we
suggest researchers do not apply this strategy as it does not represent a realistic scenario.</p>
        <p>
          Thus, inspired by [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], we make the following assumptions the two-dimensional DCG should
meet:
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>1. highly relevant results are more valuable for a user;</title>
        <p>2. a relevant result is valuable to the user only when it is first seen;
3. within a grid of results, it is preferable to have relevant results close to the top-left corner
4. it is preferable that relevant items are immediately visible to the user or can be made
visible with few user actions</p>
        <p>In order to account for this set of assumptions, we propose to extend the metric in the
following way:</p>
        <p>ℎ
2 = ∑︁ ∑︁ (, ) · (, )</p>
        <p>=1 =1</p>
        <p>As in the one-dimensional version, the  function is responsible for rewarding highly
relevant results, according to assumptions (1) and (2). The  function, instead, should
account for the penalty related to the position and number of user actions, according to
assumptions (3) and (4).</p>
        <p>Inspired by the one-dimensional version, we fix (, ) = 2(,) − 1. Instead, the
 will depend on the position in the layout, allowing ample freedom on how to define it
in diferent use cases.</p>
        <p>The normalized version of this metric, N2DCG will be defined as as  2 = 2/2.
I2DCG will be the 2DCG of the ideal ranking. In a single list setting the ideal ranking is the list
which contains the relevant items in decreasing relevance from the beginning of the list. In the
generalized two-dimensional layout it contains the user’s most relevant items, ranked according
to decreasing relevance in positions with decreasing position discount. The ideal ranking meets
the following constraints: for any pair of cells (, ), (, ) of the matrix, (, ) ≥ (, )
if (, ) &gt; (, ).</p>
        <p>Relevance As stated in assumption (2), a relevant item is valuable for the user only when it is
ifrst encountered. This means that if a relevant item appears multiple times, each in a diferent
carousels, it should be considered as relevant only in its best position. We define such position
as the one with the highest discount. Function (, ) should be modified accordingly.
Single List Discount It is possible to represent in this formulation the traditional single
list DCG by calculating the position of cell in coordinates ,  if all carousels lists would be
concatenated:</p>
        <p>(, ) = (2(( − 1) · ℎ +  + 1))− 1</p>
        <p>As previously mentioned, this formulation is not grounded in a realistic scenario because it
does not reflect the user behaviour (see Figure 4a), therefore we argue it should not be applied.
(a) Single list.</p>
        <p>(b) Golden triangle behaviour.
ion 3
it
sseuopo
l
r
aC 4
1
2
5
6
1
2</p>
        <p>Reco3mmendation po4sition
5</p>
        <p>6
(c) Golden triangle and user actions penalty.</p>
        <p>Golden Triangle Discount In order to account for the golden triangle behaviour, as per
assumption (3), the position discount should decrease as the distance of the cell from the
top-right corner increases:</p>
        <p>(, ) = (2( ·  +  · ))− 1</p>
        <p>The coeficients  ,  are two weights that can be used to account for diferent types of user
behaviors. For instance, let us assume a scenario where users are more inclined to explore the</p>
        <p>...</p>
        <p>
          Figure 5: An example interface where 3 carousels, with 4 items each, are visible. A horizontal swipe
reveals 4 items, while a vertical swipe reveals one additional carousel.
vertical dimension. In this case,  should be set to a low value in order to penalize less the
vertical dimension. In order to make the discount start from 1,  and  should be ≥ 1 since
the base of the logarithm used is 2. Notice that this is true only because we are extending a
logarithmic discount function. For other discount functions [
          <xref ref-type="bibr" rid="ref18 ref19">18, 19</xref>
          ] the constraints can change.
        </p>
        <p>The resulting discount is shown in Figure 4b (we set  =  = 1 for simplicity).
User Actions Discount Lastly, in order to account for assumption (4) the position discount
should decrease the more actions are required by the user to make that position visible. In
a carousel interface there is an initial rectangular portion of the recommendations that are
immediately shown to the user. We refer to the number of items visible as ℎ and to the
number of carousels visible as , see Figure 5. In order to reveal more items, the user needs to
perform a certain action, i.e., click on a desktop, swipe on mobile devices. Each of these actions
will reveal a certain number of new items within the currently visualized recommendation lists.
Diferent platforms and devices will correspond to diferent swipe steps, i.e., the number of items
that will be revealed after a single swipe. We will call this quantity ℎ ∈ {1, 2, . . . , ℎ}.
For example, on Netflix every click will replace all items displayed on the clicked carousel, in
which case ℎ = ℎ. The same principle holds for the vertical dimension, where the user
can navigate performing actions that will each display  new carousels.</p>
        <p>Based on this definition, we now add to the triangle penalty a term to account for the number
of actions that the user will need to perform in order to visualize the item. To do so we define
some auxiliary functions. The first one is used to check whether at least a user action, i.e., swipe,
is needed to visualize that item in a certain position  given that the interface initially shows
 positions:
isSwipeNeeded(, ) =
{︃1, if  −  &gt; 0
0, otherwise
Then, we define a function to count the number of actions needed to visualize an item, given
that each action shows  positions:
swipes(, , ) = isSwipeNeeded(, ) ·
︂⌈  −  ⌉︂

In the particular case where  = , calculating the number of swipes becomes simpler:
swipes(, ) =
︂⌊  ⌋︂</p>
        <p>The final discount will account for both the triangle discount and the number of user actions,
as previously defined:
(, ) = (2( ·  +  ·  +  · swipes(, , ))</p>
        <p>+  · swipes(, ℎ, ℎ))− 1</p>
        <p>Notice that this formulation accounts for both vertical and horizontal swipes. The coeficients
 ,  ,  ,  are four positive weights that can be used to account for diferent types of user
behaviors. The first two weights (  and  ) control the general penalization of the vertical and
horizontal dimensions, respectively. As we previously said, they should be ≥ 1 in order for
the total discount to start from 1. Controlling  and  , instead, it is possible to penalize more
or less the user actions needed to reveal a certain item. For example, it could be that items
presented together in the same carousel have a similar probability of interaction (see the first 10
elements of the first carousel in Figure 3). Hence, the horizontal dimension should be penalized
less. Another possibility is that, on a desktop device, the horizontal swipe done with a mouse
click will have a higher weight than the same swipe done with a touch on a mobile device.</p>
        <p>For illustrative purposes, let us consider a possible scenario for a mobile device, where the
screen contains 4 carousels and 3 recommendations each. We set the horizontal and vertical
steps to 1, , , ,  are set to 1 as well. The resulting discount is shown in Figure 4c.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4. Experiments</title>
      <p>
        In this section we provide an example of the diferent behaviour of NDCG and N2DCG in an
ofline experimental scenario. We consider a setting where given a set of recommendation
models and a certain number of carousels, the goal is to select which models to use to generate
each carousel. We show that the two metrics yield to diferent carousel layouts. In order to
represent a scenario where a carousel interface would be used, we selected the widely known
movie recommendations dataset MovieLens10M dataset [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], containing 70k users, 10k items
and 10M ratings.
      </p>
      <p>
        The set of models that can be selected, i.e., M, contains several simple and widely known
models that have shown to provide competitive results in recent evaluations [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. For
NonPersonalized models we selected a TopPopular recommender. As KNN models we included
ItemKNN [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] and UserKNN [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], both computing the similarity with cosine and shrinkage.
We included the Graph-based models P3 [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] and RP3 [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], which define a bipartite graph of
users and items and simulate a random walk. We added various Matrix Factorization models,
some developed for explicit interactions: PureSVD [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], FunkSVD [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and Non-negative MF
(NMF) [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]; as well as others developed for implicit interactions: MF BPR [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], IALS [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. We
included the widely known Item-Based machine learning models SLIM [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], SLIM BPR and the
more recent EASE [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. Finally, we included the Content-based model ItemKNN CBF, which
computes the item similarities from item features. using cosine similarity with shrinkage.
      </p>
      <sec id="sec-3-1">
        <title>Optimizing NDCG</title>
      </sec>
      <sec id="sec-3-2">
        <title>Optimizing N2DCG</title>
      </sec>
      <sec id="sec-3-3">
        <title>UserKNN</title>
      </sec>
      <sec id="sec-3-4">
        <title>FunkSVD NMF</title>
      </sec>
      <sec id="sec-3-5">
        <title>IALS</title>
        <p>MF BPR</p>
      </sec>
      <sec id="sec-3-6">
        <title>SLIM</title>
        <p>SLIM</p>
      </sec>
      <sec id="sec-3-7">
        <title>FunkSVD</title>
        <p>UserKNN
MF BPR</p>
        <p>NMF
IALS</p>
        <p>
          We split the data by randomly selecting 80% of interactions for the training set and 10% for
validation and test set. Each model was optimized on the validation data, following the best
practices and value ranges reported in [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ], using a Bayesian search with 50 cases.
        </p>
        <p>Since the purpose of this paper is not to propose an algorithm for the selection of carousels
but to show that the two metrics lead to diferent results, we rely on a simple greedy strategy.
At the beginning the page is empty and all candidate algorithms are evaluated independently on
the validation data. The model with the best recommendation quality is selected as first carousel.
The process repeats for the following carousels, however, in this case, the candidate model will
be evaluated by taking into account all the previous carousels. According to the definition of
relevance provided in Section 3, a correct recommendation of an item by the candidate model
may overtake another of the same item in a previous carousels if it has a better position discount.
For example, a correct recommendation at the end of the second carousel could be overtaken by
the same recommendation but at the beginning of the third carousel, if it has a better position
discount.</p>
        <p>We repeated this procedure first optimizing NDCG, and then optimizing N2DCG. We consider
a hypothetical interface with a total of 6 carousels, each composed of 10 items. The interface
will initially show 3 carousels and 2 items. The user can display 1 additional item in a given
carousel with each horizontal swipe and 1 new carousel with a vertical swipe. For this interface,
we set  =  = 1 and  =  = 2, in order to penalize more the swipes.</p>
        <p>The resulting layouts are shown in Table 1. As we can see, the layouts have almost completely
diferent orders of the chosen algorithms. For instance, optimizing N2DCG results in selecting
SLIM as the first carousel, while the same algorithm was selected at the bottom of the layout that
optimizes one-dimensional NDCG. UserKNN instead was the first algorithm when optimizing
NDCG, but it is only the third carousel during N2DCG optimization.</p>
        <p>Notice also how the 6 algorithms selected in both procedures are the same, only the order
changes. Indeed, it is expected that NDCG and N2DCG will not produce completely diferent
layouts but will difer the longer and more pronounced the efects of user actions become.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5. Conclusions</title>
      <p>In this paper we have described a user interface with multiple carousels, typical of
movie-ondemand and music streaming services, and based on its characteristics proposed an extended
version of the widely used NDCG metric. The proposed formulation accounts for the known
user behaviour of exploring the pages not one row at a time but focusing on the top-left corner
and then navigating in both directions. The proposed formulation also allows to penalize correct
recommendations that are only visible to the user after performing actions. Lastly, we show that
the two metrics can lead to the selection of a diferent carousel layout. Future works include
validating the proposed metric with user studies as well as applying it to select the optimal
carousel layout, by defining which is the best carousel to put in a certain position or which is
the best ordering of a given set of carousels. Also, further studies can be done on diferent gain
and discount functions, similar to previous research works conducted on the one-dimensional
DCG.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. G.</given-names>
            <surname>Terveen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          ,
          <article-title>Evaluating collaborative filtering recommender systems</article-title>
          ,
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>22</volume>
          (
          <year>2004</year>
          )
          <fpage>5</fpage>
          -
          <lpage>53</lpage>
          . URL: https://doi.org/10.1145/ 963770.963772. doi:
          <volume>10</volume>
          .1145/963770.963772.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          ,
          <article-title>Performance of recommender algorithms on top-n recommendation tasks</article-title>
          , in: X.
          <string-name>
            <surname>Amatriain</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Torrens</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Resnick</surname>
          </string-name>
          , M. Zanker (Eds.),
          <source>Proceedings of the 2010 ACM Conference on Recommender Systems, RecSys</source>
          <year>2010</year>
          , Barcelona, Spain,
          <source>September 26-30</source>
          ,
          <year>2010</year>
          , ACM,
          <year>2010</year>
          , pp.
          <fpage>39</fpage>
          -
          <lpage>46</lpage>
          . URL: https://doi.org/10.1145/1864708. 1864721. doi:
          <volume>10</volume>
          .1145/1864708.1864721.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sanderson</surname>
          </string-name>
          , W. B.
          <string-name>
            <surname>Croft</surname>
          </string-name>
          ,
          <article-title>The history of information retrieval research</article-title>
          ,
          <source>Proc. IEEE</source>
          <volume>100</volume>
          (
          <year>2012</year>
          )
          <fpage>1444</fpage>
          -
          <lpage>1451</lpage>
          . URL: https://doi.org/10.1109/JPROC.
          <year>2012</year>
          .
          <volume>2189916</volume>
          . doi:
          <volume>10</volume>
          .1109/JPROC.
          <year>2012</year>
          .
          <volume>2189916</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Chierichetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Raghavan</surname>
          </string-name>
          ,
          <article-title>Optimizing two-dimensional search results presentation</article-title>
          , in: I.
          <string-name>
            <surname>King</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Nejdl</surname>
          </string-name>
          , H. Li (Eds.),
          <source>Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM</source>
          <year>2011</year>
          ,
          <string-name>
            <given-names>Hong</given-names>
            <surname>Kong</surname>
          </string-name>
          , China, February 9-
          <issue>12</issue>
          ,
          <year>2011</year>
          , ACM,
          <year>2011</year>
          , pp.
          <fpage>257</fpage>
          -
          <lpage>266</lpage>
          . URL: https://doi.org/10.1145/1935826.1935873. doi:
          <volume>10</volume>
          .1145/1935826.1935873.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. V.</given-names>
            <surname>Alvino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Smola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Basilico</surname>
          </string-name>
          ,
          <article-title>Using navigation to improve recommendations in real-time</article-title>
          , in: S. Sen,
          <string-name>
            <given-names>W.</given-names>
            <surname>Geyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Freyne</surname>
          </string-name>
          , P. Castells (Eds.),
          <source>Proceedings of the 10th ACM Conference on Recommender Systems</source>
          , Boston, MA, USA, September
          <volume>15</volume>
          -
          <issue>19</issue>
          ,
          <year>2016</year>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>341</fpage>
          -
          <lpage>348</lpage>
          . URL: https://doi.org/10.1145/2959100.2959174. doi:
          <volume>10</volume>
          .1145/2959100. 2959174.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Elahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chandrashekar</surname>
          </string-name>
          ,
          <article-title>Learning representations of hierarchical slates in collaborative ifltering</article-title>
          , in: R. L.
          <string-name>
            <surname>T. Santos</surname>
            ,
            <given-names>L. B.</given-names>
          </string-name>
          <string-name>
            <surname>Marinho</surname>
            ,
            <given-names>E. M.</given-names>
          </string-name>
          <string-name>
            <surname>Daly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Falk</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Koenigstein</surname>
          </string-name>
          , E. S. de Moura (Eds.),
          <source>RecSys 2020: Fourteenth ACM Conference on Recommender Systems</source>
          , Virtual Event, Brazil,
          <source>September 22-26</source>
          ,
          <year>2020</year>
          , ACM,
          <year>2020</year>
          , pp.
          <fpage>703</fpage>
          -
          <lpage>707</lpage>
          . URL: https://doi. org/10.1145/3383313.3418484. doi:
          <volume>10</volume>
          .1145/3383313.3418484.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F. B.</given-names>
            <surname>Pérez Maurera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ferrari</given-names>
            <surname>Dacrema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Saule</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scriminaci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <article-title>Contentwise impressions: An industrial dataset with impressions included</article-title>
          , in: M.
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Dietze</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Hauf</surname>
            , E. Curry,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Cudré-Mauroux</surname>
          </string-name>
          (Eds.),
          <source>CIKM '20: The 29th ACM International Conference on Information and Knowledge Management</source>
          , Virtual Event, Ireland,
          <source>October 19-23</source>
          ,
          <year>2020</year>
          , ACM,
          <year>2020</year>
          , pp.
          <fpage>3093</fpage>
          -
          <lpage>3100</lpage>
          . URL: https://doi.org/10.1145/3340531.3412774. doi:
          <volume>10</volume>
          .1145/3340531.3412774.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gruson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Chandar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Charbuillet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>McInerney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hansen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tardieu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Carterette</surname>
          </string-name>
          ,
          <article-title>Ofline evaluation to make decisions about playlistrecommendation algorithms</article-title>
          , in: J. S. Culpepper,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mofat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Bennett</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          Lerman (Eds.),
          <source>Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM</source>
          <year>2019</year>
          ,
          <article-title>Melbourne</article-title>
          ,
          <string-name>
            <surname>VIC</surname>
          </string-name>
          , Australia,
          <source>February 11-15</source>
          ,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>420</fpage>
          -
          <lpage>428</lpage>
          . URL: https://doi.org/10.1145/ 3289600.3291027. doi:
          <volume>10</volume>
          .1145/3289600.3291027.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>W.</given-names>
            <surname>Bendada</surname>
          </string-name>
          , G. Salha, T. Bontempelli,
          <article-title>Carousel personalization in music streaming apps with contextual bandits</article-title>
          , in: R. L.
          <string-name>
            <surname>T. Santos</surname>
            ,
            <given-names>L. B.</given-names>
          </string-name>
          <string-name>
            <surname>Marinho</surname>
            ,
            <given-names>E. M.</given-names>
          </string-name>
          <string-name>
            <surname>Daly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Falk</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Koenigstein</surname>
          </string-name>
          , E. S. de Moura (Eds.),
          <source>RecSys 2020: Fourteenth ACM Conference on Recommender Systems</source>
          , Virtual Event, Brazil,
          <source>September 22-26</source>
          ,
          <year>2020</year>
          , ACM,
          <year>2020</year>
          , pp.
          <fpage>420</fpage>
          -
          <lpage>425</lpage>
          . URL: https://doi.org/10.1145/3383313.3412217. doi:
          <volume>10</volume>
          .1145/3383313.3412217.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kammerer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gerjets</surname>
          </string-name>
          ,
          <article-title>How the interface design influences users' spontaneous trustworthiness evaluations of web search results: comparing a list and a grid interface</article-title>
          , in: C. H.
          <string-name>
            <surname>Morimoto</surname>
            ,
            <given-names>H. O.</given-names>
          </string-name>
          <string-name>
            <surname>Istance</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hyrskykari</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          Ji (Eds.),
          <source>Proceedings of the 2010 Symposium on Eye-Tracking Research &amp; Applications</source>
          ,
          <string-name>
            <surname>ETRA</surname>
          </string-name>
          <year>2010</year>
          , Austin, Texas, USA, March
          <volume>22</volume>
          -24,
          <year>2010</year>
          , ACM,
          <year>2010</year>
          , pp.
          <fpage>299</fpage>
          -
          <lpage>306</lpage>
          . URL: https://doi.org/10.1145/1743666.1743736. doi:
          <volume>10</volume>
          .1145/1743666.1743736.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Harper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <article-title>Gaze prediction for recommender systems</article-title>
          , in: S. Sen,
          <string-name>
            <given-names>W.</given-names>
            <surname>Geyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Freyne</surname>
          </string-name>
          , P. Castells (Eds.),
          <source>Proceedings of the 10th ACM Conference on Recommender Systems</source>
          , Boston, MA, USA, September
          <volume>15</volume>
          -
          <issue>19</issue>
          ,
          <year>2016</year>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>131</fpage>
          -
          <lpage>138</lpage>
          . URL: https://doi.org/10.1145/2959100.2959150. doi:
          <volume>10</volume>
          .1145/2959100.2959150.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.</given-names>
            <surname>Järvelin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. L.</given-names>
            <surname>Price</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M. L.</given-names>
            <surname>Delcambre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          ,
          <article-title>Discounted cumulated gain based evaluation of multiple-query IR sessions</article-title>
          , in: C.
          <string-name>
            <surname>Macdonald</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Ounis</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Plachouras</surname>
            ,
            <given-names>I. Ruthven</given-names>
          </string-name>
          ,
          <string-name>
            <surname>R. W.</surname>
          </string-name>
          White (Eds.),
          <source>Advances in Information Retrieval , 30th European Conference on IR Research</source>
          , ECIR
          <year>2008</year>
          , Glasgow, UK, March 30-April 3,
          <year>2008</year>
          . Proceedings, volume
          <volume>4956</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2008</year>
          , pp.
          <fpage>4</fpage>
          -
          <lpage>15</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -78646-
          <issue>7</issue>
          _4. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>540</fpage>
          -78646-7\_4.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Felicioni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ferrari</given-names>
            <surname>Dacrema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <article-title>A methodology for the ofline evaluation of recommender systems in a user interface with multiple carousels</article-title>
          , in: J.
          <string-name>
            <surname>Masthof</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Herder</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Tintarev</surname>
          </string-name>
          , M. Tkalcic (Eds.),
          <source>Adjunct Publication of the 29th ACM Conference on User Modeling, Adaptation and Personalization</source>
          ,
          <string-name>
            <surname>UMAP</surname>
          </string-name>
          <year>2021</year>
          ,
          <article-title>Utrecht, The Netherlands</article-title>
          , June 21-25,
          <year>2021</year>
          , ACM,
          <year>2021</year>
          , pp.
          <fpage>10</fpage>
          -
          <lpage>15</lpage>
          . URL: https://doi.org/10.1145/3450614.3461680. doi:
          <volume>10</volume>
          .1145/3450614.3461680.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferrari Dacrema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Felicioni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <article-title>Optimizing the selection of recommendation carousels with quantum computing</article-title>
          ,
          <source>in: Proceedings of the Fifteenth ACM Conference on Recommender Systems</source>
          ,
          <year>2021</year>
          . doi:
          <volume>10</volume>
          .1145/3460231.3478853.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>K.</given-names>
            <surname>Järvelin</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Kekäläinen,</surname>
          </string-name>
          <article-title>IR evaluation methods for retrieving highly relevant documents</article-title>
          , in: E. J.
          <string-name>
            <surname>Yannakoudakis</surname>
            ,
            <given-names>N. J.</given-names>
          </string-name>
          <string-name>
            <surname>Belkin</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Ingwersen</surname>
          </string-name>
          , M. Leong (Eds.),
          <source>SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 24-28</source>
          ,
          <year>2000</year>
          , Athens, Greece,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          ,
          <year>2000</year>
          , pp.
          <fpage>41</fpage>
          -
          <lpage>48</lpage>
          . URL: https://doi.org/10.1145/345508.345545. doi:
          <volume>10</volume>
          .1145/345508.345545.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>K.</given-names>
            <surname>Järvelin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kekäläinen</surname>
          </string-name>
          ,
          <article-title>Cumulated gain-based evaluation of IR techniques</article-title>
          ,
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>20</volume>
          (
          <year>2002</year>
          )
          <fpage>422</fpage>
          -
          <lpage>446</lpage>
          . URL: http://doi.acm.
          <source>org/10</source>
          .1145/582415.582418. doi:
          <volume>10</volume>
          .1145/ 582415.582418.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>C. J. C. Burges</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Shaked</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Renshaw</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Lazier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Deeds</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Hamilton</surname>
            ,
            <given-names>G. N.</given-names>
          </string-name>
          <string-name>
            <surname>Hullender</surname>
          </string-name>
          ,
          <article-title>Learning to rank using gradient descent</article-title>
          , in: L.
          <string-name>
            <surname>D. Raedt</surname>
          </string-name>
          , S. Wrobel (Eds.),
          <source>Machine Learning, Proceedings of the Twenty-Second International Conference (ICML</source>
          <year>2005</year>
          ), Bonn, Germany,
          <source>August</source>
          <volume>7</volume>
          -
          <issue>11</issue>
          ,
          <year>2005</year>
          , volume
          <volume>119</volume>
          of ACM International Conference Proceeding Series, ACM,
          <year>2005</year>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>96</lpage>
          . URL: https://doi.org/10.1145/1102351.1102363. doi:
          <volume>10</volume>
          .1145/1102351. 1102363.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>E.</given-names>
            <surname>Kanoulas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Aslam</surname>
          </string-name>
          ,
          <article-title>Empirical justification of the gain and discount function for ndcg</article-title>
          , in: D. W. Cheung, I. Song,
          <string-name>
            <given-names>W. W.</given-names>
            <surname>Chu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Lin</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 18th ACM Conference on Information and Knowledge Management</source>
          ,
          <string-name>
            <surname>CIKM</surname>
          </string-name>
          <year>2009</year>
          ,
          <string-name>
            <given-names>Hong</given-names>
            <surname>Kong</surname>
          </string-name>
          , China, November 2-
          <issue>6</issue>
          ,
          <year>2009</year>
          , ACM,
          <year>2009</year>
          , pp.
          <fpage>611</fpage>
          -
          <lpage>620</lpage>
          . URL: https://doi.org/10.1145/1645953.1646032. doi:
          <volume>10</volume>
          .1145/1645953.1646032.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Xue, Learning the gain values and discount factors of discounted cumulative gains</article-title>
          ,
          <source>IEEE Trans. Knowl. Data Eng</source>
          .
          <volume>26</volume>
          (
          <year>2014</year>
          )
          <fpage>391</fpage>
          -
          <lpage>404</lpage>
          . URL: https://doi.org/10.1109/TKDE.
          <year>2012</year>
          .
          <volume>252</volume>
          . doi:
          <volume>10</volume>
          .1109/TKDE.
          <year>2012</year>
          .
          <volume>252</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Harper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <article-title>The movielens datasets: History and context</article-title>
          ,
          <source>ACM Trans. Interact. Intell. Syst</source>
          .
          <volume>5</volume>
          (
          <year>2016</year>
          )
          <volume>19</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          :
          <fpage>19</fpage>
          . URL: https://doi.org/10.1145/2827872. doi:
          <volume>10</volume>
          . 1145/2827872.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferrari Dacrema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Boglio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          ,
          <article-title>A troubling analysis of reproducibility and progress in recommender systems research</article-title>
          ,
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>39</volume>
          (
          <year>2021</year>
          ). URL: https://doi.org/10.1145/3434185. doi:
          <volume>10</volume>
          .1145/3434185.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>B.</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , G. Karypis,
          <string-name>
            <given-names>J.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          ,
          <article-title>Item-based collaborative filtering recommendation algorithms</article-title>
          ,
          <source>in: Proceedings of the 10th International Conference on World Wide Web (WWW '01)</source>
          ,
          <year>2001</year>
          , pp.
          <fpage>285</fpage>
          -
          <lpage>295</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , G. Karypis,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          ,
          <article-title>Item-based collaborative filtering recommendation algorithms</article-title>
          , in: V. Y.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Saito</surname>
            ,
            <given-names>M. R.</given-names>
          </string-name>
          <string-name>
            <surname>Lyu</surname>
          </string-name>
          , M. E. Zurko (Eds.),
          <source>Proceedings of the Tenth International World Wide Web Conference, WWW 10</source>
          ,
          <string-name>
            <surname>Hong</surname>
            <given-names>Kong</given-names>
          </string-name>
          , China, May 1-
          <issue>5</issue>
          ,
          <year>2001</year>
          , ACM,
          <year>2001</year>
          , pp.
          <fpage>285</fpage>
          -
          <lpage>295</lpage>
          . URL: https://doi.org/10.1145/371920.372071. doi:
          <volume>10</volume>
          .1145/371920.372071.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cooper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Radzik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Siantos</surname>
          </string-name>
          ,
          <article-title>Random walks in recommender systems: exact computation and simulations</article-title>
          , in: C.
          <string-name>
            <surname>Chung</surname>
            ,
            <given-names>A. Z.</given-names>
          </string-name>
          <string-name>
            <surname>Broder</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Shim</surname>
          </string-name>
          , T. Suel (Eds.),
          <source>23rd International World Wide Web Conference, WWW '14</source>
          , Seoul, Republic of Korea, April 7-
          <issue>11</issue>
          ,
          <year>2014</year>
          , Companion Volume,
          <source>ACM</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>811</fpage>
          -
          <lpage>816</lpage>
          . URL: https://doi.org/10.1145/ 2567948.2579244. doi:
          <volume>10</volume>
          .1145/2567948.2579244.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>B.</given-names>
            <surname>Paudel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Christofel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Newell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernstein</surname>
          </string-name>
          , Updatable, accurate, diverse, and
          <article-title>scalable recommendations for interactive applications</article-title>
          ,
          <source>ACM Trans. Interact. Intell. Syst</source>
          .
          <volume>7</volume>
          (
          <issue>2017</issue>
          ) 1:
          <fpage>1</fpage>
          -
          <lpage>1</lpage>
          :
          <fpage>34</fpage>
          . URL: https://doi.org/10.1145/2955101. doi:
          <volume>10</volume>
          .1145/2955101.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>A.</given-names>
            <surname>Cichocki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. H.</given-names>
            <surname>Phan</surname>
          </string-name>
          ,
          <article-title>Fast local algorithms for large scale nonnegative matrix and tensor factorizations</article-title>
          ,
          <source>IEICE Trans. Fundam. Electron. Commun. Comput. Sci</source>
          . 92-
          <fpage>A</fpage>
          (
          <year>2009</year>
          )
          <fpage>708</fpage>
          -
          <lpage>721</lpage>
          . URL: https://doi.org/10.1587/transfun.E92.A.
          <volume>708</volume>
          . doi:
          <volume>10</volume>
          .1587/transfun.E92.A.
          <volume>708</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Rendle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Freudenthaler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Gantner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Schmidt-Thieme</surname>
          </string-name>
          ,
          <article-title>BPR: bayesian personalized ranking from implicit feedback</article-title>
          , in: J. A.
          <string-name>
            <surname>Bilmes</surname>
          </string-name>
          , A. Y. Ng (Eds.),
          <source>UAI 2009, Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence</source>
          , Montreal, QC, Canada, June 18-21,
          <year>2009</year>
          , AUAI Press,
          <year>2009</year>
          , pp.
          <fpage>452</fpage>
          -
          <lpage>461</lpage>
          . URL: https://dslpitt.org/uai/ displayArticleDetails.jsp
          <article-title>?mmnu=1&amp;smnu=2&amp;article_id=1630&amp;proceeding</article-title>
          _id=
          <fpage>25</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Volinsky</surname>
          </string-name>
          ,
          <article-title>Collaborative filtering for implicit feedback datasets</article-title>
          ,
          <source>in: Proceedings of the 8th IEEE International Conference on Data Mining (ICDM</source>
          <year>2008</year>
          ),
          <source>December 15-19</source>
          ,
          <year>2008</year>
          , Pisa, Italy, IEEE Computer Society,
          <year>2008</year>
          , pp.
          <fpage>263</fpage>
          -
          <lpage>272</lpage>
          . URL: https: //doi.org/10.1109/ICDM.
          <year>2008</year>
          .
          <volume>22</volume>
          . doi:
          <volume>10</volume>
          .1109/ICDM.
          <year>2008</year>
          .
          <volume>22</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>X.</given-names>
            <surname>Ning</surname>
          </string-name>
          , G. Karypis, SLIM:
          <article-title>Sparse linear methods for top-n recommender systems</article-title>
          ,
          <source>in: Proceedings of the 11th IEEE International Conference on Data Mining (ICDM '11)</source>
          ,
          <year>2011</year>
          , pp.
          <fpage>497</fpage>
          -
          <lpage>506</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>H.</given-names>
            <surname>Steck</surname>
          </string-name>
          ,
          <article-title>Embarrassingly shallow autoencoders for sparse data</article-title>
          , in: L. Liu,
          <string-name>
            <given-names>R. W.</given-names>
            <surname>White</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mantrach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Silvestri</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. J. McAuley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Baeza-Yates</surname>
          </string-name>
          , L. Zia (Eds.),
          <source>The World Wide Web Conference, WWW</source>
          <year>2019</year>
          , San Francisco, CA, USA, May
          <volume>13</volume>
          -17,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>3251</fpage>
          -
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>