<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>CRUISE on Quantum Computing for Feature Selection in Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jiayang Niu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jie Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ke Deng</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yongli Ren</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computing Technologies, RMIT University</institution>
          ,
          <addr-line>Melbourne, Victoria 3000</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Using Quantum Computers to solve problems in Recommender Systems that classical computers cannot address is a worthwhile research topic. In this paper, we use Quantum Annealers to address the feature selection problem in recommendation algorithms. This feature selection problem is a Quadratic Unconstrained Binary Optimization (QUBO) problem. By incorporating Counterfactual Analysis, we significantly improve the performance of the item-based KNN recommendation algorithm compared to using pure Mutual Information. Extensive experiments have demonstrated that the use of Counterfactual Analysis holds great promise for addressing such problems.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Quantum Computers</kwd>
        <kwd>Recommender Systems</kwd>
        <kwd>Counterfactual Analysis</kwd>
        <kwd>Feature Selection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Collaborative filtering technology [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ], which predicts potential user-item interactions based on the
patterns of user behavior and item characteristics, is widely applied in recommendation algorithms,
Some well-known techniques in this field include matrix factorization methods [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], neighborhood-based
methods [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], deep learning approaches [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], graph-based techniques [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], factorization machines [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
hybrid methods [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], Bayesian methods [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and large language models (LLMs) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. However,
collaborative filtering technology [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] heavily relies on the quality of data. For instance, using user profiles,
item features, reviews, images, and other information can significantly improve the performance of
recommendation algorithms, but in some cases, it can also decrease their performance. Therefore, it’s
critical to distinguish what information are useful for recommendations so as to help the the
construction of eficient systems and reduction of energy consumption [ 13, 14, 15, 16]. Quantum computers,
with its use of qubits and quantum efects like superposition, entanglement, and quantum tunneling, is
an efective tool for identifying useful information from redundant data [ 17]. It significantly enhances
the processing speed of search problems and large integer factorization [18]. Therefore, in this paper,
we aim to find useful features for recommendations by leveraging quantum computing techniques. Our
goal is to improve the eficiency and accuracy of recommendation systems by identifying and utilizing
relevant data, thereby reducing computational requirements and energy consumption [18, 19, 20].
      </p>
      <p>In QuantumCLEF 2024, we focus on Task 1B, where 150 and 500 features are provided for each item,
respectively[21, 22]. We will analyze these features to extract the most relevant ones for recommender
systems. The task requires participants to use Quantum Annealing and Simulated Annealing to select
appropriate features from the given data for an Item-Based KNN recommendation algorithm
(ItemKNN). The organizers provided an example of feature selection by using Mutual Information [18].
However, our preliminary experiments showed that using only Mutual Information for feature selection
resulted in limited improvement in the performance of Item-KNN compared to using all features without
any selection. This is because Mutual Information only reflects the mutual relationship between two
variables and is not associated with the final goal of the recommendation algorithm. Therefore, to
achieve better performance, we propose taking the impact of features on recommendation quality into
consideration when performing feature selection.</p>
      <p>One approach to achieve this is through Counterfactual Analysis [23], which is a causal research tool
to examine the impact of a factor on the final result by hypothesizing the absence or alteration of that
factor. This approach mainly considers three aspects: Which factors need to be evaluated? What metrics
are used to assess the impact of these factors on the model’s outcomes? And what models are used to
derive the values of these metrics? In this work, due to the limited time for this task, we aim to measure
and explore the impact of item features by Counterfactually Analyzing their efect on nDCG [ 24]
performance of recommendation lists and we chose the KNN-based recommendation algorithm, a
commonly used method in collaborative filtering, to perform these measurements. Specifically, we
used Item-KNN to derive the change in nDCG values after removing a specific item feature. Since
Mutual Information can reflect the relationship between two features, which may positively afects
the final results, we did not discard it. Instead, we integrated the results of Counterfactual Analysis
into Mutual Information using a temperature coeficient, which is used to control the influence of
Counterfactual Analysis on the final results. Given the current limitations on the number of qubits in
Quantum Computers, directly performing Quantum Annealing on 500 variables remains a challenging
task. Therefore, in this task, we first partitioned the 500 features into subsets manageable by the
Quantum Computer, and then combined the results.</p>
      <p>The paper is organized as follows: Section 2 introduces related works; Section 3 describes the QUBO
formulation, how Mutual Information is applied to QUBO for feature selection, and our proposed method
of using Counterfactual Analysis for feature selection in QUBO; Section 4 explains our experimental
setup and experimental result; Section 5 discusses our main findings; finally, Section 6 draws some
conclusions and outlooks for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <sec id="sec-2-1">
        <title>2.1. Quantum Computers</title>
        <p>In recent years, the rapid development of Quantum Computers has demonstrated their tremendous
potential in solving problems that Classical Computer cannot address, such as NP and NP-hard problems [25].
Based on their functionality and application scenarios, Quantum Computers can be categorized into
Universal Quantum Computers, Quantum Annealers, Quantum Machine Learning Accelerators, and
others [26]. Recent studies have utilized Quantum Annealers for feature selection to enhance the
performance of recommendation systems or retrieval systems [27, 28, 18]. Nembrini et al. [27] attempted to
apply Quantum Computers to recommendation systems by using Quantum Annealing to solve a hybrid
feature selection approach. Their work demonstrates that current Quantum Computers are already
capable of addressing real-world recommendation system problems. Nikitin et.al.[28] reproduced
Nembrini’s work and employed Tensor Train-based Optimization (TTOpt) as an optimizer for the cold
start problem in recommendation systems. MIQUBO [18] discussed the problem of feature selection
using Quantum Computers and formalizes it as a Quadratic Unconstrained Binary Optimization (QUBO)
problem. It demonstrates the potential of Quantum Computers to solve ranking and classification
problems more eficiently.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Counterfactual Analysis</title>
        <p>Existing deep learning models have complex decision-making processes that are dificult for people to
understand, often functioning as black-box models, Counterfactual Analysis is a highly efective method
for helping people understand these complex models and robust them [29]. For example, CF2 [30]
used Counterfactual Analysis to explore the explanations of Graph Neural Networks. In recommender
systems, Counterfactual Analysis is primarily used for explainability and to combat data sparsity.
ACCENT [31] was the first to apply Counterfactual Analysis to neural network-based recommendation
algorithms. CountER [32] utilizes Counterfactual Analysis to construct a low-complexity, high-strength
model for explaining recommendation systems. It also highlights that using Counterfactual Analysis
contributes to the interpretability and evaluation of recommendation systems. Zhang et al [33] designed
a CauseRec framework that utilizes Counterfactual to enhance representations in the data distribution,
aiming to mitigate data sparsity.</p>
        <p>In summary, Counterfactual Analysis can help people understand complex deep learning decision
systems and has the potential to analyze how various factors interact in recommendation systems.
Given the current advancements in Quantum Computers, utilizing Counterfactual Analysis combined
with the ability of Quantum Computers to handle NP problems presents a promising direction.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <sec id="sec-3-1">
        <title>3.1. Preliminary</title>
        <sec id="sec-3-1-1">
          <title>3.1.1. QUBO Formulation</title>
          <p>In this work, we follow the approach described in [18], which utilizes Quantum Annealing for feature
selection. To apply these methods, the feature selection problem is formulated as a Quadratic
Unconstrained Binary Optimization (QUBO) problem. The QUBO formulation can be used to solve certain NP
and NP-hard optimization problems and is defined as follows [18]:</p>
          <p>min  =  ,
where  is a binary vector of length , with each element of the vector being either 0 or 1.  is
a symmetric matrix, where each element represents the relationship between the elements of . 
denotes the number of features to be selected. In other words, the elements of vector  indicate whether
the corresponding features are selected, and the elements in  influence the search direction of the
function, determining feature selection.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>3.1.2. Feature Selection Based on Mutual Information</title>
          <p>Following [18], Mutual Information QUBO (MIQUBO) is a quadratic feature selection model based
on Mutual Information. MIQUBO aims to maximize the Mutual Information, which measures the
dependency between two variables, and the Conditional Mutual Information, which measures the
dependency between two variables given a target variable, of the selected features. In this context, the
matrix  in Equation 1 is defined as:
(1)
(2)
(3)
where MI(; ) is the Mutual Information between feature  and target feature , and CMI(;  |  )
is the Conditional Mutual Information between feature  and target feature  given feature  . Since
QUBO formulation is used to find the minimum state, a negative sign is required before MI and CMI.</p>
          <p>To control the number of selected features, a penalty term is added to Equation 1, which is then
transformed to:
min  =   +
︃( 
∑︁  − 
=1
)︃2
.</p>
          <p>This formula will be minimized when selecting  features, this also following the descriptions in [18].</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Counterfactual Analysis</title>
        <p>To better identify features directly associated with recommendation performance, we integrate a widely
used recommendation ranking metric into Mutual Information through Counterfactual Analysis.</p>
        <p>E = nDCG(F) − nDCG(F∖),
where  represents the change in the nDCG result of the recommendation model  after removing
the feature . nDCG(F) represents the nDCG@10 value obtained by the  using all item features set
 , while nDCG(F∖) represents the nDCG@10 value obtained by the  using features set which is set
 removing feature . It is important to note that  ultimately reflects the impact of feature  on the
result. Since the final outcome is influenced by the interactions between all features, simply removing
features with positive  values does not yield the optimal feature selection solution.</p>
        <p>When  ≥ 0, it indicates that the algorithm’s performance decreases after removing the feature .
The extent of this decrease reflects the positive impact of this feature on the algorithm. Conversely, an
increase in the value reflects the negative impact of this feature on the algorithm. We hypothesize that if
the selected set of features is ( * ), the maximization the sum of  ( ∈ ( * )), the maximization
the performance improvement of the baseline algorithm. Since the QUBO problem is a minimization
optimization problem, we redefine  as follows:</p>
        <sec id="sec-3-2-1">
          <title>3.2.1. Counterfactual Analysis for Feature Selection</title>
          <p>Counterfactual Analysis [23] is usually used to examine the causal relationship between conditions,
decisions, and outcomes by hypothesizing how the results of observed events would change if the
conditions and decisions were altered. In the field of Recommender System, Counterfactual Analysis is
often used for the interpretability of recommendation models, helping researchers enhance algorithm
performance [32, 33]. Inspired by existing works [32, 33], the impact of item features can be explored
by excluding the corresponding feature and analyzing the diference in recommendation performance
between the recommendation lists generated by the model with and without the corresponding feature.</p>
          <p>In this work, we use the widely used Item-KNN recommendation algorithm, termed as model , and
employ the recommendation performance metric Normalized Discounted Cumulative Gain (nDCG) [24]
for Counterfactual Analysis. nDCG is defined as:
(4)
(5)
(6)
(7)
{︃−  (;  |  )</p>
          <p>if  ̸= 
−  (; ) −  E if  = 
where  is a coeficient used to control the influence of  on the search results. The larger the value of
 , the greater the influence of  on the final results. The overall process of the above algorithm, which
we refer to as Counterfactual Analysis QUBO (CAQUBO), is as follows in Algorithm 1.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Handling Large Feature Set</title>
        <p>Although Quantum Computers are developing rapidly, the limitation in the number of qubits restricts
them to handling only a limited number of feature selection problems. For selecting from 500 features,
we partition them into several subsets and use Quantum Annealing (QA) or Simulated Annealing (SA)
to perform feature selection on these subsets individually, then combine the results.</p>
        <p>First, partition the 500 features into  subsets by order, 1, 2, · · · , , · · · , , where  is the -th
subset of features, and  is the number of subsets.</p>
        <p>1, 2, · · · , , · · · ,  = divide(F)</p>
        <p>˜ = ⋃︁ QA/SA(),</p>
        <p>=1
Then, use Quantum Annealing (QA) or Simulated Annealing (SA) to perform feature selection on each
subset, and combine the results:
where ˜ is the final selected features set, represents each partitioned subset of features, and QA/SA
(_) represents the selected features from subset  using QA and SA. The final feature set is obtained
by merging the selected features from all subsets.</p>
        <p>Algorithm 1 Counterfactual Analysis QUBO</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental Setup</title>
      <p>Datasets: In this work, two tasks are undertaken: the first involves selecting appropriate features from
a set of 150 item features for training , and the second involves selecting features from a set of 500
item features. Three data sets are provided for these tasks: 150_ICM, 500_ICM, and URM. The 150_ICM
and 500_ICM contain item features, while the URM includes interaction data between 1,890 users and
18,022 interacted items.</p>
      <p>Experimental parameter setting: We used a self-implemented Item-KNN recommendation model
based on the problem statement to calculate . The interaction data was split into training and test sets
in an 80:20 ratio. It is worth noting that calculating  is very time-consuming, so we only used a subset
of items for the calculations. In the use of Quantum Annealing (QA) and Simulated Annealing(SA), the
coeficient  significantly afects the features selected by QA and SA. Due to the limited usage time of
the Quantum Annealer (QA), it is necessary to use Simulated Annealing (SA) to explore the efectiveness
of the selected features under diferent parameters  and  before using QA. In preliminary experiment,
we attempt [ : 0, 1e1, 1e3, 1e5, 1e7], [k: 50, 100, 130, 140, 145] in Feature 150 and [ : 0, 1e1,
1e3, 1e5, 1e7], [k: 300, 350, 400, 450, 470] in Feature 500. For the selection of 500 features, n (is
mentioned in Section 3.3) is set to 5. The preliminary experiment results can be found in Table 1.
Repeated Calculations: Due to the heuristic nature of Simulated Annealing (SA) and Quantum
Annealing (QA), the final results may vary even with fixed parameters. To mitigate this efect, we
perform multiple iterations of QA and SA under the same parameters and select the final feature set via
voting. For example, we repeated the experiment five times.  was not included in  * in any of the
ifve experiments, while  was included in  * in four out of the five experiments. Therefore, the final
submitted feature set  * does not include  but includes  .</p>
      <p>Parameters set
k=140  =1e7  =1e-5
k=140  =1e7  =1e-3
k=140  =1e7  =1e-3
.</p>
      <p>k
0
1
50
This table contains the final data submitted to the organizers, with data sourced from the organizers’ website1.
Due to the fact that when  is too large, the values of elements in Q become excessively large, which is detrimental
to the performance of QA and SA, a coeficient  is applied to all elements in Q. An asterisk (*) after the sub_ID
indicates that the selected features are the result of repeated calculations. Those submissions was repeated five
times to determine the final feature set.</p>
      <p>150 Feature submissions</p>
      <p>All Feature nDCG 0.0810</p>
      <p>Annealing Time</p>
      <p>Type
nº features sub_id</p>
      <sec id="sec-4-1">
        <title>1 https://qclef.dei.unipd.it/clef2024-results.html</title>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>diferent parameters  and . When  = 0, QA and SA select features based solely on Mutual Information
(MI) and Conditional Mutual Information (CMI). Across diferent values of parameter , the performance
of selected features in  rarely surpasses the performance in Counterfactual Analysis QUBO. As the
parameter  increases, the performance of the features selected by QA and SA in the item-KNN shows
significant improvement compared to using all features. The efectiveness of feature selection shows no
significant improvement when  &gt;</p>
      <p>15 . This may be because as the value of  increases, the impact
of MI and CMI on feature selection diminishes, causing QA and SA to rely entirely on  for feature
selection.
 into , the features selected by QA and SA show a significant performance improvement in item-KNN
compared to using all features. An unusual observation is that, under the same parameters, the features
selected by QA generally do not perform as well as those selected by SA in item-KNN, and sometimes
do not even surpass the performance of using all features. During the experiments, we noticed that this
is due to QA often returning results before finding the optimal solution.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions and Future Work</title>
      <p>In this paper, we present the explorations conducted by our team and the details of our final submission
for the QuantumCLEF 2024 activities. We used Counterfactual Analysis of individual item features to
select appropriate features for item-KNN using Quantum Annealing. Our preliminary experiments
and the results returned by QuantumCLEF 2024 demonstrated that our use of Counterfactual Analysis
significantly improved the performance of item-KNN.</p>
      <p>Within the limited time of QuantumCLEF, we attempted Counterfactual Analysis of individual
features. However, because the performance of collaborative filtering is actually the result of feature
interactions, Counterfactual Analysis of individual features has significant limitations. Additionally,
since Quantum Annealing cannot directly handle the selection of 500 features, we adopted a sequential
partitioning and merging approach. As negative features are not uniformly distributed by their indices
among all features, this sequential partitioning and merging method still requires improvement.
[13] S. Marchesin, A. Purpura, G. Silvello, Focal elements of neural information retrieval models. an
outlook through a reproducibility study, Information Processing &amp; Management 57 (2020) 102109.
[14] E. Strubell, A. Ganesh, A. McCallum, Energy and policy considerations for deep learning in nlp,
arXiv preprint arXiv:1906.02243 (2019).
[15] Y. Himeur, A. Alsalemi, A. Al-Kababji, F. Bensaali, A. Amira, C. Sardianos, G. Dimitrakopoulos,
I. Varlamis, A survey of recommender systems for energy eficiency in buildings: Principles,
challenges and prospects, Information Fusion 72 (2021) 1–21.
[16] G. Adomavicius, J. Zhang, Impact of data characteristics on recommender systems performance,</p>
      <p>ACM Transactions on Management Information Systems (TMIS) 3 (2012) 1–17.
[17] Y. Lu, A. Sigov, L. Ratkin, L. A. Ivanov, M. Zuo, Quantum computing and industrial information
integration: A review, Journal of Industrial Information Integration (2023) 100511.
[18] M. Ferrari Dacrema, F. Moroni, R. Nembrini, N. Ferro, G. Faggioli, P. Cremonesi, Towards feature
selection for ranking and classification exploiting quantum annealers, in: Proceedings of the 45th
International ACM SIGIR Conference on Research and Development in Information Retrieval,
2022, pp. 2814–2824.
[19] F. Glover, G. Kochenberger, Y. Du, Quantum bridge analytics i: a tutorial on formulating and using
qubo models, 4or 17 (2019) 335–371.
[20] G. Pilato, F. Vella, A survey on quantum computing for recommendation systems, Information 14
(2022) 20.
[21] A. Pasin, M. Ferrari Dacrema, P. Cremonesi, N. Ferro, QuantumCLEF 2024: Overview of the
Quantum Computing Challenge for Information Retrieval and Recommender Systems at CLEF,
in: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), Grenoble,
France, September 9th to 12th, 2024, 2024.
[22] A. Pasin, M. Ferrari Dacrema, P. Cremonesi, N. Ferro, Overview of QuantumCLEF 2024: The
Quantum Computing Challenge for Information Retrieval and Recommender Systems at CLEF, in:
Experimental IR Meets Multilinguality, Multimodality, and Interaction - 15th International
Conference of the CLEF Association, CLEF 2024, Grenoble, France, September 9-12, 2024, Proceedings,
2024.
[23] J. Pearl, M. Glymour, N. P. Jewell, Causal inference in statistics: A primer, John Wiley &amp; Sons, 2016.
[24] K. Järvelin, J. Kekäläinen, Cumulated gain-based evaluation of ir techniques, ACM Transactions
on Information Systems (TOIS) 20 (2002) 422–446.
[25] L. Bittel, M. Kliesch, Training variational quantum algorithms is np-hard, Physical review letters
127 (2021) 120502.
[26] S. S. Gill, A. Kumar, H. Singh, M. Singh, K. Kaur, M. Usman, R. Buyya, Quantum computing: A
taxonomy, systematic review and future directions, Software: Practice and Experience 52 (2022)
66–114.
[27] R. Nembrini, M. Ferrari Dacrema, P. Cremonesi, Feature selection for recommender systems with
quantum computing, Entropy 23 (2021) 970.
[28] A. Nikitin, A. Chertkov, R. Ballester-Ripoll, I. Oseledets, E. Frolov, Are quantum computers practical
yet? a case for feature selection in recommender systems using tensor networks, arXiv preprint
arXiv:2205.04490 (2022).
[29] S. Verma, V. Boonsanong, M. Hoang, K. E. Hines, J. P. Dickerson, C. Shah, Counterfactual
explanations and algorithmic recourses for machine learning: A review, arXiv preprint arXiv:2010.10596
(2020).
[30] M. L. Olson, R. Khanna, L. Neal, F. Li, W.-K. Wong, Counterfactual state explanations for
reinforcement learning agents via generative deep learning, Artificial Intelligence 295 (2021) 103455.
[31] K. H. Tran, A. Ghazimatin, R. Saha Roy, Counterfactual explanations for neural recommenders, in:
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in
Information Retrieval, 2021, pp. 1627–1631.
[32] J. Tan, S. Xu, Y. Ge, Y. Li, X. Chen, Y. Zhang, Counterfactual explainable recommendation, in:
Proceedings of the 30th ACM International Conference on Information &amp; Knowledge Management,
2021, pp. 1784–1793.
[33] S. Zhang, D. Yao, Z. Zhao, T.-S. Chua, F. Wu, Causerec: Counterfactual user sequence synthesis
for sequential recommendation, in: Proceedings of the 44th International ACM SIGIR Conference
on Research and Development in Information Retrieval, 2021, pp. 367–377.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>X.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. M.</given-names>
            <surname>Khoshgoftaar</surname>
          </string-name>
          ,
          <article-title>A survey of collaborative filtering techniques</article-title>
          ,
          <source>Advances in artificial intelligence 2009</source>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Lebanon</surname>
          </string-name>
          ,
          <article-title>A comparative study of collaborative filtering algorithms</article-title>
          ,
          <source>arXiv preprint arXiv:1205.3193</source>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N.</given-names>
            <surname>Koenigstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ram</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shavitt</surname>
          </string-name>
          ,
          <article-title>Eficient retrieval of recommendations in a matrix factorization framework</article-title>
          ,
          <source>in: Proceedings of the 21st ACM international conference on Information and knowledge management</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>535</fpage>
          -
          <lpage>544</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Adeniyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yongquan</surname>
          </string-name>
          ,
          <article-title>Automated web usage data mining and recommendation system using k-nearest neighbor (knn) classification method</article-title>
          ,
          <source>Applied Computing and Informatics</source>
          <volume>12</volume>
          (
          <year>2016</year>
          )
          <fpage>90</fpage>
          -
          <lpage>108</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Hidasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Karatzoglou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Baltrunas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tikk</surname>
          </string-name>
          ,
          <article-title>Session-based recommendations with recurrent neural networks</article-title>
          ,
          <source>arXiv preprint arXiv:1511.06939</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          , Ł. Kaiser,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          ,
          <source>Advances in neural information processing systems</source>
          <volume>30</volume>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Feng</surname>
          </string-name>
          , T.-S. Chua,
          <article-title>Neural graph collaborative filtering</article-title>
          ,
          <source>in: Proceedings of the 42nd international ACM SIGIR conference on Research and development in Information Retrieval</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>165</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Lightgcn: Simplifying and powering graph convolution network for recommendation</article-title>
          ,
          <source>in: Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>639</fpage>
          -
          <lpage>648</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Yuan</surname>
          </string-name>
          , G. Guo,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Jose</surname>
          </string-name>
          , L. Chen,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , Lambdafm:
          <article-title>Learning optimal ranking with factorization machines using lambda surrogates</article-title>
          ,
          <source>in: Proceedings of the 25th ACM international on conference on information and knowledge management</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>227</fpage>
          -
          <lpage>236</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>G.</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          ,
          <article-title>Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions</article-title>
          ,
          <source>IEEE transactions on knowledge and data engineering 17</source>
          (
          <year>2005</year>
          )
          <fpage>734</fpage>
          -
          <lpage>749</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Lopes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Assunção</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <article-title>Eficient bayesian methods for graph-based recommendation</article-title>
          ,
          <source>in: Proceedings of the 10th ACM Conference on Recommender Systems</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>333</fpage>
          -
          <lpage>340</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Park</surname>
          </string-name>
          , Gram:
          <article-title>Fast fine-tuning of pre-trained language models for content-based collaborative filtering</article-title>
          ,
          <source>arXiv preprint arXiv:2204.04179</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>