<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Comparison of the Effectiveness of Various Algorithms on a Recommendation System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Bektemyssova Gulnara</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Akhmer Yerassyl</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>International Information Technology University</institution>
          ,
          <addr-line>Almaty</addr-line>
          ,
          <country country="KZ">Kazakhstan</country>
        </aff>
      </contrib-group>
      <fpage>122</fpage>
      <lpage>130</lpage>
      <abstract>
        <p>Recommender systems attempt to identify user information by proposing related products or resources that customers may be interested. Recommender methods have attracted attention in the fields of information technology, ecommerce, and so on, by essentially fertilizing from a standard collection of decisions that led consumers to find information of interest. This research focuses on the three common recommendation systems: Collaborative Filtering, Content-Based Filtering, and Hybrid recommendation systems. For the purposes of this analysis, the well-known MovieLens dataset has been used. The assessment considered both the quantitative and qualitative dimensions of the recommendation systems. This paper describes the field of various recommendation approaches and related fundamental techniques. Any algorithm in this field has both benefits and drawbacks. The goal of the research is to bring various algorithms to the test in order to find the right one based on the layout of the dataset and the researchers' goals.</p>
      </abstract>
      <kwd-group>
        <kwd>Recommender Systems</kwd>
        <kwd>Collaborative Filtering</kwd>
        <kwd>Content-Based Filtering</kwd>
        <kwd>E-Commerce</kwd>
        <kwd>Hybrid Recommendation System</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Recommender systems are an integral part of e-commerce today. The active
transition from traditional offline sales to online makes the introduction of
machine learning technologies and algorithms for recommendations more
and more popular in retail. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Recommendations simplify shopping for store
customers, and allow sellers to increase customer loyalty by saving time and
an individual approach to product offerings, as well as increasing the product
matrix and average customer check. Unlike e-commerce, grocery chains do not
represent how customers react to promoted products in real time. However, thanks
to loyalty programs and check databases, it is possible to build a recommendation
system from scratch. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>In this paper, we will look at various concepts of recommender systems. We
will introduce how they perform, define their theoretical background, and start
debating their strengths and limitations for each of them. A comparative analysis
of these algorithms is carried out from the point of view of the criteria of the
accuracy of the results obtained and the performance.</p>
      <p>In the first part, we will address the two main methodologies of
recommender systems: collaborative and content-based approaches. The following two parts
would then go through different collaborative filtering methods, such as
useruser, item-item, and matrix factorization. The part that follows presents
contentsbased approaches and their operation. Finally, we will go over how to assess a
recommender system.</p>
      <p>
        In retail, three types of recommendations are commonly used: content,
collaborative, and hybrid. Recommendation systems are frequently divided into
three large categories:
• Content-Based systems, which are using keywords to propose products to
a client that are close to those historically favored [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ];
• Collaborative Filtering methods, that propose products based on
information recently seen or purchased.
• Hybrid Recommendation methods, which provide a variation of
ContentBased and Collaborative Filtering techniques to overcome some of the
shortcomings that occur in the above-mentioned systems.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Approaches</title>
      <sec id="sec-2-1">
        <title>2.1 Collaborative filtering</title>
        <p>
          Collaborative recommendation is quite certainly the commonly used, and
advanced of the approaches. Collaborative recommender frameworks combine
item ratings or suggestions, identify common threads among customers based
on their scores, and produce new suggestions based on inter-user correlations.
This approach may be Memory-Based Collaborative Filtering, which measures
customer’s access using similarity or other metrics, or Model-Based Collaborative
Filtering, that derives a template from past prescriptive analytics and uses it to
make forecasts. [
          <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 Content-based filtering</title>
        <p>
          Even though Collaborative Filtering is well known and effective, it has
drawbacks. One of them is the sparsity dilemma, which happens when users
give no scores; throughout this situation, our model is unable to produce fair
suggestions. To address the sparsity problem, study suggests Content-based
Recommender Systems, which are focused on the analysis of adjunct data such
as text, photographs, and videos, as well as customers’ accounts. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Assume
anyone loves science fiction, romance, and action films but not fantasy films.
Through period, the algorithm could collect this knowledge and decide that the
client has a high approval rating for genres such as science fiction, romance,
and action, and a negative rating for fantasy. The algorithm could even discover
which actors the client likes and dislikes. Also with tiny remarks, the customer’s
choice may be inferred in this manner. The critical point between Content Based
Filtering and Collaborative Filtering is that Collaborative Filtering proposes new
products depending on the taste of the customers who have common preferences
for many other products, while Content Based Filtering is focused on the analysis
of source data and is not associated with the expectations of many other clients.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3 Hybrid recommender system</title>
        <p>The term “hybrid recommendation strategy” applies to a recommendation
system that employs two or more sources of recommendation methods in order to
achieve better results while minimizing the disadvantages of each particular one.
Collaborative filtering is often paired with another method.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related works</title>
      <p>
        When working with items containing textual data, content-based systems
yield outcomes that are more accurate. However, these systems are incapable
of distinguishing between a well-written text definition and a poorly written
one, particularly when similar or different phrases are used [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Furthermore,
these systems are sometimes constrained by the over similarity issue; when a
system suggests products that have a higher correlation to a customer’s profile,
the client is likely to be recommended with products that are identical to those
which have already been seen [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Besides that, when a new customer enters
in the system with little or no rankings, he or she is very likely to be given low
accuracy suggestions (this is recognized as the cold-start or new-user problem)
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. As mentioned in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Content-based systems need a great amount of scores
before recommending products to a consumer with high precision. Collaborative
Filtering methods, in comparison to content-based systems, result in bias due
to the sparsity problem [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Since the amount of items on e-commerce websites
is immense, the most frequent users normally rank only a portion of the given
data. It implies that some of the most common products have very few scores
and therefore have a low probability of being suggested by the system [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ].
Collaborative Filtering systems, like Content-based systems, should have a large
number of relevant data on a user account before producing correct predictions.
Furthermore, new products must be assessed by a wide range of users; otherwise,
the RS would be unable to offer suggestions for items [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In specific, RS face
technical challenges; given the massive quantities of data available on websites
and apps, a significant amount of computing effort has been put to generate
suggestions [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>Preliminary experiments</title>
      <p>For preliminary study, we used the ‘MovieLens 1M Dataset.’ The dataset includes
1,000,209 anonymous reviews of roughly 3,900 movies submitted by 6,040
MovieLens subscribers who entered the site in 2000. We explicitly selected two
documents: ratings and movies. There were four fields in the ratings file. They
are as follows: UserID (scale from 1 to 6040), MovieID (varies from zero to
3952), Ratings (a 5-star ranking), and Timestamp (in seconds after the epoch).
Each consumer does have at least 20 ratings. There were three basic forms in the
movies log. They are as follows: MovieID, Title, and Genres. Titles are much the
same as given by IMDB (including year of release). Genres are tube and chosen
from the categories listed: Children’s, Comedy, Crime, Documentary, Drama,
Fantasy, Film-Noir, Horror, Musical, Mystery, Romance, Sci-Fi, Thriller, War
and Western.</p>
      <p>We conducted preliminary research study on the datasets. Figure 1 depicts
the histogram of average ratings posted by customers. As we can see, this plot
resembles a normal distribution with a strong left tail. The majority of users have
average scores between 3.5 and 4.</p>
      <p>Fig. 2 depicts a histogram of user-rated products. According to these two
graphs, most consumers score just a few objects.</p>
    </sec>
    <sec id="sec-5">
      <title>Results and discussion</title>
      <p>Quantitative analysis starts by examining the RMSE and MAE errors of a
Collaborative Filtering-based and a Hybrid system. Since the Content-Based
Filtering approach has quite a statistical attribute. In this section, we select the
toprecommended movies from both methods for ten clients and compute the RMSE
errors for each method for analysis. The RMSE graph for ten clients in Fig. 3
shows that perhaps the hybrid model has a relatively lower RMSE. Fig. 4’s
typical RMSE plot also illustrates the hybrid system’s supremacy.</p>
      <p>Next, we consider 5 batches of users with each batch containing 5 users for
whom we do the same test. We calculated the MAE of these sets of users that is
shown in Fig. 5 and the comparison shows Hybrid system performs
comparatively better. Fig. 6 shows the average MAE of Collaborative Filtering and Hybrid
Recommendation System.</p>
      <p>Fig. 7 shows that Collaborative Filtering will predict which films a client is
likely to score higher. However and therefore has no possibility of suggesting
related movies to a specific one suited to the consumer? The genres are all around
the place, as shown by the genre section. In this segment, we assume User 1 and
propose the top 20 movies that he is likely to appreciate high.</p>
      <p>A Content-Based Filtering recommendation framework, from the other side,
seems to have the opportunity to relate us so much similar movies to a specified
one, as seen in Figure 8, it has very little insight into whether a client will like that
or not. In this part, we select Movie Name: Toy Story 39 (1995) with Movie ID 1
and propose the top 20 films that are close to the film, Toy Story.
Fig. 8. Top 20 content-based filtering recommendation system recommendations
for a specific film.</p>
      <p>We get the best possible outcome in a hybrid system. In this section, we
identify User ID 1, Movie Toy Story (1995) with Movie ID 1, and suggest the top 20
films that are close to Toy Story and are probably to still be ranked highly by the
User 1. As a result, we may infer that perhaps a hybrid recommendation system
outperforms a separate Collaborative Filtering or Content-Based Filtering
recommendation system from both qualitative and quantitative terms.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Within the same dataset, three techniques were applied in the analysis to build
a recommendation method. By using possibly the best MovieLens dataset, we
examined various recommendation mechanisms such as Collaborative Filtering,
Content – Based Filtering and Hybrid recommendation systems. We contrasted
all three-suggestion mechanisms using a descriptive and analytical assessment of
the dataset. The need for a combined quantitative and qualitative analysis reflects
the fact that Content-Based Filtering processes cannot be easily evaluated.
Furthermore, for any recommender system, the qualitative analysis is vital. In
addition, that is why, in addition to the conventional methodology, we developed
our unique assessment process. We discovered that a hybrid recommendation
system outperforms a traditional recommendation system in all scenarios.
Following the example of the whole study, there have been possibilities for
additional research. In the suggestion method, for instance, we did not take into
account any demographic details about the client. Even so, considering this will
bring more dimension of complexity to the hybrid recommendation framework.
Furthermore, we just addressed genre in our Content-Based Filtering suggestion,
but one should check at production team as well as movie ratings for any further
similarities. A correlation of various Collaborative Filtering-based approaches
and consistency tests can also be of concern.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Kalitin</surname>
            <given-names>D.V.</given-names>
          </string-name>
          <article-title>Artificial neural networks [Electronic resource]: tutorial / Kalitin DV - Electron. Text data</article-title>
          . - Moscow: Misis Publishing House,
          <year>2018</year>
          . - 88 p
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Francesco</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lior</surname>
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>Bracha</given-names>
            <surname>Sh</surname>
          </string-name>
          .
          <article-title>Introduction to Recommender Systems Handbook</article-title>
          . Springer,
          <year>2011</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>35</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Markovsky</surname>
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Low-Rank</surname>
            <given-names>Approximation</given-names>
          </string-name>
          : Algorithms, Implementation, Applications, Springer,
          <year>2012</year>
          , ISBN 978-1-
          <fpage>4471</fpage>
          -2226-5.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Takacs</surname>
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pilaszy</surname>
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nemeth</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tikk</surname>
            <given-names>D.</given-names>
          </string-name>
          (
          <year>March 2009</year>
          ).
          <article-title>Scalable Collaborative Filtering Approaches for Large Recommender Systems (PDF)</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          <volume>10</volume>
          :
          <fpage>623</fpage>
          -
          <lpage>656</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Brusilovsky</surname>
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>The Adaptive Web</article-title>
          . p.
          <fpage>325</fpage>
          .
          <source>ISBN 978-3-540-72078-2.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>6. MovieLens dataset, https://grouplens.org/datasets/movielens</mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Konstan</surname>
            <given-names>J.A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Riedl</surname>
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Recommender systems: from algorithms to user experience</article-title>
          .
          <source>User Model. User-Adapt. Interact.</source>
          ,
          <volume>22</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>101</fpage>
          -
          <lpage>123</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Adomavicius</surname>
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Tuzhilin</surname>
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions</article-title>
          .
          <source>IEEE Trans. Knowl</source>
          . Data Eng.,
          <volume>17</volume>
          (
          <issue>6</issue>
          ):
          <fpage>734</fpage>
          -
          <lpage>749</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Pu</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hu</surname>
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Evaluating recommender systems from the user's perspective: survey of the state of the art. User Model. User-Adapt. Inter act</article-title>
          .,
          <volume>22</volume>
          (
          <issue>4- 5</issue>
          ):
          <fpage>317</fpage>
          -
          <lpage>355</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lu</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Medo</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yeung</surname>
            <given-names>C. H.</given-names>
          </string-name>
          , Zhang Y.-C., Zhang Z.
          <article-title>-</article-title>
          K., and
          <string-name>
            <surname>Zhou</surname>
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Recommender systems</article-title>
          .
          <source>Physics Reports</source>
          ,
          <volume>519</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ning</surname>
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Desrosiers</surname>
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Karypis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>A comprehensive survey of neighborhoodbased recommendation methods</article-title>
          .
          <source>In Recommender Systems Handbook</source>
          , pages
          <fpage>37</fpage>
          -
          <lpage>76</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>