<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Leveraging Emotion Features in News Recommendations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nastaran Babanejad</string-name>
          <email>nasba@eecs.yorku.ca</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ameeta Agrawal</string-name>
          <email>ameeta@eecs.yorku.ca</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heidar Davoudi</string-name>
          <email>heidar.davoudi@uoit.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aijun An</string-name>
          <email>aan@eecs.yorku.ca</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manos Papagelis</string-name>
          <email>papaggel@eecs.yorku.ca</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ontario Tech University</institution>
          ,
          <addr-line>Oshawa</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>York University</institution>
          ,
          <addr-line>Toronto</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Online news reading has become very popular as the web provides access to news articles from millions of sources around the world. As a specific application domain, news recommender systems aim to give the most relevant news article recommendations to users according to their personal interests and preferences. Recently, a family of models has emerged that aims to improve recommendations by adapting to the contextual situation of users. These models provide the premise of being more accurate as they are tailored to satisfy the continuously changing needs of users. However, little attention has been paid to the emotional context and its potential on improving the accuracy of news recommendations. The main objective of this paper is to investigate whether, how and to what extent emotion features can improve recommendations. Towards that end, we derive a large number of emotion features that can be attributed to both items and users in the domain of news. Then, we devise state-of-the-art emotion-aware recommendation models by systematically leveraging these features. We conducted a thorough experimental evaluation on a real dataset coming from news domain. Our results demonstrate that the proposed models outperform state-of-the-art non-emotion-based recommendation models. Our study provides evidence of the usefulness of the emotion features at large, as well as the feasibility of our approach on incorporating them to existing models to improve recommendations.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
      <p>• Information systems → Recommender systems; Sentiment
analysis; • Computing methodologies → Neural networks.
news recommender systems, contextual information, emotion
features</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>
        Recommender systems (RS) have widely and successfully been
employed in domains as diverse as news and media, entertainment,
e-commerce and financial services, to name a few. The main
utility of such systems is their ability to suggest items to users that
they might like or find useful. Traditionally, research on
recommendation algorithms has focused on improving the accuracy of
predictive models based on a combination of descriptive features
of the items and users themselves (e.g., user behavior, interests and
preferences) and the history of a user’s interactions with the items
through ratings, reviews, clicks and more [
        <xref ref-type="bibr" rid="ref20 ref33 ref34">20, 33, 34</xref>
        ]. However,
little attention has been paid to the emotional context and its relation
to recommendations.
      </p>
      <p>
        While emotions can be manifested in various ways, we focus on
emotions expressed in textual information that is associated with
items or users in the system. For example, the content of a news
article, the content of an online review or the lyrics of a song are
good examples of textual information directly associated with an
item’s emotional context. On the other hand, the emotional profile
of a user can be determined through explicit or implicit feedback of
users to items. Explicit feedback, such as providing ratings and/or
submitting reviews to items, can represent an accurate reflection
of a user’s opinion about the item, but it is considered an intrusive
process that disrupts the user-system interaction and negatively
impacts user experience [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. In addition, while it might be
available for certain domains (e.g, product recommendations [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], movie
recommendations [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], etc.), it is not easily obtainable in domains
such as news, where users typically interact with items at a fast
pace and are less inclined to provide feedback. In the absence,
sparsity or high cost of acquisition of explicit feedback, incorporating
implicit feedback, which is generally abundant and non-intrusive,
might be beneficial. Therefore, we focus on indirectly capturing
the emotional context of users’ activity by monitoring their
interactions with items over time. For instance, one can monitor the
tone of the stories in news article users are reading. Efectively, this
information can be used to model a user’s historical or temporal
emotional profile.
      </p>
      <p>To further motivate this, consider Figure 1 that illustrates the
emotional profiles of two users, U 1 and U2, based on eight basic
emotions, expressed in articles read by them over a period of three
months. One can notice that emotions of sadness and fear are mostly
expressed in the articles read by U1 while other emotions, such as
joy are less expressed. In addition, one can observe trends such as
the expression of anger increasing over time. On the other hand,
for U2, the emotions of joy and trust are mostly expressed and other
emotions, such as disgust are less expressed. Moreover, emotions
of fear and anticipation are increasingly expressed in the articles
read by this user. Although, the emotional tone derived from news
articles read by a user cannot justify the personality and state of
mind of the user, it can be considered as the taste or preference
of the user, where it shows the type of articles they are more
interested in. Inspired by these observations, recent advancement in
methods for emotion detection and the success of emotion-aware
recommendation algorithms, the main motivation of our research
is to investigate whether, how and to what extent emotion features
can improve the accuracy of recommendations.</p>
      <p>The Problem. More formally, the recommendation task can be
described as follows. Let a set of  users U = {1, 2, ...,  } and
a set of  items I = {1, 2, ...,  }. Let us also assume that each
user  has already interacted with a set of items I ⊆ I (e.g.,
consumed news articles). Then, the problem is to accurately
predict the probability ,  with which a user  ∈ U will like item
  ∈ I \ I . The task can also take the form of recommending a
set I ⊆ I \ I of  items that the user will find most interesting
(top- recommendations). For example, in the news domain, the
task is that of recommending an unread article.</p>
      <p>
        Challenges &amp; Approach. In order to evaluate the importance of
the emotional context to recommendations, we had to incorporate
emotional features [
        <xref ref-type="bibr" rid="ref2 ref36 ref45">2, 36, 45</xref>
        ] to state-of-the-art recommendation
algorithms and evaluate their accuracy performance. Figure 2, shows
a schematic diagram of the emotion-aware recommendation
algorithm process we designed, which consists of three main stages: i)
feature engineering, ii) model training, and iii) blending &amp; ensemble
learning. Each of these components, define a number of challenges
that need to be addressed. During feature engineering, we had to
generate a number of features attributed to both users and items.
Emphasis was given in capturing the most important non-emotional
and emotional features for the prediction task. Once features are
extracted, of-the-shelf feature selection methods are employed to
select a subset of them that are more relevant for use in model
construction. During model training, we experiment with a number
of state-of-the-art models for generating recommendations. During
blending &amp; ensemble we combine alternative models to obtain better
predictive models than any of the constituent models alone.
Stage 1
Stage 2
Stage 3
focus of
this paper
items
users
      </p>
      <p>Non-Emotion-based Features
 item-related
 user-related</p>
      <p>Emotion-based Features
 item-related
 user-related
user-specific properties
item-specific properties
user-item interactions
Feature Generation
Feature Extraction
Feature Selection</p>
      <p>Model Training</p>
      <p>Blending</p>
      <p>&amp;</p>
      <p>Ensemble</p>
      <p>Item</p>
      <p>Predictions
Contributions. The major contributions of this paper are as follows:
• We systematically identify, extract and select the most
relevant emotion-based features for use in news
recommendation models. These features are associated with both items
(e.g., news articles) and users (e.g., readers).
• We devise a number of state-of-the-art models for generating
recommendations that incorporate the additional emotion
features. These models include variations of gradient boosting
decision trees, deep matrix factorization methods and deep
neural network architectures. In addition, we use ensembling
methods to increase the predictive performance by blending
or combining the predictions of multiple constituent models.
• We propose EmoRec, an emotion-aware recommendation
model, which demonstrates the best accuracy performance
in news recommendation task. EmoRec itself is an ensemble
model.
• We conduct a thorough experimental evaluation on a real
dataset coming from news domain. Our results demonstrate
that the emotion-aware recommendation models
consistently outperform state-of-the-art non-emotion-based
recommendation models. Our study provides evidence of the
usefulness of the emotion features at large, as well as the
feasibility of our approach on incorporating them to existing
models to improve recommendations.</p>
    </sec>
    <sec id="sec-3">
      <title>RELATED WORK</title>
      <p>
        Prior research has found a range of features to be useful in the
context of news recommender systems, such as user location [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ],
time of the day [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], demographic information [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], or article social
media profile [
        <xref ref-type="bibr" rid="ref50">50</xref>
        ]. However, emotion, which is one of the important
elements of human nature that has a significant impact on our
behavior and choices [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ], has received little attention. A number
of studies in the area of psychology, neurology, and behavioral
sciences have shown that individuals’ choices are related to their
feelings and mental moods [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>
        In the context of recommender systems, one of the earliest
works [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], pointed out that emotions are crucial for users’
decision making and that users transmit their decisions together with
emotions. Tkalcic et al. [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ] introduced a unifying framework for
using emotions in user interactions with a recommender system,
and suggested that while an implicit approach of user feedback
may be less accurate, it is well suited for user interaction purposes
since the user is not aware of it [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ].
      </p>
      <p>
        While emotions as features have been studied in movie
recommendations [
        <xref ref-type="bibr" rid="ref28 ref29">28, 29</xref>
        ], music recommendations [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and restaurant
recommendations [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ], to name a few, much less work has explored
the role of emotion features in news recommender systems.
      </p>
      <p>
        Emotion in news articles has been studied for categorizing news
stories into eight emotion categories [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Specifically for
recommender systems, Parizi and Kazemifard [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] introduced a model for
Persian news utilizing both, the emotion of news as well as user’s
preference. More recently, Mizgajski and Morzy [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] introduced a
recommender system for recommending news items by leveraging
a multi-dimensional model of emotions, where emotion is derived
through user’s self-assessed reactions (i.e., explicit feedback) which
can be considered as intrusive collection. In contrast to previous
studies, our work focuses on studying the role of emotion features
in news recommender systems using implicit user feedback.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>FEATURES FOR RECOMMENDATION</title>
      <p>This section describes the feature extraction procedure which is
utilized in our proposed framework. The features are grouped into
two main categories: (i) emotion-based features for items and users,
and (ii) non-emotion-based features for items and users.
3.1</p>
    </sec>
    <sec id="sec-5">
      <title>Emotion-based Features</title>
      <p>The main objective of this paper is to improve the performance of
recommender system by leveraging the user/item emotion features.</p>
      <p>
        Figure 3 shows an example of textual content of items (i.e., an
article) in news domains. As it can be observed, there are several
words such as win and gratifying, expressing the emotion of
happiness. Moreover, interjections such as yay and oh can be indicators
of diferent emotions [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. In this section, we describe how we
extract such features to improve the recommendation system
effectiveness. In order to maintain consistency, each news article is
preprocessed by tokenizing into words, removing the stopwords
and POS-tagging to extract nouns, verbs, adverbs and adjectives. In
particular, we focus on two approaches for computing emotion
features: sentiment analysis, which classifies text into neutral, positive
and negative sentiments, and emotion analysis which categorizes
text into emotions such as happiness, sadness, anger, disgust, fear
and so on. Note that we extract emotion features for both users and
items.
3.1.1 Item Emotion-based Features.
      </p>
      <p>Number of Emotion Words: This feature represents the
number of words in an emotion lexicon (i.e., WordNet-Afect, see Table
1) that occur in the item (i.e., news article) more than once.</p>
      <p>
        Ekman’s Emotion Label: We count the number of emotion
words occurring in the text document for each emotion type
(Ekman’s six emotion categories [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]) and then the text is assigned
an emotion label with the highest number of emotion words
appearing in the text. If more than one emotion category has the
highest count, 0 is assigned to this feature, leaving the next feature
to indicate mixed emotions. A combination of diferent lexicons
(WordNet-Afect and NRC, see Table 1) is used to find the emotion
labels. We use multiple resources to have a bigger set of emotion
words for each emotion.
      </p>
      <p>Mixed Emotions: This feature indicates whether an item has
more than one document-level emotion labels based on Ekman’s
emotion model (i.e., if two or more emotions have the highest score,
this feature is valued at 1, otherwise 0). Since the initial annotation
efort (previous feature) illustrated that in many cases, a sentence
can exhibit more than one emotion, we have an additional category
called mixed emotion to account for all such instances.</p>
      <p>
        Sentiment Feature: The text is classified into three categories:
positive, negative and neutral. We utilize the approach introduced
in [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] and use SentiWordNet [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>Interjections: This feature counts the number of interjections
in a document. A short sound, word or phrase spoken suddenly to
express an emotion, e.g., oh, look out!, ah, are called interjections1.
Our preliminary analysis found that interjections were common
in quotes in news articles, which can be detected for potential
emotions.</p>
      <p>
        Capitalized Words: This feature counts the number of words
in a document with all uppercase characters. People use capital
1List of interjections derived from: i) https://surveyanyplace.com/
the-ultimate-interjection-list, ii) https://7esl.com/interjections-exclamations,
and iii) https://www.thoughtco.com/interjections-in-english-1692798
words to express an emotion [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ] and make it bold to the readers
(e.g., I said I am FINE).
      </p>
      <p>
        Punctuation: Two features are included to model the
occurrence of question marks and exclamation marks repeated more
than two times in a document. Using punctuation can clarify the
emotional content of the texts that are sometimes easy to miss [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ].
      </p>
      <p>
        Grammatical Markers and Extended Words: This feature
counts the number of times words with a character repeated more
than two times (e.g., haaappy or oh yeah!!????) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] as excessive
use of letters in a word (e.g., repetition) is one way to emphasize
feelings.
      </p>
      <p>
        Plutchik Emotion Scores: First, we measure the semantic
relatedness score between a word  in the text and an emotion
category   in the NRC lexicon (see Table 1) as follows [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]:
where  ( = 1 . . . ) is the th word of emotion category   .  
is the Pointwise Mutual Information calculated as follows:
 (,  ) = log
 (,  )
 ( ) ( )
where  ( ) and  ( ) are the probabilities that  and  occur
in a text corpus, respectively, and  (,  ) is the probability that
 and  co-occur within a sliding window in the corpus. Finally,
we calculate the average, maximum and minimum of score for all
words in the text for each emotion category and consider each as
an individual feature.
3.1.2 User Emotion-based Features.
      </p>
      <p>As we do not have access to users’ explicit emotion towards items,
we develop users’ implicit emotional profile based on their historical
interactions with items. By computing the emotion profile of the
items with which a user is interacting, we derive the emotional
taste of the user over that period of time over the set of items.</p>
      <p>User Emotions Across Items: We determine the emotion score
(i.e., Plutchik’s emotion scores) for the last accessed item before
subscription as well as for the last 20 items accessed by the user.
Then, we pick the top 3 frequent emotions.</p>
      <p>User Emotions Across Categories: We determine the emotion
of categories of items (e.g., sports in news domain) accessed by a
user by counting the number of items assigned to an emotion in
a specific category, with the most frequent emotion considered as
the emotion of the category. The feature is calculated for the whole
history of the user.
3.2</p>
    </sec>
    <sec id="sec-6">
      <title>Non-Emotion-based Features</title>
      <p>Non-emotion-based features can also be classified into item-based
and user-based features.
3.2.1 Item Non-Emotion-based Features.</p>
      <p>
        Item Topic: We extract topics in the article using Latent
Dirichlet Allocation (LDA) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In LDA, each topic is a distribution over
words, and each document is a mixture of topics. The number of
topics for the news articles are 112 , which were chosen empirically
to minimize the perplexity score of the LDA result. Thus, the item
topic is represented by a vector of length 112.
(1)
(2)
      </p>
      <p>
        Topic Label: We use lda2vec [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] to generate and label the topics
in an item (i.e., document), where each generated topic is labeled
by one of its top  words which is most semantically similar to
the other words in the top  word list. We then label the item (i.e.,
document) with the label of the most coherent topic among the top
 topics of the document. The word vector of this label word is
used as the value for this feature.
      </p>
      <p>
        TF-IDF: This feature represents items as n-grams (unigram,
bigram, trigram) with the TF-IDF weighting approach [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>Coherence: We first calculate the cosine similarity scores
between all pair of words in an item using word2vec pre-trained word
vectors2, and then record average of similarity scores, standard
deviation of similarity scores, the lowest score that is higher than
the standard deviation, and the highest score that is lower than the
standard deviation as four features.</p>
      <p>
        Potential to Trigger Subscription: This feature represents the
total number of times the item was requested right before a paywall
was presented to a user who subsequently made a subscription [
        <xref ref-type="bibr" rid="ref10 ref11">10,
11</xref>
        ]. In a subscription-based item delivery model a paywall is the
page asking for subscription before allowing an unsubscribed user
to continue accessing items.
3.2.2 User Non-emotion-based Features.
      </p>
      <p>Visit Count: We calculate the average number of items (articles)
accessed by a user per visit. A visit is terminated if a user is inactive
for more than 30 minutes.</p>
      <p>User Spent Time: Two features are represented. One is the
average time the user spent per item, and the other is the average
time the user spent per visit.</p>
      <p>User Interest in Subcategory: This feature represents the
empirical probability of subcategory  given a user  and a category 
denoted as  ( |, ).</p>
      <sec id="sec-6-1">
        <title>2https://code.google.com/archive/p/word2vec/</title>
        <p>For example,  (election|, politics) can be determined by the total
number of articles the user read on election over the total number
of articles that the user read on politics. In our experiments, the
categories and subcategories were provided with the dataset and
we consider only the top 50 most frequently visited subcategories
for this feature.</p>
        <p>
          User Latent Vector: We calculate the latent vector for each user
based on matrix factorization introduced in [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ]. This feature is
chosen so that we can compare our method with the Deep Matrix
Factorization model in [
          <xref ref-type="bibr" rid="ref47">47</xref>
          ], a state-of-the-art recommendation
method, which uses this feature as input for a deep neural network.
3.3
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Feature Selection</title>
      <p>
        One of the critical steps after feature extraction is to select important
features for recommendation. Table 2 reports the most important
features according to gain importance score for the news data set.
We evaluate feature importance by averaging over 10 training runs
of a gradient boosting machine learning model XGBoost [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] to
reduce variance3. Also, the model is trained using early stopping
with a validation set to prevent over-fitting to the training data. By
using the zero importance function, we find features that have zero
importance according to XGBoost.
4
      </p>
    </sec>
    <sec id="sec-8">
      <title>RECOMMENDATION MODEL</title>
      <p>In this section, we introduce a tailored structure of an
Emotionaware Recommender System Model (EmoRec) for personalized
recommendation. Our final model is an ensemble model of three
models leveraging both emotion/non-emotion-based features. We
describe the structure of the proposed model and the training
methods next.
4.1</p>
    </sec>
    <sec id="sec-9">
      <title>Model Training</title>
      <p>
        Model 1 (Boost Model): Gradient Boosting Decision Tree (GBDT)
methods are among the most powerful machine learning approaches
which have been efectively used in many domains [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] including
recommendation [
        <xref ref-type="bibr" rid="ref48">48</xref>
        ]. The basic idea in GBDT approaches is to
learn a set of base/weak learners (i.e., decision trees) sequentially by
using diferent training splits. More precisely, at each step, we learn
a new base model by fitting it to the error residuals (i.e., diference
between the current model predictions and the actual target values)
at that step. The new model outcome is the previous model outcome
plus the (weighted) new base learner outcome. Eventually, the final
model outcome is the weighted average of all base learners outcome,
where the weights are learned jointly with the base learners. We
train two state-of-the-art GBDT models, namely, XGBoost [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and
Catboost [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], on our training datasets with the features selected
in Section 3.3 as the input.
      </p>
      <p>XGBoost uses pre-sorted/histogram-based algorithm to compute
the best split while CatBoost uses ordered boosting, a permeation
based algorithm, to learn the weak learners efectively. Moreover,
XGBoost uses one-hot encoding before supplying categorical data,
but CatBoost handles categorical features directly. We train both
models individually (three base models for each). The final model
output (i.e., probability that a user is interested in an item) is the
3Variance refers to the sensitivity of the learning algorithm to the specifics of the
training data (e.g., the noise and specific observations).
combination of all base models outcomes:
6
Õ

 
(3)
where  is the probability that the user is interested in the item
according to base model  and  is the weight of base model 
learned by XGboost/Catboost.</p>
      <p>
        Model 2 (Deep Neural Network (DNN)): Figure 4 shows our
proposed Deep Neural Network architecture for leveraging the
emotion features (and other commonly available features) for the
recommendation purpose. The input is divided into four groups [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]:
i) user non-emotion based features, ii) item non-emotion based
features, ii) user emotion-based features, and iv) item emotion-based
features. For the categorical inputs, we utilize one-hot encoding
(the second layer is look-up embeddings mapping each categorical
feature to a fixed length embedding vector). In the architect “Dense
Layer” can be formalized as: Dense( ) =  (  + bias) where 
and  are parameters,  is the layer input and  is the activation
function (for linear layer  is the identity function). We use 2
regularization to prevent over-fitting in embedding layer and use
back-propagation to learn the parameters.
      </p>
      <p>
        Model 3 (Deep Matrix Factorization (Deep MF)): Inspired by
the models proposed in [
        <xref ref-type="bibr" rid="ref19 ref47">19, 47</xref>
        ], we built our Deep MF (Figure 5)
to leverage extra user/item features (i.e., emotion and non-emotion
features) in the recommendation prediction task. In [
        <xref ref-type="bibr" rid="ref47">47</xref>
        ], they
construct a user-item matrix with explicit ratings and implicit
preference feedback, then with this matrix as the input, they present a
deep neural architecture to learn a low dimensional space for the
representation of both users and items. In [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], by replacing the
inner product with a neural architecture, they learn an arbitrary
function to capture the interactions between user and item latent
vectors. Diferent from their work, we focused on modeling the
user/item with rich extra features, such as non-emotion and
emotion based features, as well as using embedding vectors learned in
our DNN model. The input of our proposed model is the same as
the DNN model where the categorical features are encoded using
one hot vectors. The second layer is the look-up embedding. In
this layer, we have both MF embedding vectors, which we estimate
through the learning process, and DNN embedding vectors, which
are concatenation of embedding vectors (for each similar input
group) learned from DNN model (they are fixed in this model).
Generalized Matrix Factorization (GMF) layer combines two
embeddings using dot product and applies some non-linearity. Similar
to DNN model, the output of the model is the probability that a
user is interested in an item.
      </p>
      <p>
        Ensemble/Blending Model: The final model EmoRec was the
weighted average of the three models’ predictions. We use
NelderMead Method [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] to find the optimum weights of each models.
5
      </p>
    </sec>
    <sec id="sec-10">
      <title>EXPERIMENTS</title>
      <p>In this section, we introduce the data, evaluation protocols and the
specific configurations used in our experiments.
Our experiments are conducted on a real-world news dataset. The
Globe and Mail is one of the major newspapers4 in Canada. We use
the data spanning from January to July 2014 (a 6-month period)
in our experiments where the data in the first four months were
used for training, and the last two months for testing. The dataset
contains information for 359,145 articles in total and 88,648 users
in total, out of which 17,009 became subscribers during this period,
and 71,639 were non-subscribers. Every time a user reads an article,
watches a video or generally takes an action in the news portal, the
interaction is recorded as a hit. Typically, a hit contains information
like date, time, user id, visited article, special events of interest like
subscription, sign in, and so on.
5.2</p>
    </sec>
    <sec id="sec-11">
      <title>Evaluation Metrics</title>
      <p>We use F-score to measure the predictive performance of a
recommender system. For each user in the test data set, we use the
original set of read articles in the test period as the ground truth,
denoted as  . Assuming the set of recommended news articles
for the user is  , precision, recall, and F-measure are defined as
follows:</p>
      <p>Precision = | ∩  | , Recall = | ∩  |</p>
      <p>| | | |</p>
      <sec id="sec-11-1">
        <title>4https://www.theglobeandmail.com/</title>
        <p>• Single Boost Model: We run XGBoost and Catboost separately
to make predictions and collect the average of their F-scores.
• Boost Blend: This is the 6-model ensemble described in Model
1 in Section 4.1.
• Deep MF: This is the deep matrix factorization model
described in Section 4.1.
• Single DNN model: We run the DNN model for 5 times with
the same hyperparameters but diferent random seeds and
collect the average result over 5 runs.
• DNN Ensemble: An ensemble of 5 DNN models with diferent
hyperparameters (e.g., diferent learning rates, etc.) is run 5
times each with a diferent random seed. The average result
over the 5 runs is collected.
• Boost Blend + Deep MF: This is an ensemble consisting of</p>
        <p>Boost Blend and Deep MF.
• Boost Blend + DNN Ensemble: This an ensemble consisting
of Boost Blend and DNN Ensemble.
• Deep MF + DNN Ensemble: This is an ensemble consisting of</p>
        <p>Deep MF and DNN Ensemble.
• Boost Blend + Deep MF + DNN Ensemble: an ensemble
consisting of Boost Blend, Deep MF and DNN Ensemble.</p>
        <p>We train each of the above models using the training data of our
data set and use the trained model to make recommendations by
predicting a user’s interest in an item in the test data. Table 3 shows
the results (in F-score) of using these recommendation methods
with and without emotion features on the news data set, where the
whole set of emotion features described in Section 3.3 is used in
the results for "All", while none of the emotion features is used in
the results for "Non-Emo". As can be seen, adding emotion features
improves the predictive performance for all the recommendation
methods. Among the single recommendation models (i.e., Single
Boost Model, Deep MF and Single DNN Model), Deep MF performs
the best. The results also show that ensemble methods perform
better than single/component models. The best performance is
produced by the largest ensemble (i.e., Boost Blend + Deep MF +
DNN Ensemble). We refer to this best-performing model as our
EmoRec model.
5.4</p>
      </sec>
    </sec>
    <sec id="sec-12">
      <title>Comparison with Other Baselines</title>
      <p>We also compare our EmoRec model with the following three
stateof-the-art recommendation methods with well-tuned parameters
(that is, the parameters are optimally tuned to ensure the fair
comparison). The objective is to investigate whether emotion features
can smarten up these recommender systems. A brief description of
these three models is as follows:</p>
      <p>
        Basic MF : This is the simple matrix factorization model where
used for discovering latent features between two entities (i.e., user
and articles) [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ]. Both user preferences and item characteristics are
mapped to latent factor vectors. Each element of the item-specific
factor vector measures the extent to which the item possesses one
feature. Accordingly,each element of the user-specific factor vector
measures the extent of the user preferences in that feature.
      </p>
      <p>
        FDEN and GBDT : an ensemble of diferent models, including
Field-aware Deep Embedding Networks and Gradient Boosting
Decision Trees [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The predictions of FDENs are from a bagging
ensemble using the arithmetic mean of many networks, each of
which has slight diferences on hyper-parameters, including the
forms of the activation.
      </p>
      <p>
        Truncated SVD-based Feature Engineering: a gradient boosted
decision trees model with truncated SVD-based embedding
features [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ]. To overcome the cold start problem, a truncated
SVDbased embedding features were created using the embedding
features with four diferent statistical based features (users, items,
artists and time), the final model was the weighted average of the
ifve models’ predictions.
      </p>
      <p>The results are illustrated in Table 4, which shows that
emotion features can also improve the recommendation performance
of these three state-of-the-art baselines. In addition, our EmoRec
model performs significantly better than these three baselines in
both cases of using emotion features and not using emotion
features.
5.5</p>
    </sec>
    <sec id="sec-13">
      <title>Efect of Individual Emotion Features</title>
      <p>Table 5 presents the results of a feature ablation study in order
to further understand the efect of individual emotion features
used in EmoRec. In each run of this study, we keep all the
features except one type of emotion features. The results indicate that
removing Plutchik emotion scores (item feature), User emotions
across categories and User emotions across items (user features)
lead to considerable decline in the performance. It also shows that
our model is able to capture useful implicit user emotion efectively.</p>
      <p>To further validate the efectiveness of the top emotion features
as learned from our experiments, we run a further experiment
incorporating only the top three emotion features (i.e., Plutchik emotions,
User emotions across categories, and User emotions across items)
on six state-of-the-art recommendation models. As the results in
Table 6 show, only using these three emotion features can also
improve the recommender systems, with Basic MF showing the
most gain.
6</p>
    </sec>
    <sec id="sec-14">
      <title>CONCLUSIONS</title>
      <p>Motivated by the recent development in emotion detection methods
(in textual information), we considered the problem of leveraging
emotion features to improve recommendations. Towards that end,
we derived a large number of emotion features that can be
attributed to both items and users in news domain and can provide an
emotional context. Then, we devised state-of-the-art non-emotion
and emotion-aware recommendation models to investigate whether,
how and to what extent emotion features can improve
recommendations. To the best of our knowledge, this is the first attempt to
systematically and broadly evaluate the utility of a number of
emotion features for the recommendation task. Our results indicate
that emotion-aware recommendation models consistently
outperform state-of-the-art non-emotion-based recommendation models.</p>
      <p>Model
Basic MF
Boost Blend
FDEN and GBDT
Deep MF
Truncated SVD-based
DNN Ensemble
Furthermore, our study provided evidence of the usefulness of the
emotion features at large, as well as the feasibility of our approach
on incorporating them to existing models to improve
recommendations.</p>
      <p>As a more tangible outcome of the study, we proposed EmoRec,
an emotion-aware recommendation model, which demonstrates
the best predictive performance in news recommendation task.
EmoRec itself is an ensemble model combining three models (Boost
Blend + Deep MF + DNN Ensemble). It significantly outperforms
other state-of-the-art recommendation methods evaluated in our
experiments. We also evaluated the proposed emotion features
individually. Among the emotion features examined, the Plutchik
emotion scores of items (obtained by computing PMI scores between
words) and user emotion profiles (based on the emotion scores of
the items that the user accessed) are the most important.</p>
      <p>
        Employing emotional context in recommendations appears to be
a promising direction of research. While the scope of our current
study is limited to emotions extracted by textual information, there
is evidence that emotions can be extracted through other means of
communication, such as audio and video, or other cues [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ].
7
      </p>
    </sec>
    <sec id="sec-15">
      <title>ACKNOWLEDGEMENTS</title>
      <p>This work is funded by Natural Sciences and Engineering Research
Council of Canada (NSERC), The Globe and Mail, and the Big Data
Research, Analytics, and Information Network (BRAIN) Alliance
established by the Ontario Research Fund Research Excellence
Program (ORF-RE). We would like to thank The Globe and Mail for
providing the dataset used in this research. In particular, we thank
Gordon Edall and the Data Science team of The Globe and Mail for
their insights and collaboration in our joint project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Ameeta</given-names>
            <surname>Agrawal</surname>
          </string-name>
          and
          <string-name>
            <given-names>Aijun</given-names>
            <surname>An</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Unsupervised Emotion Detection from Text Using Semantic and Syntactic Relations</article-title>
          .
          <source>In 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology</source>
          , Vol.
          <volume>1</volume>
          . IEEE, Macau, China,
          <fpage>346</fpage>
          -
          <lpage>353</lpage>
          . https://doi.org/10.1109/WI-IAT.
          <year>2012</year>
          .170
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Ameeta</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Aijun</given-names>
            <surname>An</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Manos</given-names>
            <surname>Papagelis</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Learning Emotionenriched Word Representations</article-title>
          .
          <source>In Proceedings of the 27th International Conference on Computational Linguistics</source>
          .
          <article-title>Association for Computational Linguistics</article-title>
          , Santa Fe, New Mexico, USA,
          <fpage>950</fpage>
          -
          <lpage>961</lpage>
          . https://www.aclweb.org/anthology/C18-1081
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Mostafa</given-names>
            <surname>Al Masum Shaikh</surname>
          </string-name>
          , Helmut Prendinger, and
          <string-name>
            <given-names>Mitsuru</given-names>
            <surname>Ishizuka</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Emotion Sensitive News Agent (ESNA): A system for user centric emotion sensing from the news</article-title>
          .
          <source>Web Intelligence and Agent Systems</source>
          <volume>8</volume>
          ,
          <issue>4</issue>
          (
          <year>2010</year>
          ),
          <fpage>377</fpage>
          -
          <lpage>396</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Baccianella</surname>
          </string-name>
          , Andrea Esuli, and
          <string-name>
            <given-names>Fabrizio</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining.</article-title>
          .
          <source>In Lrec</source>
          , Vol.
          <volume>10</volume>
          .
          <fpage>2200</fpage>
          -
          <lpage>2204</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Bing</given-names>
            <surname>Bai</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yushun</given-names>
            <surname>Fan</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Incorporating Field-aware Deep Embedding Networks and Gradient Boosting Decision Trees for Music Recommendation</article-title>
          .
          <source>In The 11th ACM International Conference on Web Search and Data Mining(WSDM)</source>
          . ACM, London, England,
          <volume>7</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>David</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>Andrew Y.</given-names>
          </string-name>
          <string-name>
            <surname>Ng</surname>
            , and
            <given-names>Michael I.</given-names>
          </string-name>
          <string-name>
            <surname>Jordan</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Latent Dirichlet Allocation</article-title>
          .
          <source>Journal of Machine Learning Research 3 (March</source>
          <year>2003</year>
          ),
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Mondher</given-names>
            <surname>Bouazizi</surname>
          </string-name>
          and
          <string-name>
            <given-names>Tomoaki</given-names>
            <surname>Otsuki</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A Pattern-Based Approach for Sarcasm Detection on Twitter</article-title>
          .
          <source>IEEE Access</source>
          <volume>4</volume>
          (
          <year>2016</year>
          ),
          <fpage>5477</fpage>
          -
          <lpage>5488</lpage>
          . https://doi.org/ 10.1109/ACCESS.
          <year>2016</year>
          .2594194
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Li</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Guanliang</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Feng</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Recommender Systems Based on User Reviews: The State of the Art. User Modeling and User-Adapted Interaction 25, 2</article-title>
          (
          <year>June 2015</year>
          ),
          <fpage>99</fpage>
          -
          <lpage>154</lpage>
          . https://doi.org/10.1007/s11257-015-9155-5
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Tianqi</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Carlos</given-names>
            <surname>Guestrin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>XGBoost: A Scalable Tree Boosting System</article-title>
          .
          <source>In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16)</source>
          . ACM, New York, NY, USA,
          <fpage>785</fpage>
          -
          <lpage>794</lpage>
          . https://doi.org/10.1145/2939672.2939785
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Heidar</surname>
            <given-names>Davoudi</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Aijun</given-names>
            <surname>An</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Morteza</given-names>
            <surname>Zihayat</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Gordon</given-names>
            <surname>Edall</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Adaptive Paywall Mechanism for Digital News Media</article-title>
          .
          <source>In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery &amp;#38; Data Mining (KDD '18)</source>
          . ACM, New York, NY, USA,
          <fpage>205</fpage>
          -
          <lpage>214</lpage>
          . https://doi.org/10.1145/3219819. 3219892
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Davoudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zihayat</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>An</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Time-Aware Subscription Prediction Model for User Acquisition in Digital News Media</article-title>
          .
          <source>In Proceedings of the 2017 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics</source>
          , Houston, Texas, USA,
          <fpage>135</fpage>
          -
          <lpage>143</lpage>
          . https://doi.org/10.1137/1. 9781611974973.16
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Anna</given-names>
            <surname>Veronika</surname>
          </string-name>
          <string-name>
            <surname>Dorogush</surname>
          </string-name>
          , Vasily Ershov, and
          <string-name>
            <given-names>Andrey</given-names>
            <surname>Gulin</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>CatBoost: gradient boosting with categorical features support</article-title>
          .
          <source>(Oct</source>
          .
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Paul</given-names>
            <surname>Ekman</surname>
          </string-name>
          .
          <year>1984</year>
          .
          <article-title>Expression and the nature of emotion. Approaches to emotion 3 (</article-title>
          <year>1984</year>
          ),
          <fpage>19</fpage>
          -
          <lpage>344</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Ji</surname>
            <given-names>Feng</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang Yu</surname>
          </string-name>
          , and
          <string-name>
            <surname>Zhi-Hua Zhou</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Multi-Layered Gradient Boosting Decision Trees</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          31, S. Bengio,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Larochelle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Grauman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Cesa-Bianchi</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>R.</given-names>
            <surname>Garnett</surname>
          </string-name>
          (Eds.). Curran Associates, Inc.,
          <fpage>3551</fpage>
          -
          <lpage>3561</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Blaž</surname>
            <given-names>Fortuna</given-names>
          </string-name>
          , Carolina Fortuna, and
          <string-name>
            <given-names>Dunja</given-names>
            <surname>Mladenić</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Real-time news recommender system</article-title>
          .
          <source>In Joint European Conference on Machine Learning and Knowledge Discovery in Databases</source>
          . Springer,
          <fpage>583</fpage>
          -
          <lpage>586</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Clif</given-names>
            <surname>Goddard</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Interjections and Emotion (with Special Reference to “Surprise”</article-title>
          and “Disgust”).
          <source>Emotion Review</source>
          <volume>6</volume>
          ,
          <issue>1</issue>
          (Jan.
          <year>2014</year>
          ),
          <fpage>53</fpage>
          -
          <lpage>63</lpage>
          . https: //doi.org/10.1177/1754073913491843
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Gustavo</surname>
            <given-names>Gonzalez</given-names>
          </string-name>
          , Josep Lluis de la Rosa, Miquel Montaner, and
          <string-name>
            <given-names>Sonia</given-names>
            <surname>Delfin</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Embedding Emotional Context in Recommender Systems</article-title>
          .
          <source>In Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering Workshop (ICDEW '07)</source>
          . IEEE Computer Society, Washington, DC, USA,
          <fpage>845</fpage>
          -
          <lpage>852</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Byeong-Jun</surname>
            <given-names>Han</given-names>
          </string-name>
          , Seungmin Rho, Sanghoon Jun, and
          <string-name>
            <given-names>Eenjun</given-names>
            <surname>Hwang</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Music emotion classification and context-based music recommendation</article-title>
          .
          <source>Multimedia Tools and Applications</source>
          <volume>47</volume>
          ,
          <issue>3</issue>
          (
          <year>2010</year>
          ),
          <fpage>433</fpage>
          -
          <lpage>460</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Xiangnan</surname>
            <given-names>He</given-names>
          </string-name>
          , Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Neural Collaborative Filtering</article-title>
          .
          <source>In Proceedings of the 26th International Conference on World Wide Web - WWW '17</source>
          . ACM Press, Perth, Australia,
          <fpage>173</fpage>
          -
          <lpage>182</lpage>
          . https://doi.org/10.1145/3038912.3052569
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Dhruv</surname>
            <given-names>Khattar</given-names>
          </string-name>
          , Vaibhav Kumar, Manish Gupta, and
          <string-name>
            <given-names>Vasudeva</given-names>
            <surname>Varma</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Neural Content-Collaborative Filtering for News Recommendation</article-title>
          . In NewsIR'18 Workshop. NewsIR@ECIR, Grenoble, France,
          <fpage>1395</fpage>
          -
          <lpage>1399</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Hong</surname>
            <given-names>Joo</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          and Sung Joo Park.
          <year>2007</year>
          .
          <article-title>MONERS: A news recommender for the mobile web</article-title>
          .
          <source>Expert Systems with Applications 32</source>
          ,
          <issue>1</issue>
          (
          <year>2007</year>
          ),
          <fpage>143</fpage>
          -
          <lpage>150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Christopher</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>Prabhakar</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
            , and
            <given-names>Hinrich</given-names>
          </string-name>
          <string-name>
            <surname>Schütze</surname>
          </string-name>
          .
          <year>2009</year>
          . Introduction to Information Retrieval. (
          <year>2009</year>
          ),
          <fpage>569</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Jan</given-names>
            <surname>Mizgajski</surname>
          </string-name>
          and
          <string-name>
            <given-names>Mikołaj</given-names>
            <surname>Morzy</surname>
          </string-name>
          . [n.d.].
          <article-title>Afective recommender systems in online news industry: how emotions influence reading choices</article-title>
          .
          <source>User Modeling and User-Adapted Interaction ([n. d.])</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Saif</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mohammad</surname>
            and
            <given-names>Felipe</given-names>
          </string-name>
          <string-name>
            <surname>Bravo-Marquez</surname>
          </string-name>
          .
          <year>2017</year>
          . WASSA-2017
          <source>Shared Task on Emotion Intensity. In In Proceedings of the EMNLP 2017 Workshop on Computational Approaches</source>
          to Subjectivity, Sentiment, and
          <article-title>Social Media (WASSA)</article-title>
          .
          <source>Association for Computational Linguistics</source>
          , Copenhagen, Denmark,
          <fpage>34</fpage>
          -
          <lpage>49</lpage>
          . https: //arxiv.org/abs/1708.03700
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Saif</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mohammad</surname>
          </string-name>
          and
          <string-name>
            <surname>Peter D. Turney</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Crowdsourcing a Word-Emotion Association Lexicon</article-title>
          .
          <source>Computational Intelligence</source>
          <volume>29</volume>
          ,
          <issue>3</issue>
          (Aug.
          <year>2013</year>
          ),
          <fpage>436</fpage>
          -
          <lpage>465</lpage>
          . https://doi.org/10.1111/j.1467-
          <fpage>8640</fpage>
          .
          <year>2012</year>
          .
          <volume>00460</volume>
          .x
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>Alejandro</given-names>
            <surname>Montes-García</surname>
          </string-name>
          , Jose María Álvarez-Rodríguez, Jose Emilio Labra-Gayo, and
          <string-name>
            <surname>Marcos</surname>
          </string-name>
          Martínez-Merino.
          <year>2013</year>
          .
          <article-title>Towards a journalist-based news recommendation system: The Wesomender approach</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>40</volume>
          ,
          <issue>17</issue>
          (
          <year>2013</year>
          ),
          <fpage>6735</fpage>
          -
          <lpage>6741</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Christopher</surname>
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Moody</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec</article-title>
          .
          <source>arXiv:1605</source>
          .
          <year>02019</year>
          [cs] (May
          <year>2016</year>
          ). http://arxiv.org/abs/ 1605.02019 arXiv:
          <fpage>1605</fpage>
          .
          <year>02019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Yashar</surname>
            <given-names>Moshfeghi</given-names>
          </string-name>
          , Benjamin Piwowarski, and Joemon M Jose.
          <year>2011</year>
          .
          <article-title>Handling data sparsity in collaborative filtering using emotion and semantic based features</article-title>
          .
          <source>In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval. ACM</source>
          ,
          <volume>625</volume>
          -
          <fpage>634</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Ante</surname>
            <given-names>Odić</given-names>
          </string-name>
          , Marko Tkalčič, Jurij F Tasič,
          <string-name>
            <given-names>and Andrej</given-names>
            <surname>Košir</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Predicting and detecting the relevant contextual information in a movie-recommender system</article-title>
          .
          <source>Interacting with Computers</source>
          <volume>25</volume>
          ,
          <issue>1</issue>
          (
          <year>2013</year>
          ),
          <fpage>74</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Sylvester</given-names>
            <surname>Olubolu</surname>
          </string-name>
          <string-name>
            <given-names>Orimaye</given-names>
            ,
            <surname>Saadat M. Alhashmi</surname>
          </string-name>
          , and
          <string-name>
            <surname>Siew</surname>
          </string-name>
          Eu-gene.
          <year>2012</year>
          .
          <article-title>Sentiment Analysis Amidst Ambiguities in Youtube Comments on Yoruba Language (Nollywood) Movies</article-title>
          .
          <source>In Proceedings of the 21st International Conference on World Wide Web (WWW '12 Companion)</source>
          . ACM, New York, NY, USA,
          <fpage>583</fpage>
          -
          <lpage>584</lpage>
          . https://doi.org/10.1145/2187980.2188138
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Yoshihiko</surname>
            <given-names>Ozaki</given-names>
          </string-name>
          , Masaki Yano, and
          <string-name>
            <given-names>Masaki</given-names>
            <surname>Onishi</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Efective hyperparameter optimization using Nelder-Mead method in deep learning</article-title>
          .
          <source>IPSJ Transactions on Computer Vision and Applications</source>
          <volume>9</volume>
          ,
          <issue>1</issue>
          (Nov.
          <year>2017</year>
          ),
          <volume>20</volume>
          . https://doi.org/10.1186/ s41074-017-0030-7
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>Maja</given-names>
            <surname>Pantic</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Vinciarelli</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Implicit human-centered tagging [Social Sciences]</article-title>
          .
          <source>IEEE Signal Processing Magazine</source>
          <volume>26</volume>
          ,
          <issue>6</issue>
          (
          <year>2009</year>
          ),
          <fpage>173</fpage>
          -
          <lpage>180</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>Manos</given-names>
            <surname>Papagelis</surname>
          </string-name>
          and
          <string-name>
            <given-names>Dimitris</given-names>
            <surname>Plexousakis</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Qualitative analysis of userbased and item-based prediction algorithms for recommendation agents</article-title>
          .
          <source>Engineering Applications of Artificial Intelligence</source>
          <volume>18</volume>
          ,
          <issue>7</issue>
          (
          <year>2005</year>
          ),
          <fpage>781</fpage>
          -
          <lpage>789</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <surname>Manos</surname>
            <given-names>Papagelis</given-names>
          </string-name>
          , Dimitris Plexousakis, and
          <string-name>
            <given-names>Themistoklis</given-names>
            <surname>Kutsuras</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Alleviating the sparsity problem of collaborative filtering using trust inferences</article-title>
          .
          <source>In International Conference on Trust Management</source>
          . Springer,
          <fpage>224</fpage>
          -
          <lpage>239</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>Ali</given-names>
            <surname>Hakimi</surname>
          </string-name>
          Parizi and
          <string-name>
            <given-names>Mohammad</given-names>
            <surname>Kazemifard</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Emotional news recommender system</article-title>
          .
          <source>In 2015 Sixth International Conference of Cognitive Science (ICCS)</source>
          . IEEE,
          <fpage>37</fpage>
          -
          <lpage>41</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>Mikhail</given-names>
            <surname>Rumiantcev</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Music adviser : emotion-driven music recommendation ecosystem</article-title>
          .
          <source>Ph.D. Dissertation</source>
          . Department of Mathematical Information Technology Oleksiy Khriyenko. https://jyx.jyu.fi/handle/123456789/53196
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <surname>Nima</surname>
            <given-names>Shahbazi</given-names>
          </string-name>
          , Mohamed Chahhou, and
          <string-name>
            <given-names>Jarek</given-names>
            <surname>Gryz</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Truncated SVD-based Feature Engineering for Music Recommendation</article-title>
          .
          <source>In The 11th ACM International Conference on Web Search and Data Mining(WSDM)</source>
          . ACM, London, England,
          <volume>7</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <surname>Mohammad</surname>
            <given-names>Soleymani</given-names>
          </string-name>
          , Sadjad Asghari-Esfeden,
          <string-name>
            <given-names>Yun</given-names>
            <surname>Fu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Maja</given-names>
            <surname>Pantic</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Analysis of EEG signals and facial expressions for continuous emotion detection</article-title>
          .
          <source>IEEE Transactions on Afective Computing</source>
          <volume>7</volume>
          ,
          <issue>1</issue>
          (
          <year>2016</year>
          ),
          <fpage>17</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <surname>Carlo</surname>
            <given-names>Strapparava</given-names>
          </string-name>
          , Alessandro Valitutti, and others.
          <year>2004</year>
          .
          <article-title>WordNet Afect: an Afective Extension of WordNet.</article-title>
          .
          <string-name>
            <surname>In</surname>
            <given-names>LREC</given-names>
          </string-name>
          , Vol.
          <volume>4</volume>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association</surname>
          </string-name>
          (ELRA), Lisbon, Portugal,
          <fpage>1083</fpage>
          -
          <lpage>1086</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <surname>Gábor</surname>
            <given-names>Takács</given-names>
          </string-name>
          , István Pilászy, Bottyán Németh, and
          <string-name>
            <given-names>Domonkos</given-names>
            <surname>Tikk</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Investigation of various matrix factorization methods for large recommender systems</article-title>
          .
          <source>In 2008 IEEE International Conference on Data Mining Workshops. IEEE</source>
          ,
          <fpage>553</fpage>
          -
          <lpage>562</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <surname>Marko</surname>
            <given-names>Tkalčič</given-names>
          </string-name>
          , Urban Burnik, Ante Odić, Andrej Košir, and
          <string-name>
            <given-names>Jurij</given-names>
            <surname>Tasič</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Emotion-aware recommender systems-a framework and a case study</article-title>
          .
          <source>In International Conference on ICT Innovations</source>
          . Springer,
          <fpage>141</fpage>
          -
          <lpage>150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <surname>Marko</surname>
            <given-names>Tkalcic</given-names>
          </string-name>
          , Andrej Kosir, Jurij Tasivc, and
          <string-name>
            <given-names>Matevž</given-names>
            <surname>Kunaver</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Afective recommender systems: the role of emotions in recommender systems</article-title>
          . 9-
          <fpage>13</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>Oren</given-names>
            <surname>Tsur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Dmitry</given-names>
            <surname>Davidov</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Ari</given-names>
            <surname>Rappoport</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>ICWSM-A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews.</article-title>
          .
          <source>In In Fourth International AAAI Conference on Weblogs and Social Media. Association for Computational Linguistics</source>
          , Washington, DC,
          <fpage>107</fpage>
          -
          <lpage>116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>Blanca</given-names>
            <surname>Vargas-Govea</surname>
          </string-name>
          , Gabriel González-Serna, and
          <string-name>
            <surname>Rafael</surname>
          </string-name>
          Ponce-Medellın.
          <year>2011</year>
          .
          <article-title>Efects of relevant contextual features in the performance of a restaurant recommender system</article-title>
          .
          <source>ACM RecSys 11</source>
          ,
          <issue>592</issue>
          (
          <year>2011</year>
          ),
          <fpage>56</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <surname>Karzan</surname>
            <given-names>Wakil</given-names>
          </string-name>
          , Rebwar Bakhtyar, Karwan Ali, and
          <string-name>
            <given-names>Kozhin</given-names>
            <surname>Alaadin</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Improving Web Movie Recommender System Based on Emotions</article-title>
          .
          <source>International Journal of Advanced Computer Science and Applications 6</source>
          ,
          <issue>2</issue>
          (
          <year>2015</year>
          ),
          <article-title>9</article-title>
          . https: //doi.org/10.14569/IJACSA.
          <year>2015</year>
          .060232
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>H. G.</given-names>
            <surname>Wallbott</surname>
          </string-name>
          and
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Scherer</surname>
          </string-name>
          .
          <year>1986</year>
          .
          <article-title>How universal and specific is emotional experience? Evidence from 27 countries on five continents</article-title>
          .
          <source>Social Science Information</source>
          <volume>25</volume>
          ,
          <issue>4</issue>
          (Dec.
          <year>1986</year>
          ),
          <fpage>763</fpage>
          -
          <lpage>795</lpage>
          . https://doi.org/10.1177/053901886025004001
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <surname>Hong-Jian</surname>
            <given-names>Xue</given-names>
          </string-name>
          , Xinyu Dai, Jianbing Zhang, Shujian Huang, and
          <string-name>
            <given-names>Jiajun</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Deep Matrix Factorization Models for Recommender Systems</article-title>
          .
          <source>In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence . International Joint Conferences on Artificial Intelligence Organization</source>
          , Melbourne, Australia,
          <fpage>3203</fpage>
          -
          <lpage>3209</lpage>
          . https://doi.org/10.24963/ijcai.
          <year>2017</year>
          /447
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <surname>Qian</surname>
            <given-names>Zhao</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Yue</given-names>
            <surname>Shi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Liangjie</given-names>
            <surname>Hong</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>GB-CENT: Gradient Boosted Categorical Embedding and Numerical Trees</article-title>
          .
          <source>In Proceedings of the 26th International Conference on World Wide Web (WWW '17)</source>
          .
          <source>International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva</source>
          , Switzerland.
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <surname>Yong</surname>
            <given-names>Zheng</given-names>
          </string-name>
          , Robin Burke, and
          <string-name>
            <given-names>Bamshad</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>The Role of Emotions in Context-aware Recommendation</article-title>
          .
          <source>In RecSys workshop in conjunction with the 7th ACM conference on Recommender Systems. RecSys workshop in conjunction with the 7th ACM conference on Recommender Systems</source>
          , Hong Kong, China.,
          <volume>8</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          [50]
          <string-name>
            <surname>Morteza</surname>
            <given-names>Zihayat</given-names>
          </string-name>
          , Anteneh Ayanso,
          <string-name>
            <given-names>Xing</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <source>Heidar Davoudi, and Aijun An</source>
          .
          <year>2019</year>
          .
          <article-title>A utility-based news recommendation system</article-title>
          .
          <source>Decision Support Systems</source>
          <volume>117</volume>
          (
          <year>2019</year>
          ),
          <fpage>14</fpage>
          -
          <lpage>27</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>