<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Prediction of Eudaimonic and Hedonic Movie Characteristics From Subtitles</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Elham Motamedi</string-name>
          <email>elham.motamedi@famnit.upr.si</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marko Tkalcic</string-name>
          <email>marko.tkalcic@famnit.upr.si</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Primorska</institution>
          ,
          <addr-line>Koper 6000</addr-line>
          ,
          <country country="SI">Slovenia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Personalization and explaining the recommendations of recommender systems (RSs) have recently gotten more attention from researchers. We would like to use eudaimonic and hedonic perceptions of users from movies to personalize the movie recommendations. To achieve this goal, first, we need to predict the movies' eudaimonic/hedonic quality features. This research study presents the pipeline and preliminary results for predicting movies' eudaimonic and hedonic characteristics.</p>
      </abstract>
      <kwd-group>
        <kwd>personalization</kwd>
        <kwd>recommender system</kwd>
        <kwd>eudaimonic perception</kwd>
        <kwd>hedonic perception</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Movie recommender systems have been studied extensively in the last two decades [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Roughly,
we distinguish three groups of algorithmic approaches: (i) content-based (CBR), where the
recommendations are based on item characteristics, (ii) collaborative, where the predictions are
based on similarities between users and items [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and (iii) hybrid, where the recommendations
are based on item characteristic and similarities between users and items. This work addresses
CBR. The proposed idea can also be exploited in hybrid approaches. Related work has exploited
a wide variety of item characteristics for movie recommendations, such as genre, year, actors,
directors, etc. While these characteristics have been proven to contain information about user
preferences, we conjecture that item characteristics that have a more intimate reflection of the
user perception of the item might carry more information about user preferences, hence leading
to better recommendations. We aim at investigating a novel set of item characteristics, namely
the hedonic and eudaimonic qualities of a movie. The eudaimonic quality is related to the
meaning or the goal of the life that one pursues, while the hedonic quality is more associated
with the plain pleasure that one experiences [
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3, 4, 5</xref>
        ]. The more meaningful a multimedia item is
for a user, the higher the value of eudaimonic perception is for that person. Similarly, the hedonic
perception score is the degree of plain pleasure that the user experience while consuming the
multimedia item. Therefore, each multimedia item and in particular each movie, can get a score
of eudaimonic/hedonic perceptions for each user. Each item has a distribution of eudaimonic/
hedonic perceptions of users. We have first tried to predict the average eudaimonic/hedonic
qualities for movies. However, for future work, we can try to model users and predict for each
user what would be the value of the eudaimonic/ hedonic perception of a multimedia item. As
a first step towards designing eudaimonia/hedonia-based recommender systems, we need to
be able to label movies with their hedonic and eudaimonic qualities. This paper presents the
preliminary results of a prediction model that marks movies using their subtitles.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Multimedia items can be described with some characteristics. Typical movie characteristics that
have been used extensively in recent studies are content-centric such as title, genre, director,
actor and length. These characteristics are only related to the item and do not consider how
the users perceive the items. The characteristics of the multimedia items can be calculated
from text, audio, visual modalities or any combinations of these modalities. Deldjoo et al.
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] have proposed a content-based movie recommender system using diferent sets of such
content-centric features. The weights of the features have been learned based on the behaviour
of the users. They have deployed their recommender system for 13 months and have compared
the results in terms of hit count using diferent feature sets. The features used in this study
have included actors, directors, genre, and keywords that are all related to the content of the
item and not to users’ perceptions. In another study, Deldjoo et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have computed aesthetic
low-level features from visual information of movie trailers and have exploited these features
in recommender systems. They have shown that their proposed system has outperformed some
base recommender systems using common high-level features such as genre. Chang and Ki
[8] have proposed a new theoretical framework to predict the theatrical movie’s success. They
have found that diferent features, including length of run, objective elements, sequel actor,
budget, genre and release periods, were significantly related to the box ofice performance.
      </p>
      <p>Many studies have been using content-centric items’ characteristics in diferent applications,
including recommender systems. However, in recent years, there has been a growing interest
in the new category of characteristics. Researchers have started to study characteristics of the
items that depend on the users’ perceptions of the items. One of the widely used examples
is induced emotions. Induced emotion is the characteristic of the item and is based on which
emotion it induces in users. Yang et al. [9] have manipulated the surgical images and have
exposed users to the original and manipulated images. They have measured the users’ emotional
reactions and have found that the negative emotional reactions caused by the original images
were higher than the manipulated images. Manipulating the emotional design of diferent
multimedia learning materials and exploring the efect on other parameters such as learning
outcomes and the users’ cognitive process have also been studied in the literature [10, 11].</p>
      <p>
        Other user-centric characteristics, such as induced stress levels and users’ eudaimonic/
hedonic perceptions, have gotten less attention. Eudaimonic/hedonic qualities are concepts that
are mostly studied in the psychology domain. Some works have introduced these concepts in
the domain of recommender systems. Tkalčič and Ferwerda [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] have adapted these concepts to
the domain of movies and have given a score of the eudaimonic/hedonic qualities to each movie
of the dataset. They have found two clusters of users: (i) pleasure seekers, who prefer movies
with higher value of hedonic quality and (ii) meaning seekers, who prefer movies with a higher
score of eudaimonic quality. Chu et al. [12] have proposed an audio recommender system that
chooses the music based on the events found in the text messages of a user’s phone. Their
idea has relied on this assumption that the users’ preference for meaning in music is based on
what is happening to them individually. Further work is needed to explore the efectiveness of
exploiting eudaimonic/ hedonic qualities in other applications including recommender systems.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Preliminary Analysis of the Dataset</title>
      <p>We have used the dataset collected by Puc [13], containing data on 177 users. They have filled-in
a questionnaire and provided more information about five movies they have watched from a
pool of 30 movies. Users’ information contains the following information: genre, education,
age, genre preferences, personality traits, sophistication index factors, and eudaimonic-hedonic
orientation. Users’ perceptions from movies include eudaimonic-hedonic perceptions,
preference score, and REIVO factors. REIVO stands for relative assessment of orientation to extrinsic
motivation rather than intrinsic motivation. If one does an activity for the extrinsic advantages
of the activity rather than inner satisfaction, that leads to a higher score of REIVO. The REIVO’s
main factors are: financial success, popularity, image, community feeling, afiliation, and
selfacceptance. The first three factors appear with positive weight in the formula of REIVO as they
are more related to extrinsic motivation, and the last three factors with negative weight. The
questions related to assessing each factor have been adapted for the movie domains. Genre,
education, age, and genre preferences have been asked explicitly. However, personality traits,
sophistication index factors, and eudaimonic-hedonic perceptions have been calculated from
users’ responses to the respective questions.</p>
      <p>The Big five personality model has been used for describing users’ personality traits. The
iflm sophistication index is composed of three main factors: 1) active engagement, 2) perceptual
abilities, and 3) emotions. Active engagement assesses the degree of being actively engaged
in movies (as an example, one question here has been how often the person reads or searches
on the internet for the things or activities related to movies). The perceptual ability assesses the
ability to judge and notice diferent concepts about movies by watching a movie. For example,
if a person can notice the genre of the movie by watching a movie, that increases the score
of perceptual ability. Emotion is more related to the degree of being emotionally induced
by movies. Another piece of information is the eudaimonic-hedonic orientation of users. It
captures how much a person seeks meaning in movies or to which degree a person likes to
watch movies that are just entertaining. A movie gets a higher score of eudaimonic quality if it
contains a complex or deep meaning, and it gets a higher score of hedonic quality if it is more
entertaining. Eudaimonic-hedonic perception of a movie is how much the user judges a movie
as eudaimonic or hedonic.</p>
      <p>For each user, we have had a score of eudaimonic and hedonic quality. We have been interested
in checking if there is any significant correlation between other users’ characteristics and the
eudaimonic/ hedonic quality. Among diferent genres that users have been interested in, some
have correlated with eudaimonic/hedonic quality significantly. For drama preference, there
has been a significant correlation of +0.7 and −0.7 with eudaimonic and hedonic attributes,
respectively. The correlations with eudaimonic quality for action and comedy preferences
have been -0.5 and -0.4 and with hedonic quality +0.6 and +0.7, respectively. The values of
correlations are not surprising as we expected that users who are more into comedy and action
genres are likely to be from users who are more interested in being entertained than going
deep into the thoughts raised by a movie. We also conjecture that the users who prefer drama
are more meaning-seekers. Therefore, we have expected to have a higher score of eudaimonic
quality for them. The same goes for two other genres of biographies and documentaries. We
have found a correlation of +0.4 and +0.5 with eudaimonic quality and −0.5 for hedonic quality
for two genres of biographies and documentaries, respectively.</p>
      <p>Among many personality models, the big five personality traits have been extensively used
by psychology’s researchers to analyse people’s behaviours. The five personality traits are
openness, conscientiousness, extraversion, agreeableness and neuroticism. We wanted to see if
there is a high correlation between some of these factors and the eudaimonic/hedonic qualities.
Based on the results, users with a higher score of openness traits have been among users with
a higher value of eudaimonic quality. The correlation between eudaimonic and openness has
been calculated as +0.6, and the correlation between hedonic and openness has been −0.4. The
correlation between extraversion and eudaimonic quality has been −0.3, and the correlation
with hedonic quality has been +0.3. The correlation between other traits in the Big five-factor
trait model and eudaimonic hedonic quality has not been noticeable.</p>
      <p>The sophistication index includes three factors. Emotion is one of the factors that highly
correlates with eudaimonic/hedonic quality. The easier the movies can induce emotions in a
person, the higher the value of emotion in the sophistication index. The correlation between
emotion and eudaimonic quality is +0.8, and the correlation between emotion and hedonic
quality is −0.6, which is noticeably high.</p>
      <p>Self-acceptance and community feeling are two factors of REIVO that correlate to
eudaimonic/hedonic perception qualities. Correlation among diferent factors of REIVO and
eudaimonic/ hedonic qualities are demonstrated as a heat map in Figure 1.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology and Results</title>
      <p>We conjecture that the subtitles of the movies include information about the eudaimonic/hedonic
qualities of the movies. Therefore, we have aimed at developing a model that predicts the average
eudaimonic/hedonic scores of movies given by users from the movie subtitles. In order to achieve
this goal, we have considered the prediction problem as regression and classification.</p>
      <p>In the regression problem, the aim is to predict the value of eudaimonic/hedonic scores. In the
classification problem, we try to identify the eudaimonic/hedonic class of the movies. We have
defined the eudaimonic and hedonic classes of the movies: 1) If the movies’ eudaimonic score is
higher than the median and the hedonic score is less than the median, the movie falls into the
eudaimonic category. 2) If the hedonic score is higher than the median, and the eudaimonic
score is less than the median, it falls into the hedonic category. 3) All other movies that do not
belong to the eudaimonic or hedonic classes are labelled as other movies.</p>
      <p>The subtitles are first preprocessed by applying some common preprocessing techniques
used in NLP, including stemming, lemmatization, tokenization and removing stop words. Then
the features have been extracted from the preprocessed subtitles. We have used TF-IDF and
FASTTEXT pretraining models for feature extractions [14]. The list of algorithms that we used
and the pipeline of the work is presented in Figure 2.</p>
      <p>We have used a nested k-fold cross-validation algorithm to evaluate the algorithms and
compare the results with the base algorithms. In the regression algorithms, the predicted label
by the base algorithm is the average of the labels in the training dataset. The base classification
algorithm chooses the most frequent class as the new item’s label.</p>
      <p>The results of the regression algorithms are presented in Table 1. Decision tree, random forest
regressor and XGboost have performed better in terms of root mean square error (RMSE) values
in both eudaimonic and hedonic predictions. However, the mean absolute error (MAE) results
are not always better than the base algorithms for the mentioned algorithms. We conjecture
that the diference is related to the existence of some outliers in the dataset. The results of
the classification algorithms are reported in Table 2. The accuracy and recall values of all
the classification algorithms are the same or better than the baseline. However, the decision
tree, random forest and XGB classifiers are not always better based on precision, F1-score or
ROC AUC values. The achieved results of this study may not be conclusive due to the limitation
of having enough data. For future work, we have a plan to collect more data to improve the
training models.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion and Future Work</title>
      <p>In this work, we devised a model for predicting movies’ eudaimonic/hedonic qualities from
subtitles. To the best of our knowledge, there were no attempts to predict movies’ eudaimonic/
hedonic qualities from subtitles prior to this study. The results of the regression and classification
algorithms are, in most cases, better than the base algorithms. However, we assume that the
results are not conclusive due to not having enough data. For future work, we are going to
collect more data to having more generalizable results. Moreover, we would like to model users
to predict the eudaimonic/hedonic perception of each user for diferent movies. We are also
interested in exploiting users’ eudaimonic/hedonic perceptions in recommender systems for
personalizing the recommendations.
[8] B.-H. Chang, E.-J. Ki, Devising a practical model for predicting theatrical movie success:</p>
      <p>Focusing on the experience good property, Journal of Media Economics 18 (2005) 247–269.
[9] H. Yang, J. Han, K. Min, Eeg-based estimation on the reduction of negative emotions for
illustrated surgical images, Sensors 20 (2020) 7103.
[10] L. Stark, R. Brünken, B. Park, Emotional text design in multimedia learning: A
mixedmethods study using eye tracking, Computers &amp; Education 120 (2018) 185–196.
[11] J. Li, C. Luo, Q. Zhang, R. Shadiev, Can emotional design really evoke emotion in multimedia
learning?, International Journal of Educational Technology in Higher Education 17 (2020)
1–18.
[12] S. L. Chu, S. Brown, H. Park, B. Spornhauer, Towards personalized movie selection for
wellness: Investigating event-inspired movies, International Journal of Human–Computer
Interaction 36 (2020) 1514–1526.
[13] E. Puc, Movie Recommender System – Psychological Constructs and General Movie
Sophistication or Movie Likeability Score and Movie Genre, Ph.D. thesis, University of
Primorska, 2021.
[14] E. Grave, P. Bojanowski, P. Gupta, A. Joulin, T. Mikolov, Learning word vectors for 157
languages, in: Proceedings of the International Conference on Language Resources and
Evaluation (LREC 2018), 2018.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Shapira</surname>
          </string-name>
          ,
          <source>Recommender Systems: Introduction and Challenges, in: Recommender Systems Handbook</source>
          , volume
          <volume>54</volume>
          ,
          <string-name>
            <surname>Springer</surname>
            <given-names>US</given-names>
          </string-name>
          ,
          <year>2015</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>34</lpage>
          . URL: http://www.springerlink.com/index/10.1007/978-0-
          <fpage>387</fpage>
          -
          <fpage>85820</fpage>
          -3http://link.springer.
          <source>com/10.1007/978-1-4899-7637-6_1. doi:1 0 . 1 0</source>
          <volume>0 7 / 9 7 8 - 1 - 4 8 9 9 - 7 6 3 7 - 6</volume>
          _
          <fpage>1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M. de Gemmis</surname>
            , P. Lops,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Musto</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Narducci</surname>
          </string-name>
          , G. Semeraro,
          <article-title>Semantics-Aware ContentBased Recommender Systems</article-title>
          , in: Recommender Systems Handbook, Springer US, Boston, MA,
          <year>2015</year>
          , pp.
          <fpage>119</fpage>
          -
          <lpage>159</lpage>
          . URL: http://link.springer.
          <source>com/10.1007/978-1-4899-7637-6_4. doi:1 0 . 1 0</source>
          <volume>0 7 / 9 7 8 - 1 - 4 8 9 9 - 7 6 3 7 - 6</volume>
          _
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M. B.</given-names>
            <surname>Oliver</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Raney</surname>
          </string-name>
          ,
          <article-title>Entertainment as Pleasurable and Meaningful: Identifying Hedonic and Eudaimonic Motivations for Entertainment Consumption</article-title>
          ,
          <source>Journal of Communication</source>
          <volume>61</volume>
          (
          <year>2011</year>
          )
          <fpage>984</fpage>
          -
          <lpage>1004</lpage>
          .
          <source>doi:1 0 . 1 1 1 1 / j . 1 4</source>
          <volume>6 0 - 2 4 6 6 . 2 0 1 1 . 0 1 5 8 5</volume>
          . x .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Tkalčič</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ferwerda</surname>
          </string-name>
          , Eudaimonic Modeling of Moviegoers,
          <source>in: Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization</source>
          , ACM, New York, NY, USA,
          <year>2018</year>
          , pp.
          <fpage>163</fpage>
          -
          <lpage>167</lpage>
          . URL: http://dl.acm.org/citation.cfm?doid=
          <fpage>3209219</fpage>
          .3209249https: //dl.acm.org/doi/10.1145/3209219.3209249.
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 3 2 0 9 2 1 9 . 3 2 0 9 2 4 9 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Delle Fave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Massimini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bassi</surname>
          </string-name>
          ,
          <article-title>Hedonism and eudaimonism in positive psychology, in: Psychological selection and optimal experience across cultures</article-title>
          , Springer,
          <year>2011</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Deldjoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Elahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Garzotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Piazzolla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Quadrana</surname>
          </string-name>
          ,
          <article-title>Content-based video recommendation system based on stylistic visual features</article-title>
          ,
          <source>Journal on Data Semantics</source>
          <volume>5</volume>
          (
          <year>2016</year>
          )
          <fpage>99</fpage>
          -
          <lpage>113</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Deldjoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Elahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Quadrana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <article-title>Using visual features based on mpeg7 and deep learning for movie recommendation</article-title>
          ,
          <source>International journal of multimedia information retrieval 7</source>
          (
          <year>2018</year>
          )
          <fpage>207</fpage>
          -
          <lpage>219</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>