=Paper= {{Paper |id=Vol-2068/exss5 |storemode=property |title=Explainable Movie Recommendation Systems by using Story-based Similarity |pdfUrl=https://ceur-ws.org/Vol-2068/exss5.pdf |volume=Vol-2068 |authors=O-Joun Lee,Jason J. Jung |dblpUrl=https://dblp.org/rec/conf/iui/LeeJ18 }} ==Explainable Movie Recommendation Systems by using Story-based Similarity== https://ceur-ws.org/Vol-2068/exss5.pdf
     Explainable Movie Recommendation Systems by using
                    Story-based Similarity
                      O-Joun Lee                                                        Jason J. Jung∗
          Department of Computer Engineering,                                 Department of Computer Engineering,
                 Chung-Ang University                                                Chung-Ang University
          Dongjak-gu, Seoul, Republic of Korea                                Dongjak-gu, Seoul, Republic of Korea
                 concerto34@cau.ac.kr                                                  j3ung@cau.ac.kr


ABSTRACT                                                                However, the previous studies could not identify what the
The goal of this paper is to provide a story-based explana-             users will gain or feel from the recommended items. Most
tion for movie recommendation systems, achieved by a multi-             of the studies only took into account the ‘scrutability’ for the
aspect explanation and narrative analysis methods. We explain           recommended items. Moreover, content of the items were not
how and why particular movies are similar based on following            considered.
two aspects: (i) composition of movie characters and (ii) in-
                                                                        For recommending the narrative works, in this work, we as-
teractions among the characters. These aspects correspond to
                                                                        sume that the content of the items directly affect the users’
story-based features of the movies that are extracted from char-
                                                                        preference. We focus on analyzing and exploiting the major
acter networks (i.e., social networks among the characters).
                                                                        characteristics of the content (such as drawing styles of comics
By using the story-based features, we can explain the reason
                                                                        or stories of movies).
why two arbitrary movies are similar or not. We anticipate
that the proposed method could improve the explainability of            We have conducted a simple user survey among 97 users of
the recommender systems for movies.                                     ‘webtoon’, which is a novel media distributing comics through
                                                                         the web. The survey simply consisted of one question, which
ACM Classification Keywords                                              allowed plural responses: “What are criteria that affect your
H.5.m. Information Interfaces and Presentation (e.g. HCI):               preferences for webtoons?”. Most of the users wrote two cri-
Miscellaneous; J.5. Computer Applications: Arts and Human-               teria: stories (98.96%) and drawing styles (97.93%); 96.90%
ities; I.2.4. Artificial Intelligence: Knowledge Representation          of the users selected both the criteria. Also, we interviewed
Formalisms and Methods                                                   Lehzin Comics1 , which is one of the major webtoon publishers
                                                                         in Korea. For a question: “Why you do not use recommender
                                                                         engines for your platform?”, they answered that users mostly
Author Keywords                                                          consume webtoons within a few limited genres and drawing
Explainable Recommender System; Movies; Character                        styles.
Network; Story Analysis; Computational Narrative.
                                                                        Based on the results, we have found out that the following
INTRODUCTION
                                                                        patterns can be used for explaining the recommendations.
Various online services have been employing recommender                 • stories contained in the narrative works
module to provide users with the most relevant items. How-
ever, with only a list of recommended items, it is difficult for        • how the stories are physically described.
the users to understand why such items are selected. The users
should spend additional resources (mostly time or money) for            The goal of this study is to improve the explainability of the
identifying whether the recommended items are really prefer-            recommender systems by using the similarity among the sto-
able. The problem becomes even worse for recommending                   ries. Nevertheless, as an ongoing study, we limited our target
narrative works (e.g., movies, TV series, novels, and so on).           domain into the movies. Also, using character networks (i.e.,
For example, we have experienced giving up watching TV                  social networks among the characters), we preserved the ex-
series after a few episodes.                                            pandability of the proposed method for other types of narrative
                                                                        works.
Hence, explaining the reason on the recommendation has been
regarded as an important research issue. There have been                EXPLAINING STORY-BASED SIMILARITY
various studies [7, 2] on building ‘explainable’ recommender
                                                                        Expected results of the proposed method are similar to what
systems.
                                                                        Netflix2 is already providing, as displayed in Fig. 1. Netflix
* Corresponding author.                                                 recommends sets of movies or TV series with some reasons;
                                                                        e.g., “Because you watched Madam Secretary.”
© 2018. Copyright for the individual papers remains with the authors.   1 https://www.lezhin.com/ko/
Copying permitted for private and academic purposes. ExSS ’18, March    2 https://www.netflix.com/browse
11, Tokyo, Japan.
                                                                    In order to compare the compositions of characters, we clas-
                                                                    sified the characters with two criteria: (i) importance of their
                                                                    roles and (ii) proximity with a protagonist. In our former
                                                                    studies, we proved that roles of the characters are easily dis-
                                                                    tinguishable. We classified the characters’ roles into three
                                                                    categories (i.e., main characters, minor characters, and ex-
  Figure 1: Examples of explanations provided by Netflix.           tras) by using their centralities on the character networks [6].
                                                                    The centralities were estimated by the linear combination of
                                                                    standard node centrality measurements: the degree centrality,
                                                                    closeness centrality, and betweenness centrality.
However, we targeted on more detailed explanations than
Netflix’s. Our proposed method focused on providing causal          In addition, we distinguished the protagonists who are the
evidence based on the movies’ stories. For example, when we         most spotlighted and the antagonists who are secondly focused
recommend “Cinderella” for the users, we can say “Cinderella        [6]. We considered the remaining ones as tritagonists. The
is similar with Snow White with focus on relationships among        tritagonists were categorized into three sides: friendly, neutral,
characters appearing in the movies.”                                and hostile ones for the protagonists.
To achieve these goals, this study aimed for following ob-          The categorization was conducted by measuring social ties
jectives: (i) discovering story-based features, (ii) estimating     among the characters. If there are three characters (cP , cA , and
story-based similarity among the movies, (iii) providing ex-        ci ), cP is a protagonist, and cA is an antagonist, we can identify
plainable recommendation on the basis of the story-based            which character is closer with ci than the other by comparing
similarity, and (iv) making the proposed method expendable          ci ’s social ties for cP and cA . It can be formulated as
to other media.
                                                                                                 T (ci , cP ) > T (ci , cA ),
                                                                                   
                                                                                   
                                                                                      P, if T (ci , cP ) > median T (c j , cP ),
                                                                                   
For the former two objectives, we attempted to discover fea-                       
                                                                                   
                                                                                                              ∀c j
                                                                                   
                                                                                   
tures with two aspects: composition of the characters and                          
interactions among the characters. Based on the story-based                   ci ∈               T (ci , cA ) > T (ci , cP ),    ,  (1)
                                                                                     A, if T (ci , cA ) > median T (c j , cA ),
features, we developed measurements to display the differ-                         
                                                                                   
                                                                                   
                                                                                                             ∀c j
ences between two arbitrary stories. Also, with focus on the                       
                                                                                     N, otherwise.
expandability, data sources of the features were limited within
the character network.                                              where T (c j , ck ) indicates the degree of social tie between c j
                                                                    and ck , P and A indicate sets of the characters who are friendly
Character Network                                                   with the protagonist and the antagonist, respectively, N denotes
Our previous studies [1, 5, 4, 3] have proposed SNA-based           a set of the characters who take the neutral positions between
methods for computationally analyzing the movies’ stories.          the protagonist and the antagonist, median∀c j T (c j , cP ) and
We modeled the stories by using the character network that          median∀c j T (c j , cA ) refer to median values of the social ties
was defined as follows;                                             of all the characters for cP and cA , respectively.
   D EFINITION 1 (C HARACTER N ETWORK ). Suppose                    To measure the degree of social ties, we considered the fre-
that N is the number of characters that appeared in a movie,        quency of interactions and the number of paths between target
Cα . When N(Cα ) indicates a character network of Cα ,              characters. It is formulated as:
N(Cα ) can be described as a matrix ∈ RN×N . It consists                                                   |pk |
of N × N components which are social affinities among the                       T (ci , c j ) =     ∑ ∏ f (nl−1 , nl ),            (2)
characters where, ai, j is the social affinity of ci for c j when                                 ∀pk ∈Pi, j l=2
Cα is an universal set of characters that appeared in Cα and
ci is an i-th element of Cα .                                       where Pi, j is a set of possible paths between ci and c j , pk
                                                                    indicates a k-th path in Pi, j , nl denotes a l-th node (character)
In this study, we used frequency of the dialogues among the         on pk , |pk | is the number of nodes included in the pk , and
characters as the social affinity among them. The dialogues          f (nl−1 , nl ) means a weighting value (interaction frequency)
were extracted from the movies’ scripts that were collected         between nl−1 and nl .
from the Internet Movie Script Database (IMSDb) 3 .
                                                                    By combining the two classification criteria, we categorized
                                                                    the characters into six groups, as displayed in Fig. 2.
Composition of the Characters
The directors have to compose characters with a focus on rep-       The number of characters in each category was represented as
resentation of their stories. In other words, the users might       a 2 × 3 matrix. We call this matrix a ‘character composition
recognize the movies’ stories from the composition of char-         matrix’. As a naive approach, the difference between two
acters, whether it is intuitively or analytically. Hence, the       movies can be estimated by the Frobenius distance among
similarity among movies’ stories is recognizable from the           their character composition matrices as:
difference among compositions of characters.                                         DC (Cα , Cβ ) =k Cα − Cβ kF ,                 (3)
3 http://www.imsdb.com/
                                           Importance                Interactions among the Characters
                                                                     Although the protagonists and antagonists interact with most
                                       Main         Minor            of the characters, the others are mostly bounded in particular
                                                                     communities. For example, acquaintances of the protagonists
                                                                     usually interact and appear with the protagonists. If they start
                       Protagonist    Friendly     Friendly          appearing frequently with the antagonists, there is a possible
                                       Main         Minor            indication that a conflict or a crisis (e.g., betrayal, convert, kid-
                                     Characters   Characters         napping, etc.) is likely to happen. In other words, interactions
                                                                     among the characters’ groups reflect methods for developing
                                                                     the stories.
           Proximity

                        Neutral




                                      Neutral      Neutral           Based on this intuition, we compared the stories by using two
                                       Main        Minor             criteria: (i) frequency: proportion of inter-group interactions
                                     Characters   Characters         and (ii) aggressiveness: external adjacency of the groups.
                                                                     To utilize the two metrics, we had to discover the characters’
                                                                     groups. The groups were built on the basis of P, A, and N that
                       Antagonist




                                      Hostile      Hostile           were composed in Sect. 2.2. Procedures for discovering the
                                       Main        Minor             groups can be summarized as follows.
                                     Characters   Characters         1. Subsume extras under P, A, and N with the same method
                                                                        that is used for the main and minor characters.
     Figure 2: The proposed category of the characters.
                                                                     2. Calculate the internal compactness and the external adja-
                                                                        cency of each group.

                                                                     3. If the external adjacency of a particular group is too high,
                                                                        compared with its internal compactness, partition the group.
where k · kF denotes the Frobenius norm. In here, DC (Cα , Cβ )
has smaller value, as Cα and Cβ have more similar number of          4. Iterate Step. 2 and Step. 3, until the groups have adequate
characters for all the categories. It means that DC is highly           quality.
affected by the number of characters.                                The external adjacency is measured on the basis of the external
Therefore, we normalized the character composition matrix            interactions’ frequency and the out-degree of the groups, as:
as:
                                Cα
                                                                                                                  ∑ d( j, l)
                                                                                                                 cl ∈Gk
                     CαNorm =        ,                (4)                          I (Gk ) =      ∑ ai, j ×                     ,     (7)
                               |Cα |                                                                                    |Gk |
                                                                                               ∀ci ∈Gk ,
                                                                                               ∀c j ∈Gk ,
where |Cα | is the number of characters that appeared in Cα .                                   ci ,c j
By comparing the normalized composition matrices, we got
a scale-tolerant distance. Also, the number of characters was                                       
directly compared with other movie’s. It can be formulated as:                                          1,   if a j,l , 0,
                                                                                       d( j, l) =                          ,          (8)
                                                                                                        0,   otherwise.
           DCNorm (Cα , Cβ ) =k CαNorm − CβNorm kF ,           (5)
                                                                     where Gk denotes a k-th group, |Gk | indicates the number of
                                      max(|Cα |, |Cβ |)              characters included in Gk , and d( j, l) refers to an indicator
                 DCScale (Cα , Cβ ) =                   .      (6)   function that indicates existence of interactions between c j and
                                      min(|Cα |, |Cβ |)              cl . On the contrary, the internal compactness was estimated
                                                                     from the internal interactions’ frequency and the in-degree of
Thus, the explanations to the users can be composed by con-          the groups.
sidering the (i) proximity and (ii) importance. For example,
‘The Day After Tomorrow (2004)’ and ‘Gravity (2013)’ are                                                         ∑ d( j, l)
commonly disaster movies. In these two movies, nature takes                                                     cl