=Paper= {{Paper |id=Vol-2068/exss5 |storemode=property |title=Explainable Movie Recommendation Systems by using Story-based Similarity |pdfUrl=https://ceur-ws.org/Vol-2068/exss5.pdf |volume=Vol-2068 |authors=O-Joun Lee,Jason J. Jung |dblpUrl=https://dblp.org/rec/conf/iui/LeeJ18 }} ==Explainable Movie Recommendation Systems by using Story-based Similarity== https://ceur-ws.org/Vol-2068/exss5.pdf

Explainable Movie Recommendation Systems by using
Story-based Similarity
O-Joun Lee Jason J. Jung∗
Department of Computer Engineering, Department of Computer Engineering,
Chung-Ang University Chung-Ang University
Dongjak-gu, Seoul, Republic of Korea Dongjak-gu, Seoul, Republic of Korea
concerto34@cau.ac.kr j3ung@cau.ac.kr

ABSTRACT However, the previous studies could not identify what the
The goal of this paper is to provide a story-based explana- users will gain or feel from the recommended items. Most
tion for movie recommendation systems, achieved by a multi- of the studies only took into account the ‘scrutability’ for the
aspect explanation and narrative analysis methods. We explain recommended items. Moreover, content of the items were not
how and why particular movies are similar based on following considered.
two aspects: (i) composition of movie characters and (ii) in-
For recommending the narrative works, in this work, we as-
teractions among the characters. These aspects correspond to
sume that the content of the items directly affect the users’
story-based features of the movies that are extracted from char-
preference. We focus on analyzing and exploiting the major
acter networks (i.e., social networks among the characters).
characteristics of the content (such as drawing styles of comics
By using the story-based features, we can explain the reason
or stories of movies).
why two arbitrary movies are similar or not. We anticipate
that the proposed method could improve the explainability of We have conducted a simple user survey among 97 users of
the recommender systems for movies. ‘webtoon’, which is a novel media distributing comics through
the web. The survey simply consisted of one question, which
ACM Classification Keywords allowed plural responses: “What are criteria that affect your
H.5.m. Information Interfaces and Presentation (e.g. HCI): preferences for webtoons?”. Most of the users wrote two cri-
Miscellaneous; J.5. Computer Applications: Arts and Human- teria: stories (98.96%) and drawing styles (97.93%); 96.90%
ities; I.2.4. Artificial Intelligence: Knowledge Representation of the users selected both the criteria. Also, we interviewed
Formalisms and Methods Lehzin Comics1 , which is one of the major webtoon publishers
in Korea. For a question: “Why you do not use recommender
engines for your platform?”, they answered that users mostly
Author Keywords consume webtoons within a few limited genres and drawing
Explainable Recommender System; Movies; Character styles.
Network; Story Analysis; Computational Narrative.
Based on the results, we have found out that the following
INTRODUCTION
patterns can be used for explaining the recommendations.
Various online services have been employing recommender • stories contained in the narrative works
module to provide users with the most relevant items. How-
ever, with only a list of recommended items, it is difficult for • how the stories are physically described.
the users to understand why such items are selected. The users
should spend additional resources (mostly time or money) for The goal of this study is to improve the explainability of the
identifying whether the recommended items are really prefer- recommender systems by using the similarity among the sto-
able. The problem becomes even worse for recommending ries. Nevertheless, as an ongoing study, we limited our target
narrative works (e.g., movies, TV series, novels, and so on). domain into the movies. Also, using character networks (i.e.,
For example, we have experienced giving up watching TV social networks among the characters), we preserved the ex-
series after a few episodes. pandability of the proposed method for other types of narrative
works.
Hence, explaining the reason on the recommendation has been
regarded as an important research issue. There have been EXPLAINING STORY-BASED SIMILARITY
various studies [7, 2] on building ‘explainable’ recommender
Expected results of the proposed method are similar to what
systems.
Netflix2 is already providing, as displayed in Fig. 1. Netflix
* Corresponding author. recommends sets of movies or TV series with some reasons;
e.g., “Because you watched Madam Secretary.”
© 2018. Copyright for the individual papers remains with the authors. 1 https://www.lezhin.com/ko/
Copying permitted for private and academic purposes. ExSS ’18, March 2 https://www.netflix.com/browse
11, Tokyo, Japan.
In order to compare the compositions of characters, we clas-
sified the characters with two criteria: (i) importance of their
roles and (ii) proximity with a protagonist. In our former
studies, we proved that roles of the characters are easily dis-
tinguishable. We classified the characters’ roles into three
categories (i.e., main characters, minor characters, and ex-
Figure 1: Examples of explanations provided by Netflix. tras) by using their centralities on the character networks [6].
The centralities were estimated by the linear combination of
standard node centrality measurements: the degree centrality,
closeness centrality, and betweenness centrality.
However, we targeted on more detailed explanations than
Netflix’s. Our proposed method focused on providing causal In addition, we distinguished the protagonists who are the
evidence based on the movies’ stories. For example, when we most spotlighted and the antagonists who are secondly focused
recommend “Cinderella” for the users, we can say “Cinderella [6]. We considered the remaining ones as tritagonists. The
is similar with Snow White with focus on relationships among tritagonists were categorized into three sides: friendly, neutral,
characters appearing in the movies.” and hostile ones for the protagonists.
To achieve these goals, this study aimed for following ob- The categorization was conducted by measuring social ties
jectives: (i) discovering story-based features, (ii) estimating among the characters. If there are three characters (cP , cA , and
story-based similarity among the movies, (iii) providing ex- ci ), cP is a protagonist, and cA is an antagonist, we can identify
plainable recommendation on the basis of the story-based which character is closer with ci than the other by comparing
similarity, and (iv) making the proposed method expendable ci ’s social ties for cP and cA . It can be formulated as
to other media.
T (ci , cP ) > T (ci , cA ),


P, if T (ci , cP ) > median T (c j , cP ),

For the former two objectives, we attempted to discover fea- 

∀c j


tures with two aspects: composition of the characters and 
interactions among the characters. Based on the story-based ci ∈ T (ci , cA ) > T (ci , cP ), , (1)
 A, if T (ci , cA ) > median T (c j , cA ),
features, we developed measurements to display the differ- 


 ∀c j
ences between two arbitrary stories. Also, with focus on the 
 N, otherwise.
expandability, data sources of the features were limited within
the character network. where T (c j , ck ) indicates the degree of social tie between c j
and ck , P and A indicate sets of the characters who are friendly
Character Network with the protagonist and the antagonist, respectively, N denotes
Our previous studies [1, 5, 4, 3] have proposed SNA-based a set of the characters who take the neutral positions between
methods for computationally analyzing the movies’ stories. the protagonist and the antagonist, median∀c j T (c j , cP ) and
We modeled the stories by using the character network that median∀c j T (c j , cA ) refer to median values of the social ties
was defined as follows; of all the characters for cP and cA , respectively.
D EFINITION 1 (C HARACTER N ETWORK ). Suppose To measure the degree of social ties, we considered the fre-
that N is the number of characters that appeared in a movie, quency of interactions and the number of paths between target
Cα . When N(Cα ) indicates a character network of Cα , characters. It is formulated as:
N(Cα ) can be described as a matrix ∈ RN×N . It consists |pk |
of N × N components which are social affinities among the T (ci , c j ) = ∑ ∏ f (nl−1 , nl ), (2)
characters where, ai, j is the social affinity of ci for c j when ∀pk ∈Pi, j l=2
Cα is an universal set of characters that appeared in Cα and
ci is an i-th element of Cα . where Pi, j is a set of possible paths between ci and c j , pk
indicates a k-th path in Pi, j , nl denotes a l-th node (character)
In this study, we used frequency of the dialogues among the on pk , |pk | is the number of nodes included in the pk , and
characters as the social affinity among them. The dialogues f (nl−1 , nl ) means a weighting value (interaction frequency)
were extracted from the movies’ scripts that were collected between nl−1 and nl .
from the Internet Movie Script Database (IMSDb) 3 .
By combining the two classification criteria, we categorized
the characters into six groups, as displayed in Fig. 2.
Composition of the Characters
The directors have to compose characters with a focus on rep- The number of characters in each category was represented as
resentation of their stories. In other words, the users might a 2 × 3 matrix. We call this matrix a ‘character composition
recognize the movies’ stories from the composition of char- matrix’. As a naive approach, the difference between two
acters, whether it is intuitively or analytically. Hence, the movies can be estimated by the Frobenius distance among
similarity among movies’ stories is recognizable from the their character composition matrices as:
difference among compositions of characters. DC (Cα , Cβ ) =k Cα − Cβ kF , (3)
3 http://www.imsdb.com/
Importance Interactions among the Characters
Although the protagonists and antagonists interact with most
Main Minor of the characters, the others are mostly bounded in particular
communities. For example, acquaintances of the protagonists
usually interact and appear with the protagonists. If they start
Protagonist Friendly Friendly appearing frequently with the antagonists, there is a possible
Main Minor indication that a conflict or a crisis (e.g., betrayal, convert, kid-
Characters Characters napping, etc.) is likely to happen. In other words, interactions
among the characters’ groups reflect methods for developing
the stories.
Proximity

Neutral

Neutral Neutral Based on this intuition, we compared the stories by using two
Main Minor criteria: (i) frequency: proportion of inter-group interactions
Characters Characters and (ii) aggressiveness: external adjacency of the groups.
To utilize the two metrics, we had to discover the characters’
groups. The groups were built on the basis of P, A, and N that
Antagonist

Hostile Hostile were composed in Sect. 2.2. Procedures for discovering the
Main Minor groups can be summarized as follows.
Characters Characters 1. Subsume extras under P, A, and N with the same method
that is used for the main and minor characters.
Figure 2: The proposed category of the characters.
2. Calculate the internal compactness and the external adja-
cency of each group.

3. If the external adjacency of a particular group is too high,
compared with its internal compactness, partition the group.
where k · kF denotes the Frobenius norm. In here, DC (Cα , Cβ )
has smaller value, as Cα and Cβ have more similar number of 4. Iterate Step. 2 and Step. 3, until the groups have adequate
characters for all the categories. It means that DC is highly quality.
affected by the number of characters. The external adjacency is measured on the basis of the external
Therefore, we normalized the character composition matrix interactions’ frequency and the out-degree of the groups, as:
as:
Cα
∑ d( j, l)
cl ∈Gk
CαNorm = , (4) I (Gk ) = ∑ ai, j × , (7)
|Cα | |Gk |
∀ci ∈Gk ,
∀c j ∈Gk ,
where |Cα | is the number of characters that appeared in Cα . ci ,c j
By comparing the normalized composition matrices, we got
a scale-tolerant distance. Also, the number of characters was
directly compared with other movie’s. It can be formulated as: 1, if a j,l , 0,
d( j, l) = , (8)
0, otherwise.
DCNorm (Cα , Cβ ) =k CαNorm − CβNorm kF , (5)
where Gk denotes a k-th group, |Gk | indicates the number of
max(|Cα |, |Cβ |) characters included in Gk , and d( j, l) refers to an indicator
DCScale (Cα , Cβ ) = . (6) function that indicates existence of interactions between c j and
min(|Cα |, |Cβ |) cl . On the contrary, the internal compactness was estimated
from the internal interactions’ frequency and the in-degree of
Thus, the explanations to the users can be composed by con- the groups.
sidering the (i) proximity and (ii) importance. For example,
‘The Day After Tomorrow (2004)’ and ‘Gravity (2013)’ are ∑ d( j, l)
commonly disaster movies. In these two movies, nature takes cl