<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Estimation of Character Diagram from Open Movie Database using Markov Logic Network</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yuta Ohwatari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takahiro Kawamura</string-name>
          <email>kawamura@ohsuga.is.uec.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuichi Sei</string-name>
          <email>sei@is.uec.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yasuyuki Tahara</string-name>
          <email>tahara@is.uec.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Akihiko Ohsuga</string-name>
          <email>ohsuga@uec.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Information Systems, University of Electro-Communications</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we propose the estimation method of interpersonal relationships of characters from movie script databases on the Web using Markov Logic Network. By using Markov Logic Network, we can infer while allowing the violation of rules. In experiments, we con rmed that our proposed method can estimate favors between the characters in a movie with a precision of 69.8%.</p>
      </abstract>
      <kwd-group>
        <kwd>Markov Logic Network</kwd>
        <kwd>Semantic Analysis</kwd>
        <kwd>Open Movie Database</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Every year, a large number of movies have been released. If a user want to quickly
know about a movie, he/she will see the summary of the movie. Therefore, It is
effective summarization of a movie is required in order for better understanding
of the movie.</p>
      <p>An overview of our proposed method is illustrated in Figure 1. Our method
is separated into the estimation of interpersonal relationships and the generation
of character diagrams.</p>
      <p>First, we prepared script data for learning and inferring by extracting who
speak what to whom from a movie database. Then, we estimate the sentiment
($79'!,67$%98+!
8"+$'&amp;'(!</p>
      <p>&amp;'6"$$&amp;'(!
2+$=7&gt;,?7(&amp;),@"#A7$=!</p>
      <p>B8)*"%C!</p>
      <p>!"#"$%&amp;'&amp;'(!
)*+$+)#"$,!&amp;+($+%!
D;&lt;%+&lt;7',B("'#,76,E*+$+)#"$,-&amp;+($+%!
polarity for lines in the script and the favorable impression between a speaker
and a listener in a movie using Markov Logic Network. Finally, we generate the
character diagram of a movie from the estimated interpersonal relationships.</p>
      <p>
        A rst-order knowledge base can be seen as a set of hard constraints on the
set of possible worlds. However, the solution in the real world is often on the set
of impossible worlds. In contrast, Markov Logic Network solves this problem by
associating weight that re ects how strong a constraint is with each formulas.
Also, it is laborious to construct Markov Networks. Markov Logic Network can be
viewed as a template for constructing Markov Networks. Markov Logic Network
(MLN) is a probabilistic extension of a nite rst-order logic[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which makes up
the disadvantages of Markov Networks and a rst-order logic.
      </p>
      <p>Note that we used the learning and inference algorithms provided in the
open-source Alchemy 1 as an implementation of the MLN in this paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Tanaka et al[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] presents interpersonal relationships extracted from sentence
structures as a summary of a story. We considered that it is effective to present the
relationships of characters as a summary. On the other hand, analysis of e-mails[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
and estimation from co-occurrence of the name[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] are studies of estimating the
relationships of persons in the real world. However, it has not been studied about
the estimation of interpersonal relationships of ctional characters.
      </p>
      <p>
        There are many studies using MLN, for example, entity resolution[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
information extraction[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. These studies focus on global constraints, and built a
model by using MLN. We also targets a text and extracts infomation on global
constraints.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>De ned Rules</title>
      <p>
        We de ned rules to estimate interpersonal relationships for MLN. These rules
determine the sentiment polarity for lines in the script using sentiment polarity
for the word and favor between characters using the sentiment polarity for lines.
To use sentiment polarity of words, we incorporated as the Semantic Orientations
of Words Dictionary that is built by Takamura et al[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This assigns a real value
in the range from -1 to +1 to where the words assigned with values close to -1
are supposed to be negative, and the words assigned with values close to +1 are
supposed to be positive. Vocabulary was extracted from WordNet2.
      </p>
      <p>In this paper, we limited to two-valued attribute of positive(+1) and
negative( 1). A observed predicate is a predicate with all arguments given by
inferring and training. A hidden predicate is a predicate with an argument not
given by inferring but given by training. Observed predicates and hidden
predicates in this paper are shown in Table 1.</p>
      <sec id="sec-3-1">
        <title>1 http://alchemy.cs.washington.edu/ 2 http://wordnet.princeton.edu</title>
        <p>Line(text, speaker, listener) speaker speak text to listener
Word(text, position, word) word in text and the position is position
Wpol(word, pol) The sentiment polarity of word is pol
Lpol(text, pol)
Likes(person, person)</p>
        <p>The sentiment polarity of text is pol</p>
        <p>Favor</p>
        <p>We describe some of the logical rules for each script line below. t and l is
variable. A constant is enclosed in double quotes. Underscore means an arbitrary
value. If (+) gets attached to the front of the variable, it is replaced by all the
constants that is deployed from the actual data (grounding).</p>
        <p>W ord(t; l; +w) ^ W pol(+w; +p) ) Lpol(t; +p)
Line(t; +sp; +li) ^ Lpol(t; "P ") ) Likes(+sp; +li)
Line(t; +sp; +li) ^ Lpol(t; "N ") ) :Likes(+sp; +li)
.
.
.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experiment on Relation Extraction</title>
      <sec id="sec-4-1">
        <title>Datasets</title>
        <p>In the experiment, we used movie script data from IMSDb: The Internet
Movie Script Database3 on the Web. The title of movies used in the experiment
are Back to the Future (1985), Good Will Hunting (1997), Harry Potter And
The Sorcerer's Stone (2001), The Lord of the Rings The Fellowship of the Ring
(2001), and Star Wars Episode I The Phantom Menace (1999). The average
number of lines and characters are 704.6 and 42.6, respectively.</p>
      </sec>
      <sec id="sec-4-2">
        <title>Setting</title>
      </sec>
      <sec id="sec-4-3">
        <title>Result</title>
        <p>We used a movie for testing, and the remaining 4 movies as training data.
We treated as true above the mean value of the probability, because estimation
results are expressed in a probability. Note, we ask the person for a description
of Likes predicates in the training data that is familiar with the movies and has
seen actually.</p>
        <p>The experimental results are shown in Table 2. The training time was about
19 hours in total, and the inferring time was about 3 hours in total. As a result,
recall is lower than precision. In addition, Figure 2 shows an example of the
generated character diagram from the estimated interpersonal relationships. In
this gure, a node represents a person, an edge represents a relationship. The</p>
        <sec id="sec-4-3-1">
          <title>3 http://www.imsdb.com/</title>
          <p>information with edge shows the estimated probability of the predicate Likes()
and the mean of the probability (like or not like). A dashed edge means false
estimation. This gure generally represents the interpersonal relationships of
Star Wars Episode I The Phantom Menace (1999).
mean
-#./0&amp;!/%.#*#!
!"#$%"&amp;'$()#*$+,!</p>
          <p>9:9B=9&lt;;B?*%$+A!
9:9BC9&lt;;&lt;?*%$+A!
9:9B&gt;9&lt;;;?*%$+A!
12-3!</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conculusion</title>
      <p>In this paper, using MLN on the movie script database, we estimated the
sentiment polarity of script lines and the interpersonal relationships of the characters
in a movie. In the experiments, we con rmed that our proposed method
estimated favors between the characters in a movie with a precision of 69.8%. In
the future, we will improve the model to achieve the higher accuracy.</p>
      <sec id="sec-5-1">
        <title>Acknowledgements</title>
        <p>This work was supported by JSPS KAKENHI Grant Numbers 24300005,
26330081, 26870201.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Adamic</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adar</surname>
          </string-name>
          , E.:
          <article-title>Friends and neighbors on the web</article-title>
          .
          <source>Social networks 25(3)</source>
          ,
          <volume>211</volume>
          {
          <fpage>230</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Matsuo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tomobe</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hashida</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nakajima</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ishizuka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Social network extraction from the web information</article-title>
          .
          <source>Transactions of the Japanese Society for Arti cial Intelligence</source>
          <volume>20</volume>
          (
          <issue>1</issue>
          ),
          <volume>46</volume>
          {
          <fpage>56</fpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Poon</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Joint inference in information extraction</article-title>
          .
          <source>In: AAAI</source>
          . vol.
          <volume>7</volume>
          , pp.
          <volume>913</volume>
          {
          <issue>918</issue>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Markov logic networks</article-title>
          .
          <source>Machine learning 62(1-2)</source>
          ,
          <volume>107</volume>
          {
          <fpage>136</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Singla</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Entity resolution with markov logic</article-title>
          .
          <source>In: Data Mining</source>
          ,
          <year>2006</year>
          . ICDM'
          <fpage>06</fpage>
          . Sixth International Conference on. pp.
          <volume>572</volume>
          {
          <fpage>582</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Takamura</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Inui</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Okumura</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Extracting semantic orientations of words using spin model</article-title>
          .
          <source>In: Proc. the 43rd Annual Meeting on Association for Computational Linguistics</source>
          . pp.
          <volume>133</volume>
          {
          <fpage>140</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Tanaka</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Okabe</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Onai</surname>
          </string-name>
          , R.:
          <article-title>Interactive narrative summarization</article-title>
          .
          <source>Workshop on Interactive Systems and Software</source>
          pp.
          <volume>06</volume>
          {
          <issue>01</issue>
          {
          <issue>06</issue>
          {
          <issue>6</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>