<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Estimation of Character Diagram from Open Movie Database using Markov Logic Network</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Yuta</forename><surname>Ohwatari</surname></persName>
							<email>y-ohwatari@ohsuga.is.uec.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="department">Graduate School of Information Systems</orgName>
								<orgName type="institution">University of Electro-Communications</orgName>
								<address>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Takahiro</forename><surname>Kawamura</surname></persName>
							<email>kawamura@ohsuga.is.uec.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="department">Graduate School of Information Systems</orgName>
								<orgName type="institution">University of Electro-Communications</orgName>
								<address>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yuichi</forename><surname>Sei</surname></persName>
							<email>sei@is.uec.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="department">Graduate School of Information Systems</orgName>
								<orgName type="institution">University of Electro-Communications</orgName>
								<address>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yasuyuki</forename><surname>Tahara</surname></persName>
							<email>tahara@is.uec.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="department">Graduate School of Information Systems</orgName>
								<orgName type="institution">University of Electro-Communications</orgName>
								<address>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Akihiko</forename><surname>Ohsuga</surname></persName>
							<email>ohsuga@uec.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="department">Graduate School of Information Systems</orgName>
								<orgName type="institution">University of Electro-Communications</orgName>
								<address>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Estimation of Character Diagram from Open Movie Database using Markov Logic Network</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">2EC7D4DEF319AC905EF64A8ADEFD12FD</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T23:30+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Markov Logic Network</term>
					<term>Semantic Analysis</term>
					<term>Open Movie Database</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we propose the estimation method of interpersonal relationships of characters from movie script databases on the Web using Markov Logic Network. By using Markov Logic Network, we can infer while allowing the violation of rules. In experiments, we confirmed that our proposed method can estimate favors between the characters in a movie with a precision of 69.8%.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Every year, a large number of movies have been released. If a user want to quickly know about a movie, he/she will see the summary of the movie. Therefore, It is effective summarization of a movie is required in order for better understanding of the movie.</p><p>An overview of our proposed method is illustrated in Figure <ref type="figure">1</ref>. Our method is separated into the estimation of interpersonal relationships and the generation of character diagrams.</p><p>First, we prepared script data for learning and inferring by extracting who speak what to whom from a movie database. Then, we estimate the sentiment</p><formula xml:id="formula_0">!"#"$%&amp;'&amp;'(! )*+$+)#"$,!&amp;+($+%! -.,, /"0(0,123-45! 67$,8"+$'&amp;'(! ($79'!,67$%98+! 8"+$'&amp;'(! &amp;'6"$$"!, &amp;'#"$:"$;7'+8,$"8+&lt;7';*&amp;:! :$":$7)";;&amp;'(! 67$,&amp;'6"$$&amp;'(! &amp;'6"$$&amp;'(! 2+$=7&gt;,?7(&amp;),@"#A7$=! ($79'!,67$%98+! B8)*"%C! D;&lt;%+&lt;7',B("'#,76,E*+$+)#"$,-&amp;+($+%!</formula><p>EBFGB1@,H,I";J,;&amp;$K, LM1NOP@,H,/Q0P5,G"88, #*"%,A",A&amp;;*,#7, 47+$!,+#,7')"0, EBFGB1@,H,I";J,;&amp;$0! ?&amp;=";/LM1OP@JEBFGB1@5, R0RRSRTUS, ?&amp;=";/LM1OP@JLM1OP@5, R0RVRRTU, ?&amp;=";/LM1OP@J@MGD5, R0RRTRTUW! Fig. <ref type="figure">1</ref>. Method workflow polarity for lines in the script and the favorable impression between a speaker and a listener in a movie using Markov Logic Network. Finally, we generate the character diagram of a movie from the estimated interpersonal relationships.</p><p>A first-order knowledge base can be seen as a set of hard constraints on the set of possible worlds. However, the solution in the real world is often on the set of impossible worlds. In contrast, Markov Logic Network solves this problem by associating weight that reflects how strong a constraint is with each formulas. Also, it is laborious to construct Markov Networks. Markov Logic Network can be viewed as a template for constructing Markov Networks. Markov Logic Network (MLN) is a probabilistic extension of a finite first-order logic <ref type="bibr" target="#b3">[4]</ref>, which makes up the disadvantages of Markov Networks and a first-order logic.</p><p>Note that we used the learning and inference algorithms provided in the open-source Alchemy<ref type="foot" target="#foot_0">1</ref> as an implementation of the MLN in this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head><p>Tanaka et al <ref type="bibr" target="#b6">[7]</ref> presents interpersonal relationships extracted from sentence structures as a summary of a story. We considered that it is effective to present the relationships of characters as a summary. On the other hand, analysis of e-mails <ref type="bibr" target="#b0">[1]</ref> and estimation from co-occurrence of the name <ref type="bibr" target="#b1">[2]</ref> are studies of estimating the relationships of persons in the real world. However, it has not been studied about the estimation of interpersonal relationships of fictional characters.</p><p>There are many studies using MLN, for example, entity resolution <ref type="bibr" target="#b4">[5]</ref>, information extraction <ref type="bibr" target="#b2">[3]</ref>. These studies focus on global constraints, and built a model by using MLN. We also targets a text and extracts infomation on global constraints.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Defined Rules</head><p>We defined rules to estimate interpersonal relationships for MLN. These rules determine the sentiment polarity for lines in the script using sentiment polarity for the word and favor between characters using the sentiment polarity for lines. To use sentiment polarity of words, we incorporated as the Semantic Orientations of Words Dictionary that is built by Takamura et al <ref type="bibr" target="#b5">[6]</ref>. This assigns a real value in the range from -1 to +1 to where the words assigned with values close to -1 are supposed to be negative, and the words assigned with values close to +1 are supposed to be positive. Vocabulary was extracted from WordNet<ref type="foot" target="#foot_1">2</ref> .</p><p>In this paper, we limited to two-valued attribute of positive(+1) and negative(−1). A observed predicate is a predicate with all arguments given by inferring and training. A hidden predicate is a predicate with an argument not given by inferring but given by training. Observed predicates and hidden predicates in this paper are shown in Table <ref type="table" target="#tab_0">1</ref>. We describe some of the logical rules for each script line below. t and l is variable. A constant is enclosed in double quotes. Underscore means an arbitrary value. If (+) gets attached to the front of the variable, it is replaced by all the constants that is deployed from the actual data (grounding).</p><p>W ord(t, l, +w) ∧ W pol(+w, +p) ⇒ Lpol(t, +p) Line(t, +sp, +li) ∧ Lpol(t, "P ") ⇒ Likes(+sp, +li) Line(t, +sp, +li) ∧ Lpol(t, "N ") ⇒ ¬Likes <ref type="bibr">(+sp, +li)</ref> . . .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Experiment on Relation Extraction</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Datasets</head><p>In the experiment, we used movie script data from IMSDb: The Internet Movie Script Database<ref type="foot" target="#foot_2">3</ref> on the Web. The title of movies used in the experiment are Back to the Future <ref type="bibr">(1985)</ref>, Good Will Hunting (1997), Harry Potter And The Sorcerer's Stone <ref type="bibr">(2001)</ref>, The Lord of the Rings The Fellowship of the Ring (2001), and Star Wars Episode I The Phantom Menace <ref type="bibr">(1999)</ref>. The average number of lines and characters are 704.6 and 42.6, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Setting</head><p>We used a movie for testing, and the remaining 4 movies as training data. We treated as true above the mean value of the probability, because estimation results are expressed in a probability. Note, we ask the person for a description of Likes predicates in the training data that is familiar with the movies and has seen actually.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Result</head><p>The experimental results are shown in Table <ref type="table" target="#tab_1">2</ref>. The training time was about 19 hours in total, and the inferring time was about 3 hours in total. As a result, recall is lower than precision. In addition, Figure <ref type="figure">2</ref> shows an example of the generated character diagram from the estimated interpersonal relationships. In this figure, a node represents a person, an edge represents a relationship. The information with edge shows the estimated probability of the predicate Likes() and the mean of the probability (like or not like). A dashed edge means false estimation. This figure generally represents the interpersonal relationships of Star Wars Episode I The Phantom Menace (1999). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conculusion</head><p>In this paper, using MLN on the movie script database, we estimated the sentiment polarity of script lines and the interpersonal relationships of the characters in a movie. In the experiments, we confirmed that our proposed method estimated favors between the characters in a movie with a precision of 69.8%. In the future, we will improve the model to achieve the higher accuracy.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Observed and hidden predicates</figDesc><table><row><cell></cell><cell>Predicate</cell><cell>Description</cell></row><row><cell>Observed</cell><cell cols="2">Line(text, speaker, listener) speaker speak text to listener</cell></row><row><cell>predicates</cell><cell cols="2">Word(text, position, word) word in text and the position is position</cell></row><row><cell></cell><cell>Wpol(word, pol)</cell><cell>The sentiment polarity of word is pol</cell></row><row><cell>Hidden</cell><cell>Lpol(text, pol)</cell><cell>The sentiment polarity of text is pol</cell></row><row><cell>predicates</cell><cell>Likes(person, person)</cell><cell>Favor</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Estimated relationships and the number of grounded rules Part of character diagram generated from the estimated relationship</figDesc><table><row><cell></cell><cell cols="2">Precision Recall</cell><cell cols="3">F-measure ground clauses</cell></row><row><cell>mean</cell><cell>69.752</cell><cell>45.332</cell><cell>53.392</cell><cell>46,203</cell></row><row><cell></cell><cell></cell><cell cols="2">4#,56&amp;7#8*!</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell cols="2">9:9B&lt;9&lt;;C?*%$+A&amp;</cell></row><row><cell></cell><cell cols="2">9:99D9&lt;=2?"@5&amp;*%$+A&amp;</cell><cell></cell><cell></cell></row><row><cell cols="2">9:99;9&lt;=&gt;?"@5&amp;*%$+A!</cell><cell cols="2">9:9BC9&lt;;&lt;?*%$+A! !"#$%"&amp;'$()#*$+,!</cell><cell cols="2">9:9B=9&lt;;B?*%$+A!</cell></row><row><cell cols="2">-#./0&amp;!/%.#*#!</cell><cell></cell><cell cols="2">9:9B&gt;9&lt;;;?*%$+A!</cell><cell>12-3!</cell></row><row><cell>Fig. 2.</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://alchemy.cs.washington.edu/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://wordnet.princeton.edu</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.imsdb.com/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work was supported by JSPS KAKENHI Grant Numbers 24300005, 26330081, 26870201.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Friends and neighbors on the web</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">A</forename><surname>Adamic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Adar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Social networks</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="211" to="230" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Social network extraction from the web information</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Matsuo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Tomobe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hashida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Nakajima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ishizuka</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions of the Japanese Society for Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="46" to="56" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Joint inference in information extraction</title>
		<author>
			<persName><forename type="first">H</forename><surname>Poon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">AAAI</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="913" to="918" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Markov logic networks</title>
		<author>
			<persName><forename type="first">M</forename><surname>Richardson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Machine learning</title>
		<imprint>
			<biblScope unit="volume">62</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="107" to="136" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Entity resolution with markov logic</title>
		<author>
			<persName><forename type="first">P</forename><surname>Singla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICDM&apos;06. Sixth International Conference on</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2006">2006. 2006</date>
			<biblScope unit="page" from="572" to="582" />
		</imprint>
	</monogr>
	<note>Data Mining</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Extracting semantic orientations of words using spin model</title>
		<author>
			<persName><forename type="first">H</forename><surname>Takamura</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Inui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Okumura</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. the 43rd Annual Meeting on Association for Computational Linguistics</title>
				<meeting>the 43rd Annual Meeting on Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
			<biblScope unit="page" from="133" to="140" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Interactive narrative summarization</title>
		<author>
			<persName><forename type="first">S</forename><surname>Tanaka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Okabe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Onai</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Workshop on Interactive Systems and Software</title>
				<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="6" to="7" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
