<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">ScANT: A Small Corpus of Scene-Annotated Narrative Texts[resource papers]</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tarfah</forename><surname>Alrashid</surname></persName>
							<email>ttalrashid1@sheffield.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Sheffield</orgName>
								<address>
									<settlement>Sheffield</settlement>
									<country key="GB">UK</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Jeddah</orgName>
								<address>
									<settlement>Jeddah</settlement>
									<country key="SA">Saudi Arabia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Robert</forename><surname>Gaizauskas</surname></persName>
							<email>r.gaizauskas@sheffield.ac.uk</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Jeddah</orgName>
								<address>
									<settlement>Jeddah</settlement>
									<country key="SA">Saudi Arabia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">ScANT: A Small Corpus of Scene-Annotated Narrative Texts[resource papers]</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">C845C3DC5F2A819F47CFEA9AEBB6ADB6</idno>
					<idno type="DOI">10.15131/shef.data.21517908</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-04-29T06:29+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>SceneML</term>
					<term>narrative text</term>
					<term>scenes</term>
					<term>text segmentation</term>
					<term>corpus</term>
					<term>dataset</term>
					<term>annotation</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>We present the first publicly available dataset of English narrative texts annotated in compliance with SceneML, a framework for annotating scenes in narrative text. The dataset is composed of selected chapters from six narrative texts -two children's stories and four novels from Project Gutenberg. We give a brief overview of SceneML, describe the corpus sources and the annotation process and provide details of the resulting annotations and inter-annotator agreement.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction and Related Work</head><p>Narrative, or storytelling, is a fundamental mode of human discourse, found across all cultures and all times, and in many different forms, including writing (both fiction and non-fiction), spoken storytelling, film, video games, and so on <ref type="bibr" target="#b0">[1]</ref>. A basic structural unit of narrative is the scene, "a unit of a story in which the elements of time, location, and main characters are constant" <ref type="bibr" target="#b1">[2]</ref>. Narratives tend to progress as a sequence of scenes, though of course the sequence of scenes in a narration need not be the same as the temporal sequence of the narrated events in the storyworld <ref type="bibr" target="#b2">[3]</ref> the narrative is describing. Furthermore, one scene may be expressed in multiple non-contiguous text segments in the narration. Additionally, what are generally deemed narrative texts may include various non-narrative elements, e.g. authorial comment. Thus, the task of identifying those chunks in a narrative text which correspond to scenes in the storyworld and temporally ordering these chunks is a non-trivial challenge. It is an important challenge both for the insights it gives us into the structure of narratives and for possible applications, which include automatic story illustration, aligning books and movies, automatic generation of image descriptions and automatic generation of narratives.</p><p>In previous work we have introduced SceneML as a framework for annotating scenes in narrative text <ref type="bibr" target="#b3">[4]</ref> and discussed issues arising in a pilot annotation exercise which focussed on the scene identification task <ref type="bibr" target="#b1">[2]</ref>. In this paper we present ScANT, the first publicly available dataset of English narrative texts annotated in compliance with SceneML. While the corpus is small -just 14 chapters from 6 narrative sources -our hope is that the wider community will find this useful both for converging on annotation standards for scene identification and for initial training and testing of automatic scene identification algorithms. The corpus and annotation guidelines are available at https://doi.org/10.15131/shef.data.21517908 and are made available under the CC By-NC 4.0 licence <ref type="foot" target="#foot_0">1</ref> .</p><p>There has been a growing interest in computational analysis of narrative. Ranade et al. <ref type="bibr" target="#b4">[5]</ref> provide a thorough overview of recent work on computational understanding of narrative and Santana et al. <ref type="bibr" target="#b5">[6]</ref> provide an extended survey on narrative extraction from textual data. However, neither of these addresses the issue of identifying scenes in narrative text. The only other work on annotation of scenes in narrative texts of which we are aware is that of Zehe et al. <ref type="bibr" target="#b6">[7]</ref>. Their work differs from ours in several respects. First, their definition of scene states that a scene is a segment in a narrative in which the time, place and characters remain constant and which centres around one action. This contrasts with our definition that does not take into account the actions in a scene and allows multiple actions to happen in one scene (see Gaizaukas and Alrashid <ref type="bibr" target="#b3">[4]</ref> for discussion around our choice of definition). Secondly, their scheme is less comprehensive -it does not define narrative progression links between scenes or scene transition segments, and only distinguishes scene and non-scene segments. Thirdly, they follow a container principle (small places make up larger places) to detect a change in place, e.g. if the action of characters moves from a corridor to dining room that will not indicate a change in place as they both part of a hotel, where as our definition counts these as two different places. Finally, they work on German texts while we are working on English texts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Methods and Resources</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">SceneML</head><p>SceneML is an evolving framework for annotating scenes in narrative text. The latest specification and annotation guidelines are available along with the corpus at the DOI referenced above. Here we summarise the core concepts in SceneML.</p><p>A scene is defined as a unit of narrative in which the time, location and principal characters are constant and in which specific events which constitute the narrative are recounted. Any change in time, location or characters indicates a change in the scene. A scene is realised in text (for written forms of narrative) through one or more scene description segments (SDSs). The SDS mechanism allows for the relation of one scene in a narrative to be embedded within another, as for example, in flashback or flashforward. The task of scene identification thus becomes the task of identifying the boundaries of SDSs, and linking SDSs for the same scene together<ref type="foot" target="#foot_1">2</ref> . SceneML also specifies a set of four narrative progression relations (sequence, analepsis, prolepsis and concurrence) that are used to capture the temporal relations between scenes.</p><p>Typically, not all text in a narrative is part of a scene description. Some passages describe not one scene or another but rather the transition between scenes. For example, in Conan Doyle's The Man With The Twisted Lip the first scene takes place in Watson's house and the second in the East End of London, where Watson goes to seek a missing man. Between the two we have the short passage: "And so in ten minutes I had left my armchair and cheery sitting-room behind me, and was speeding eastward in a hansom on a strange errand...". Such elements SceneML refers to as scene transition segments (STSs). Other sorts of non-scene elements are also present in narrative. These include general philosophising or opinion segments, background information segments, and narrative summary or narrative catchup. These passages serve a variety of functions but do not relate specific, situated events involving protagonists in the story. All such passages SceneML designates as non-scene elements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Corpus Sources</head><p>The dataset is composed of selected chapters from children's stories and from out-of-copyright adult novels. The former were hypothesised as likely to have a simpler narrative structure and hence to be a good place to trial our approach; the latter as likely to possess more complex narrative structure and hence pose a more challenging test to our approach. The sources are:</p><p>(1) Bunnies from the Future, a middle grade children's story by Joe Corcoran <ref type="foot" target="#foot_2">3</ref> . The author has personally granted permission for us to release annotated chapters of this work. ( <ref type="formula">2</ref>) The Wonderful Wizard of Oz, originally released as part of the Brown Corpus <ref type="foot" target="#foot_3">4</ref> and free for noncommercial purposes. (3) Pride and Prejudice, A Tale of Two Cities, The Adventures of Sherlock Holmes and The Great Gatsby from Project Gutenberg<ref type="foot" target="#foot_4">5</ref> . These are out of copyright in the US and UK and freely re-distributable subject to Project Gutenberg's terms and conditions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">The Annotation Process</head><p>In an earlier pilot study <ref type="bibr" target="#b1">[2]</ref> we investigated how well-defined the SceneML definitions and annotation framework were with respect to scene boundary identification. Analysis of the annotations in that study revealed several causes of observed disagreement: (1) lack of understanding of the guidelines and task, (2) lack of clarity or specificity in the guidelines, (3) failure of non-native English speakers to fully grasp the meaning of certain expressions (e.g. idioms). We have addressed these issues in the construction of ScANT through the following steps: (1) A more thorough training process that included both an initial training session with a presentation, demonstration and hands-on exercise for the trainees, plus a follow-on take-away exercise that was scored against gold-standard annotations produced by the authors and then discussed with the trainees, (2) Improvement of the initial guidelines to remove sources of confusion revealed in the earlier pilot, (3) recruitment of native English-speaking annotators with sensitivity to text analysis (two PhD students, one in English Literature and one in Computational Linguistics).</p><p>The annotation process was carried out through a web-based interface to a local instance of Brat Annotation Tool <ref type="foot" target="#foot_5">6</ref> . Annotators used swipe and click operations to annotate SDSs and STs. Multiple SDSs that are part of the same scene were linked using the Brat relation annotation tool to signal that a same-scene-as relation holds between them. The annotated data is stored and made available in BRAT standoff annotation format <ref type="foot" target="#foot_6">7</ref> .</p><p>The corpus consists of fourteen chapters from six different narrative sources <ref type="foot" target="#foot_7">8</ref> . In each chapter SDSs, STs and same-scene-as relations were annotated by two annotators and saved in a separate text file. Both annotators' annotations are supplied with the corpus. Further annotations together with a consensus annotation may be made available in the future.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">The ScANT Corpus</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Corpus Statistics</head><p>Table <ref type="table" target="#tab_0">1</ref> shows summary statistics for the ScANT corpus and associated annotations. Note that the relation between scenes and SDSs is largely one-to-one. With one exception this is always true for the children's stories, while there is somewhat more variation, suggesting more complex narrative form, in the adult novels.</p><p>The variation between annotators A1 and A2 is relatively small in terms of SDSs and Scenes. However, they are far apart regarding both scene transition segments and non-scene segments. We discuss this further below in Section 4.3. First, however, we examine inter-annotator agreement regarding SDSs in more detail. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Inter-annotator Agreement</head><p>Table <ref type="table" target="#tab_1">2</ref> shows inter-annotator agreement results for SDSs using Cohen's Kappa <ref type="bibr" target="#b7">[8]</ref>. To calculate Kappa, each sentence is given a tag, 1 for sentences on the boundary of an SDS (either beginning or end) and 0 otherwise. Boundaries of STs are ignored as it is clear the two annotators' understanding of the task is so different that precise quantitative analysis is not merited. Aside from calculating only exact matches as agreement (𝑁 = 0 in Table <ref type="table" target="#tab_1">2</ref> ) we also investigated a more lenient approach to calculate the agreement in which annotators are deemed to agree if they place a sentence boundary within N sentences. This was prompted by the observation that in many cases annotators seemed to be placing SDS boundaries relatively close to each other, but not exactly in the same place. Kappa scores have been calculated for various N sizes: 𝑁 = 30% of the median SDS sentence length in each chapter, N=1, N=3 and N=5. We have omitted one chapter from Table <ref type="table" target="#tab_1">2</ref> -A Tale of Two Cities, chapter 1, because one annotator believed it contained nothing but non-scene segments, while the other thought it contained 5 scenes. This gave a kappa score of 0, which skewed the rest of the results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Discussion</head><p>Regarding differences between our annotators, it is clear that the two annotators have a different conception of what STs and NSSs are. This is probably due to the fact that the children's stories which we used as training materials contain very few of either of these, particulary of NSSs (in fact this appears to be an interesting difference between children's and adult narrative).</p><p>Any future annotation effort should ensure that these concepts are more clearly understood by annotators. On examination our view is that A1 has followed the guidelines much more closely regarding STs and NSSs and therefore, if one is to train or test a classifier on these materials, our recommendation would be to use the A1 annotations only. However recent work such as that reported in Uma et al. <ref type="bibr" target="#b8">[9]</ref> highlights the potential value of learning from disagreement, so we have included both sets of annotations in the corpus.</p><p>Concerning the kappa scores for SDS agreement, they fall in the range that has been interpreted as "fair" or "fair to good". However, kappa scores are known to be lower when there are fewer labels (just two in our case) and where the labels are not equiprobably occurring (also true in our case, since 0 labels are much more frequent than 1's) so results should be viewed in this light <ref type="bibr" target="#b9">[10]</ref>. Percentage agreement scores for SDS annotations are around 90%. Note that kappa scores rise significantly if we are prepared to allow some leniency in terms of non-exact matching. How legitimate this is needs further examination to determine whether the improvement is a reflection of genuine uncertainty about the precise boundary between what the annotators clearly agree are distinct scenes or whether it is the result of conflating separate scenes, according to the different annotators' perceptions.</p><p>While the corpus is too small to start making generalisations about stylistic differences between different authors, it is worth noting that the amount of non-scene content in the adult novels (6.21% of the total sentences if we accept A1's NSS annotations, which we believe are more accurate) is vastly greater than that in the children's stories (0.14%), suggesting that much beyond simple event narration goes on in adult fiction.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion and Future Work</head><p>We have presented the first dataset of English narrative texts annotated in compliance with SceneML. The dataset consists of fourteen chapters of novels and children's stories annotated for scene description segments and scene transitions segments as defined in SceneML. A total of almost 200 scenes have been annotated.</p><p>Future work plans include various activities. These include: 1. gathering further annotations for the ScANT source texts to increase robustness of the annotations and guidelines; 2. extending the annotation to include SceneML narrative progression links; 3. training a model on the corpus to investigate automating the task of scene boundary detection and to ascertain the sufficiency of ScANT for this task; 4. adding other text types to the annotated dataset, such as biography, plays and film scripts; 5. expanding the dataset to include texts in other languages. 6. exploring whether there is interest in a shared task challenge on scene boundary detection.</p><p>We hope the community finds ScANT of use and welcome comment on our work.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Summary Statistics for the ScANT corpus, showing for each annotated text the count of sentences and words and of SDSs, STs, Scenes and Non-scene sentences (NSSs) for each annotator (A1 and A2).</figDesc><table><row><cell>Text</cell><cell cols="2">Sents Words</cell><cell cols="2">SDSs</cell><cell>STs</cell><cell></cell><cell cols="2">Scenes</cell><cell cols="2">NSSs</cell></row><row><cell></cell><cell></cell><cell></cell><cell>A1</cell><cell>A2</cell><cell cols="3">A1 A2 A1</cell><cell cols="2">A2 A1</cell><cell>A2</cell></row><row><cell>Bunnies Ch3</cell><cell>124</cell><cell>2756</cell><cell>8</cell><cell>10</cell><cell>1</cell><cell>0</cell><cell>8</cell><cell>9</cell><cell>0</cell><cell>0</cell></row><row><cell>Bunnies Ch4</cell><cell>65</cell><cell>1775</cell><cell>10</cell><cell>8</cell><cell>0</cell><cell>0</cell><cell>9</cell><cell>7</cell><cell>0</cell><cell>0</cell></row><row><cell>Bunnies Ch5</cell><cell>173</cell><cell>3514</cell><cell>10</cell><cell>7</cell><cell>0</cell><cell>3</cell><cell>10</cell><cell>7</cell><cell>0</cell><cell>0</cell></row><row><cell>Bunnies Ch6</cell><cell>117</cell><cell>2911</cell><cell>10</cell><cell>10</cell><cell>0</cell><cell>6</cell><cell>10</cell><cell>10</cell><cell>0</cell><cell>0</cell></row><row><cell>WOZ CH2</cell><cell>132</cell><cell>2449</cell><cell>4</cell><cell>8</cell><cell>0</cell><cell>2</cell><cell>4</cell><cell>8</cell><cell>0</cell><cell>1</cell></row><row><cell>WOZ CH3</cell><cell>123</cell><cell>2361</cell><cell>9</cell><cell>8</cell><cell>1</cell><cell>7</cell><cell>9</cell><cell>8</cell><cell>1</cell><cell>1</cell></row><row><cell>Sherlock Holmes Ch1 P1</cell><cell>268</cell><cell>4200</cell><cell>11</cell><cell>10</cell><cell>0</cell><cell>4</cell><cell>11</cell><cell>10</cell><cell>17</cell><cell>0</cell></row><row><cell>Sherlock Holmes Ch1 P2</cell><cell>277</cell><cell>4784</cell><cell>23</cell><cell>11</cell><cell>1</cell><cell>6</cell><cell>20</cell><cell>11</cell><cell>0</cell><cell>0</cell></row><row><cell>Sherlock Holmes Ch1 P3</cell><cell>93</cell><cell>1333</cell><cell>8</cell><cell>6</cell><cell>0</cell><cell>5</cell><cell>8</cell><cell>6</cell><cell>2</cell><cell>0</cell></row><row><cell>Sherlock Holmes Ch6</cell><cell>561</cell><cell>10974</cell><cell>31</cell><cell>34</cell><cell cols="2">2 15</cell><cell>26</cell><cell>34</cell><cell>8</cell><cell>0</cell></row><row><cell>Pride and Prejudice Ch1</cell><cell>60</cell><cell>1018</cell><cell>1</cell><cell>3</cell><cell>0</cell><cell>2</cell><cell>1</cell><cell>3</cell><cell>6</cell><cell>0</cell></row><row><cell>Pride and Prejudice Ch3</cell><cell>86</cell><cell>1984</cell><cell>12</cell><cell>10</cell><cell>0</cell><cell>5</cell><cell>12</cell><cell>10</cell><cell>0</cell><cell>0</cell></row><row><cell>A Tale of Two Cities Ch1</cell><cell>19</cell><cell>1140</cell><cell>0</cell><cell>5</cell><cell>0</cell><cell>4</cell><cell>0</cell><cell>5</cell><cell>19</cell><cell>0</cell></row><row><cell>A Tale of Two Cities Ch3</cell><cell>73</cell><cell>1920</cell><cell>4</cell><cell>13</cell><cell>0</cell><cell>4</cell><cell>4</cell><cell>13</cell><cell>11</cell><cell>0</cell></row><row><cell>The Great Gatsby Ch1</cell><cell>337</cell><cell>7209</cell><cell>19</cell><cell>43</cell><cell cols="2">1 12</cell><cell>19</cell><cell>43</cell><cell>52</cell><cell>0</cell></row><row><cell>The Great Gatsby Ch3</cell><cell>288</cell><cell>5307</cell><cell>31</cell><cell>24</cell><cell cols="2">0 10</cell><cell>29</cell><cell>24</cell><cell>13</cell><cell>0</cell></row><row><cell>Total</cell><cell>2796</cell><cell cols="3">55635 191 210</cell><cell cols="5">6 85 180 208 129</cell><cell>2</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>IAA Results, showing Cohen's kappa under varying degrees of leniency, where 𝑁 indicates the number of sentences apart SDS boundaries may be to count as a match or, where 𝑁 = 30%, the number of sentences expressed as a percentage of the median SDS sentence length for that text.</figDesc><table><row><cell>Chapter</cell><cell>SDS</cell><cell cols="5">N = 30% N = 0 N = 1 N = 3 N = 5</cell></row><row><cell></cell><cell>Median</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Bunnies Chapter 3</cell><cell>7</cell><cell>0.79</cell><cell>0.74</cell><cell>0.74</cell><cell>0.79</cell><cell>0.79</cell></row><row><cell>Bunnies Chapter 4</cell><cell>5.5</cell><cell>0.60</cell><cell>0.60</cell><cell>0.60</cell><cell>0.60</cell><cell>0.60</cell></row><row><cell>Bunnies Chapter 5</cell><cell>14.5</cell><cell>0.45</cell><cell>0.29</cell><cell>0.29</cell><cell>0.45</cell><cell>0.45</cell></row><row><cell>Bunnies Chapter 6</cell><cell>5.75</cell><cell>0.68</cell><cell>0.47</cell><cell>0.47</cell><cell>0.72</cell><cell>0.76</cell></row><row><cell>WOZ Chapter 2</cell><cell>19.75</cell><cell>0.47</cell><cell>0.11</cell><cell>0.11</cell><cell>0.27</cell><cell>0.41</cell></row><row><cell>WOZ Chapter 3</cell><cell>11</cell><cell>0.77</cell><cell>0.57</cell><cell>0.57</cell><cell>0.72</cell><cell>0.77</cell></row><row><cell>Sherlock Holmes Chapter1 P1</cell><cell>10.5</cell><cell>0.16</cell><cell>0.07</cell><cell>0.07</cell><cell>0.16</cell><cell>0.16</cell></row><row><cell>Sherlock Holmes Chapter1 P2</cell><cell>8</cell><cell>0.53</cell><cell>0.42</cell><cell>0.42</cell><cell>0.53</cell><cell>0.53</cell></row><row><cell>Sherlock Holmes Chapter1 P3</cell><cell>8.75</cell><cell>0.75</cell><cell>0.75</cell><cell>0.75</cell><cell>0.75</cell><cell>0.75</cell></row><row><cell>Sherlock Holmes Chapter 6</cell><cell>8.5</cell><cell>0.37</cell><cell>0.25</cell><cell>0.25</cell><cell>0.37</cell><cell>0.40</cell></row><row><cell>Pride and Prejudice Chapter 1</cell><cell>16</cell><cell>0.65</cell><cell>0.65</cell><cell>0.65</cell><cell>0.65</cell><cell>0.65</cell></row><row><cell>Pride and Prejudice Chapter 3</cell><cell>6.25</cell><cell>0.65</cell><cell>0.43</cell><cell>0.43</cell><cell>0.65</cell><cell>0.65</cell></row><row><cell>Tale of Two Cities Chapter 3</cell><cell>6.5</cell><cell>0.21</cell><cell>0.01</cell><cell>0.01</cell><cell>0.30</cell><cell>0.39</cell></row><row><cell>The Great Gatsby Chapter 1</cell><cell>7.5</cell><cell>0.34</cell><cell>0.22</cell><cell>0.22</cell><cell>0.34</cell><cell>0.39</cell></row><row><cell>The Great Gatsby Chapter 3</cell><cell>6</cell><cell>0.49</cell><cell>0.38</cell><cell>0.38</cell><cell>0.58</cell><cell>0.66</cell></row><row><cell>Average</cell><cell>8.94</cell><cell>0.53</cell><cell>0.40</cell><cell>0.40</cell><cell>0.53</cell><cell>0.56</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://creativecommons.org/licenses/by-nc/4.0/.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">Full scene annotation in SceneML also involves annotating the time, place and characters (named entities) in the scene, using existing annotation standards (ISO-TimeML,ISO-Space and the ACE NE guidelines). However, in ScANT we focus on scene segmentation only</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://freekidsbooks.org/author/joe-corcoran/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://www.nltk.org/nltk_data/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://www.gutenberg.org</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">https://brat.nlplab.org</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">It can be converted to JSONL format using the tool at: https://github.com/astutic/brat-standoff-to-json/.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">As one of the chapters is quite long it has been divided into three parts for analysis.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>The authors thank the Text2Story reviewers for their helpful comments. The first author acknowledges support from the University of Jeddah in the form of a PhD studentship.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><surname>Wikipedia</surname></persName>
		</author>
		<ptr target="https://en.wikipedia.org/wiki/Narrative" />
		<title level="m">Narrative</title>
				<imprint>
			<date type="published" when="2023-03-25">2023. 25 March, 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A pilot study on annotating scenes in narrative text using SceneML</title>
		<author>
			<persName><forename type="first">T</forename><surname>Alrashid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Gaizauskas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th international workshop on narrative extraction from texts</title>
				<meeting>the 4th international workshop on narrative extraction from texts<address><addrLine>Text2Story</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021. 2021</date>
			<biblScope unit="page" from="7" to="14" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Narratology: An introduction</title>
		<author>
			<persName><forename type="first">W</forename><surname>Schmid</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>Walter de Gruyter</publisher>
			<pubPlace>Berlin</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">SceneML: A proposal for annotating scenes in narrative text</title>
		<author>
			<persName><forename type="first">R</forename><surname>Gaizauskas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Alrashid</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 15th Workshop on Interoperable Semantic Annotation (ISA-15)</title>
				<meeting>the 15th Workshop on Interoperable Semantic Annotation (ISA-15)<address><addrLine>Gothenburg, Sweden</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Computational understanding of narratives: A survey</title>
		<author>
			<persName><forename type="first">P</forename><surname>Ranade</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Joshi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Finin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page" from="101575" to="101594" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A survey on narrative extraction from textual data</title>
		<author>
			<persName><forename type="first">B</forename><surname>Santana</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Campos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Amorim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jorge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Silvano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nunes</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence Review</title>
		<imprint>
			<biblScope unit="page" from="1" to="43" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Detecting scenes in fiction: A new segmentation task</title>
		<author>
			<persName><forename type="first">A</forename><surname>Zehe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Konle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">K</forename><surname>Dümpelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Gius</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hotho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Jannidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Krug</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Puppe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Reiter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume</title>
				<meeting>the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="3167" to="3177" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">A coefficient of agreement for nominal scales</title>
		<author>
			<persName><forename type="first">J</forename><surname>Cohen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Educational and psychological measurement</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page" from="37" to="46" />
			<date type="published" when="1960">1960</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Learning from disagreement: A survey</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Uma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Fornaciari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hovy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Paun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Plank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poesio</surname></persName>
		</author>
		<idno type="DOI">10.1613/jair.1.12752</idno>
		<ptr target="https://doi.org/10.1613/jair.1.12752.doi:10.1613/jair.1.12752" />
	</analytic>
	<monogr>
		<title level="j">J. Artif. Int. Res</title>
		<imprint>
			<biblScope unit="volume">72</biblScope>
			<biblScope unit="page" from="1385" to="1470" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">s kappa</title>
		<author>
			<persName><surname>Wikipedia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">'</forename><surname>Cohen</surname></persName>
		</author>
		<ptr target="https://en.wikipedia.org/wiki/Cohen%27s_kappa" />
		<imprint>
			<date type="published" when="2022">2023. 26 March, 2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
