<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Breaking the Narrative: Scene Segmentation through Sequential Sentence Classification</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Murathan</forename><surname>Kurfalı</surname></persName>
							<email>murathan.kurfali@ling.su.se</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Linguistics</orgName>
								<orgName type="institution">Stockholm University Stockholm</orgName>
								<address>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Mats</forename><surname>Wirén</surname></persName>
							<email>mats.wiren@ling.su.se</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Linguistics</orgName>
								<orgName type="institution">Stockholm University Stockholm</orgName>
								<address>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Breaking the Narrative: Scene Segmentation through Sequential Sentence Classification</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">AE687298906ECBC236C6A111506216E1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T14:04+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we describe our submission to the Shared Task on Scene Segmentation (STSS). The shared task requires participants to segment novels into coherent segments, called scenes. We approach this as a sequential sentence classification task and offer a BERTbased solution with a weighted cross-entropy loss. According to the results, the proposed approach performs relatively well on the task as our model ranks first and second, in official in-domain and out-domain evaluations, respectively. However, the overall low performances (0.37 F 1 -score) suggest that there is still much room for improvement.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Scene segmentation is a novel task introduced in <ref type="bibr" target="#b9">(Zehe et al., 2021a</ref>) that aims to divide long narrative texts, e.g. novels, into smaller coherent segments or scenes, as they are called. Scenes, in this context, can be roughly defined as "a segment of a text where the story time and the discourse time are more or less equal, the narration focuses on one action and space and character constellations stay the same" <ref type="bibr" target="#b9">(Zehe et al., 2021a)</ref>. 1 The task of scene segmentation is of great value on several ends: (i) it can be directly employed in several digital humanities tasks, e.g. plot reconstruction; (ii) segmenting longer texts into smaller coherent pieces help other NLP tasks, e.g. co-reference resolution, that struggle with texts longer than a couple of paragraphs <ref type="bibr" target="#b5">(Joshi et al., 2020)</ref>; (iii) as a novel task that requires high-level modeling of long texts, it offers itself as a valuable probing task to evaluate language models on long-context scenarios which is an active research area <ref type="bibr" target="#b7">(Tay et al., 2020)</ref>. 1 Interested readers are referred to annotation guidelines available at https://zenodo.org/record/4457177 for further details.</p><p>Our main interest in the current paper is to explore whether scene segmentation can be handled as a sequential sentence classification task. To this end, we follow the methodology proposed in <ref type="bibr" target="#b2">Cohan et al. (2019)</ref>, which encodes all sentences in a sequence jointly through BERT <ref type="bibr" target="#b4">(Devlin et al., 2019)</ref> to directly leverage the contextual information from all tokens in the sequence at the same time. The model of <ref type="bibr" target="#b2">Cohan et al. (2019)</ref> is further adapted to the task via introduction of a weighted cross-entropy loss in order to account for the imbalanced distribution of the labels in the dataset.</p><p>According to the official results, our model achieves the best performance on the in-domain texts, significantly outperforming the secondranking system. However, the performance drops when evaluated on out-of-domain novels, suggesting that the proposed methodology only poorly generalizes over different domains. We release our system to facilitate reproducibility and future work.<ref type="foot" target="#foot_0">2</ref> 2 System Overview</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Task Details</head><p>The scene segmentation task can be framed in several ways. Within the shared task, it is defined as the identification of the boundaries that delimit the consecutive segments <ref type="bibr" target="#b10">(Zehe et al., 2021b)</ref>. The boundaries between segments are labeled according to the types of segments they delimit. Specifically, a boundary can belong one of the following three classes: Scene-Scene; Nonscene-Scene; Scene-Nonscene. <ref type="foot" target="#foot_1">3</ref>The participating teams are evaluated only according to their success at finding and labeling these boundaries. That is to say, classification of an individual sentence as belonging to a Scene or a Nonscene means very little in the evaluations. Of the possible three transitions, Scene-Scene is the most common one as Nonscenes are significantly less frequent in data (see Table <ref type="table" target="#tab_0">1</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Our Model</head><p>We model scene segmentation as a Sequence Sentence Classification (SSC) task where the goal is to understand whether a given sentence is segmentinitial or not along with the type of segment it belongs to. Similarly to the more common token classification tasks, e.g. POS-tagging or NER, we employ the IOB2 format and assign a tag to each sentence. Specifically, we label segment-initial sentences (boundaries) as #X-B and other sentences as merely #X where X indicates the type of segment (Scene or Nonscene).</p><p>Our classifier closely follows the methodology proposed in <ref type="bibr" target="#b2">Cohan et al. (2019)</ref>. Here, the authors employed BERT to perform several document-level classification tasks, e.g. abstract sentence classification, where the aim is to classify sentences in a scientific abstract into their rhetorical roles such as introduction, method, etc. The rest of this section describes the model along with our modifications.</p><p>The proposed methodology follows the standard way of using BERT through fine-tuning on the target task but uses a novel input representation. The classifier used in the experiments is illustrated in Figure <ref type="figure" target="#fig_0">1</ref>. As input, a sequence of N sentences is concatenated by BERT's special delimiter token [SEP], yielding one long sequence. This sequence, after the insertion of the standard [CLS] token at the beginning, is fed into BERT. However, unlike the standard way of using the [CLS] token as the representation of the input sequence, the representations of the individual [SEP] tokens are used as the representations of the sentences that precede them. Hence, instead of the [CLS] token, [SEP] representations are classified by a multi-layer feedforward network to reach labels.</p><p>The rationale for using [SEP] as sentence representation has to do with the next-sentence objective of BERT: "Intuitively, through BERT's pretraining, the [SEP] tokens learn sentence structure and relations between continuous sentences" <ref type="bibr" target="#b2">(Cohan et al., 2019)</ref>. During fine-tuning, the model is further primed to assign appropriate weights to <ref type="bibr">[SEP]</ref> tokens to encode necessary contextual information for classification. Fine-tuning BERT in this way has the benefit of simultaneously leveraging the contextual information from all sentences in the sequence.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Loss function</head><p>The model is trained to minimize the cross-entropy loss between the probabilities over the possible labels computed using a softmax activation and the target distribution. However, during the initial experiments, we observed that the model severely suffered from the highly skewed label distribution, namely the low number of boundary sentences in comparison to nonboundary ones. <ref type="foot" target="#foot_3">4</ref> In order to mitigate this issue, following the previous studies <ref type="bibr" target="#b6">(Rotsztejn et al., 2018;</ref><ref type="bibr" target="#b3">Cui et al., 2019;</ref><ref type="bibr" target="#b8">Yang et al., 2019)</ref>, we introduce a weighting factor to the loss function where each class is assigned a weight that is inversely proportionally to their frequency in the training set:</p><formula xml:id="formula_0">weight c = i freq(i) freq(c)</formula><p>where freq indicates the count of a certain class.</p><p>Overall, the weighted cross-entropy becomes</p><formula xml:id="formula_1">Loss W CE = − C c w c t c log(s c</formula><p>) where w c is the weight, t c is the gold truth value (taking either 0 or 1), and s c is the corresponding Softmax probability of the class c.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Experimental Setup</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Data</head><p>The dataset used in this shared task is based on an expanded version of the annotation effort introduced in <ref type="bibr" target="#b9">(Zehe et al., 2021a)</ref>  <ref type="table" target="#tab_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Parameter Setting</head><p>We follow the implementation of <ref type="bibr" target="#b2">Cohan et al. (2019)</ref>. <ref type="foot" target="#foot_5">6</ref> As the language model, we use the large German BERT model from <ref type="bibr" target="#b1">(Chan et al., 2020)</ref> (dubbed GBERT-large<ref type="foot" target="#foot_6">7</ref> ) due its superior performance over the existing German models. The batch size of 8 and gradient accumulation steps of 4 are used to reach effective batch size of 32. All experiments are run on a single V100 GPU. We set the learning rate to 5e-6 and the training is run for the maximum of 100 epochs with the early stopping applied (patience = 20) based on the performance on the development set. Due to BERT's inherit sequence size limit, we set a threshold of 25 sentences in each sequence which is chosen empirically (i.e., according to the performance on the in-house test set) among the set of {20, 25, 30, 50}.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Results and Discussion</head><p>The official evaluation is performed on two different test sets:</p><p>i. Test suite 1 focuses on in-domain evaluation and consists of 5 annotated dime novels,</p><p>ii. Test suite 2 focuses on out-of-domain evaluation and consists of 2 annotated contemporary high-literature texts.</p><p>Table <ref type="table">2</ref> presents the breakdown of our results into each possible transition whereas the official ranking of the participating systems, according to the mean micro-averaged F 1 scores, is provided in Table <ref type="table" target="#tab_2">3</ref>. According to the official rankings, the proposed approach is good at segmenting in-domain novels and outperforms the second best system by some margin. However, the performance significantly drops when evaluated on out-of-domain novels, suggesting that the system generalizes poorly across domains.</p><p>According to Table <ref type="table">2</ref>, our model is best at recognizing Scene to Scene transitions; however, it is almost completely incapable of finding the borders between non-scenes and scenes. Suggested by the high-recall, low-precision scores, our model tends to over-segment the novels. On average, the system divides the in-domain novels into 1.76 and outof-domain novels into 1.61 times more segments. This tendency towards over-segmenting hints at over-sensitivity to certain markers which is further discussed in the next section.</p><p>Overall, the results clearly demonstrate that the task is extremely challenging even in the in-domain setting. The poor performance of solutions based on contextual embeddings<ref type="foot" target="#foot_7">8</ref> highlight the need for novel architectures. One obvious drawback with BERT-based models is their inability to encode long sequences. Hence, a straightforward extension of the current model would be to employ a model which supports longer contexts, e.g. Longformer <ref type="bibr" target="#b0">(Beltagy et al., 2020)</ref>; however, such a model is unfortunately not available for German at the time of writing.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Error Analysis</head><p>In addition to the official evaluation, we performed a manual error analysis of our model's predictions on the in-house test set (see Section 3.1). One observation was that in certain cases, although the model correctly recognized the type of transition (e.g. Scene-Scene), it misplaced the boundary only  by a single sentence. An instance of this can be seen in Example (1), where the predicted boundary appears just before sentence (1a), whereas the gold boundary appears just before the subsequent sentence (1b): 9</p><p>(1) a. Und bald darauf fuhr der Wagen aus dem Wald und einen allmählich ansteigenden Berg hinan.</p><p>(And soon afterwards the car drove out of the forest and up a gradually rising mountain.)</p><p>b. Dort oben lag das Schloss Treuenfels.</p><p>(Treuenfels Castle was up there.)</p><p>Furthermore, as mentioned in the previous section, the system tends to over-segment the novels. A manual inspection of false positives (sentences that are erroneously identified as segment boundaries) reveals that despite being incorrect, these predictions are not completely random. Most of the false positives involve an adverbial or other kind of phrase which signals a shift in time and/or place. Some cherry-picked examples are given in (5) Er wandte sich um und ging wieder zurück, bis in das Zimmer, wo der Schreibtisch der Gräfin Alice stand.</p><p>(He turned and went back to the room where Countess Alice's desk was.).</p><p>Similar to the behavior of the baseline system proposed in <ref type="bibr" target="#b9">Zehe et al. (2021a)</ref>, these examples highlight the model's sensitivity to the local cues rather than the larger context. That is, to a certain extent, the system makes its predictions according to the individual phrases that signal shifts in time or place, paying too little attention to the global context.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>The current paper summarizes our submission to the Shared Task on Scene Segmentation (STSS). We handle scene segmentation as a sequential sentence classification task and offer a BERT-based solution. The proposed model achieves the best performance in the in-domain evaluations but falls short of transferring its performance across domains. Error analysis further reveals that the predictions are more sensitive to local cues rather than the global structure of the text, highlighting the need for better document-level modeling.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Overview of the system architecture. Each sentence is represented by the respective [SEP] token which is used to predict the label. Figure copied from Cohan et al. (2019).</figDesc><graphic coords="2,307.56,62.79,217.71,121.58" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>, and consists of 20 German novels in total, excluding the blind test sets Segment| |Avg. Sent| Count |Avg. Segment| |Avg. Sent| Characteristics of the train/dev/test splits used in model development. The numbers in the columns refer to number of segments, the average size of a segment (in terms of # of sentences) and the average size of a sentence (in terms of # of words) for Scenes and Non-scenes separately.</figDesc><table><row><cell>Split</cell><cell></cell><cell>Scene</cell><cell></cell><cell></cell><cell>Non-scene</cell><cell></cell></row><row><cell cols="3">Count |Avg. Training 1075 45.33</cell><cell>10.58</cell><cell>51</cell><cell>16.11</cell><cell>15.39</cell></row><row><cell>Dev</cell><cell>127</cell><cell>69.5</cell><cell>12.12</cell><cell>7</cell><cell>5.6</cell><cell>18.20</cell></row><row><cell>Test</cell><cell>46</cell><cell>38.16</cell><cell>8.05</cell><cell>7</cell><cell>19.14</cell><cell>11.15</cell></row><row><cell cols="4">used in the official evaluations. During model de-</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">velopment, we create custom development and test</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">sets by randomly allocating one file for each, using the remaining 18 files for training. 5 The statistics</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">regarding the training/dev/test splits used during</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="3">model development are provided in Table</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 :</head><label>3</label><figDesc>Official rankings and results of all participating systems, according to the micro-averaged F 1 scores, averaged over all the novels in the corresponding suite. Results of our submission is highlighted in boldface.</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://github.com/MurathanKurfali/scene segmentation</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">Unlike Scenes, Nonscenes, naturally, are not distinguished from one another; hence, Nonscene-Nonscene is not a valid transition.Copyright © 2021 for this paper by its authors. Use permitted under Creative Commons License</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">Attribution 4.0 International (CC BY 4.0).</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">The most frequent label, Scene, single-handedly accounts for 96.1% of the training data.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">Files 9783740941093 and 9783732522033 are used the dev and test set, respectively.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">https://github.com/allenai/sequential sentence classification</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">https://huggingface.co/deepset/gbert-large</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">A BERT-based baseline in the original resource paper similarly fails on this task<ref type="bibr" target="#b9">(Zehe et al., 2021a)</ref>.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_8">In these examples, the system has predicted the shift immediately before the sentences displayed.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has been partly funded by an infrastructure grant from the Swedish Research Council (SWE-CLARIN, 2019-24; contract no. 2017-00626). We thank the Swedish National Infrastructure for Computing (SNIC) for providing computational resources under Project 2020/33-26.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Iz</forename><surname>Beltagy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Matthew</forename><forename type="middle">E</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Arman</forename><surname>Cohan</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2004.05150</idno>
		<title level="m">Longformer: The long-document transformer</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">German&apos;s next language model</title>
		<author>
			<persName><forename type="first">Branden</forename><surname>Chan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stefan</forename><surname>Schweter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Timo</forename><surname>Möller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 28th International Conference on Computational Linguistics</title>
				<meeting>the 28th International Conference on Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="6788" to="6796" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Pretrained language models for sequential sentence classification</title>
		<author>
			<persName><forename type="first">Arman</forename><surname>Cohan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Iz</forename><surname>Beltagy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Daniel</forename><surname>King</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bhavana</forename><surname>Dalvi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dan</forename><surname>Weld</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</title>
				<meeting>the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)<address><addrLine>Hong Kong, China</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="3693" to="3699" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Class-balanced loss based on effective number of samples</title>
		<author>
			<persName><forename type="first">Yin</forename><surname>Cui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Menglin</forename><surname>Jia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tsung-Yi</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yang</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serge</forename><surname>Belongie</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the IEEE/CVF conference on computer vision and pattern recognition</title>
				<meeting>the IEEE/CVF conference on computer vision and pattern recognition</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="9268" to="9277" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">BERT: Pre-training of deep bidirectional transformers for language understanding</title>
		<author>
			<persName><forename type="first">Jacob</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ming-Wei</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kenton</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristina</forename><surname>Toutanova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4171" to="4186" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Spanbert: Improving pre-training by representing and predicting spans</title>
		<author>
			<persName><forename type="first">Mandar</forename><surname>Joshi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Danqi</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yinhan</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Luke</forename><surname>Daniel S Weld</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Omer</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<author>
			<persName><surname>Levy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="64" to="77" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Eth-ds3lab at semeval-2018 task 7: Effectively combining recurrent and convolutional neural networks for relation classification and extraction</title>
		<author>
			<persName><forename type="first">Jonathan</forename><surname>Rotsztejn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nora</forename><surname>Hollenstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ce</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of The 12th International Workshop on Semantic Evaluation</title>
				<meeting>The 12th International Workshop on Semantic Evaluation</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="689" to="696" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Long range arena: A benchmark for efficient transformers</title>
		<author>
			<persName><forename type="first">Yi</forename><surname>Tay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mostafa</forename><surname>Dehghani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Samira</forename><surname>Abnar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yikang</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dara</forename><surname>Bahri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Philip</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jinfeng</forename><surname>Rao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Liu</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sebastian</forename><surname>Ruder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Donald</forename><surname>Metzler</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2011.04006</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Emotionx-ku: Bertmax based contextual emotion classifier</title>
		<author>
			<persName><forename type="first">Kisu</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dongyub</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Taesun</forename><surname>Whang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Seolhwa</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Heuiseok</forename><surname>Lim</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1906.11565</idno>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Detecting scenes in fiction: A new segmentation task</title>
		<author>
			<persName><forename type="first">Albin</forename><surname>Zehe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Leonard</forename><surname>Konle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lea</forename><surname>Katharina Dümpelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Evelyn</forename><surname>Gius</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Hotho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fotis</forename><surname>Jannidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lucas</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Krug</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Frank</forename><surname>Puppe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nils</forename><surname>Reiter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume</title>
				<meeting>the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume</meeting>
		<imprint>
			<date type="published" when="2021">2021a</date>
			<biblScope unit="page" from="3167" to="3177" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Shared task on scene segmentation@konvens2021</title>
		<author>
			<persName><forename type="first">Albin</forename><surname>Zehe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Leonard</forename><surname>Konle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Svenja</forename><surname>Guhr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lea</forename><surname>Katharina Dümpelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Evelyn</forename><surname>Gius</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Hotho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fotis</forename><surname>Jannidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lucas</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Krug</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Frank</forename><surname>Puppe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nils</forename><surname>Reiter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Annekea</forename><surname>Schreiber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Shared Task on Scene Segmentation</title>
				<imprint>
			<date type="published" when="2021">2021b</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
