<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Recognising non-named spatial entities in literary texts: a novel spatial entities classifier ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Daniel</forename><surname>Kababgi</surname></persName>
							<email>daniel.kababgi@uni-bielefeld.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Universität Bielefeld</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gulia</forename><surname>Grisot</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">University of Cambridge</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Federico</forename><surname>Pennino</surname></persName>
							<email>federico.pennino2@unibo.it</email>
							<affiliation key="aff2">
								<orgName type="institution">Università di Bologna</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Berenike</forename><surname>Herrmann</surname></persName>
							<email>berenike.herrmann@uni-bielefeld.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Universität Bielefeld</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Recognising non-named spatial entities in literary texts: a novel spatial entities classifier ⋆</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">7960785DB6627B883E992685990A2075</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:49+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Computational Literary Studies, language model, spatial humanities, token classification 0009-0002-0990-6418 (D. Kababgi)</term>
					<term>0000-0002-3038-6202 (G. Grisot)</term>
					<term>0000-0001-7563-070X (F. Pennino)</term>
					<term>0000-0002-5256-0566 (B. Herrmann)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Predicting spatial representations in literature is a challenging task that requires advanced machine learning methods and manual annotations. In this paper, we present a study that leverages manual annotations and a BERT language model to automatically detect and recognise non-named spatial entities in a historical corpus of Swiss novels. The annotated data, consisting of Swiss narrative texts in German from the period of 1840 to 1950, was used to train the machine learning model and fine-tune a deep learning model specifically for literary German. The annotation process, facilitated by the use of Prodigy, enabled iterative improvement of the model's predictions by selecting informative instances from the unlabelled data. Our evaluation metrics (F1 score) demonstrate the model's ability to predict various categories of spatial entities in our corpus. This new method enables researchers to explore spatial representations in literary text, contributing both to digital humanities and literary studies. While our study shows promising results, we acknowledge challenges such as representativeness of the annotated data, biases in manual annotations, and domain-specific language. By addressing these limitations and discussing the implications of our findings, we provide a foundation for future research in sentiment and spatial analysis in literature. Our findings not only contribute to the understanding of literary narratives but also demonstrate the potential of automated spatial analysis in historical and literary research.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Building on previous work examining fictional space and sentiment in Swiss-German narrative <ref type="bibr" target="#b6">[7]</ref>, this paper reports on the development and evaluation of a novel machine learning model for the analysis of fictional space in literary texts. 1  In recent criticism across disciplines there has been an increased emphasis towards considering place and space as crucial factors in understanding social, cultural, and historical phenomena. This perspective on spatiality is generally referred to as the 'spatial turn' <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b18">19]</ref>, and, in literary studies, it highlights the integral components of space representation in how we under-stand and contextualise narrative and fictional texts. The exploration of spatial representations in literary works offers valuable insights into the landscapes, the constructed environments and their social implications within narratives, as well as into the cultural and socio-political constructs surrounding certain images. while there are several valuable proposals to a quantitative approach to spatial research <ref type="bibr" target="#b23">[24,</ref><ref type="bibr" target="#b1">2]</ref>, including for example a differentiation of space as background and place more specifically as locus of events <ref type="bibr" target="#b22">[23,</ref><ref type="bibr" target="#b14">15]</ref>, we however set a different focus.</p><p>In this paper, we present a case study on the prediction of what we call 'non-named spatial entities' (NNSE) in a historical corpus of Swiss-German novels using a deep learning model in conjunction with BERT and Prodigy. By combining manual annotations and advanced machine learning methods, we aim to automatically detect and recognise NNSE within the literary narratives via a similar approach to named entity recognition (NER).</p><p>NER techniques are used to identify and categorise text segments that refer to entities such as people, places, or companies, and that 'constitute proper names' <ref type="bibr" target="#b10">[11]</ref>. The latest NER techniques rely on manually annotated text corpora, which are automatically analysed to build models that capture language use and grammar. These models can then identify and classify entities in new, unprocessed documents. State-of-the-art NER systems come with pre-built models trained on extensive collections of annotated documents, like news articles. These models typically perform well and are ideal for specific applications such as analysing customer feedback or extracting locations and characters. However, when applied to documents with linguistic features not well-represented in the training data, such as literary texts, NER performance can decline, increasing the likelihood of errors, also increased for most languages other than English.</p><p>Within literary studies, various scholars have used NER, particularly to identify fictional characters <ref type="bibr" target="#b19">[20]</ref> , build social networks <ref type="bibr" target="#b4">[5]</ref>, identify geographical locations <ref type="bibr" target="#b2">[3]</ref>, to assign headings to novels <ref type="bibr" target="#b15">[16]</ref>, or to analyse relationships between literary works <ref type="bibr" target="#b20">[21,</ref><ref type="bibr" target="#b21">22,</ref><ref type="bibr" target="#b16">17]</ref>. Only few researchers until recently however focused on the identification of NNSE, i.e. those terms, or elements of space representation, which are not necessarily named geographical locations, like Berlin, London, or Zurich, and that therefore typically cannot be located on a map. It is this kind of entity that generally contributes most to effectively to create the so-called 'storyworld' <ref type="bibr" target="#b17">[18]</ref>: simple terms or phrases such as 'mountain', 'bridge', 'beach' or 'cave', as well as objects and architectural parts that make them tangible, such as 'window', 'table', or 'wall'.</p><p>A similar perspective has been considered by Schumacher, Flüh, and Nantke <ref type="bibr" target="#b13">[14]</ref>, who used conditional random fields to automatically annotate among other things non-named places, which is conceptually similar to NNSEs. However, the operationalization of space in their research is focused on places that can be found on a map, leaving therefore the broader concept of NNSEs unexplored. Also, the popular BookNLP toolkit by Bamman is able to work to identify what they call 'locations' (for natural entities) and facilities' (for man-made structures) in English-language texts with an accuracy of up to 90%<ref type="foot" target="#foot_0">2</ref> . While BookNLP is able to differentiate NNSEs somewhat according to our own needs, we propose a distinction for 'facilities' into more discrete classes shown in Section 2.1. This paper sets out to fill this gap, training a model that will help us identify spatial elements in narrative. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Method</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Spatial categories</head><p>In order to train our model to recognise literary space, we decided to train it not only to be able to recognise non-named spatial entities (NNSEs), but also to distinguish among four different types of spatial environments. We decided to base our categorisation on the research by Grisot and Herrmann <ref type="bibr" target="#b6">[7]</ref>, who looked at the sentiment encoded in narrative texts in relation to both named and non-named entities. They used a dictionary based approach, collecting spatial terms for geographical locations as well as for non-named entities, distinguishing in particular the categories 'rural', 'natural' and 'urban'. While these three categories offered a promising base, we felt that for a more comprehensive perspective on the spaces and places rendered in fictional texts we also needed to include spatial elements describing the interiors/indoor space of buildings and rooms. We therefore created an additional category of NNSE, 'interior'. Some examples for each category are shown in Table <ref type="table" target="#tab_0">1</ref> above.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Annotations and model training</head><p>To produce the training set, two annotators were provided with written guidelines and trained in person to understand the difference between the various NNSE categories. They were then instructed to use the platform Prodigy <ref type="bibr" target="#b11">[12]</ref>, which allowed them to read sentences from the training set in random order, and to annotate NNSEs directly on the interface by adding labels to individual tokens. For the annotation process, six novels were sampled from the complete corpus of Swiss-German novels <ref type="bibr" target="#b7">[8]</ref>. <ref type="foot" target="#foot_1">3</ref> The novels were then split into sentences (N=9,062), which were manually annotated by the annotators. The annotators featured a high interannotator agreement (Cohen's Kappa) for the NNSE types 'interior', 'natural', and 'rural', and medium agreement for 'urban' <ref type="foot" target="#foot_2">4</ref> , as well as a high agreement for the distinction of NNSEs against a not-NNSE token (Cohen's Kappa = 0.898).</p><p>The values for annotators' agreement in relation to individual types are shown in Table <ref type="table" target="#tab_1">2</ref>. Also shown in Table <ref type="table" target="#tab_1">2</ref> are the number of sentences in our dataset in which the various classes of spatial entities were identified by manual annotation. Type O shows the number of sentences where no NNSEs were identified in the annotation process. These amount to over 86% of the dataset, making the distribution of the five categories in our dataset unbalanced. However, the NNSE categories are much closer together, ranging between 2.2% (urban) and 4.5% (interior) of all sentences.</p><p>After the annotation process, sentences were randomly assigned to either the training dataset (80%, N=7,249) or the test dataset (20%, N=1,813). This was done according to common best practices for training a machine learning model <ref type="bibr" target="#b5">[6]</ref>. To train the classifier, PyTorch version 2.1.1 was used as the deep learning framework <ref type="bibr" target="#b12">[13]</ref>. In conjunction with PyTorch, the popular hugging face library (version 4.35.2) was used to load and interact with the language model <ref type="bibr" target="#b24">[25]</ref>.</p><p>The model gbert-large by deepset was utilised as input layer for the token classifier, since it outperformed all other language models <ref type="bibr" target="#b3">[4]</ref>. The model classifies each token of a given text, and attempts to predict whether the token under consideration can be classified as a NNSE (one of the four types mentioned above) or whether it can be considered not a spatial term (O). It was tested if the model performs better with the complete, unbalanced training dataset (N=7,249 sentences) or with a more balanced, downsampled training dataset (N=2,004 sentences). The downsampled training dataset was composed of sentences including at least one NNSE (N=1,002) and a random selection of sentences with no NNSE of equal size. The downsampled train dataset was then split again into a final training dataset (N=1,603) and a validation set (N=401).</p><p>The training was repeated for 17 epochs with a learning rate of 5e-6 and a dropout of 0.1. These parameters are determined by extensive hyperparameter testing. Since the maximum length of all sentences is 40 words or 61 tokens, the max length for the BERT model was set to 64 tokens.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head><p>With the annotation and training process described above, we produced a classifier able to 1) identify NNSEs in a given sentence, and 2) classify the identified NNSE as belonging to one of the four discrete classes: 'rural', 'urban', 'natural' or 'interior'. <ref type="foot" target="#foot_3">5</ref> To find the best parameters, Figure <ref type="figure" target="#fig_0">1</ref> illustrates the performance evaluation of the classifier on the validation dataset across different epochs. Class O, which represents tokens not classified as NNSE according to our guidelines, consistently achieves an almost perfect F1 score of 1. However, after 17 epochs, there is a significant decline in performance. Among the other classes, the 'interior' class performs best with an F1 score of 0.792, while the 'urban' class performs worst with an F1 score of 0.632 on the validation dataset. For a more detailed analysis, Figure <ref type="figure" target="#fig_1">2</ref> presents the average F1 scores for the 'interior', 'urban', 'rural', and 'natural' classes on the validation dataset, excluding class O. The black dotted line indicates the highest overall F1 score of 0.743.</p><p>Table <ref type="table" target="#tab_2">3</ref> below presents the final scores on the test dataset. Class O, with an F1 score of 0.99, significantly outperforms all other classes, as is expected. The 'interior' class follows with an F1 score of 0.60. The remaining classes have F1 scores ranging between 0.64 and 0.53.</p><p>for categorization into interior-rural or interior-urban. We are planning to run a set of annotations on the interior items to add this type of information and explore the differences.  In addition to the general F1 scores for each class, we analysed false classifications across all five classes. The separation between NNSEs and class O was highly effective, with tokens belonging to class O rarely being incorrectly classified as NNSE. Conversely, the most common error for all types of spatial entities was their misclassification as class O. Notably, the differentiation between various spatial entities was generally accurate, except for the 'urban' class, which was occasionally misclassified as 'rural'.  The highest error rate for all four NNSE types involves their misclassification as class O, ranging from 6.9% for 'interior' to 14% for 'rural'. The biggest error after that is by misclassifing 'rural' as 'urban' 13% error rate for being classified as 'rural'.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion and further Research</head><p>In this work, we have developed a tool that already in its present state facilitates valuable quantitative spatial research on 19th and early 20th century German-language literary corpora. While out-of-the-box solutions typically only provide Named Entity Recognition (NER) models, to the best of the authors' knowledge, a classification of non-named spatial entities as conducted here, classifying each entity into different types, has never been published before for German. Schumacher, Flüh, and Nantke <ref type="bibr" target="#b13">[14]</ref> developed a classifier based on Conditional Random Fields (CRF), which includes several more categories than our classifier, but it can only detect places, not their types. Bamman's bookNLP is able to differentiate between locations, which alignes with our 'natural' type, and facilities, which covers 'urban' and 'rural' spatial entities. For comparison, we also tested the large language model Llama3 7b <ref type="bibr" target="#b0">[1]</ref> with a fewshot prompt for recognising unnamed spatial entities (NNSE). The automatic classification on the test dataset with Llama3 resulted in only a 5.6% partial match with the manual annotation, and a 0.7% perfect match. The main issue was the model's tendency to hallucinate new NNSEs when attempting to continue a sentence, contrary to instructions.</p><p>The high performance in distinguishing spatial entities from non-spatial tokens is unsurprising, as this was the least contentious aspect during the evaluation of the annotation process. The high error rate of 'rural' being misclassified as 'urban' but not vice versa can be explained by the prevalence of 'rural' space in the training data. Additionally, the boundary between 'rural' and 'urban', as described in the guidelines, is more 'fuzzy' compared to the respective distinctions to 'interior' and 'natural'. This fuzziness may be aggravated by the inherent ambiguity of using sentences as training units.</p><p>This classifier is considered a work in progress, as it has currently been exclusively trained on Swiss-German texts from the late 19th to early 20th century. Potential improvements include gathering more training data and adapting the gbert-large model to Swiss-German literary texts from the long 19th century and beyond, as well as remodelling the categories to include interiorurban and interior-rural.</p><p>We plan to utilise this classifier to explore the remainder of the Swiss-German novel corpus built by Herrmann and Grisot <ref type="bibr" target="#b7">[8]</ref>, qualitatively examining patterns in the representation of space, with a particular focus on interior items. Subsequently, we intend to extend our research to a broader corpus of German literature. Methodologically, we plan to evaluate the use of synthetic training data provided by generative AI to enhance our model. One key aspect of space that will be analysed in-depth in future research is the relationship between affect and space in literature, building upon the previous work of Grisot and Herrmann <ref type="bibr" target="#b6">[7]</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: F1 scores on the validation dataset for all classes, including O, over 160 epochs. Highest scores for all trained epochs are by class O, the lowest scores are for identifying urban NNSE</figDesc><graphic coords="5,89.28,84.17,416.72,312.54" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Mean F1 score for the classes interior, urban, rural, and natural over 160 epochs. The dotted black line indicates the highest mean F1-score of 0.743.</figDesc><graphic coords="6,89.28,84.17,416.72,312.54" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Confusion matrix for all five classes. All classes are classified correctly as themselves, while the biggest error is not being recognized as a NNSE. The biggest error after that is by misclassifing 'rural' as 'urban' 13% error rate for being classified as 'rural'.</figDesc><graphic coords="7,151.79,84.18,291.70,316.01" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3</head><label>3</label><figDesc>Figure3displays a confusion matrix for all classes. The highest error rate for all four NNSE types involves their misclassification as class O, ranging from 6.9% for 'interior' to 14% for 'rural'. The biggest error after that is by misclassifing 'rural' as 'urban' 13% error rate for being classified as 'rural'.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Examples of words for each category of NNSE</figDesc><table><row><cell>Category Exampels</cell></row></table><note>interior Abstellkammer (storage room), Wohnzimmer (living room), Küche (kitchen) urban Bibliothek (library), Kloster (abby), Vorstadt (suburb) rural Bauernhaus (farmhouse), Garten (garden), Schweinestall (pigsty) natural Berg (mountain), Fluss (river), Wald (forest)</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Types of NNSE in complete dataset with inter-annotator aggreement (Cohen's Kappa) between the two annotators for each class, number of sentences in the complete dataset in which the types of NNSE were identified by manual annotation. Class O marks sentences with no NNSE.</figDesc><table><row><cell>Class</cell><cell cols="3">Cohen's Kappa n Occurrences Percentage</cell></row><row><cell>interior</cell><cell>0.933</cell><cell>412</cell><cell>4.5</cell></row><row><cell>urban</cell><cell>0.608</cell><cell>196</cell><cell>2.2</cell></row><row><cell>rural</cell><cell>0.775</cell><cell>315</cell><cell>3.5</cell></row><row><cell>natural</cell><cell>0.857</cell><cell>328</cell><cell>3.6</cell></row><row><cell>O</cell><cell>0.896</cell><cell>7811</cell><cell>86.2</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>F1 scores per class on the test dataset. After class O the second best score is for the class interior. The worst performance is for class rural.</figDesc><table><row><cell>Class</cell><cell cols="3">F1 score Precision Recall</cell></row><row><cell>interior</cell><cell>0.6079</cell><cell>0.4673</cell><cell>0.8696</cell></row><row><cell>urban</cell><cell>0.5333</cell><cell>0.4034</cell><cell>0.7869</cell></row><row><cell>rural</cell><cell>0.5573</cell><cell>0.4620</cell><cell>0.7025</cell></row><row><cell>natural</cell><cell>0.6468</cell><cell>0.5353</cell><cell>0.8125</cell></row><row><cell>O</cell><cell>0.9984</cell><cell>0.9996</cell><cell>0.9972</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">As is shown on their ofÏcial GitHub page: https://github.com/booknlp/booknlp?tab=readme-ov-file</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">These are: Der Wetterwart (1905) by Jacob Christoph Heer, Heimatscholle (1914) by Maria Goswina von Berlepsch, Berge und Menschen (1911) by Heinrich Federer, Heidis Lehr-und Wanderjahre (1880) by Johanna Spyri, Friedli, der Kolderi (1891) by Carl Spitteler, and Martin Salander (1886) by Gottfried Keller.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">The low score for 'urban' is explainable by the low number of occurrencesof NNSE of this type in the dataset.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">The results reported here should be understood as a report of an ongoing process, rather than as a final product. For example, a theoretically more sound model may understand 'interior' not as a category of its own, but allow</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><surname>Aimeta</surname></persName>
		</author>
		<ptr target="https://github.com/meta-llama/llama3/blob/main/MODEL%5C%5FCARD.md" />
		<title level="m">Llama 3 Model Card</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Konzept und Klassifikation literarischer Raumentitäten</title>
		<author>
			<persName><forename type="first">F</forename><surname>Barth</surname></persName>
		</author>
		<idno type="DOI">10.18420/inf2020\_120</idno>
	</analytic>
	<monogr>
		<title level="j">Informatik</title>
		<imprint>
			<biblScope unit="volume">2020</biblScope>
			<biblScope unit="page" from="1281" to="1293" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Digital Literary Mapping: I. Visualizing and Reading Graph Topologies as Maps for Literature</title>
		<author>
			<persName><forename type="first">S</forename><surname>Bushell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">O</forename><surname>Butler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hutcheon</surname></persName>
		</author>
		<idno type="DOI">10.3138/cart-2021-0008</idno>
	</analytic>
	<monogr>
		<title level="j">Cartographica: The International Journal for Geographic Information and Geovisualization</title>
		<imprint>
			<biblScope unit="volume">57</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="11" to="36" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">German&apos;s Next Language Model</title>
		<author>
			<persName><forename type="first">B</forename><surname>Chan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Schweter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Möller</surname></persName>
		</author>
		<idno>arXiv: 2010.1 0906</idno>
		<ptr target="http://arxiv.org/abs/2010.10906.Pre-published" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Evaluating Named Entity Recognition Tools for Extracting Social Networks from Novels</title>
		<author>
			<persName><forename type="first">N</forename><surname>Dekker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kuhn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Van Erp</surname></persName>
		</author>
		<idno type="DOI">10.7717/peerj-cs.189</idno>
	</analytic>
	<monogr>
		<title level="j">PeerJ Computer Science</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page">e189</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Hands-on Machine Learning with Scikit</title>
		<author>
			<persName><forename type="first">A</forename><surname>Géron</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems</title>
				<meeting><address><addrLine>Beijing Boston Farnham Sebastopol Tokyo</addrLine></address></meeting>
		<imprint>
			<publisher>O&apos;Reilly</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page">819</biblScope>
		</imprint>
	</monogr>
	<note>Covid-19 Collection</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Examining the Representation of Landscape and Its Emotional Value in German-Swiss Fiction between 1840 and 1940</title>
		<author>
			<persName><forename type="first">G</forename><surname>Grisot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Herrmann</surname></persName>
		</author>
		<idno type="DOI">10.22148/001c.84475</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Cultural Analytics</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Swiss German Novel Collection (ELTeC-gsw)</title>
		<author>
			<persName><forename type="first">G</forename><surname>Grisot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Herrmann</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.4584544</idno>
	</analytic>
	<monogr>
		<title level="j">Version</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">0</biblScope>
			<date type="published" when="2021">2021</date>
			<publisher>Zenodo</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">W</forename><surname>Hallet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Neumann</surname></persName>
		</author>
		<title level="m">Raum Und Bewegung in der Literatur: Die Literaturwissenschaften Und Der Spatial Turn</title>
				<meeting><address><addrLine>Bielefeld</addrLine></address></meeting>
		<imprint>
			<publisher>transcript Verlag</publisher>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Spatial Turn: On the Concept of Space in Cultural Geography and Literary Theory</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">W B</forename><surname>Hess-Lüttich</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Meta-Carto-Semiotics</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="27" to="37" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Speech and Language Processing</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Martin</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Prodigy: A Modern and Scriptable Annotation Tool for Creating Training Data for Machine Learning Models</title>
		<author>
			<persName><forename type="first">I</forename><surname>Montani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Honnibal</surname></persName>
		</author>
		<ptr target="https://prodi.gy/" />
	</analytic>
	<monogr>
		<title level="j">Explosion</title>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">PyTorch: An Imperative Style, High-Performance Deep Learning Library</title>
		<author>
			<persName><forename type="first">A</forename><surname>Paszke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Massa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bradbury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Chanan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Killeen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Gimelshein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Antiga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Desmaison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kopf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Devito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Raison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tejani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chilamkurthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Steiner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Fang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chintala</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems 32</title>
				<imprint>
			<publisher>Curran Associates, Inc</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="8024" to="8035" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Place and Space in Literature Named Entity Recognition as a Possibility for Spatial Modelling in Computational Literary Studies</title>
		<author>
			<persName><forename type="first">M</forename><surname>Schumacher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Flüh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Nantke</surname></persName>
		</author>
		<idno type="DOI">10.14361/9783839469187-006</idno>
	</analytic>
	<monogr>
		<title level="m">Geographical Research in the Digital Humanities</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Dammann</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Kremer</surname></persName>
		</editor>
		<imprint>
			<publisher>transcript Verlag</publisher>
			<date type="published" when="2024">2024</date>
			<biblScope unit="page" from="83" to="112" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Orte und Räume in Romanen</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Schumacher</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-662-66035-5\_1</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
			<publisher>Springer</publisher>
			<biblScope unit="page">234</biblScope>
			<pubPlace>Berlin; Heidelberg; Berlin Heidelberg</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Text Mining and Subject Analysis for Fiction; or, Using Machine Learning and Information Extraction to Assign Subject Headings to Dime Novels</title>
		<author>
			<persName><forename type="first">M</forename><surname>Short</surname></persName>
		</author>
		<idno type="DOI">10.1080/01639374.2019.1653413</idno>
	</analytic>
	<monogr>
		<title level="j">Cataloging &amp; Classification Quarterly</title>
		<imprint>
			<biblScope unit="volume">57</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="315" to="336" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<idno type="DOI">10.1057/9780230337930</idno>
		<title level="m">Geocritical Explorations: Space, Place, and Mapping in Literary and Cultural Studies</title>
				<editor>
			<persName><forename type="first">R</forename><forename type="middle">T</forename><surname>Tally</surname></persName>
		</editor>
		<meeting><address><addrLine>New York</addrLine></address></meeting>
		<imprint>
			<publisher>Palgrave Macmillan US</publisher>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The Space of the Novel</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">T</forename><surname>Tally</surname></persName>
		</author>
		<idno type="DOI">10.1017/9781316659694.011</idno>
	</analytic>
	<monogr>
		<title level="m">The Cambridge Companion to the Novel</title>
				<editor>
			<persName><forename type="first">E</forename><surname>Bulson</surname></persName>
		</editor>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="152" to="167" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Topophrenia: Place, Narrative, and the Spatial Imagination</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">T</forename><surname>Tally</surname></persName>
		</author>
		<idno type="DOI">10.2307/j.ctv7r40df</idno>
		<idno>JSTOR: 10.2307/j.ctv7r40df</idno>
		<imprint>
			<publisher>Indiana University Press</publisher>
			<biblScope unit="page" from="2019" to="2020" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">His Coachman, and the Archbishop Walk into a Bar but Only One of Them Gets Recognized: On The DifÏculty of Detecting Characters in Literary Texts</title>
		<author>
			<persName><forename type="first">H</forename><surname>Vala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jurgens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Piper</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ruths</surname></persName>
		</author>
		<author>
			<persName><surname>Mr</surname></persName>
		</author>
		<author>
			<persName><surname>Bennet</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/D15-1088</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2015 Conference on Empirical Methods in Natural Language Processing<address><addrLine>Lisbon, Portugal</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="769" to="774" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Names in Novels: An Experiment in Computational Stylistics</title>
		<author>
			<persName><forename type="first">K</forename><surname>Van Dalen-Oskam</surname></persName>
		</author>
		<idno type="DOI">10.1093/llc/fqs007</idno>
	</analytic>
	<monogr>
		<title level="j">Literary and Linguistic Computing</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="359" to="370" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Named Entity Recognition and Resolution for Literary Studies</title>
		<author>
			<persName><forename type="first">K</forename><surname>Van Dalen-Oskam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Marx</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sijaranamual</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Depuydt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Verheij</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Geirnaert</surname></persName>
		</author>
		<ptr target="http://www.clinjournal.org/node/62" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics in the Netherlands Journal</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="121" to="136" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Zur Erkennung von Raum in Narrativen Texten: Spatial Frames Und Raumsemantik Als Modelle Für Eine Digitale Narratologie Des Raums</title>
		<author>
			<persName><forename type="first">G</forename><surname>Viehhauser</surname></persName>
		</author>
		<idno type="DOI">10.1515/9783110693973-015</idno>
	</analytic>
	<monogr>
		<title level="m">Reflektierte Algorithmische Textanalyse</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Reiter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Pichler</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Kuhn</surname></persName>
		</editor>
		<imprint>
			<publisher>De Gruyter</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="373" to="388" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Towards a Digital Narratology of Space</title>
		<author>
			<persName><forename type="first">G</forename><surname>Viehhauser-Mery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Barth</surname></persName>
		</author>
		<ptr target="https://dh2017.adho.org/abstracts/413/413.pdf" />
	</analytic>
	<monogr>
		<title level="m">Book of Abstracts DH2017. Dh2017</title>
				<meeting><address><addrLine>Montreal</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">HuggingFace&apos;s Transformers: Stateof-the-art Natural Language Processing</title>
		<author>
			<persName><forename type="first">T</forename><surname>Wolf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Debut</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Sanh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chaumond</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Delangue</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cistac</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Rault</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Louf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Funtowicz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Davison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Shleifer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Von Platen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Jernite</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Plu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">L</forename><surname>Scao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gugger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Drame</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Lhoest</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Rush</surname></persName>
		</author>
		<idno type="DOI">10.48550/arxiv.1910.03771</idno>
	</analytic>
	<monogr>
		<title level="j">Version</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note>Pre-published</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
