<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Literary Time Travel: Distinguishing Past and Contemporary Worlds in Danish and Norwegian Fiction</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Jens</forename><surname>Bjerring-Hansen</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Nordic Studies and Linguistics</orgName>
								<orgName type="institution">University of Copenhagen</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ali</forename><surname>Al-Laith</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Nordic Studies and Linguistics</orgName>
								<orgName type="institution">University of Copenhagen</orgName>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Copenhagen</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Daniel</forename><surname>Hershcovich</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Copenhagen</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Alexander</forename><surname>Conroy</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Nordic Studies and Linguistics</orgName>
								<orgName type="institution">University of Copenhagen</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Sebastian</forename><forename type="middle">Ørtoft</forename><surname>Rasmussen</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Department of Comparative Literature and Rhetoric</orgName>
								<orgName type="institution">Aarhus University</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Literary Time Travel: Distinguishing Past and Contemporary Worlds in Danish and Norwegian Fiction</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">BADF9D799868CA2C9972FFB9C883E550</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:49+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Historical Text</term>
					<term>Text Classification</term>
					<term>Danish</term>
					<term>Norwegian</term>
					<term>Literature</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The classification of historical and contemporary novels is a nuanced task that has traditionally relied on expert literary analysis. This paper introduces a novel dataset comprising Danish and Norwegian novels from the last 30 years of the 19 th century, annotated by literary scholars to distinguish between historical and contemporary works. While this manual classification is time-consuming and subjective, our approach leverages pre-trained language models to streamline and potentially standardize this process. We evaluate their effectiveness in automating this classification by examining their performance on titles and the first few sentences of each novel. After fine-tuning, the models show good performance but fail to fully capture the nuanced understanding exhibited by literary scholars. This research underscores the potential and limitations of NLP in literary genre classification and suggests avenues for further improvement, such as incorporating more sophisticated model architectures or hybrid methods that blend machine learning with expert knowledge. Our findings contribute to the broader field of computational humanities by highlighting the challenges and opportunities in automating literary analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Some novels are set in the past, offering readers a glimpse into historical periods, while others reflect the time in which they were written, dealing with contemporary issues. For example, the novel Lolotte. En Roman fra den Gustavianske Tid (Lolotte. A Novel from the Gustavian Period, 1898) by Marie Henckel clearly signals its historical nature through its subtitle, which specifies a late 18th-century setting. In contrast, Albert Gnudtzman's Ridder Thorvald. En lille københavnsk Roman (Knight Thorvald. A small Copenhagen novel, 1899) initially misleads with the historical-sounding keyword "knight" in the title, but the opening scene set in a modern urban café distinctly establishes it as a contemporary novel.</p><p>The question of whether a novel is set in modern days or historical times (contemporary or historical novel?) was by no means uncontroversial in the time of the so-called Modern Breakthrough in Scandinavian literature circa 1870-1900 <ref type="bibr" target="#b5">[6]</ref>. On the contrary, it was a question of taste (good vis-à-vis bad) and, accordingly, a detection and quantification of the historical novel give insight into the cultural divides of the period. Modern realist aesthetics ostracized the historical novel and insisted that literature should be situated in the present and address current problems. In 1871, famously and characteristically, the influential Danish critic Georg Brandes, referring to Scott's Waverley novels from the early 1800s, rejected the historical novel as "an unfortunate and now abandoned genre, imported from Scotland and invented by a pureblooded Tory, which originated in a state of mind similar to ours, one with all its ideals in the past" <ref type="bibr" target="#b6">[7]</ref>. And at least for a while, the historical novel was aesthetically and socially demoted to the realm of popular literature, which no one, except for readers and consumers, cared about. In the 20 th century advanced definitions of the historical novel and its complex relationship to its political contexts and to the development of the genre towards realism and modernism have been major points of discussion in the historiography of the novel. In our paper, we approach the question more directly by tentatively putting ourselves in the place of the historical actors, professional tastemakers like Brandes or conventional consumers in the literary marketplace, and emphasizing some immediate, easily decodable genre signals from paratext (titles and subtitles) and text (the opening of the novels).</p><p>We introduce a dataset of Danish and Norwegian novels from the last 30 years of the 1800s, annotated by literary scholars according to whether they are historical or contemporary. The novels are taken from the MeMo (Measuring Modernity) corpus <ref type="bibr" target="#b4">[5]</ref>, comprising 859 novels. We assess the ability of language models to generalize this generic distinction as expressed in their titles and first few sentences, by training them on a portion of the dataset and evaluating them on unseen novels. While fine-tuned Danish language models show good performance in the task, error analysis reveals they still lack sensitivity to salient cues that literary scholars observe. <ref type="foot" target="#foot_0">1</ref></p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>Text classification is a pivotal task in natural language processing (NLP) that entails categorizing text into predefined labels or classes. It has a broad spectrum of applications, including sentiment analysis <ref type="bibr" target="#b0">[1]</ref>, word sense disambiguation <ref type="bibr" target="#b21">[22]</ref>, named entity recognition <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b18">19]</ref>, and genre classification <ref type="bibr" target="#b31">[32,</ref><ref type="bibr" target="#b20">21]</ref>. With the advent of pre-trained language models like BERT <ref type="bibr" target="#b9">[10]</ref>, GPT <ref type="bibr" target="#b34">[35]</ref>, and their variants, significant advancements have been achieved in this domain. These models leverage extensive text corpora to enhance the understanding of context and semantics <ref type="bibr" target="#b26">[27]</ref>, establishing new standards in accuracy and robustness. Consequently, they enable more nuanced and sophisticated text classification systems capable of handling diverse and complex textual data. Current research continues to investigate enhancements in model architectures, fine-tuning techniques, and domain-specific adaptations to further boost the performance of text classification tasks.</p><p>When dealing with literature, in academic as well as everyday contexts, taxonomic thinking and practices seem both habitual and inevitable. Literary genre studies got underway with the Ancient Greeks, from which the division of poetic literature in three main genres: lyric, epic and drama, often ascribed to Aristotle and his Poetics, has proliferated <ref type="bibr" target="#b16">[17]</ref>. Since then an enormous and ever-growing body of genre theory has developed <ref type="bibr" target="#b13">[14]</ref>. Of special interest to us is:</p><p>1. scholarship on the historical novel of the 19 th century which often serves as the predecessor and/or antidote to the modern realist (and contemporary set) novel <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b39">40,</ref><ref type="bibr" target="#b46">47]</ref>, 2. historical studies, influenced by the sociology of literature and the history of the book, concerned with genre fiction and its aesthetic and commercial development in the 19 th century <ref type="bibr" target="#b35">[36,</ref><ref type="bibr" target="#b15">16]</ref>, and 3. (non-digital) quantitative approaches to the history of the novel <ref type="bibr" target="#b27">[28,</ref><ref type="bibr" target="#b32">33,</ref><ref type="bibr" target="#b29">30,</ref><ref type="bibr" target="#b14">15]</ref>. Within computational literary studies of recent years, genre has been an important touch point for NLP approaches and literary theory and historiography. Text genre classification is a crucial area of research that aids in systematically categorizing vast and diverse collections of literary works. This task involves distinguishing between various genres such as fiction and non-fiction <ref type="bibr" target="#b45">[46,</ref><ref type="bibr" target="#b44">45,</ref><ref type="bibr" target="#b33">34]</ref>, poetry <ref type="bibr" target="#b36">[37]</ref>, and drama <ref type="bibr" target="#b37">[38]</ref>, among others, within literary corpora. Also, significant efforts have been made to classify novels in various sub-genres, predominantly with a focus on volume-level similarity across a range of features that capture significant generic aspects <ref type="bibr" target="#b45">[46,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b43">44]</ref>. The advancements facilitated by NLP techniques and machine learning, including predictive modeling, are substantial, resulting in more accurate and automated genre classification while also embracing notions from contemporary literary scholarship that a literary genre comprises many features rather than a single defining characteristic <ref type="bibr" target="#b38">[39,</ref><ref type="bibr" target="#b43">44,</ref><ref type="bibr" target="#b22">23]</ref>. As Ted Underwood has argued, "[t]he best way to measure the differentiation between literary genres is probably to train supervised predictive models that attempt to distinguish works in one genre from other works in a given period or cultural milieu" <ref type="bibr" target="#b38">[39]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Historical vs. Contemporary Novels</head><p>There is a long and intensive research tradition that has been interested in the political and social-historical implications of the historical novel of the 19 th century and the decline of the genre in the latter part of the century as a reflex of a new aesthetic positions with a primacy of immediate perception and contemporaneity <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b28">29]</ref>. In this context, complex definitions have been drafted on the basis of intensive close readings of particularly British, French and Russian novels. Lukacs' five principal claims about the genre is a pioneering example of this <ref type="bibr" target="#b24">[25]</ref>. However, in practice, the question of categorization poses fewer problems for both literary scholars and customers at bookstores. If you consider a common definition of the genre, such as this one from a literary reference work, it will correspond to most readers' common and more or less reflected perception of the genre:</p><p>A novel in which the action takes place during a specific historical period well before the time of writing (often one or two generations before, sometimes several centuries), and in which some attempt is made to depict accurately the customs and mentality of the period. <ref type="foot" target="#foot_1">2</ref>In this paper, we construct a dataset using the genre classification of the novels of the MeMo corpus performed by Bjerring-Hansen and Ørtoft <ref type="bibr" target="#b5">[6]</ref>, which has followed such pragmatic and intuitive understanding of the genre as something that can be decoded immediately by a quick inspection of the temporal coordinates of the individual texts. To carry out an analogous quantification of genre trends and proportions between historical and contemporary novels, the authors performed close readings of both (certain) paratexts and (particular) parts of each novel. More specifically, the annotation was carried out on the following premises:</p><p>1. Many historical novels "reveal" themselves already in the title (as is the case with Dronning Caroline Mathilde af Danmark = Queen Mathilde of Denmark by the pseudonym Caja from 1889) or in the subtitle (as is the case with the anonymously published Caroline. Bøhmens frygtelige Svøbe eller et Gammel Bjergslots Hemmelighed: Historisk-romantisk Fortaelling = Caroline. Böhmen's terrible scourge or the secret of an old mountain castle; Historical-romantic tale), or more redundantly both the one and the other (cf. H.F. Ewald's Griffenfeld. Historisk Roman = Griffenfeld. Historical Novel from 1888).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">If the titles do not contain clear paratextual signals, genre afÏliation is often indicated</head><p>on the first few pages of the novel (as, for example, in the case of Indianerpigen fra Cape Breton = The Indian Girl from Cape Breton by "L.M", which, although the title page does not indicate that we are dealing with a historical novel, immediately sets the temporal scene with the opening sentence: "It was in the year 1780 [...]").</p><p>So, generally, it is striking to what extent the historical novels of the 19 th century clearly and actively give away their generic afÏliation. As several literary scholars have pointed out, this is probably because the historical novel's foremost characteristic-and selling point-is its historical setting, which, then, producers and distributors clearly wants to mark for the intended readership and therefore already on the title page or the first pages "come clean" <ref type="bibr" target="#b46">[47,</ref><ref type="bibr" target="#b41">42]</ref>. It can be added that these guidelines only to a very limited extent can be reversed on the basis of a similar "scanning" of the paratextual and textual evidence. Non-historical, i.e. contemporary novels-the novels which aesthetics and the criticism in the late 1800s placed a decisive and favourable emphasis on-do not communicate their temporality in a similar way. The MeMo corpus entails a few handfuls of instances of emphatically contemporary subtitles (e.g. "Nutidsfortaelling" = story from the present day, "Samtidsroman" = contemporary novel etc.), but in general the contemporary novels are implying their genericity through silence on their temporal setting.</p><p>This transparent literary communication, or consumer information, which is of course less obvious in the few and often canonized instances of experimental novels that "play" with genre fiction such as the historical novel, can be said to be a general feature of popular literature, including also romances and detective stories etc., and the genre-fiction system, developing in the latter part of the 19 th century <ref type="bibr" target="#b27">[28,</ref><ref type="bibr" target="#b15">16]</ref>. The question is whether machines can learn to read these literary and cultural signals, apparent in the paratext and/or the opening pages of the novel, which for historical actors have seemed quite obvious? (Non-)Historical novel -yes or no? In other words, the genre distinctions that our method rely on are historically framed, meaning they are tied to specific periods and cultural contexts rather than having universal relevance across time and place.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Methodology</head><p>To address this question, we treat the problem from a machine learning perspective. We introduce an annotated corpus and fine-tune pre-trained transformer language models on it, evaluating their performance on a held-out test set.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Dataset</head><p>We rely on the MeMo corpus <ref type="bibr" target="#b4">[5]</ref>, comprising 859 Danish and Norwegian novels spanning the last 30 years of the 19 th century, with more than 64 million tokens. The corpus is a rich and diverse collection of texts that provides valuable insights into the classification of novels as historical and contemporary during the period under investigation. Table <ref type="table" target="#tab_0">1</ref> shows statistical information about the corpus. We obtain the annotated dataset of novels from Bjerring-Hansen and Ørtoft <ref type="bibr" target="#b5">[6]</ref>. The final list of annotated novels consists of 859 novels, with 78% categorized as contemporary and 22% as historical. Figure <ref type="figure" target="#fig_0">1</ref> illustrates the temporal distribution of historical and contemporary novels in our corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Novel Classification</head><p>We use the dataset for training and evaluating transformer-based language models. Specifically, inspired by the observations made by Bjerring-Hansen and Ørtoft, we consider three settings:</p><p>1. Providing the title and sub-title of the novel as input to the model, 2. Providing the first 15 sentences of the novel as input to the model, 3. Concatenating the title, sub-title and first 15 sentences and providing them to the model. In all cases, we train the model to classify the novel according to the binary label obtained from the annotated dataset. Subsequently, we evaluate the models on a test set of held-out novels to assess their ability to generalize the ability to identify the cues learned during training. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Experiments and Results</head><p>We experiment with four pre-trained language models and three types of provided context, comparing their performance and using them as the basis for an elaborate analysis of errors and indicative features.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Pre-trained Language Models</head><p>The models evaluated in our novel classification experiments had been pre-trained on text corpora including Danish and Norwegian text. We train them on the task using supervised finetuning. Importantly, all models are selected based on their performance evaluated on Danish and Norwegian literary benchmark datasets <ref type="bibr" target="#b21">[22]</ref>, the Scandinavian Embedding Benchmark<ref type="foot" target="#foot_2">3</ref> and ScandEval,<ref type="foot" target="#foot_3">4</ref>  <ref type="bibr" target="#b30">[31]</ref> even though these models had not been trained primarily on historical Danish or Norwegian. We additionally experiment with a model (MeMo-BERT-03) specifically adapted for the MeMo corpus.</p><p>DanskBERT. DanskBERT,<ref type="foot" target="#foot_4">5</ref> a top-performing Danish language model noted for its success on the ScandEval benchmark <ref type="bibr" target="#b40">[41]</ref>, is based on the XLM-RoBERTa architecture and trained on the Danish Gigaword Corpus <ref type="bibr" target="#b42">[43]</ref>. It features 24 layers, a hidden dimension of 1024, 16 attention heads, and a subword vocabulary of 250,000. The model was trained with a batch size of 2,000 for 500,000 steps on 16 V100 GPUs over two weeks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Danish Foundation Models sentence encoder.</head><p>A sentence-transformers model <ref type="bibr" target="#b10">[11]</ref> based on the BERT architecture, featuring 24 layers, 16 attention heads, and a hidden size of 1024. It incorporates a dropout rate of 0.1 for attention probabilities and hidden states, using GELU activation and supporting up to 512 position embeddings. With a vocabulary size of 50,000 tokens, this model, referred to as DFM (Large), excels in tasks such as Danish sentiment analysis and named entity recognition. <ref type="foot" target="#foot_5">6</ref>MeMo-BERT-03. Developed by continuing the pre-training of the pre-trained Transformer language model DanskBERT <ref type="bibr" target="#b21">[22]</ref>. <ref type="foot" target="#foot_6">7</ref> This foundation allows MeMo-BERT-3 to leverage extensive linguistic knowledge for NLP tasks in historical literary Danish including sentiment analysis and word sense disambiguation. The model outperformed different models in sentiment analysis and word sense disambiguation tasks <ref type="bibr" target="#b21">[22]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>NB-BERT-base.</head><p>A general-purpose BERT-base model was developed using the extensive digital collection at the National Library of Norway <ref type="bibr" target="#b19">[20]</ref>. <ref type="foot" target="#foot_7">8</ref> It follows the architecture of the BERT Cased multilingual model and has been trained on a diverse range of Norwegian texts, encompassing both Bokmål and Nynorsk from the past 200 years. This comprehensive training allows the NB-BERT-base to effectively handle a wide array of NLP tasks in Norwegian. The model achieved the second-highest performance ranking in the Norwegian Named Entity Recognition task compared to other models listed on the ScandEval benchmark for Norwegian natural language understanding.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Experimental Setup</head><p>Our experiments involve fine-tuning the pre-trained language models on the annotated novels from our corpus. To enable testing of generalization in the face of temporal shift <ref type="bibr" target="#b25">[26]</ref>, the last 130 novels according to publication year (≈15%) are used as a testing set, while the remaining novels were randomly divided into training and validation with 70% and 15% respectively. The experiments involve fine-tuning the models on the dataset using a batch size of 32, training for 20 epochs with the AdamW optimizer <ref type="bibr" target="#b23">[24]</ref> at a learning rate of 10 −3 . During training, we monitor the performance on the validation set to assess model convergence and to prevent overfitting, keeping the checkpoint with the best validation score. For evaluation, we employ the F1-score metric due to its ability to balance precision and recall, particularly effective for tasks with imbalanced datasets. The performance of each model is evaluated on both validation and test sets, ensuring the robustness and generalizability of the models across different datasets and epochs. For comparison, due to the imbalanced nature of the dataset, with 22% of novels being historical overall and the percentage being 17% in the test set, a naive baseline that selects a label based on the training distribution would achieve about 70% weighted F1-score. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Novels Classification Experiments</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.1.">Titles and Sub-titles Classification</head><p>In this experiment, we concatenate the title and subtitle of each novel and perform classification by fine-tuning the aforementioned pre-trained language models with the novel labels, using the cross-entropy objective. Table <ref type="table" target="#tab_1">2</ref> (left) presents the fine-tuning results of the selected models. DFM (Large) achieved the highest performance on the validation set with an F1-score of 92%, while the MeMo-BERT-03 model excelled on the testing set with an F1-score of 91%.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.2.">First 15 Sentences Classification</head><p>We use the Danish pipeline in spaCy <ref type="bibr" target="#b17">[18]</ref> for sentence segmentation and extract the first 15 sentences from each novel. We then use each sentence as a separate input instance for finetuning the aforementioned pre-trained language models, with the same novel-level labels as previously now inducing sentence-level labels. To predict novel-level labels using the finetuned models, we apply them to the first 15 sentences of a (validation or testing) novel, and use majority voting to determine the novel-level predictions.</p><p>The results of fine-tuning the models is shown in Table <ref type="table" target="#tab_1">2</ref> (middle). DFM (Large) and MeMo-BERT-03 achieved the highest performance on the validation set with an F1-score of 81%, while DFM (Large) excelled on the testing set with an F1-score of 86%. Notably, for all models, using the first 15 sentences as input performs worse than using the title and sub-title.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.3.">Both Titles &amp; Sub-titles and First 15 Sentences Classification</head><p>In this experiment, we combine both the titles &amp; sub-titles and the first 15 sentences of each novel in the corpus: technically, we repeat the same setup as using the first 15 sentences, but additionally prepend the concatenated title and sub-title as if they were another sentence. The fine-tuning results of the four models of this experiments are shown in Table <ref type="table" target="#tab_1">2</ref> (right). DFM (Large), MeMo-BERT-03 and NB-BERT-Base achieve equal performance on the validation set with an F1-score of 82%, while DanskBERT and NB-BERT-Base perform best on the testing set with an F1-score of 84%. Overall, performance in this setting is similar to just using the first 15 sentences, but the best performance on the test set is in fact obtained when just using titles and sub-titles, and ignoring the first 15 sentences. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Discussion</head><p>While all models are highly accurate after fine-tuning, surprisingly, we observe that the best predictions are obtained by just using the title and sub-title as input, disregarding the first 15 sentences of the novel. This suggests either that the genre information is less salient in the first 15 sentences, or that the models are not as capable of extracting it from them. To analyze this further, we investigate model confidence on mislabeled predictions, and perform a fine-grained error analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1.">Model Calibration</head><p>When reading the text opening, introspection from expert annotation reveals that genre identification often hinges on specific key sentences (e.g., mentioning specific entities, events, or years) rather than the entire opening passage. While most sentences in the opening text do not clearly suggest one genre or another, these "giveaways" are sparse but salient. This nuance may be lost by the majority voting procedure over sentences, leading to misclassifications when the first 15 sentences are used as input. Therefore, we are interested in the model's confidence and whether it is calibrated to match the experts' uncertainty or disagreement about the labels <ref type="bibr" target="#b2">[3]</ref>.</p><p>To evaluate model calibration, we use Expected Calibration Error (ECE), a metric that measures how well the model's predicted probabilities reflect the true accuracy <ref type="bibr" target="#b8">[9]</ref>:</p><formula xml:id="formula_0">𝐸𝐶𝐸 = 𝐾 ∑ 𝑘=1 |𝐵 𝑘 | 𝑛 |𝑎𝑐𝑐(𝐵 𝑘 ) − 𝑐𝑜𝑛𝑓 (𝐵 𝑘 )|</formula><p>where 𝐾 = 10 is the number of bins (confidence intervals), |𝐵 𝑘 | is the number of samples in bin 𝑘, 𝑎𝑐𝑐(𝐵 𝑘 ) is the accuracy in bin 𝑘, and 𝑐𝑜𝑛𝑓 (𝐵 𝑘 ) is the average confidence in bin 𝑘.</p><p>When using titles and sub-titles as input, the best-performing model (MeMo-BERT-03) achieved a relatively low ECE of 0.062, as shown in Table <ref type="table" target="#tab_2">3</ref>, indicating that it was reasonably well-calibrated when relying on paratextual information. Titles and sub-titles often contain clear genre markers that allow the model to make high-confidence, mostly accurate predictions. However, the model still made misclassifications, particularly when historical-sounding titles misled the model. Despite this, the model's overall confidence generally matched its performance in this setting, and it exhibited the lowest calibration error compared to other settings. This result underscores the strength of using paratextual clues, though it also reveals that misleading terms in the title can cause overconfidence in wrong predictions.</p><p>In contrast, when the models used the first 15 sentences of the novels as input, calibration worsened across all models. For example, the ECE for DFM (Large) increased to 0.085, and other models similarly struggled with higher calibration errors (see Table <ref type="table" target="#tab_2">3</ref>). This is likely because, as noted in expert analyses, genre signals are not uniformly distributed across the opening sentences. Instead, they tend to appear in specific key sentences that reveal important genre-relevant details. In many cases, the first 15 sentences are ambiguous, lacking explicit time markers or character descriptions, which increases the model's uncertainty. However, rather than reflecting this uncertainty in their confidence scores, the models often exhibited overconfidence, resulting in higher calibration errors. This overconfidence indicates that the models are not adequately capturing the uncertainty present in the text openings, a discrepancy that reflects the challenge of extracting nuanced genre information from longer inputs.</p><p>Combining both titles, sub-titles, and the first 15 sentences of the novels did not uniformly improve calibration. For MeMo-BERT-03, the ECE increased to 0.094, suggesting that integrating both sources of information did not lead to better confidence alignment. Although the models had access to more context, they struggled to effectively weigh the paratext against the more ambiguous textual cues from the opening sentences. In some cases, conflicting signals between the title and the text may have caused the models to oscillate between genres, ultimately leading to poorer calibration. DanskBERT, however, exhibited slightly better calibration in this setting, indicating that it was more adept at integrating the two types of input compared to other models. This slight improvement over single-input settings suggests that certain models can benefit from additional context, though the integration process remains challenging for most.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.">Error Analysis</head><p>We discuss the errors encountered during the classification of historical and contemporary novels, focusing on prediction errors made by the best-performing models, as well as annotation errors identified by expert inspection of the mislabeled predictions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.1.">Prediction Errors</head><p>A notable pattern in the prediction errors is the tendency of the models to misclassify contemporary novels as historical based on titles (in 12 out of 17 cases where at least one model misclassified the label in this setting) and based on text openings (first 15 sentences) or the combination of titles and text openings (11 out of 11 cases). An illustrative example of the first type of error is Albert Gnudtzman's urban novel Ridder Thorvald. En lille københavnsk Roman (Knight Thorvald. A small Copenhagen novel, 1899). The title's keyword "knight" leads the models to misinterpret it as a historical romance. However, the opening scene set in a lively urban café correctly classifies it as a contemporary novel, highlighting the discrepancy between title-based and content-based classification.</p><p>When models misinterpret text openings (first 15 sentences), a common issue is their failure to recognize historical settings established through character introductions rather than explicit time clues. For instance, in Marie Henckel's Lolotte. En Roman fra den Gustavianske Tid (Lolotte. A Novel from the Gustavian Period, 1898), while the subtitle clearly indicates a late 18th-century setting, the models fail to date the characters like Prince Gustaf and Sofie Magdalene, leading to incorrect classifications.</p><p>Machine readings of genre clues also shed light on borderline cases, such as novels set in the near past relative to the modern breakthrough period (1870-99). Examples include novels set during the Danish-German wars of 1848-50 and 1864, like Chr. Christensen's Kaerlighedens Mysterier. En Historie fra 1848-50 (The Mysteries of Love. A Tale from 1848-50, 1899) and P.A. Worm's Forbrydelsernes Konge eller Den skalperede Praest (The King of Crime and the Scalped Priest, 1899). An intriguing case involves a novel beginning in the narrator's present with modern elements like electric light and a telephone but transitioning to a historical analepsis: U. Ravn's Interioerer fra vores Bedsteforaeldres Tid (Interiors from the Time of our Grandparents, 1899).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.2.">Annotation Errors</head><p>After in-depth expert analysis, eight of the models' "errors" in the test set turned out to be Bjerring-Hansen and Ørtoft's annotation errors, including four plain mistakes and four tricky in-between novels where further inspection validated the models' predictions. These erroneous annotations were evenly distributed between misclassified historical and contemporary novels.</p><p>An interesting case is the novel Hvorfor hun blev Nonne. En Fortaelling om fransk Klosterliv (Why she became a nun. A story about French monastic life, 1899) by the pseudonym "Herdis". Both models and annotators were misled by the title into thinking it was a medieval story. However, a close reading of the text's opening pages revealed it to be set in modern times, aligning with the neo-romantic current of the 1890s that revived the historical novel and Catholic themes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusion</head><p>In this study, we presented a dataset of Danish and Norwegian novels from the late 19 th century, classified as historical or contemporary by literary scholars. We investigated the performance of several pre-trained language models in distinguishing between these two genres based on titles and the first few sentences. While the models demonstrated commendable accuracy, the error analysis revealed limitations in capturing the nuanced cues recognized by human experts. These findings underscore the complexity of literary text classification and suggest that while NLP models can significantly aid in the categorization process, they still require further refinement to match human interpretative abilities fully. In our approach, we chose to limit the textual input to only the titles and the first 15 sentences of each novel. This decision was informed by the pretraining of the models, which predominantly focused on non-literary and contemporary content, meaning that historical figures, settings, and subtleties were likely underrepresented in the training data. As a result, we hypothesized that the opening framing of the novels, which often serves to establish the genre, would be more effective for detection than deeper content. This was evident in our findings, where the titles and subtitles emerged as the most straightforward indicators of the genre distinction. While expanding the analysis to include more content from the novels could potentially capture stronger genre signals, the results indicated that the models performed best on the titles and subtitles; in fact, the first 15 sentences did not surpass the effectiveness of the titles alone. This suggests that the model, like the historical readers, effectively identifies key genre signals from surface-level clues, such as titles and subtitles. While our method is aligned with historical signals by focusing on initial clues that reflect the genre distinctions recognized during the period, it also highlights the need for deeper content analysis, which would require models pretrained on a substantial amount of historical material extending well before the end of the 19 th century. To address this, future work could explore more sophisticated model architectures or hybrid approaches that combine machine learning with expert knowledge to enhance the accuracy and depth of genre classification in literary studies.</p><p>Our study shows that the historical novel was by no means an extinct genre in Danish-Norwegian literature at the end of the 19 th century, as the prevailing modern aesthetic would have it, and furthermore that this was a rather obvious fact, since genre decoding-historical novel -yes or no? -is a relatively trivial affair. It can be determined with great certainty, both by people and models, by reading the paratext and/or the first lines of the novels. Of course, this generic stability cannot be taken for granted or universalized if the fine-tuned models from our study are applied to other textual sources, such as 20 th century novels, where genre innovations are increasing and where the historical novel is also exposed to modernist experiments (an early Danish example of this is Nobel Prize winner Johannes V. Jensen's novel Kongens Fald (The fall of the king, 1900-01), which represents both a historical depiction of the dramatic events leading to the fall of the Kalmar Union and a modern flâneur novel. In future literary studies, we will be able to test the stability of the genre distinctions created during the 19 th century, when the literary field and the formation of taste were established, but for accuracy in prediction, we will most likely have to adjust our methods to consider the content of novels on a broader scale.</p><p>The comparison between qualitative annotations and machine predictions enhanced our understanding of the quantitative arguments applicable to the period's literature. It highlighted the coexistence and interaction of old and new forms and meanings within an aesthetic timespan. This approach aligns with a broader perspective on genre classification that leverages predictive modeling. As Ted Underwood suggests, our objective shifts from defining a genre to developing a model that can replicate the judgments made by specific historical observers <ref type="bibr" target="#b43">[44]</ref>. This paradigm not only advances our technical capabilities but also deepens our literary and historical understanding, bridging the gap between computational methods and humanistic inquiry.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Distribution of Historical and Contemporary Novels in the MeMo Corpus Over Time.</figDesc><graphic coords="6,89.28,84.17,416.72,248.71" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>MeMo corpus statistics.</figDesc><table><row><cell>Total novels</cell><cell>859</cell></row><row><cell>Total sentences</cell><cell>3,282,643</cell></row><row><cell>Total words</cell><cell>53,588,381</cell></row><row><cell cols="2">Average sentences per novel 3,821</cell></row><row><cell>Average words per novel</cell><cell>62,385</cell></row><row><cell cols="2">Average words per sentence 16.3</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Fine-tuning Classification Results: F1-score for the four pre-trained Transformer language models in three input settings on both validation and tests sets.</figDesc><table><row><cell></cell><cell cols="4">Titles &amp; Sub-titles First 15 Sentences</cell><cell cols="2">Both</cell></row><row><cell>Model</cell><cell>Valid.</cell><cell>Test</cell><cell>Valid.</cell><cell>Test</cell><cell cols="2">Valid. Test</cell></row><row><cell>DanskBERT DFM (Large) MeMo-BERT-03 NB-BERT-base</cell><cell>0.91 0.92 0.89 0.91</cell><cell>0.88 0.88 0.91 0.89</cell><cell>0.80 0.81 0.81 0.79</cell><cell>0.82 0.86 0.85 0.83</cell><cell>0.81 0.82 0.82 0.82</cell><cell>0.84 0.83 0.83 0.84</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Expected Calibration Error (ECE) for the models on the test set.</figDesc><table><row><cell>Model</cell><cell cols="3">Titles &amp; Sub-titles First 15 Sentences Both</cell></row><row><cell>DFM (Large)</cell><cell>0.040</cell><cell>0.085</cell><cell>0.094</cell></row><row><cell>DanskBERT MeMo-BERT-03 NB-BERT-base</cell><cell>0.043 0.062 0.028</cell><cell>0.087 0.076 0.156</cell><cell>0.081 0.098 0.070</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Our dataset, code and models will be made publicly available upon publication.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://www.oxfordreference.com/display/10.1093/oi/authority.20111104173823536</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://kennethenevoldsen.github.io/scandinavian-embedding-benchmark/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://scandeval.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://huggingface.co/vesteinn/DanskBERT</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">https://huggingface.co/KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">https://huggingface.co/MiMe-MeMo/MeMo-BERT-03</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">https://huggingface.co/NbAiLab/nb-bert-base</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Sentiment Classification of Historical Danish and Norwegian Literary Texts</title>
		<author>
			<persName><forename type="first">A</forename><surname>Allaith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Degn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Conroy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Pedersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bjerring-Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hershcovich</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2023.nodalida-1.34" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)</title>
				<editor>
			<persName><forename type="first">T</forename><surname>Alumäe</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Fishel</surname></persName>
		</editor>
		<meeting>the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)<address><addrLine>Tórshavn, Faroe Islands</addrLine></address></meeting>
		<imprint>
			<publisher>University of Tartu Library</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="324" to="334" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">From progress to catastrophe</title>
		<author>
			<persName><forename type="first">P</forename><surname>Anderson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">London Review of Books</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="issue">15</biblScope>
			<biblScope unit="page" from="24" to="28" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Stop Measuring Calibration When Humans Disagree</title>
		<author>
			<persName><forename type="first">J</forename><surname>Baan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Aziz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Plank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Fernandez</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2022.emnlp-main.124</idno>
		<ptr target="https://aclanthology.org/2022.emnlp-main.124" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing</title>
				<editor>
			<persName><forename type="first">Y</forename><surname>Goldberg</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Z</forename><surname>Kozareva</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</editor>
		<meeting>the 2022 Conference on Empirical Methods in Natural Language Processing<address><addrLine>Abu Dhabi, United Arab Emirates</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="1892" to="1915" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">An annotated dataset of literary entities</title>
		<author>
			<persName><forename type="first">D</forename><surname>Bamman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Popat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Shen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="2138" to="2144" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Mending Fractured Texts. A heuristic procedure for correcting OCR data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Bjerring-Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">D</forename><surname>Kristensen-Mclachlan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Diderichsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">H</forename><surname>Hansen</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Litteratursociologi og kvantitative litteraturstudier: Den historiske roman i det moderne gennembrud som case</title>
		<author>
			<persName><forename type="first">J</forename><surname>Bjerring-Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">Ø</forename><surname>Rasmussen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Passage-Tidsskrift for litteratur og kritik</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="171" to="189" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">The 1872 Introduction to Hovedstrømninger i det 19de Aarhundredes Litteratur (Main Currents of Nineteenth-Century Literature)</title>
		<author>
			<persName><forename type="first">G</forename><surname>Brandes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">R</forename><surname>Wilkinson</surname></persName>
		</author>
		<idno type="DOI">10.1632/pmla.2017.132.3.696</idno>
	</analytic>
	<monogr>
		<title level="j">PMLA/Publications of the Modern Language Association of America</title>
		<imprint>
			<biblScope unit="volume">132</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="696" to="705" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">The novel in the Spanish Silver Age: a digital analysis of genre using machine learning</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Calvo</forename><surname>Tello</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<publisher>Bielefeld University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Calibration of Pre-trained Transformers</title>
		<author>
			<persName><forename type="first">S</forename><surname>Desai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Durrett</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.emnlp-main.21</idno>
		<ptr target="https://aclanthology.org/2020.emnlp-main.21" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP</title>
				<editor>
			<persName><forename type="first">B</forename><surname>Webber</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Cohn</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</editor>
		<meeting>the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="295" to="302" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding</title>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/N19-1423</idno>
		<ptr target="https://aclanthology.org/N19-1423" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
				<editor>
			<persName><forename type="first">J</forename><surname>Burstein</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Doran</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Solorio</surname></persName>
		</editor>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4171" to="4186" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Danish Foundation Models</title>
		<author>
			<persName><forename type="first">K</forename><surname>Enevoldsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Nielsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A F</forename><surname>Egebaek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V</forename><surname>Holm</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Nielsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bernstorff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Larsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">B</forename><surname>Jørgensen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Højmark-Bertelsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">B</forename><surname>Vahlstrup</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Møldrup-Dalum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Nielbo</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.07264</idno>
	</analytic>
	<monogr>
		<title level="m">(programming languages, logics, formal systems) that does not explicitly address natural-language issues broadly construed (natural-language processing, computational linguis</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note>Covers natural language processing. Roughly includes material in ACM Subject Class I.. tics, speech, text retrieval, etc.) is not appropriate for this area</note>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Challenges and solutions for Latin named entity recognition</title>
		<author>
			<persName><forename type="first">A</forename><surname>Erdmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Joseph</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Janse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Ajaka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Elsner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-C</forename><surname>De Marneffe</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</title>
				<meeting>the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="85" to="93" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">The English historical novel</title>
		<author>
			<persName><forename type="first">A</forename><surname>Fleishman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1971">1971</date>
			<publisher>Johns Hopkins University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Genre</title>
		<author>
			<persName><forename type="first">J</forename><surname>Frow</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
			<publisher>Routledge</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Furuland</surname></persName>
		</author>
		<title level="m">Romanen som vardagsvara: förläggare, författare och skönlitterära häftesserier i Sverige 1833-1851 från Lars Johan Hierta till Albert Bonnier</title>
				<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Origins of the US genre-fiction system, 1890-1956</title>
		<author>
			<persName><forename type="first">A</forename><surname>Goldstone</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Book history</title>
		<imprint>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="203" to="233" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Aristotle&apos;s poetics</title>
		<author>
			<persName><forename type="first">S</forename><surname>Halliwell</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>University of Chicago Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">spaCy: Industrial-strength Natural Language Processing in Python</title>
		<author>
			<persName><forename type="first">M</forename><surname>Honnibal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Montani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Van Landeghem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Boyd</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.1212303</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Normalisation of 16th and 17th century texts in French and geographical named entity recognition</title>
		<author>
			<persName><forename type="first">E</forename><surname>Kogkitsidou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Gambette</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th ACM SIGSPATIAL Workshop on Geospatial Humanities</title>
				<meeting>the 4th ACM SIGSPATIAL Workshop on Geospatial Humanities</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="28" to="34" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">E</forename><surname>Kummervold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>De La Rosa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wetjen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Brygfjeld</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2021.nodalida-main.3" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)</title>
				<meeting>the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)<address><addrLine>Reykjavik, Iceland; Sweden</addrLine></address></meeting>
		<imprint>
			<publisher>Linköping University Electronic Press</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="20" to="29" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Genre Classification and the Current State of Turkmen Musical Folklore</title>
		<author>
			<persName><forename type="first">D</forename><surname>Kurbanova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Culture and Arts in the Modern World</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="155" to="167" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Development and Evaluation of Pre-trained Language Models for Historical Danish and Norwegian Literary Texts</title>
		<author>
			<persName><forename type="first">A</forename><surname>Al-Laith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Conroy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bjerring-Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hershcovich</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2024.lrec-main.431" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Calzolari</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M.-Y</forename><surname>Kan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Hoste</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Lenci</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Sakti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Xue</surname></persName>
		</editor>
		<meeting>the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024<address><addrLine>Torino, Italia</addrLine></address></meeting>
		<imprint>
			<publisher>ELRA and ICCL</publisher>
			<date type="published" when="2024">2024</date>
			<biblScope unit="page" from="4811" to="4819" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">DeepGenre: Deep Neural Networks for Genre Classification in Literary Works</title>
		<author>
			<persName><forename type="first">S</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zhang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Decoupled Weight Decay Regularization</title>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Hutter</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:53592270" />
	</analytic>
	<monogr>
		<title level="m">International Conference on Learning Representations</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Lukács</surname></persName>
		</author>
		<title level="m">Der Historische Roman. 1937</title>
				<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<publisher>Aufbau-Verlag</publisher>
			<date type="published" when="1955">1955</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Sentiment analysis under temporal shift</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lukes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Søgaard</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/W18-6210</idno>
		<ptr target="https://aclanthology.org/W18-6210" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Balahur</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Mohammad</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Hoste</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Klinger</surname></persName>
		</editor>
		<meeting>the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis<address><addrLine>Brussels, Belgium</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="65" to="71" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Recent advances in natural language processing via large pre-trained language models: A survey</title>
		<author>
			<persName><forename type="first">B</forename><surname>Min</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Ross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sulem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P B</forename><surname>Veyseh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">H</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Sainz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Agirre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Heintz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Roth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="1" to="40" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Style, Inc. Reflections on Seven Thousand Titles (British Novels, 1740?1850</title>
		<author>
			<persName><forename type="first">F</forename><surname>Moretti</surname></persName>
		</author>
		<idno type="DOI">10.1086/606125</idno>
	</analytic>
	<monogr>
		<title level="j">Critical Inquiry</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="134" to="158" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Fredric Jameson. The Antinomies of Realism</title>
		<author>
			<persName><forename type="first">R</forename><surname>Mucignat</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Orbis Litterarum</title>
		<imprint>
			<biblScope unit="volume">71</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="430" to="431" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>: Verso, 2013, 326</note>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<title level="m" type="main">Romanens århundrede: studier i den masselaeste oversatte roman i Danmark 1800-1870</title>
		<author>
			<persName><forename type="first">E</forename><surname>Munch-Petersen</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1978">1978</date>
		</imprint>
	</monogr>
	<note>No Title</note>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<title level="m" type="main">Scandeval: A benchmark for Scandinavian natural language processing</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Nielsen</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2304.00906</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Genre Classification of Books on Spanish</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Nolazco-Flores</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Guerrero-Galván</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Del-Valle-Soto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">P</forename><surname>Garcia-Perera</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="132878" to="132892" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<title level="m" type="main">Technologies of the Novel: Quantitative Data and the Evolution of Literary Systems</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">D</forename><surname>Paige</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">A simple approach to classify fictional and non-fictional genres</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Qureshi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ranjan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rajkumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Shah</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Second Workshop on Storytelling</title>
				<meeting>the Second Workshop on Storytelling</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="81" to="89" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<monogr>
		<title level="m" type="main">Language Models are Unsupervised Multitask Learners</title>
		<author>
			<persName><forename type="first">A</forename><surname>Radford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Child</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Luan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Amodei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<title level="m" type="main">Reading the romance: Women, patriarchy, and popular literature</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Radway</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Univ of North Carolina Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Automated analysis of Bangla poetry for classification and poet identification</title>
		<author>
			<persName><forename type="first">G</forename><surname>Rakshit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ghosh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Bhattacharyya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Haffari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 12th international conference on natural language processing</title>
				<meeting>the 12th international conference on natural language processing</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="247" to="253" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">Stage Direction Classification in French Theater: Transfer Learning Experiments</title>
		<author>
			<persName><forename type="first">A</forename><surname>Schneider</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">R</forename><surname>Fabo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature</title>
				<meeting>the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature</meeting>
		<imprint>
			<publisher>LaTeCH-CLfL</publisher>
			<date type="published" when="2024">2024</date>
			<biblScope unit="volume">2024</biblScope>
			<biblScope unit="page" from="278" to="286" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<analytic>
		<title level="a" type="main">The rise and fall of genre differentiation in English-language fiction</title>
		<author>
			<persName><forename type="first">A</forename><surname>Sharmaa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Shang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Singhal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Underwood</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">DH2020 (ADHO) Proceedings</title>
				<imprint>
			<date type="published" when="1973">2020. 0073</date>
			<biblScope unit="volume">1613</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<monogr>
		<title level="m" type="main">The forms of historical fiction: Sir Walter Scott and his successors</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">E</forename><surname>Shaw</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1983">1983</date>
			<publisher>Cornell University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<analytic>
		<title level="a" type="main">Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese</title>
		<author>
			<persName><forename type="first">V</forename><surname>Snaebjarnarson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Simonsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Glavaš</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Vulić</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)</title>
				<meeting>the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)<address><addrLine>Tórshavn, Faroe Islands; Sweden</addrLine></address></meeting>
		<imprint>
			<publisher>Linköping University Electronic Press</publisher>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<analytic>
		<title level="a" type="main">At fortaelle historier om historien: Om den historiske roman i relation til Poul Vads Rubruk (1972) og Ib Michaels Troubadurens laerling</title>
		<author>
			<persName><forename type="first">L</forename><surname>Søndergaard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Fortaellingen i Norden efter 1960</title>
				<imprint>
			<publisher>Aalborg Universitetsforlag</publisher>
			<date type="published" when="1983">1983. 2004</date>
			<biblScope unit="page" from="404" to="412" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<analytic>
		<title level="a" type="main">The Danish Gigaword Corpus</title>
		<author>
			<persName><forename type="first">L</forename><surname>Strømberg-Derczynski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ciosici</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Baglini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">H</forename><surname>Christiansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Dalsgaard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Fusaroli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Henrichsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hvingelby</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kirkedal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Kjeldsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Ladefoged</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">Å</forename><surname>Nielsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Madsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Petersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Rystrøm</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Varab</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2021.nodalida-main.46" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)</title>
				<meeting>the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)<address><addrLine>Reykjavik, Iceland; Sweden</addrLine></address></meeting>
		<imprint>
			<publisher>Linköping University Electronic Press</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="413" to="421" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<monogr>
		<title level="m" type="main">The life cycles of genres</title>
		<author>
			<persName><forename type="first">T</forename><surname>Underwood</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b44">
	<monogr>
		<title level="m" type="main">The transformation of gender in Englishlanguage fiction</title>
		<author>
			<persName><forename type="first">T</forename><surname>Underwood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Bamman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lee</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<analytic>
		<title level="a" type="main">Genre, computation, and the varieties of twentieth-century US fiction</title>
		<author>
			<persName><forename type="first">M</forename><surname>Wilkens</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Cultural Analytics</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">2</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b46">
	<monogr>
		<title level="m" type="main">Fortiden som spejl</title>
		<author>
			<persName><forename type="first">M</forename><surname>Winge</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
			<publisher>Lindhardt og Ringhof</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
