<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Rediscovering the 1890s: A Norwegian Poetry Corpus ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ranveig</forename><surname>Kvinnsland</surname></persName>
							<email>ranveig.kvinnsland@ibsen.uio.no</email>
							<affiliation key="aff0">
								<orgName type="department">Centre for Ibsen Studies</orgName>
								<orgName type="institution">University of Oslo</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ingerid</forename><surname>Løyning</surname></persName>
							<email>ingerid.dale@nb.no</email>
							<affiliation key="aff1">
								<orgName type="institution">The National Library of Norway</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Lars</forename><forename type="middle">Magne</forename><surname>Tungland</surname></persName>
							<email>lars.tungland@nb.no</email>
							<affiliation key="aff1">
								<orgName type="institution">The National Library of Norway</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Rediscovering the 1890s: A Norwegian Poetry Corpus ⋆</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">AD9E7408070A7D3C8BF3D8D2E9FB82E8</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:50+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>poetry</term>
					<term>corpus construction</term>
					<term>Norwegian literary history</term>
					<term>computational literary analysis</term>
					<term>rhyme</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>What language was used most for written poetry in the 1890s in Norway? What was the gender distribution of published poets in that decade? To what degree does the lyrical subject "reveal" itself, explicitly or implicitly, in the poems? This article presents answers to these questions in relation to corpus of poetry written by Norwegian and Danish poets, published between 1890 and 1899. The corpus contains 3,440 poems from 81 books, encoded in TEI XML, and annotated with author and publication metadata, as well as lyrical features such as rhyme schemes and presence of the lyrical subject. The objective of constructing this corpus is to fill a resource gap for research on Norwegian poetry. It provides empirical data for investigating historical claims about the literary period, and comparing findings from both close and distant readings of the data. It is structured in a well-known format (TEI) that enables testing and development of automatic poetry analysis tools, as well as the addition of further annotations. We have published the data in a public repository, making it easily findable, accessible, interoperable and resuable (FAIR).</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The purpose of this paper is to present a new corpus of digitized poetry published between 1890 and 1899 by Norwegian and Danish poets for a Norwegian audience. The construction of this specific dataset is part of the research project Norwegian Romantic Nationalisms (NORN) at the University of Oslo, aiming to model nineteenth century literature. <ref type="bibr" target="#b0">1</ref> NORN Poems is a corpus that encapsulates many different linguistic and genre-specific features from a decade with a large literary production in Norway. While previous literary research on the period has mainly focused on a few canonized authors, this corpus enables research on lyrical features both as a cultural trend and as a marker of its time of production.</p><p>There are two goals connected to this material. The first and main goal is to use computational approaches to examine genre-specific trends in a wide variation of poems, produced during a historical period that existing literary studies claim to be influential for modern Norwegian poetry. Previous studies has mainly focused on a handful of canonized terms and poets, while this corpus will enable comprehensive investigations of the genre as a whole during the 1890s. The second goal is to explore new analytical methods and fill a gap of Norwegianspecific digital tools for literary studies.</p><p>We describe this gap in section 2, and the data selection process and the curation of the corpus metadata in section 3. The process of extracting the poems and encoding them in TEI is summarized in section 4. Section 5 presents some key features and distributions of the corpus metadata that help us answer our overarching questions about the 1890s poetry scene. Section 6 describes the annotation and distribution of lyrical features in the text data. We conclude the corpus construction work in section 7 and outline planned future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background</head><p>De Sisto et al. (2024) present an overview of existing digital tools and data for automatic poetry analysis, focusing on resources for European languages. [7] Among the 57 described resources, none were developed for any of the Scandinavian languages. The Corpus of Czech Verse and The Diachronic Spanish Sonnet Corpus (DISCO) are most similar to the corpus we present in this article in terms of annotations and size respectively. The Corpus of Czech Verse is impressively large with close to 80 000 poems, and is annotated extensively on the token and verse level. [14] Most of the annotations are closely linked to linguistic particularities of the Czech language, and the wide selection of annotation types allows for many different statistical analyses.</p><p>DISCO is a smaller corpus, containing 4,085 Spanish-written sonnets from the fifteenth through the nineteenth centuries. [18] Annotations include author information, metrical encoding and enjambment. The diachronicity of the data lends itself to time series analyses, and enable comparisons of features over time.</p><p>These two corpora provide empirical data for poetry analysis in their respective languages and for their respective literary histories, and the lack of such a resource for Norwegian led us to create NORN Poems. Like Czech Verse, our corpus include phonemic transcriptions for each verse line, as well as a rhyme tag, and a collated rhyme scheme for each stanza. Like DISCO, we encode our corpus in TEI and share it in public repositories on widely used platforms. <ref type="bibr" target="#b1">2</ref> The corpora are similar in size and enable many of the same analyses and discussions. However, DISCO is made exclusively out of sonnets which streamlines the computational analyses in some ways while NORN Poems is composed of a wider variation of poems.</p><p>This lyric type variation in NORN Poems enables more comprehensive literary analyses of the genre as a whole in the given time period: What was the gender distribution among published poets? What language did they write poetry in, after almost half a century of discussions about what written Norwegian should look like? What does the distribution of works by canonical authors look like, compared to non-canonical? Can we find non-canonical authors that were popular in their own time, only to be forgotten by the literary histories?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Data Selection</head><p>There are 22,591 books published between 1800 and 1899 that have been digitized by the Norwegian National Library. Researchers from the Norwegian literary research project ImagiNation has manually annotated 489 books of poetry from the nineteenth century, published between 1814 and 1905. <ref type="bibr" target="#b2">3</ref>  [11] Out of these, the corpus we present here is made up of the 81 first edition books of poetry published in Norway and Denmark from 1890 through 1899, in their original language. <ref type="bibr" target="#b3">4</ref> The total number of digitized publications of poetry from this decade is 135, however these books include re-publications of older poetry. We have left these out in order to establish the qualities the new poetry from the 1890s shares; this allows us to discuss the labels and features of the decade as identified in literary histories.</p><p>There are several reasons for limiting the corpus to these 81 books of poetry. Firstly, according to many literary historians, this decade allegedly marks significant changes in the lyric genre. Something "new" happens when poetry supposedly reappears after a downturn in the 1870s and 1880s. [2, 4, 9, 12] Secondly, the critical reception suggests that this new wave is an expression of decadence, symbolism, neo-romanticism, and early modernism. These four terms have been connected to the lyric genre in different ways, but there are discrepancies and loose ends. We hypothesize that this corpus can provide empirical data for the claims made by literary studies.</p><p>Finally, for practical reasons the corpus had to be small enough to easily toggle analysis methods. At the same time, it should be large enough to ensure that statistical patterns and analyses are generalizable to the lyric genre of the period, and not skewed too much by outliers and discrepancies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Corpus Construction and Data Processing</head><p>All the poems in our corpus were originally printed in books, and digitized and made available by the National Library of Norway. <ref type="bibr" target="#b4">5</ref> One of the main challenges in this project was structuring the dataset with the poem as the main object of study, instead of the book. The process involved several manual steps, including listing the title of each poem, identifying the pages each poem was located on, and assessing each split.</p><p>We received a collection of Alto XML files from the National Library's DH-lab, containing the optical character recognised (OCR) text from the 81 handpicked, scanned books of poetry. <ref type="bibr" target="#b5">6</ref> We extracted the individual poems and indexed them. A thorough manual cleaning process was conducted on the resulting text files. We corrected line breaks and indentations, fixed OCR errors, and removed unwanted artifacts such as page numbers, stamp text, and page breaks. For data analysis and processing, as well as preservation and sharing, we structured the poem texts and metadata in the TEI format. <ref type="bibr" target="#b6">7</ref> The hierarchy of textual elements in each poem consists of stanzas, verses, word tokens, and syllables. Additionally, rhyme schemes are encoded with the TEI attribute @rhyme on the stanza text elements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Corpus Overview</head><p>We have encoded certain key features in the metadata of the corpus which will enable further investigations. Easily quantified measures include gender and language distributions. A more interpretive feature is whether or not the poets are canonized in literary historical accounts of the period. We have encoded canonization of an author in three ways: <ref type="bibr" target="#b7">8</ref> 1. Whether or not an article is written about them in the main online Norwegian Encyclopedia, Store Norske Leksikon.<ref type="foot" target="#foot_7">9</ref> 2. The amount of times they are mentioned in any of the six most prominent Norwegian literary histories of the twentieth century: Elster (1934), Winsnes (1937), Beyer (1952), Dahl (1984), Beyer and Beyer (1996), Andersen (2012). 3. The number of times they are mentioned in any of the articles published in the most prominent Norwegian journal for literary studies, Edda.<ref type="foot" target="#foot_8">10</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Language</head><p>The NORN Poems are written in three different languages: Danish, Landsmål and Riksmål. While Danish had been the standard written language in Norway for many centuries, the nation-building project of the 1800s included debates about the development of a standardized Norwegian language. These debates resulted in the establishment of two new written languages: Landsmål, based on Norwegian dialects, and Riksmål, based on orthographic modification of Danish, today known as Nynorsk and Bokmål. Riksmål is overrepresented in the corpus with 60 of the 81 publications, while 16 books are written solely in Landsmål. Two books alternate between Landsmål and Riksmål for each poem, and only three of the publications are written in Danish. The distribution is visualized in Figure <ref type="figure">1</ref>. The three languages pose a linguistic challenge for the project, both in terms of inconsistencies in the optical character recognition that generated the digital text, as well as the linguistic features and vocabularies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Gender</head><p>From the ImagiNation corpus as a whole, we have manually identified fifteen female poets of lyric poetry in the nineteenth century. [11] Four of these female poets are represented in our corpus. 56 authors are male, bringing the total to 60 poets in our poetry selection from the 1890s. <ref type="bibr" target="#b10">11</ref> One of the female poets stands out in quantitative terms: Karen Nilsen is represented with four books in this corpus. She wrote another during the same time period, which has not been digitized and is therefore not included here. These five books make her one of the most productive poets of this decade, only matched by Vilhelm Krag.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 1:</head><p>The poetry in the corpus is predominantly written by male authors in Riksmål.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Canonization</head><p>Sigbjørn Obstfelder is by far the most frequently mentioned poet in the six canonical literary histories, as can be seen by the overview in table A. At the same time, there are 11 poets that are not mentioned anywhere in the literary reference works. The 48 other poets in the corpus are either mentioned a handful of times, or with a frequency of double digits. There is not necessarily a connection between the number of produced works and their position in literary history. For instance, it is interesting to note that Vilhelm Krag is one of the most prolific authors, but is mostly remembered by a single poem, "Fandango". Menawhile, Sigbjørn Obstfelder published only one book of poems in his lifetime, which includes the poem "Jeg ser" which has become emblematic for the entire decade.</p><p>Karen Nilsen is not mentioned in any of the canonical literary histories before 2012, and was first described explicitly in 1988 in Norsk kvinnelitteraturhistorie [12]. According to Steinar Gimnes, Karen Nilsen must have been one of the most read authors at the time. [12, p. 141] Her first book was printed three times and her second book was published in an ornamental edition, which indicates that she had many interested readers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Data Exploration</head><p>The field of lyric theory has generated a number of different definitions of the lyric genre. Many of the most prominent features of modern genre theory, can be difÏcult to operationalize. However, some genre features from the nineteenth century still hold a place in Western lyric traditions. The features include that the poems are relatively short, often separated into stanzas with a certain pattern, and often contain the presence of a lyric speaker. [5, p. 89] In these preliminary explorations of NORN Poems, we want to examine how these three traditionally important lyric features appear in the corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1.">Poem Length</head><p>By means of counting verse lines in NORN Poems, we can confirm that the majority of the poems are short. We find only a couple of outliers with closer to 1000 lines as shown in Figure <ref type="figure" target="#fig_1">2</ref>. More than 75% of all poems are shorter than 42 lines. Additionally, 50% of the corpus have 26 lines or shorter, which confirm the claim that poem length is a distinctive genre feature in NORN Poems.</p><p>Poem length is also a distinctive feature within the longest poems in the corpus, although these include greater variety in form. The 25% longest poems span between 42 and 993 lines, and only 55 of the poems are longer than 200 lines, as illustrated by Figure <ref type="figure" target="#fig_1">2</ref>. However, the 55 longest poems are written by 22 different poets, which can indicate an interest in playing with the poem length tradition. These long poems have a large variation in line length, stanza length, and whether they are even stanzaic, which confirms the claim that the long poem as a concept is "a generic hybrid".  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1</head><p>Example of annotated rhyme schemes for the poem "Introduction" by Hjalmar Meidell in Ranker: et bundt Digte. The orthographic text of each of the stanza's verses are listed alongside the phonemic transcription of the last stressed syllable(s) with IPA notation: periods mark syllable boundaries and the preliminary ticks mark stress and toneme.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Stanza Transcription Scheme</head><p>Jeg vil binde mit sinds, de tunge tanker. tA.k@r a og hefte dem sammen til blomsterranker.</p><p>rA.k@r a Og alle de blomster -med bladene mørke -"mr.k@ b som spirte i regn, og som vokste i tørke, "tr.k@ b dem tager jeg med i de blomsterranker, rA.k@r a som jeg vil binde af mine tanker.</p><p>tA.k@r a</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.">Rhyme Schemes</head><p>Rhyme and rhythm (as sound patterning) play a crucial role in the lyric genre's aural dimension. Jonathan Culler claims that "[s]uch patterning is highly seductive, which is in part to say that a given sequence of sounds does not have some fixed, necessary effect, but invites readers to an experience. " [5, p. 134] One of our objectives is to annotate the NORN Poems with end rhyme schemes, because we are interested in recurring patterns in stanzaic poems, and we want to examine the distribution of end rhyme schemes. The annotations will later serve as training data for automatic poetry analysis tools as well.</p><p>Reddy and Knight ( <ref type="formula">2011</ref>) trained a generative model on data with pre-annotated rhyme schemes, without any phonetic information. [15] They relied on aggregating rhyming word pairs that occurred frequently in their training data, but limited the model's possible rhyme scheme output to the annotations that already existed in their dataset. Petr Plecháč (2018) developed further on Reddy and Knight's idea with a generative model, and trained a collocationdriven rhyme tagger which identifies rhyming word pairs by aggregating the frequencies that end-of-line-words occur within the same stanza. [13]  Both Reddy and Knight (2011) and Plecháč (2018) disregard phonetic information, either due to lack of resources or the possible discrepancies in pronunciations over time. They both make use of large datasets that have already been annotated with rhyming patterns, and no such dataset previously exists for Norwegian poetry.</p><p>However, there are two recently developed resources for Norwegian pronunciation: one multidialectal pronunciation dictionary and grapheme-to-phoneme models for the written standard Bokmål. [17, 16] With this phonemic information we segment verse lines into syllable sequences, identify the last stressed syllable, and compare end rhymes across verses within a stanza, exemplified in Table <ref type="table">1</ref>. Our approach is rule based and naively annotates any list of lists with letter sequences. <ref type="bibr" target="#b11">12</ref> In future work, we will explore whether using modern pronunciation information works better than not using it at all for the initial rhyme scheme annotation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.3.">The Presence of a Lyric Subject</head><p>Lyric poetry is often recognized by the presence of a speaker in the poem, and this subject can either point inwards through a subject, or outwards through a lyric address. Throughout the nineteenth century, the subject is often understood as an expression of the poet, which establishes the lyric genre as a specifically private genre. With the development of New Criticism in the mid-twentieth century, the lyric subject is understood not as an act of the poet, but as the "speech of a persona". [5, p. 84]  The lyric subject may be present without an explicit mark by personal pronoun, which emphasizes the experience of a speaker in the poem. In order to explore the presence of the lyric subject in NORN Poems, and annotate unambiguously, we defined four groups of words that all indicate that the speaker is present in the poem's context, and assigned labels for the type of lyric speaker: Explicit subject, explicit object, implicit subject, and deixis. Table <ref type="table">2</ref> lists all the search terms that we used to probe the poems for different ways that the lyric subject appears. The "Indicating words" also illustrate the subtle differences in spelling between the three languages in NORN Poems.</p><p>The search results displayed in Figure <ref type="figure" target="#fig_2">3</ref> show that every poem in NORN Poems contains a reference to the lyric subject by use of one of the indicating words. The distribution plot confirms that the lyric subject is a prominent genre feature in this corpus. On its own, it interesting to note how omnipresent the lyric subject is during this decade. However, further investigations on how the speaker is conveyed and how it acts in the poems will be best explored through close reading.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 2</head><p>Categories of lyric subject. Given the presence of any of the indicating words, the category label is assigned to the poem. The definition specifies the linguistic features that are shared among the indicating words.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Category</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Indicating words Definition</head><p>Explicit </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusion</head><p>This article presents the corpus NORN Poems that enables further research into literary and linguistic features of poetry published for Norwegian readers in the 1890s. We have highlighted three contextual features among the encoded metadata in Section 5, which enable relevant data partitioning, selection, and exploration. We explored more poem specific qualities in Section 6, and ways to operationalize features of the lyric genre.</p><p>Going forward, we aim to investigate other lyric features such as alliteration, anaphores, enjambment, themes and topics, and connections between poets and their poetry. A poetry analysis tool for Norwegian, both for annotating and visualizing the poems is also in development.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Author Canonization</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>[10, p. 2]   (a) The majority of the poems are short.(b) Exponentially increasing poem length by lines.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: The two plots illustrate how the vast majority of poems are below 200 lines.</figDesc><graphic coords="6,89.28,414.07,200.02,156.37" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: The lyric subject is present in all the NORN Poems, most notably marked by the Implicit subject.</figDesc><graphic coords="9,89.28,84.18,416.72,264.77" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3 :</head><label>3</label><figDesc>Overview of author names mentioned by various literary reference works. Author names are listed in the index. Reference works are indicated by the column headers. Two authors are listed here with their real names and an asterisk, whereas the works in the NORN poems corpus were published with pseudonyms: Olaf Berg went by the name of "Martin Kvaennavika", and Sofie Sigurdsen wrote as "Edith". The row values show how often a poet is mentioned in a given literary history, whether an article is written about the poet in SNL (denoted in binary values by 1 or 0 regardless of how many times the name is mentioned), and the hit result when searching for the poet's name in Edda's archive search. The highest value in each column is formatted in boldface.</figDesc><table><row><cell>Author Andersen, Tryggve Berg, Olaf* Blaumüller, Edvard Boye, Eyvind Brandes, Georg Braenden, Lars Bull, Jacob Breda Caspari, Theodor</cell><cell>1934 1937 1952 1984 1996 2012 SNL Edda 14 50 12 15 18 26 1 30 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 80 11 2 20 14 1 292 0 0 0 0 0 0 0 0 2 2 1 6 2 3 1 2</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://github.com/norn-uio/norn-poems</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">ImagiNation: https://www.ntnu.edu/isl/imagination</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">Although the majority of the literary production in Norway was published in Norwegian and in Norwegian cities, there were still a large influence by the Danish literary scene. Most major Norwegian writers were published in Denmark until the turn of the century.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">The National Library of Norway's online library: https://www.nb.no/search</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_4">The DH-lab: dh.nb.no</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_5">TEI P5: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/VE.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_6">A full overview can be found in appendix A.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_7">https://snl.no</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_8">https://www.idunn.no/journal/edda</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_9">This number does not include one poetry collection called "Første greb på lyren: digte udgivne af gymnasialsamfundet 'Fram' (1883-1891)" made up by numerous anonymous authors, all connected to the same school.<ref type="bibr" target="#b0">[1]</ref> </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_10">Link to https://github.com/norn-uio/poetry-analysis</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work is funded by the European Union. The views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or European Commission. Neither the European Union nor the granting authority can be held responsible for them.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Første greb på lyren : Digte udgivne af gymnasialsamfundet</title>
		<author>
			<persName><forename type="first">G</forename><surname>Fram</surname></persName>
		</author>
		<ptr target="https://urn.nb.no/URN:NBN:no-nb%5C%5Fdigibok%5C%5F2014041406092" />
		<imprint>
			<date type="published" when="1883">1883. 1891. 1892</date>
			<publisher>Fram</publisher>
			<pubPlace>Hamar</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">T</forename><surname>Andersen</surname></persName>
		</author>
		<title level="m">Norsk Litteraturhistorie</title>
				<imprint>
			<publisher>Universitetsforlaget</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Norsk litteraturhistorie</title>
		<author>
			<persName><forename type="first">H</forename><surname>Beyer</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1952">1952</date>
			<publisher>Aschehoug</publisher>
			<pubPlace>Oslo</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Beyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Beyer</surname></persName>
		</author>
		<title level="m">Norsk litteraturhistorie</title>
				<imprint>
			<publisher>Tano Aschehoug</publisher>
			<date type="published" when="1996">1996</date>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Theory of the Lyric</title>
		<author>
			<persName><forename type="first">J</forename><surname>Culler</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2017">2017</date>
			<publisher>Harvard University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">W</forename><surname>Dahl</surname></persName>
		</author>
		<title level="m">Norges litteratur II : Tid og tekst 1884-1935</title>
				<meeting><address><addrLine>Oslo</addrLine></address></meeting>
		<imprint>
			<publisher>Aschehoug</publisher>
			<date type="published" when="1984">1984</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Understanding Poetry Using Natural Language Processing Tools: A Survey</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">De</forename><surname>Sisto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Hernández-Lorenzo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>De La Rosa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ros</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>González-Blanco</surname></persName>
		</author>
		<idno type="DOI">10.1093/llc/fqae001</idno>
	</analytic>
	<monogr>
		<title level="m">Digital Scholarship in the Humanities</title>
				<imprint>
			<date type="published" when="2024">2024</date>
			<biblScope unit="page">e001</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">K</forename><surname>Elster</surname></persName>
		</author>
		<title level="m">Illustert Norsk Litteraturhistorie</title>
				<meeting><address><addrLine>Oslo</addrLine></address></meeting>
		<imprint>
			<publisher>Gyldendal norsk forlag</publisher>
			<date type="published" when="1934">1934</date>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
	<note>2nd ed.</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">H</forename><surname>Jaeger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Illustreret norsk literaturhistorie : B</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="1896">1896</date>
			<publisher>Bigler</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Forms of Expansion: Recent Long Poems by Women</title>
		<author>
			<persName><forename type="first">L</forename><surname>Keller</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1997">1997</date>
			<publisher>University of Chicago Press</publisher>
			<pubPlace>Chicago</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">ImagiNation-korpuset</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Malvik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">L</forename><surname>Boasson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">O</forename><surname>Eliassen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">G</forename><surname>Johnsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">L</forename><surname>Dale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">B</forename><surname>Birkenes</surname></persName>
		</author>
		<idno type="DOI">10.18261/edda.111.3.3</idno>
		<ptr target="https://www.idunn.no/doi/abs/10.18261/edda.111.3.3" />
	</analytic>
	<monogr>
		<title level="j">Edda</title>
		<imprint>
			<biblScope unit="volume">111</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="120" to="137" />
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m">Norsk kvinnelitteraturhistorie : B</title>
				<editor>
			<persName><forename type="first">H</forename><surname>Engelstad</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Hareide</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">I</forename><surname>Iversen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Steinfeld</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Øverland</surname></persName>
		</editor>
		<meeting><address><addrLine>Oslo</addrLine></address></meeting>
		<imprint>
			<publisher>Pax</publisher>
			<date type="published" when="1988">1988</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1600" to="1900" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">A Collocation-Driven Method of Discovering Rhymes (in Czech, English, and French Poetry)</title>
		<author>
			<persName><forename type="first">P</forename><surname>Plecháč</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-319-98017-1\_5</idno>
	</analytic>
	<monogr>
		<title level="m">Taming the Corpus</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Fidler</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Cvrček</surname></persName>
		</editor>
		<editor>
			<persName><surname>Cham</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="79" to="95" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">The Corpus of Czech Verse</title>
		<author>
			<persName><forename type="first">P</forename><surname>Plecháč</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kolár</surname></persName>
		</author>
		<idno type="DOI">10.12697/smp.2015.2.1.05</idno>
		<ptr target="http://ojs.utlib.ee/index.php/smp/article/view/smp.2015.2.1.05" />
	</analytic>
	<monogr>
		<title level="j">Studia Metrica et Poetica</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="107" to="118" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Unsupervised Discovery of Rhyme Schemes</title>
		<author>
			<persName><forename type="first">S</forename><surname>Reddy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Knight</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Lin</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Matsumoto</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Mihalcea</surname></persName>
		</editor>
		<meeting>the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Portland, Oregon, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="77" to="82" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Grapheme-to-Phoneme Models for Norwegian</title>
		<author>
			<persName><forename type="first">M</forename><surname>Røsok</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">L</forename><surname>Dale</surname></persName>
		</author>
		<ptr target="https://github.com/Sprakbanken/g2p-nb" />
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">NB Uttale: A Norwegian Pronunciation Lexicon with Dialect Variation</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">I</forename><surname>Røsok</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">L</forename><surname>Dale</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024</title>
				<editor>
			<persName><forename type="first">N</forename><surname>Calzolari</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M.-Y</forename><surname>Kan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Hoste</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Lenci</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Sakti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Xue</surname></persName>
		</editor>
		<meeting>the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024</meeting>
		<imprint>
			<publisher>ELRA and ICCL</publisher>
			<date type="published" when="2024">2024</date>
			<biblScope unit="page" from="12087" to="12092" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The Diachronic Spanish Sonnet Corpus: TEI and Linked Open Data Encoding, Data Distribution, and Metrical Findings</title>
		<author>
			<persName><forename type="first">P</forename><surname>Ruiz Fabo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bermúdez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sabel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Martínez Cantón</surname></persName>
		</author>
		<author>
			<persName><surname>González-Blanco</surname></persName>
		</author>
		<idno type="DOI">10.1093/llc/fqaa035</idno>
		<ptr target="https://academic.oup.com/dsh/article/36/Supplement%5C%5F1/i68/5902019" />
	</analytic>
	<monogr>
		<title level="m">Digital Scholarship in the Humanities</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="68" to="80" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">H</forename><surname>Winsnes</surname></persName>
		</author>
		<title level="m">Norges litteratur: fra Februarrevolutionen til verdenskrigen</title>
				<meeting><address><addrLine>Oslo</addrLine></address></meeting>
		<imprint>
			<publisher>Aschehoug</publisher>
			<date type="published" when="1937">1937</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
