<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">The GOLEM Triple Store: A Graph-based Representation of Narrative and Fiction</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Franziska</forename><surname>Pannach</surname></persName>
							<email>f.a.pannach@rug.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Xiaoyan</forename><surname>Yang</surname></persName>
							<email>xiaoyan.yang@rug.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Noa</forename><forename type="middle">Visser</forename><surname>Solissa</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ze</forename><surname>Yu</surname></persName>
							<email>z.yu@rug.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Andreas</forename><surname>Van Cranenburgh</surname></persName>
							<email>van.cranenburgh@rug.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michiel</forename><surname>Van Der Ree</surname></persName>
							<email>michiel.van.der.ree@rug.nl</email>
							<affiliation key="aff1">
								<orgName type="department">Center for Information Technology (CIT)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Federico</forename><surname>Pianzola</surname></persName>
							<email>f.pianzola@rug.nl</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Language and Cognition (CLCG)</orgName>
								<orgName type="institution">University of Groningen</orgName>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Text Encoding Initiative</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">The GOLEM Triple Store: A Graph-based Representation of Narrative and Fiction</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">662AD4456AD68336ABDCEB5264DC1E79</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:28+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we present the GOLEM triple store, a massive triple store resource for ction and narrative. This triple store is the rst step towards a large-scale knowledge-graph for stories, as well as characters and events in narratives. At the moment, it contains more than 8 million stories collected from the Archive of Our Own (AO3) [1], providing scholars with a tool to derive unique insights into fan narratives and storytelling trends over time.</p><p>Semantic Methods for Events and Stories (SEMMES) Workshop, 2024   </p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In this article we introduce a new resource for the large scale study of ction on the basis of metadata and "derived data" <ref type="bibr" target="#b1">[2]</ref> -or "mesodata" <ref type="bibr" target="#b2">[3]</ref> -that is, various textual features that allow to compare documents without accessing their full text. The idea is similar to that of the HathiTrust Extracted Features dataset <ref type="bibr" target="#b3">[4]</ref>, but the features encoded in the GOLEM ("Graphs and Ontologies for Literary Evolution Models") triple store are much richer, also referring to narrative and stylistic elements, and to reader response data (e.g. characters, relationships, topics, readability, sentiment of comments received by the story, etc.). Similar projects exist on a smaller scale for a selection of texts in English <ref type="bibr" target="#b4">[5]</ref>, Dutch <ref type="bibr" target="#b5">[6]</ref> and German <ref type="bibr" target="#b6">[7]</ref>. The creation of the GOLEM triple store has been inspired by such work but will operate on a completely dierent scale, which requires the automation of the extraction of textual features for millions of stories.</p><p>The core concept of the GOLEM infrastructure is that of "programmable corpora", i.e. "research-oriented corpora providing an API" <ref type="bibr" target="#b7">[8]</ref>, which allows to easily reapply scripts, notebooks, and pipelines of analysis to all texts in the corpora, inasmuch as they are encoded following the same principles and can be queried via the same API and SPARQL endpoint. Since the GOLEM focuses primarily on derived data, there is no need for a resource-intensive XML database of texts encoded in TEI 1 format, like that created by <ref type="bibr" target="#b7">[8]</ref>. Only statements about the texts and their reception are stored in the database.</p><p>The rst batch of texts made available in the GOLEM triple store are gathered from the largest and most popular fanction platform, Archive of Our Own (AO3) <ref type="foot" target="#foot_0">2</ref> . The spread of digital and social media in the 21st century has recongured many literary dynamics, namely it has reduced the inuence of literary institutions like literary critics, publishers, and schools, creating more spaces and occasions for niche and amateur ction to become popular, thanks to reader-to-reader interactions. Fanction platforms and website for book reviews/discussion (e.g. Goodreads) are now some of the most thriving environments to study narrative, ction, and reader response. In the fanction space, readers become writers, viewers become creators and recipients become participants of their favorite (ctional) universes. Since fanction writers publish their own works independently from publishing houses and editors, their creativity knows no bounds, and is not subjected to limitations or censorship. Writers easily cross from one ctional universe (fandom) into another, or place themselves (or the reader) as characters in their favorite stories. Fanction has become an integral part of transformative fan culture, a cultural phenomenon in its own right.</p><p>Up to 2022 (the cut-o point of our data collection), more than 8.7 million stories were published on AO3 in the English language alone. Additionally, there is a wide coverage of other languages, including Chinese, Italian, or Korean including many so-called low-resource languages, such as Bahasa Indonesian or isiZulu. Therefore, the fanction domain holds immense potential for the study of user-produced narratives, readers' response, semantic and narrative modeling approaches, and for the development of natural language processing (NLP) tools for low-or under-resourced languages. While certain individual features of fanction domain have been investigated, such as user-selected and user-provided tags <ref type="bibr" target="#b8">[9]</ref>, the narratives in their entirety are largely under-studied.</p><p>The GOLEM Project triple store is a large and easily accessible resource for querying AO3 data and more data sources will be added later on. We demonstrate the potential of the triple store in three case studies.</p><p>The article is structured as follows: Section 2 describes the data and its representation in the triple store. Section 3 presents some illustrative case studies. Section 4 contains the discussion, and Section 5 describes future work and planned extensions. the predicate keyword. Notably, the GOLEM triples store does not provide access to the full text, which remains in solely accessible through the Archive of Our Own. However, text-based features with regard to events and character-features are planned to be incorporated in the knowledge graph, see Section 6. Table <ref type="table" target="#tab_0">1</ref> explains the relevant data elds and gives examples where needed. Some values that were originally aggregated in lists, such as additional tags in Figure <ref type="figure" target="#fig_0">1</ref>, are split into multiple triples, e.g. golem:keyword to facilitate explorability of the data (like querying specic keywords in fandoms).</p><p>For internal use, the data was rst harvested from the archive.org archive <ref type="foot" target="#foot_1">3</ref> and stored in an internal Elasticsearch database with help of a custom ingest script <ref type="foot" target="#foot_2">4</ref> . This database has now been converted into triple store data and is available at: http://graph.golemlab.eu:8890/sparql via an institutional Virtuoso server <ref type="foot" target="#foot_3">5</ref> . Virtuoso was chosen because it scales well with growing knowledge graphs. Even holding multiple billions of triples on a single instance, single machine setup, Virtuoso still performs well, in a real-life setting. <ref type="foot" target="#foot_4">6</ref> .</p><p>Up to the date of this publication, metadata for 8 million stories have been made available. The data in the GOLEM triple store contains the story related metadata in AO3 up to and including December 2022. With this choice we want to limit the stories that are potentially written with the aid of large language models, allowing for a more reliable investigation of human storytelling. While comparing human-generated stories with narratives produced by large language models could be an interesting area of research, it is not currently within the scope of the project. Extending the knowledge graph with more recent (human) user-produced stories from AO3 and other fanction platforms is planned for future updates. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Workflow</head><p>We transferred the Elasticsearch data into triple store in a series of steps. Firstly, the database was queried for stories by languages other than English. The smaller language sets were converted to triples in one step. Larger languages, such as Russian and Chinese, were processed using a batch size of 50,000 stories. The English data was queried from the database by fandom. The larger fandoms were processed in batches, while the smaller fandoms, i.e. the fandoms with few or very few stories, where queried sequentially from the database, before they were converted into triples according to the schema above. This process has two time-consuming bottle necks: the download and the import into the Virtuoso instance. This is illustrated on the example of English fanction stories for the Attack on Titan anime in Table <ref type="table" target="#tab_1">2</ref>. The Elasticsearch data (jsonl format) for this fandom has a size of 4.9 GB, resulting in 60,503 triple store entries. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Querying the triple store</head><p>Three small case studies are presented here to demonstrate how to use the triple store and give more insights into the data contained in it. First, to know which languages are contained in the data and how many stories per language there are (stories can have more than one associated language), we can write a simple SPARQL query using COUNT. The same query can be made for dierent fandoms, yielding a ordered list of fandoms with the most stories (i.e. Harry Potter J.K. Rowling: 324,767, Marvel Cinematic Universe: 252,605, Supernatural: 244,182).  The results yields a list of 110 languages in total, with the top 10 by story count presented in Table <ref type="table" target="#tab_2">3</ref>. Next, to nd out the distribution of stories per rating (e.g. Mature or Explicit) for the fandom "Artemis Fowl -Eoin Colfer", we can use the following query: It yields a distribution of ratings of stories within the fandom, which is normalized and illustrated in Figure <ref type="figure" target="#fig_4">2</ref>. We can see that this particular fandom produces stories that are largely targeted at general audiences or teen and older audiences, with only ew explicit or mature stories. In contrast, the same query for the fandom BTS (a popular Korean boy band) produces a dierent distribution, with a larger proportion of explicit and mature stories (see Figure <ref type="figure" target="#fig_5">3</ref>).  Content-related elds are interesting for processing the fanction data in downstream tasks, and to derive additional semantic information on the stories. Therefore, the last example shows how to query for a list of summaries of fanction stories in a specic fandom and language that are tagged with a specic keyword.</p><p>p r e f i x golem : &lt; h t t p s : / / g o l e m l a b . eu / g r a p h / &gt; SELECT ? o WHERE { ? s golem : keyword " Angst " . ? s golem : fandom " A t t a c k on T i t a n " . ? s golem : l a n g u a g e " E n g l i s h " . ? s golem : summary ? o . } The result of this query are available at https://github.com/GOLEM-lab/triple_store/.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion</head><p>In this short paper, we present the GOLEM triple store, our eort towards a comprehensive semantic representation of fannish narratives. It provides users with manifold possibilities to study fanction from dierent viewpoints, e.g. by inspecting keywords and tags provided by the users or the distribution of romantic pairings across dierent fandoms. To date, the triple store contains more than 8 million stories. An overview on the statistics of the GOLEM triple store is given in Table <ref type="table" target="#tab_3">4</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Future Work</head><p>The presented triple store is the rst step towards a broader knowledge base for fanction narratives. In the short term, the triple store will be extended with additional reader response data, such as number of time users have bookmarked a story. It will further be extended from a story-centric view to a more complete data modelling based on the existing AO3 metadata, e.g. by modelling content collections according to various criteria. In the medium term, the GOLEM triple store will be extended towards a full-edged knowledge graph of characters and events in the fanction domain. This includes the results of character analysis, modeling essential properties of ctional characters, i.e. physiological and psychological traits, as well as narrative function of a character. Additionally, the full knowledge graph will also contain additional data on reader response (e.g. emotions felt, etc.) The project is currently developing a comprehensible ontology <ref type="bibr" target="#b12">[13]</ref> for the modelling of (fan narratives), which will be aligned to relevant other ontologies, as closely as possible in order to maximize the interoperability with other relevant projects, like Wikidata and MiMoText <ref type="bibr" target="#b6">[7]</ref>.</p><p>We are additionally planning to report recent statistics of the quality of the knowledge graph (such as consistency) regularly on the project website.</p><p>Currently, the triple store only contains stories from AO3. However, we are working on including data from other sources, such as Wattpad 7 and fanction.net 8 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Acknowledgements</head><p>This work is part of the Golem Lab: Graphs and Ontologies for Literary Evolution Models project, a 5-year (2023-2027) research project funded by the European Commission (ERC StG).</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Example Metadata for AO3 data, Source: https://archiveofourown.org/</figDesc><graphic coords="3,89.29,84.19,416.68,134.52" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>1 3 { 4 ?</head><label>34</label><figDesc>PREFIX golem : &lt;h t t p s : / / g o l e m l a b . eu / graph/&gt; 2 SELECT ? o (COUNT( ? o ) a s ? oCount ) WHERE s golem : l a n g u a g e ? o .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>5 } 6 7</head><label>56</label><figDesc>GROUP BY ? o 8 ORDER BY DESC( ? oCount )</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>1 3 { 4 ? 5 ? 6 } 7</head><label>34567</label><figDesc>PREFIX golem : &lt;h t t p s : / / g o l e m l a b . eu / graph/&gt; 2 SELECT ? o (COUNT( ? o ) a s ? oCount ) WHERE s golem : r a t i n g ? o . s golem : fandom " Artemis ␣Fowl␣ -␣ Eoin ␣ C o l f e r " . GROUP BY ? o 8 ORDER BY DESC( ? oCount )</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Distribution of Content-Ratings in Fandom "Artemis Fowl"</figDesc><graphic coords="6,172.63,278.22,250.02,135.76" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Distribution of Content-Ratings in Fandom "BTS"</figDesc><graphic coords="6,172.63,465.56,250.02,140.92" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Triple Store Predicates</figDesc><table><row><cell>Predicate</cell><cell>Explanation</cell><cell>Example</cell></row><row><cell>golem:author</cell><cell>Username Author (anonymised)</cell><cell></cell></row><row><cell>golem:characters</cell><cell>Characters appearing in the story</cell><cell>Molly Weasley</cell></row><row><cell>golem:collections</cell><cell cols="2">Title of the collection that a story is part of Good Omens Minisode</cell></row><row><cell></cell><cell></cell><cell>Minibang 2024</cell></row><row><cell>golem:contentWarning</cell><cell>Content warnings regarding level</cell><cell>Graphic Depictions</cell></row><row><cell></cell><cell>of violence/sexuality</cell><cell>Of Violence</cell></row><row><cell>golem:datePackaged</cell><cell>Date packaged for the project database</cell><cell></cell></row><row><cell>golem:datePublished</cell><cell>Date published on AO3</cell><cell></cell></row><row><cell>golem:dateModified</cell><cell>Date updated by the author</cell><cell></cell></row><row><cell>golem:fandom</cell><cell>Fictional universe(s) of the story</cell><cell>Good Omens (TV Show)</cell></row><row><cell>golem:keyword</cell><cell>User-provided content keywords</cell><cell>Loch-Ness Monster</cell></row><row><cell>golem:language</cell><cell>Language in which the story is written</cell><cell>English, Italiano</cell></row><row><cell>golem:numberOfChapters</cell><cell>Number of chapters</cell><cell></cell></row><row><cell cols="2">golem:numberOfComments Number of comments</cell><cell></cell></row><row><cell>golem:numberOfKudos</cell><cell>Number of user-approvals (similar to likes)</cell><cell></cell></row><row><cell>golem:numberOfWords</cell><cell>Number of words</cell><cell></cell></row><row><cell>golem:publicationStatus</cell><cell>In-Progress or Completed</cell><cell></cell></row><row><cell>golem:publisher</cell><cell>Source platform</cell><cell>archiveofourown.org</cell></row><row><cell>golem:rating</cell><cell>Content-rating, level of sexuality/violence</cell><cell>Teen and Up Audiences</cell></row><row><cell>golem:romanticCategory</cell><cell>Classification for romantic relationships</cell><cell>F/M, Gen (no rel.)</cell></row><row><cell></cell><cell>within the story</cell><cell></cell></row><row><cell>golem:socialRelationships</cell><cell cols="2">Social, e.g. romantic or sexual relationships Arthur/Molly Weasley</cell></row><row><cell></cell><cell>between characters</cell><cell></cell></row><row><cell>golem:series</cell><cell>Series the work is a part of, if any</cell><cell></cell></row><row><cell>golem:summary</cell><cell>Text of the summary</cell><cell></cell></row><row><cell>golem:title</cell><cell>Title</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Attack on Titan Example Workflow</figDesc><table><row><cell>Step</cell><cell>Time in Minutes</cell></row><row><cell>Elasticsearch Export</cell><cell>20:00</cell></row><row><cell>Copy jsonl files</cell><cell>1:47</cell></row><row><cell>Convert to TTL format</cell><cell>3:20</cell></row><row><cell>Import to Virtuoso</cell><cell>9:57</cell></row><row><cell cols="2">Copy TTL files for backup 0:11</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Results for the first case study, stories per languages</figDesc><table><row><cell>Language</cell><cell>Count</cell></row><row><cell>English</cell><cell>7,129,450</cell></row><row><cell>中文-普通话</cell><cell>448,268</cell></row><row><cell>Русский</cell><cell>148,981</cell></row><row><cell>Espa ñol</cell><cell>96,477</cell></row><row><cell>Français</cell><cell>41,006</cell></row><row><cell>Italiano</cell><cell>27,762</cell></row><row><cell cols="2">Português brasileiro 22,115</cell></row><row><cell>Bahasa Indonesia</cell><cell>21,605</cell></row><row><cell>Deutsch</cell><cell>17,757</cell></row><row><cell>Polski</cell><cell>15,551</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>Triple Store Statistics</figDesc><table><row><cell>Count</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://archiveofourown.org</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">https://archive.org/details/AO3_nal_location</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">Available at https://github.com/GOLEM-lab/golem-ingest</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">https://virtuoso.openlinksw.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_4">See UniProt https://www.w3.org/wiki/LargeTripleStores#OpenLink_Virtuoso_v7.2B_.2894.2B.2B_explicit.2C_ uncounted_virtual.2Finferred.2C_in_1_instance_on_1_machine.29</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data</head><p>Apart from the textual data provided by fanction writers and the common metadata such as author and title, AO3 provides a wide array of additional metadata, such as user-selected content tags, characters appearing in the story, as well as their relationships. Particularly popular are romantic (canon and non-canon) character pairings. Users can praise and react to each other's work by giving kudos or leaving comments.</p><p>Individual stories in the triple store are identied by their story ID. Each story has a number of associated metadata items, such as summary, word count, date published and more. As of date, all predicates in the triple store are using the golem prex (https://golemlab.eu/graph/ ), derived in parts by properties from CIDOC-CRM <ref type="bibr" target="#b9">[10]</ref>, Schema.org <ref type="bibr" target="#b10">[11]</ref>, and LRMoo <ref type="bibr" target="#b11">[12]</ref>. The triple store maintains the user-selected (upper case) and user-generated (lower case) tags via</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">An archive of their own: A case study of feminist hci and values in design</title>
		<author>
			<persName><forename type="first">C</forename><surname>Fiesler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Morrison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Bruckman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2016 CHI conference on human factors in computing systems</title>
				<meeting>the 2016 CHI conference on human factors in computing systems</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="2574" to="2585" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="https://stats.oecd.org/glossary/detail.asp?ID=5130" />
		<title level="m">Derived data element</title>
				<imprint>
			<publisher>OECD</publisher>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Boot</surname></persName>
		</author>
		<title level="m">Mesotext: Digitised Emblems, Modelled Annotations and Humanities Scholarship</title>
				<imprint>
			<publisher>Amsterdam University Press</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Jett</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Capitanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kudeki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Cole</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Organisciak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Underwood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Dickson Koehl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dubnicek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Downie</surname></persName>
		</author>
		<idno type="DOI">10.13012/R2TE-C227</idno>
		<ptr target="https://wiki.htrc.illinois.edu/pages/viewpage.action?pageId=79069329.doi:10.13012/R2TE-C227" />
		<title level="m">The HathiTrust Research Center Extracted Features Dataset</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The CONLIT Dataset of Contemporary Literature</title>
		<author>
			<persName><forename type="first">A</forename><surname>Piper</surname></persName>
		</author>
		<idno type="DOI">10.5334/johd.88</idno>
		<ptr target="http://openhumanitiesdata.metajnl.com/articles/10.5334/johd.88/.doi:10.5334/johd.88" />
	</analytic>
	<monogr>
		<title level="j">Journal of Open Humanities Data</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page">24</biblScope>
			<date type="published" when="2022">2022</date>
			<publisher>Ubiquity Press</publisher>
		</imprint>
	</monogr>
	<note>0 Publisher</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Psycholinguistic dataset on language use in 1145 novels published in English and Dutch</title>
		<author>
			<persName><forename type="first">S</forename><surname>Luoto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Van Cranenburgh</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.dib.2020.106655</idno>
		<ptr target="https://www.sciencedirect.com/science/article/pii/S2352340920315353.doi:10.1016/j.dib.2020.106655" />
	</analytic>
	<monogr>
		<title level="j">Data in Brief</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="page">106655</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Smart Modelling for Literary History</title>
		<author>
			<persName><forename type="first">C</forename><surname>Schöch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hinzmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Röttgermann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Dietz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Klee</surname></persName>
		</author>
		<idno type="DOI">10.3366/ijhac.2022.0278</idno>
		<ptr target="https://www.euppublishing.com/doi/10.3366/ijhac.2022.0278.doi:10.3366/ijhac.2022.0278" />
	</analytic>
	<monogr>
		<title level="j">International Journal of Humanities and Arts Computing</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="page" from="78" to="93" />
			<date type="published" when="2022">2022</date>
			<publisher>Edinburgh University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama</title>
		<author>
			<persName><forename type="first">F</forename><surname>Fischer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Börner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Göbel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hechtl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Kittel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Milling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Trilcke</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.4284002</idno>
		<ptr target="https://zenodo.org/record/4284002.doi:10.5281/zenodo.4284002" />
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Price</surname></persName>
		</author>
		<title level="m">Fandom, folksonomies and creativity: the case of the archive of our own</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">The CIDOC CRM, an Ontological Approach to Schema Heterogeneity</title>
		<author>
			<persName><forename type="first">M</forename><surname>Doerr</surname></persName>
		</author>
		<idno type="DOI">10.4230/DagSemProc.04391.22</idno>
		<ptr target="https://drops-dev.dagstuhl.de/entities/document/10.4230/DagSemProc.04391.22.doi:10.4230/DagSemProc.04391.22" />
	</analytic>
	<monogr>
		<title level="m">Semantic Interoperability and Integration</title>
				<editor>
			<persName><forename type="first">Y</forename><surname>Kalfoglou</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Schorlemmer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Sheth</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Uschold</surname></persName>
		</editor>
		<meeting><address><addrLine>Germany</addrLine></address></meeting>
		<imprint>
			<publisher>Dagstuhl</publisher>
			<date type="published" when="2005">2005</date>
			<biblScope unit="volume">4391</biblScope>
			<biblScope unit="page" from="1" to="5" />
		</imprint>
	</monogr>
	<note>Dagstuhl Seminar Proceedings (DagSemProc)</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Schema.org: evolution of structured data on the web</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">V</forename><surname>Guha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Brickley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Macbeth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="page" from="44" to="51" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">the IFLA library reference model, and now LRMoo: a circle of development</title>
		<author>
			<persName><forename type="first">P</forename><surname>Riva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Žumer</surname></persName>
		</author>
		<author>
			<persName><surname>Frbroo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IFLA WLIC 2018 -Kuala Lumpur, Malaysia -Transform Libraries, Transform Societies</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The Golem Ontology: Theoretical and data-driven modelling of narrative and ction</title>
		<author>
			<persName><forename type="first">X</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Pianzola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Pannach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Preparation</title>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
