<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Non-Canonical Acts and their Topical Distribution</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Christian</forename><surname>Vrangbaek</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Eva</forename><surname>Vrangbaek</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Márton</forename><surname>Kardos</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Kristoffer</forename><surname>Nielbo</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jacob</forename><surname>Mortensen</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">†</forename><forename type="middle">C</forename><surname>Vrangbaek</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Aarhus University</orgName>
								<address>
									<addrLine>Ringgaden 1</addrLine>
									<postCode>8000</postCode>
									<settlement>Aarhus C</settlement>
									<country key="DK">Denmark</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Non-Canonical Acts and their Topical Distribution</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">DD9A571F2B69D1459A90DCD4CE5A4DC9</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:42+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Apocrypha, New Testament Studies, topic modelling, classification Orcid 0009-0005-2011-3082 (C. Vrangbaek)</term>
					<term>0000-0002-4941-3434 (E. Vrangbaek)</term>
					<term>0000-0001-9652-4498 (M. Kardos)</term>
					<term>0000-0002-5116-5070 (K. Nielbo)</term>
					<term>0000-0002-7153-3707 (J. Mortensen)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper investigates how we can use topic modelling to characterize and place four apocryphal, i.e. non-canonical, "Acts stories" in a corpus of ancient Greek texts. In the research field of New Testament Apocrypha, there remains uncertainty concerning the classification of apocryphal text The analysis serves the purpose of creating a structured ontology to be used in classifying New Testament Apocrypha. We attempt to show that topic modelling can be a viable tool in classifying and characterizing these texts. The results show that a) our four target texts of non-canonical "Acts stories" are ambiguous and multifaceted in their topical distribution compared to other texts in the corpus, and b) that topic modelling is a viable tool in this analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In the field of New Testament Studies, classification of the heterogenous group of non-canonical texts, i.e. apocrypha, is disputed. The main problem is that the taxonomy of modern scholars largely reproduces the ancient classifications which were shaped during the 4th and 5th-century debate of canon which tends to lead to a binary classification between either canonical or noncanonical, and, moreover, to utilize conceptualized labels such as "Gnosticism" and "Encratite" which does not do justice to the complexity of the topical variety in these texts <ref type="bibr" target="#b0">[1,</ref><ref type="bibr">2,</ref><ref type="bibr">3]</ref>. To contribute to this debate, we want to create an ontology, i.e., a structured framework, out of (apocryphal) textual data with the overarching goal of establishing a computationally driven classification system for New Testament Apocrypha within the context of the semantic web. We will investigate the topical distribution of four non-canonical acts. Non-canonical acts are stories from roughly 2nd-3rd century CE about early Christian apostles' legendary deeds and speeches <ref type="bibr">[4,</ref><ref type="bibr">2,</ref><ref type="bibr">5]</ref>. In this paper, we investigate the stories called Acta Joannis, Acta Thomae, Acta Barnabae and Acta Philippi <ref type="bibr">[6]</ref>. These texts have been chosen as tests to create the basis of including more texts. We want to contribute to define categories, textual characteristics, and relationships between texts by building a structured ontology. While the creation of this ontology extends beyond the scope of this paper, our present study of integrating topic modelling in research on New Testament Apocrypha serves a crucial step towards this endeavor. By utilizing ontological relationships and semantic annotations, in this context as topical distributions, this study explores the potential for integrating the classical theological concepts present in New Testament Apocrypha with modern computational methods, thereby bridging the gap between traditional textual analysis and advanced semantic technologies within the context of the semantic web. Our working question is how and in what way can this model contribute to the discussion of classifying non-canonical acts in a corpus of ancient Greek texts?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data and Methods</head><p>Our starting point is that digital methods are not quick and magic tools to solve complex questions <ref type="bibr">[7]</ref>, rather we find that constant and critical exchange between traditional and new methods in qualitative collaboration is the way forward <ref type="bibr" target="#b14">[8,</ref><ref type="bibr">9,</ref><ref type="bibr" target="#b16">10]</ref>. Due to the circumstance that this study originates from a research discipline where computational methods are not embedded, we find it important to provide a detailed description -tending to tedious -of the methodological process, since we believe that this can help to bridge the disciplinary divides with a low-practical how-to-style</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Text Corpus and Preprocessing</head><p>The first step is to provide our text corpus. The corpus serves in our experiments as the literary context for the target texts, for which reason a historical relation between corpus and target texts are needed. Our target texts, the four apocryphal acts, are written in the Ancient Greek language, for which reason we have gathered a database consisting of 2153 Ancient Greek Texts. These texts are retrieved from the Perseus Corpus, First1KGreek, Pseudepigrapha.org and Deutsche Bibelgesellschaft <ref type="bibr" target="#b17">[11,</ref><ref type="bibr" target="#b18">12]</ref>. The selection of this corpus is chosen based on its relevance for our target texts. The corpus largely sets the parameters for our topic modelling at the later stages of the experiments. The Ancient Greek texts are in a solid machine-readable state. For the corpus text to be prepared for calculations we perform a set of preprocessing steps, so that the text is cleaned and lemmatized. We clean out for any Latin characters, digits, extra whitespaces and stop words. The textual cleaning follows the logic of standard natural language processing utilities. The models for cleaning and parsing the text were built with a transformer-based pipeline called OdyCy <ref type="bibr" target="#b19">[13]</ref>. This pipeline is a single-transformer pipeline that uses the following workflow: The transformer is built on Ancient Greek Bert <ref type="bibr" target="#b20">[14]</ref>. The parser, morphologizer and lemmatizer follows the infrastructure of SpaCy <ref type="bibr" target="#b21">[15]</ref>. Before running topic modelling, we had already preprocessed the input text, so we set max_df and min_df to 1.0, since we did not want to ignore any terms in this experiment <ref type="bibr" target="#b22">[16]</ref>. When the textual preprocessing is done, we move on to vectorization. Vectorisation is a highly qualitative choice, almost a language philosophical step, in which we must decide how to represent our textual corpus as numerical vectors <ref type="bibr" target="#b23">[17]</ref>. In this case, we employ the method of term frequency-inverted document frequency (TF-IDF) which represents the logarithmic scale between a term's frequency and the inverted frequency of total documents in the corpus <ref type="bibr" target="#b24">[18,</ref><ref type="bibr" target="#b23">17,</ref><ref type="bibr" target="#b22">16]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Topic Modelling: Non-Negative Matrix Factorization</head><p>Topic modelling is a much-utilized tool when engaging in natural language processing classification tasks, mostly for the purpose of assigning a category based on the most probable topic to a text in a corpus <ref type="bibr" target="#b25">[19,</ref><ref type="bibr" target="#b26">20,</ref><ref type="bibr" target="#b27">21]</ref>. This is also partly our aim, although we do see possibilities of detecting more complex layers in the topical distribution besides a text being only a part of one single category, rather, topic modelling gives due credit to the complex topical distribution and its significance for the position of our target texts in the corpus. Topic modelling is a way of structuring our textual data to be included in a future knowledge graph and similar semantic web technologies. Topic modelling assumes that a corpus of texts consists of topics and that these topics are comprised by the words of the corpus. We utilize the kind of topic modelling called Non-negative Matrix Factorization (NMF) <ref type="bibr" target="#b28">[22,</ref><ref type="bibr" target="#b29">23]</ref>. NMF is an approach that decomposes a high-dimensional term-document matrix, i.e., a matrix consisting of the corpus documents in columns, the words in rows, and their occurrence-values in the cell entries. The occurrence-values are, as mentioned above, chosen to be TF-IDF. Based on optimization, the method of NMF calculates topical patterns by associating words with topics and topics with documents. The NMF model, so to say, produces latent topical patterns by grouping similar and co-occurring words in the corpus <ref type="bibr" target="#b30">[24]</ref>. These topics are then backtracked to fit to the documents. The number of topics to choose is important for the analytical task. If we choose too many topics, there were no interpretable coherence, so based on our knowledge of the corpus and trial-and-error process the most robust output in topics was 10 topics <ref type="bibr" target="#b31">[25]</ref>.</p><p>The 10 topics and a selection of their top words are: Testament Gospels. These genre categories did not influence the topic model's calculations but can be used by us as a navigating tool to interpret the topics. For example, we can see that Topic 7 and Topic 6 have overlapping words about justice and city, but we can see that the texts that are dominated by Topic 6 are rhetorical texts like Demosthenes, whereas Topic 7 dominate many of Philo of Alexandria's texts as well as Libanius' Declamationes which are more philosophical. These extra steps enable us to distill the words of the topic into a qualified label or notion about how to describe the topic presented.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Analysis 1: Topical Distribution of Four Non-Canonical Acts Stories</head><p>In our first analysis, we interpret the topical distribution of the Acta Joannis, Acta Thomae, Acta Barnabae and Acta Philippi <ref type="bibr" target="#b33">[27]</ref>. The results can be seen in Figure <ref type="figure" target="#fig_0">1</ref>, which shows four pie charts over the topical distribution of the four non-canonical acts. Overall, all the four target texts are dominated by topic 2 and generally they share similar groups of topics, but the distribution is not equal. If we wanted to group the target texts together in the corpus on a coarse level, they would be set in the group of topic 2. However, this would not be a discovery. Where topic modelling can lead us further is in the presence and distribution of minor topics compared to each other. The topics of Acta Joannis are distributed over mainly topics 2, 1, 9, 3, 0, 4 and 6. Topic 1 is, like topic 2, also a theological topic. The presence of topic 9, 3 and 4 is revealing of the content of the story, since these topics represent philosophically oriented words which tell about how the story engages in Hellenistic religion and philosophy. Topic 0 is the historical-political topic which is understandable since the text narrates sequences of events and speeches. We also see a small portion of Topic 6, which concerns justice and law, which resonates with a few scenes in the narrative. From the topical distribution, we can generally characterize Acta Joannis as being situated in a Jewish-Christian theological context where Hellenistic anthropocentric philosophy and religion is also present. Concerning Acta Thomae , the topical distribution is similar to that of Acta Joannis, with a large topic 2 and 1, but Acta Thomae has a more equal distribution between its subtopics, 3, 9, 4 and 0. The Acta Thomae-story is set in legendary India, where Thomas is sent off as a missionary. From manual reading, Acta Thomae and Acta Joannis are comparable in the sense of the mix between narrative and preaching, where the preaching might drag in the more philosophically weighted topics, and this relation is also visible in the topical distribution. When we inspect the topics of Acta Philippi, then the distribution is markedly different from the two previous narratives with Topic 0, the historical-political topic being the third most prominent. Although Acta Philippi in its content has a much more adventurous and mythical tone with, for example, the protagonists arriving to a city of snakes, the structural dynamic of this narrative is dominated by sequence narration where we follow event after event <ref type="bibr">[5,</ref><ref type="bibr">2]</ref>. This makes the Acta Philippi into a drier, journal-like text. The large presence of Topic 0 is also visible in the topical distribution of Acta Barnabae. This circumstance is probably linked to the fact that Acta Barnabae is situated in a church-political context of legitimizing the so-called autocephalous, i.e., independent ecclesiastical unity in Creta in the 4th century <ref type="bibr" target="#b34">[28]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Analysis 2: Position in Corpus</head><p>In the second part of the analysis, we want to address how the topical distribution analyzed in the previous part affects texts' position in the corpus. The position of each of the four target texts are visualized in Figure <ref type="figure" target="#fig_1">2</ref> The clusters are formed based on texts with a dominant topic. Those texts which have an almost unequivocal dominance of one topic, e.g., Aristotle's Problemata and The Book of Jeremiah are placed in the outskirts of each colored cluster dragging away from the center of complexity. Conversely, those texts that are close to the center and also other topical groups display diversity and complexity in their topical distribution. Some clusters appear like relatively demarcated cone-shapes like e.g. Topic 4 of Hellenistic religion and philosophy and Topic 2 of god, people, human, whereas topic 3 of Greek element-philosophy is more flat, which indicates that this topic is dispersed more evenly in the corpus. The topical distribution on the corpus level gives a navigational tool with which texts can be grouped with relative clarity. For example, it is noteworthy that classic, and almost foundational texts, of Ancient Greek language, the Iliad and Odyssey, are clearly situated in topic 4 of Greek religion and philosophy, and that later texts that are trying to imitate these, like Tryphiodorus' Sack of Troy (4th century CE) and Nonnus' Dionysiaca from the 5th century CE, almost a thousand years after Homer, have an almost identical topical distribution. All of the Acts stories are placed in the dark green cluster of Topic 2, the topic of god, people, human. But it is noticeable that they all show diversity in the distribution. Acta Joannis and Acta Thomae are placed more securely in the Topic 2 cluster, whereas Acta Philippi and Acta Barnabae are drawn toward the center which might be explained by their higher percentage of the historical-political topic Topic 0. When texts are placed in this corpus, it becomes important to iterate the basic assumption of topic modelling: that all topics are produced based on words in the corpus. This means that the corpus words are constituent of the created topics which calls for a qualitative choice of corpus texts. Our corpus consists, as mentioned, of texts that historically have shaped our four target texts, either directly or indirectly due to the circumstance that the (mostly anonymous) authors were educated people in the Greco-Roman world about whom it can be assumed that they had basic knowledge of the texts in their historical and literary context. The method of topic modelling, then, almost backtracks the literary world of the authors of our texts, of course, with the important acknowledgement that we do not have all texts which made up the author's literary context.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Concluding Remarks</head><p>In this analysis assisted by topic modelling, we were able to characterize and place four noncanonical acts based on their topical distribution. The topical distribution of the analyzed texts will be used to map ontological relationships and enhance semantic annotations in order to classify New Testament Apocrypha, among which the topical distribution is a major component. The results of this topic modelling analysis will thus be able to be included in a future New Testament Apocryphal Ontology. The navigational advantages of topic modelling allowed us to inspect the target texts in a qualitatively selected corpus consisting of texts from a similar historical and literary horizon. This ensured meaningful topics. Instead of characterizing and classifying the texts based on abstractions and taxonomy from the 4th and 5th century church-political discussions on canonization, we could characterize and classify, or at least contribute to these tasks on the basis of raw content, both on a small, close scale in the topical distribution from text to text, but also on a larger scale based on the entire corpus.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Pie chart visualization of topical distribution of the four target texts in the corpus of Ancient Greek texts. From top left to bottom right the four Acts stories are marked in the order Acta Joannis, Acta Thomae, Acta Philippi, Acta Barnabae.</figDesc><graphic coords="4,89.29,84.19,416.72,260.40" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Four scatterplot snapshots of the same corpus with different markings of a target text. The plots' positions are based on their topical distribution. From top left to bottom right the four Acts stories are marked in the order Acta Joannis, Acta Thomae, Acta Philippi, Acta Barnabae. The figure is a frozen image from one angle of a multidimensional space. The scatterplot is created based on the topical distribution of the four texts</figDesc><graphic coords="6,89.29,84.19,416.72,269.64" type="bitmap" /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>The research in this article is funded by the Carlsberg Foundation in the Semper Ardens: Accelerate-project "Computing Antiquity: Computational Research in Ancient Text Corpora. " We would also like to thank Deutsche Bibelgesellschaft, Pseudepigrapha.org, The Perseus-Project and First1KGreek for providing texts.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Hellene</forename></persName>
		</author>
		<title level="m">πόλις, πὸλεμος, βασιλεύς, ἕλλην</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><surname>God</surname></persName>
		</author>
		<title level="m">soul, heaven, word, Christ [θεός, ψυχή</title>
				<imprint/>
	</monogr>
	<note>χριστός</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m">god, lord, people, human being [θεός, κυριός, λαός</title>
				<imprint/>
	</monogr>
	<note>ἄνθρωπος</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m">body, matter, air, earth, water, fire [σῶμα, ὕλη</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><surname>Zeus</surname></persName>
		</author>
		<title level="m">Cypris, Apollo [ζέυς, παῖς, ἔρως</title>
				<imprint/>
	</monogr>
	<note>ἀπόλλων</note>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m">part, character, being, necessity, cause [μόριον, τρόπος, οὐσία, άνάγκη</title>
				<imprint/>
	</monogr>
	<note>αίτία</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m">law, justice, city, witness, possession</title>
				<imprint/>
	</monogr>
	<note>νόμος, δίκη, πὸλις, μὰρτυς, χρῆμα</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<idno type="DOI">10.2307/j.ctv2rh2cqj</idno>
		<title level="m">like, for example, Jewish Philosophy, Tragedies and New References</title>
				<editor>
			<persName><forename type="first">O</forename><surname>Lehtipuu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Petersen</surname></persName>
		</editor>
		<meeting><address><addrLine>Atlanta</addrLine></address></meeting>
		<imprint>
			<publisher>SBL Press</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="13" to="32" />
		</imprint>
	</monogr>
	<note>Ancient Christian Apocrypha</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Die kanonische apostelgeschichte und die apokryphen apostelakten</title>
		<author>
			<persName><forename type="first">F</forename><surname>Bovon</surname></persName>
		</author>
		<idno type="DOI">10.1515/9783110216325</idno>
	</analytic>
	<monogr>
		<title level="m">Die Apostelgeschichte im Kontext antiker und frühchristlicher Historiographie</title>
				<meeting><address><addrLine>Berlin, New York</addrLine></address></meeting>
		<imprint>
			<publisher>Walter de Gruyter</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">162</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">New testament apocrypha: Introduction and critique of a modern category</title>
		<author>
			<persName><forename type="first">D</forename><surname>Martin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Ancient Christian Apocrypha</title>
				<editor>
			<persName><forename type="first">C</forename><forename type="middle">W J C</forename><surname>Edwards</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Evans</surname></persName>
		</editor>
		<meeting><address><addrLine>Grand Rapids</addrLine></address></meeting>
		<imprint>
			<publisher>Zondervan Academic</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="490" to="511" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">H.-J</forename><surname>Klauck</surname></persName>
		</author>
		<title level="m">The Apocryphal Acts of The Apostles: An Introduction</title>
				<imprint>
			<publisher>Baylor Univ. Press</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Canonical and apocryphal acts of apostles</title>
		<author>
			<persName><forename type="first">F</forename><surname>Bovon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Early Christian Studies</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="165" to="194" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Bonnet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Lipsius</surname></persName>
		</author>
		<title level="m">Acta Apostolorum Apocrypha. Vols. 1-3</title>
				<meeting><address><addrLine>Mendelssohn, Leipzig</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1903">1903</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">The order of things. a study on topic modelling of literary texts</title>
		<author>
			<persName><forename type="first">I</forename><surname>Uglanova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Gius</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshop on Computational Humanities Research (CHR 2020)</title>
				<meeting>the Workshop on Computational Humanities Research (CHR 2020)<address><addrLine>Amsterdam, the Netherlands</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="57" to="76" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Digital methods for intertextuality studies</title>
		<author>
			<persName><forename type="first">P</forename><surname>Molitor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ritter</surname></persName>
		</author>
		<idno type="DOI">10.1515/itit-2020-0006</idno>
	</analytic>
	<monogr>
		<title level="j">IT -Information Technology</title>
		<imprint>
			<biblScope unit="volume">62</biblScope>
			<biblScope unit="page" from="49" to="51" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Blunt instrumentalism: On tools and methods</title>
		<author>
			<persName><forename type="first">D</forename><surname>Tenen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Debates in the Digital Humanities 2016</title>
				<editor>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Gold</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">F</forename><surname>Klein</surname></persName>
		</editor>
		<imprint>
			<publisher>University of Minnesota Press</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Composition and change in de ciuitate dei: A case study of computationally assisted methods</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">E H</forename><surname>Vrangbaek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">L</forename><surname>Nielbo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Papers presented at the Eighteenth International Conference on Patristic Studies held in Oxford 2019</title>
				<imprint>
			<publisher>Peeters</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="149" to="164" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">Code for corpus</title>
		<ptr target="https://github.com/centre-for-humanities-computing/computing-antiquity" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Code for parser</title>
		<ptr target="https://github.com/centre-for-humanities-computing/odyCy" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Odycy -a general-purpose nlp pipeline for ancient greek</title>
		<author>
			<persName><forename type="first">J</forename><surname>Kostkan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kardos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P B</forename><surname>Mortensen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">L</forename><surname>Nielbo</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/2023.latechclfl-1.14" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Association for Computational Linguistics</title>
				<meeting>the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Association for Computational Linguistics<address><addrLine>Dubrovnik, Croatia; Accessed</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023-02">2023. Jun. 02, 2023</date>
			<biblScope unit="page" from="128" to="134" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">A pilot study for bert language modelling and morphological analysis for ancient and medieval greek</title>
		<author>
			<persName><forename type="first">P</forename><surname>Singh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Rutten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lefever</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2021.latechclfl-1.15</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">N</forename><forename type="middle">R S</forename><surname>Degaetano-Ortlieb</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Kazantseva</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Szpakowicz</surname></persName>
		</editor>
		<meeting>the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Association for Computational Linguistics<address><addrLine>Punta Cana, Dominican Republic</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="128" to="137" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">spacy: Industrial-strength natural language processing in python</title>
		<author>
			<persName><forename type="first">M</forename><surname>Honnibal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Montani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V</forename><surname>Landeghem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Boyd</surname></persName>
		</author>
		<idno type="DOI">10.5281/zenodo.1212303</idno>
		<ptr target="https://doi.org/10.5281/zenodo.1212303" />
	</analytic>
	<monogr>
		<title level="j">IEEE</title>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Api design for machine learning software: experiences from the scikitlearn project</title>
		<author>
			<persName><forename type="first">L</forename><surname>Buitinck</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ECML PKDD Workshop: Languages for Data Mining and Machine Learning</title>
				<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="108" to="122" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Introduction to text classification: Impact of stemming and comparing tf-idf and count vectorization as feature extraction technique</title>
		<author>
			<persName><forename type="first">A</forename><surname>Wendland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zenere</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Niemann</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-85521-5_19</idno>
	</analytic>
	<monogr>
		<title level="m">Communications in Computer and Information Science</title>
				<editor>
			<persName><forename type="first">R</forename><forename type="middle">M M</forename><surname>Yilmaz</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Clarke</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Reiner</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">1442</biblScope>
			<biblScope unit="page" from="289" to="300" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Introduction to tf-idf: To represent importance of keyword within whole dataset</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">D</forename><surname>Mehare</surname></persName>
		</author>
		<idno type="DOI">10.22214/ijraset.2018.3369</idno>
	</analytic>
	<monogr>
		<title level="j">IJRASET</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="2321" to="2323" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Topic modelling with nmf vs. expert topic annotation: The case study of russian fiction</title>
		<author>
			<persName><forename type="first">T</forename><surname>Sherstinova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Mitrofanova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Skrebtsova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Zamiraylova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kirina</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-608</idno>
	</analytic>
	<monogr>
		<title level="m">Advances in Computational Intelligence</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">L</forename><surname>Martínez-Villasenor</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">O</forename><surname>Herrera-Alcántara</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Ponce</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Castro-Espinoza</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">12026</biblScope>
			<biblScope unit="page" from="134" to="151" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Topic modeling: A comprehensive review</title>
		<author>
			<persName><forename type="first">P</forename><surname>Kherwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Bansal</surname></persName>
		</author>
		<idno type="DOI">10.4108/eai.13-7-2018.159623</idno>
		<ptr target="https://eudl.eu/doi/10.4108/eai.13-7-2018.159623" />
	</analytic>
	<monogr>
		<title level="j">EAI Endorsed Transactions on Scalable Information Systems</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<date type="published" when="2019-03-12">2019. Mar. 12, 2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">The interpretation of topic models for scholarly analysis: An evaluation and critique of current practice</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gillings</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hardie</surname></persName>
		</author>
		<idno type="DOI">10.1093/llc/fqac075</idno>
	</analytic>
	<monogr>
		<title level="j">Digital Scholarship in the Humanities</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="530" to="543" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Learning the parts of objects by non-negative matrix factorization</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">D</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">S</forename><surname>Seung</surname></persName>
		</author>
		<idno type="DOI">10.1038/44565</idno>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<imprint>
			<biblScope unit="volume">401</biblScope>
			<biblScope unit="page" from="788" to="791" />
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<title level="m" type="main">Blueprints for Text Analytics Using Python: Machine Learning-Based Solutions for Common Real World (NLP) Applications</title>
		<author>
			<persName><forename type="first">J</forename><surname>Albrecht</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ramachandran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Winkler</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<publisher>O&apos;Reilly Media</publisher>
			<pubPlace>Sebastopol, CA</pubPlace>
		</imprint>
	</monogr>
	<note>1st ed</note>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<title level="m" type="main">The why and how of nonnegative matrix factorization</title>
		<author>
			<persName><forename type="first">N</forename><surname>Gillis</surname></persName>
		</author>
		<ptr target="http://arxiv.org/abs/1401.5226" />
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<monogr>
		<title level="m" type="main">Practical Natural Language Processing: A Comprehensive Guide to Building Real-World NLP Systems</title>
		<author>
			<persName><forename type="first">S</forename><surname>Vajjala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Majumder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gupta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Surana</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>O&apos;Reilly Media</publisher>
			<pubPlace>Beijing Boston Farnham Sebastopol Tokyo</pubPlace>
		</imprint>
	</monogr>
	<note>1st ed</note>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Reading tea leaves: How humans interpret topic models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gerrish</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Boyd-Graber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems 22</title>
				<editor>
			<persName><forename type="first">Y</forename><surname>Bengio</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Schuurmans</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Lafferty</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Williams</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Culotta</surname></persName>
		</editor>
		<imprint>
			<publisher>NIPS</publisher>
			<date type="published" when="2009">2009. 2009</date>
			<biblScope unit="page" from="288" to="296" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<ptr target="https://github.com/ankyloHryax/classic-topic" />
		<title level="m">Code for topic modelling</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">An early byzantine pseudepigraphon: the apocryphal acta barnabae</title>
		<author>
			<persName><forename type="first">F</forename><surname>Cairns</surname></persName>
		</author>
		<idno type="DOI">10.1515/bz-2019-0004</idno>
	</analytic>
	<monogr>
		<title level="j">Byzantinische Zeitschrift</title>
		<imprint>
			<biblScope unit="volume">112</biblScope>
			<biblScope unit="page" from="47" to="66" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
