<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Estimating the Loss of Medieval Literature with an Unseen Species Model from Ecodiversity</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Mike</forename><surname>Kestemont</surname></persName>
							<email>mike.kestemont@uantwerpen.be</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Literature</orgName>
								<orgName type="institution">University of Antwerp</orgName>
								<address>
									<settlement>Antwerp</settlement>
									<country key="BE">Belgium</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Folgert</forename><surname>Karsdorp</surname></persName>
							<email>folgert.karsdorp@meertens.knaw.nl</email>
							<affiliation key="aff1">
								<orgName type="department">Royal Netherlands Academy of Arts and Sciences</orgName>
								<orgName type="institution">Meertens Institute</orgName>
								<address>
									<settlement>Amsterdam</settlement>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Estimating the Loss of Medieval Literature with an Unseen Species Model from Ecodiversity</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">2E56D359D5F95A0B160B237DA9F25566</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T22:10+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>medieval literature</term>
					<term>book history</term>
					<term>unknown species problem</term>
					<term>Middle Dutch</term>
					<term>ecodiversity</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The century-long loss of documents is one of the major impediments to the study of historic literature.</p><p>Here we focus on Middle Dutch chivalric epics (ca. 1200-1450), a genre for which little archival records exist that shed light on the survival rates of works and documents. We cast the quantitative estimation of these survival rates as a variant of the unseen species problem from ecodiversity. We apply an established non-parametric method (Chao1) and compare it to a number of common alternatives on simulated data. Finally, we discuss the implications of our results for conventional philology: our numbers suggest that the losses sustained on the level of works may be more dramatic than previously imagined, whereas those at the document-level align surprisingly well with existing estimates in book history, although these were based on completely different data sources.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The century-long loss of material artifacts is one of the major impediments to the study of the history of human culture. Across various domains in the humanities, scholars must base their study on incomplete archival collections that offer but a tiny fraction of the wealth of historical specimens that originally existed. In this contribution, we focus on the domain of literature from the High Medieval period in Western Europe, which has sustained significant losses in the past centuries. Previous work has argued that unseen species models from ecodiversity can be used to estimate the number of works (multi-copy documents) that have been lost to us [e.g., <ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b25">26]</ref>. Although these models have already yielded interesting insights for early modern printed works, they have hardly been applied to premodern handwritten literature so far [exceptions include <ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b25">26]</ref>. Here, we apply Chao1, a non-parametric estimator of asymptotic species richness, to a representative corpus of Middle Dutch romances and quantitatively evaluate its performance on simulated datasets. A novelty of this contribution is that we do not only estimate the proportion of lost works in this dataset, but also the number of lost documents, through an extension of Chao1, which aims to gauge the number of additional samples that would be minimally required to reach the asymptote of the species accumulation curve, estimated by the original model.</p><p>In traditional philology, a theoretical distinction is typically drawn between the abstract notion of a "work" and the physical "documents" (witnesses, carriers) in which the work is attested in some version <ref type="bibr" target="#b35">[36]</ref>. Throughout the Middle Ages (ca. 500-1500 ad), handwritten media, such as manuscripts or scrolls, were the primary physical medium for the sustainable exchange of literary texts <ref type="bibr" target="#b27">[28,</ref><ref type="bibr" target="#b1">2]</ref>. Before the advent of printing, all witnesses of a text were hand-copied from pre-existing exemplars, a practice that yielded textual traditions in which intricate interdependencies exist between copies. The document tree resulting from this process is known as the stemma codicum and such trees are nowadays studied in the field of phylogenetics <ref type="bibr" target="#b0">[1]</ref>.</p><p>Medieval text traditions, however, rarely survive in full, as many documents have now been lost, due to a variety of historical reasons, including natural or infrastructural disasters (e.g. library fires, such as the famous example of Alexandria) but also the wilful destruction by humans, such as the controlled disposition of duplicates by heritage institutions or collectors <ref type="bibr" target="#b1">[2]</ref>. Moreover, many sources have only survived fragmentarily and often the severely damaged remnants of the same book are nowadays even scattered across various locations. <ref type="foot" target="#foot_0">1</ref> This is related to the fact that, in the premodern period, book binders regularly recycled parchment codices into "maculature" that was used, for instance, to strengthen the spines of newer books, which eventually ended up in different locales <ref type="bibr" target="#b18">[19]</ref>.</p><p>We can assume that a large fraction of premodern documents, if not an absolute majority, is nowadays unknown to us <ref type="bibr" target="#b31">[32]</ref>, either because the documents no longer exist, or because they have not been recovered yet (e.g., due to cataloguing initiatives that are lagging behind) <ref type="bibr" target="#b21">[22]</ref>. Consequently, a great deal of works are also unknown to us, in the obvious case where all the documents representing a work are currently unknown <ref type="bibr" target="#b39">[40,</ref><ref type="bibr" target="#b23">24]</ref>. These assumptions are not only justified by the many references in historic sources to works that we no longer know, but also by the constant stream of new material findings nowadays -which has clearly been intensified in recent decades by the emergence of the internet and social media <ref type="bibr" target="#b20">[21]</ref>. Understanding the literary preferences of the past, and explaining historical shifts therein, is one of the core tasks of cultural studies and a prerequisite for producing valid literary histories. Nevertheless, it is clear that the situation of partial observability, outlined above, severely compromises this task: the available data only constitute a very limited sample of an original population of literature that was much larger and more diverse. In statistical terms, our present-day perspective is by necessity biased towards the materials that actually survived. Understandably, scholars invariably agree that it is vital to correct these biased preconceptions and account for the materials which are are no longer known to us <ref type="bibr" target="#b39">[40,</ref><ref type="bibr" target="#b23">24,</ref><ref type="bibr" target="#b1">2]</ref>.</p><p>Methodologically, it is important to separate the loss of documents, from the loss of works which it entails. As to the second matter, the loss of works, there has been very little empirical work in the field of medieval studies, beyond the descriptive analysis of historic references to lost works <ref type="bibr" target="#b39">[40,</ref><ref type="bibr" target="#b23">24]</ref>. There has been some empirical research into the first matter. Book historical studies (such as <ref type="bibr" target="#b1">[2]</ref>) have mainly studied the survival rate of documents on the basis of the limited set of medieval collections, of which the composition is exactly known at specific points in time. This allows one to quantify the gradual, diachronic loss of documents from these collections. While these estimates are currently among the best we have, it is clear that it can be hard to extrapolate these numbers to other regions, languages or collection environments (e.g. monastic vs. lay book possession), so that alternative approaches to complement this methodology would be a valuable addition to the field. Finally, it is worth mentioning the polemic that ensued the 2005 high-profile publication by John L. Cisne in Science <ref type="bibr" target="#b10">[11]</ref>. This paper used methods from geology and population biology to estimate the rate of manuscript loss for a set of early medieval text traditions, but has almost instantly been met with severe, yet well-founded criticism from a number of well-placed medievalists <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b34">35]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related work: bibliometry and the unknown species problem</head><p>In a pioneering contribution, Egghe &amp; Proot <ref type="bibr" target="#b15">[16]</ref> have proposed a probabilistic model that attempts to estimate the level of loss for historic works (multi-copy documents), based on the frequency with which retrieved copies of such works survive. Their case study was based on bibliometric data, drawn from a short-title catalogue of printed works from the Low Countries. Follow-up work has confirmed the practical usefulness of their approach for printed works <ref type="bibr" target="#b33">[34,</ref><ref type="bibr" target="#b22">23,</ref><ref type="bibr" target="#b21">22,</ref><ref type="bibr" target="#b32">33]</ref>. Their model can be formalized as follows:</p><formula xml:id="formula_0">f 0 ˆ= ( 1 1 + 2f 2 (a−1)f 1 ) a (1)</formula><p>In this formula, f 1 is the number of works in a given corpus that survive in exactly one copy and f 2 the number of works that survive in exactly two copies; a is the number of copies that were produced of each work, which is the so-called "run" for printed works (which they set to 500 copies). These coefficients are then used to estimate f 0 ˆ, or the proportion of lost works from the total, original population of works. In a sagacious response to Egghe &amp; Proot <ref type="bibr" target="#b15">[16]</ref>, Burrell <ref type="bibr" target="#b3">[4]</ref> has noted that their task could be considered as a variant of a much older problem, namely the "unseen species problem". This problem is studied in various fields, ranging from ecology to genetics, where scholars have to estimate aspects of species diversity (e.g. biota richness) in a specific assemblage on the basis of highly incomplete samples of the full population <ref type="bibr" target="#b13">[14]</ref>. This task has a rich tradition in biostatistics, reaching back to the 1940s, with the work of Alexander Steven Corbet, who had been trapping and inventorizing new butterflies species in British Malaya for two years <ref type="bibr" target="#b30">[31]</ref>. In collaboration with the statistician R.A. Fisher <ref type="bibr" target="#b17">[18]</ref>, he formulated a model to estimate the number of new species he would discover, if he were to continue his trapping efforts for another two years.</p><p>Nowadays there exists a variety of statistical approaches to the unseen species problem that can be borrowed from ecodiversity, an interdisciplinary domain where researchers study, amongst other things, the biota richness in ecosystems. Monitoring the number of unique species, for example, is a key task for various environmental reasons, for instance, to assess the impact of natural disasters <ref type="bibr" target="#b13">[14]</ref>. These approaches are well established <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b19">20]</ref> but not all of these are applicable to our kind of data. Applying the pioneering model by Egghe &amp; Proot <ref type="bibr" target="#b15">[16]</ref>, for instance, is not without theoretical issues, because the concept of a print run is almost meaningless in this context (cf. the a coefficient in Eq. 1), even though the authors show that its effect is limited. The serial production of handwritten text carriers was extremely uncommon throughout the Middle Ages, as books were still highly customized luxury objects that were never mass-produced. It seems impossible to provide an estimate for this parameter, also because the available evidence suggests that the number of original copies per work was heavily skewed (dependent on factors such as genre, language or general prestige). It is likely that the large majority of medieval works already originally only existed in very few copies (i.e. singletons or doubletons).</p><p>To a reasonable extent, these methodological caveats are mitigated by the Chao1 estimator <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b4">5]</ref>, a non-parametric method that is robust (even universally valid) in the face of unknown species abundance distributions and enables the comparison of species richness across multiple assemblages <ref type="bibr" target="#b6">[7]</ref>. Previous, exploratory work in literary studies <ref type="bibr" target="#b25">[26]</ref> has shown that this estimator produces interesting results for handwritten sources. This method is especially attractive for highly diverse, log-normally distributed assemblages, typical of human cultural production, where many species are infrequent and thus hard to detect. In such cases, it is futile to try and offer a precise point estimate; Chao1 therefore rather offers an accurate lower bound of the number of undetected species in a sample. The estimator is given by <ref type="bibr" target="#b7">[8]</ref>:</p><formula xml:id="formula_1">f 0 ˆ=        (n − 1) n f 2 1 (2f 2 ) if f 2 &gt; 0; (n − 1) n f 1 (f 1 − 1) 2 if f 2 = 0 (2)</formula><p>Here, f 1 is the number of species sighted exactly once in the sample (singletons), f 2 the number of species that were sighted twice (doubletons), and n the observed, total sample size (cf. Eq. 1). Finally, f 0 ˆis the estimated lower bound for the number of species that do exist in the assemblage, but which were sighted zero times, i.e. the number of undetected species. To obtain a confidence interval, a simple bootstrap procedure can be applied, in which the available data is iteratively resampled <ref type="bibr" target="#b7">[8]</ref>.</p><p>An attractive feature of this estimator is that it can be naturally extended to estimate the number of lost documents (instead of the number of lost works) <ref type="bibr" target="#b9">[10]</ref>. Field workers tasked with biodiversity sampling often do not observe a substantial fraction of the biota that live in a certain assemblage. While Chao1 can estimate how many of this low-abundance species have (minimally) gone undetected, it does not tell us how much additional effort would be required to observe these, i.e. how many additional m individuals would have to be sampled to observe all of the biota at least once. Put informally, with respect to the species accumulation curve (cf. Fig. <ref type="figure" target="#fig_3">2</ref>), we would like to find out in which area the asymptote starts to kick in.</p><p>Using the same abundance data as above, this extension of Chao1 tries to estimate at which point every species would have been observed at least as a singleton. The singletons in the enlarged sample of size m + n (where n is still the number of previously observed individuals in the sample) would fall apart in two distinct categories <ref type="bibr" target="#b9">[10]</ref>: (1) singletons from the original sample, for which no additional individuals are detected by the enlarged sample, and (2) previously undetected species for which exactly one individual is observed during the additional sampling. The estimator aims to calculate the proportion between (1) and ( <ref type="formula">2</ref>) to determine m on the basis of two functions. The first function, h(x) = 2f 1 (1 + x), is a linear transformation of x, whereas the second function, v(x) = exp[x(2f 2 /f 1 )], is an exponentially increasing function; v is bound to intersect h at a certain x * &gt; 0. The number of additional m individuals that are theoretically required to observe the full richness of a population is given by: m = nx * . Here too, a bootstrapping procedure can be used to estimate a confidence interval.</p><p>Regarding historic literature, the analogy in applying this method is straightforward: how many additional documents would have to be rediscovered in the future to observe all works at least once? While this estimate has very useful, practical implications for philologists scanning archives for new fragments, the resulting number, m + n, also has theoretical relevance, since it would be reasonably close to the actual size of the original population of documents. Thus, m + n would allow us to estimate the historic loss of documents, based on a type of data that is complementary to (and even completely independent from) the archival library records mentioned above. Because of the log-normal distribution of literary works over documents, we expect that most works were of an extremely low historic abundance, i.e. they were already originally produced in very low numbers of copies. We can therefore assume that the majority of works that are currently unknown will in the future only be detected in a low number of documents. Once we would have observed all works, we can therefore expect that we would also have observed most documents. Thus, while the outcome of this method should not be treated as a precise point estimate -like Chao1, it too estimates the minimal sampling effort requiredwe argue that it offers a useful approximation of the historical loss of documents. Nevertheless, we should emphasize that this method likely yields an underestimation of the original document richness and it would not account for specific aspects of the historic document mass, for instance, in cases where presently unknown works actually survive in more than one (so far undetected) documents. We have collected the surviving works and documents from the genre of Middle Dutch chivalric epics (ridderepiek) as abundance data, where we record in how many documents a particular work has been "sighted". This data is mainly drawn from Kienhorst's acclaimed repertory <ref type="bibr" target="#b26">[27]</ref> but we have updated this information with newer, and even very recent findings (situation as of 10 July 2020). <ref type="foot" target="#foot_1">2</ref> The main bibliographic information can be gleaned from Table <ref type="table">1</ref>, showing, in the last row, how the 75 presently known works are distributed over the 167 documents (=n) that have been retrieved. 45 works are attested as unica in only a single source (=f 1); 13 works are doubletons (=f 2).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Estimating the loss of Middle Dutch chivalric epics</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Loss of works</head><p>We can plug these numbers into the equations presented above and arrive at the estimates presented in Table <ref type="table" target="#tab_1">2</ref>. Here, we additionally give the estimate of the so-called Jackknife procedure (following the reference implementation in <ref type="bibr" target="#b37">[38]</ref>), a historic alternative to the more recent estimators that generally aim to reduce the bias in estimators <ref type="bibr" target="#b2">[3]</ref>. Importantly, this approach lacks theoretical justification <ref type="bibr" target="#b11">[12]</ref> but offers a surprisingly solid baseline in many practical applications <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b28">29]</ref>. We mention in passing that such techniques have already been applied in domains that border on the Humanities, such as archaeology <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b16">17]</ref>. We also present the confidence intervals (CI) obtained from the bootstrap procedure: these are fairly wide but show considerable overlap, thus stressing the relative agreement between the three estimators. The distribution of the bootstrap values is shown in the rainplots (Fig. <ref type="figure" target="#fig_1">1</ref>), except for the Jackknife (for which the CI is calculated analytically). We observe that Chao1 gives the most conservative estimate for the loss of works, which was to be expected, given the fact that it estimates a lower bound for the loss. The Jackknife and EP procedure both estimate a higher loss rate (yet both in the same range).  Crucial for the discussion below, is that all three estimators for the loss of works suggest that only half (and potentially even less) of the original works that once existed are currently known to us. The final row in Table <ref type="table">1</ref> gives the estimate (with CI) for the loss of documents. While we should account for an extremely wide CI in this case, the number suggest a survival rate of ≈8.15%, i.e. of an original population of 2047 documents, only 167 have survived. We offer a final and joint visualization of the results in Fig. <ref type="figure" target="#fig_3">2</ref>. This plot shows what is known as a "species accumulation curve" <ref type="bibr" target="#b8">[9]</ref>. The blue line plots the number of retrieved works as an (asymptotic) function of the number of documents recovered in this assemblage. The full line indicates the situation for the observed sample, whereas the dashed part concerns the hypothetical increase, in the case where more "sightings" would occur in the future. The grey distribution shows the bootstrap values resulting from the minimum sampling effort estimator, broadly indicating the region where we expect the curve to hit the asymptote.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Loss of documents</head><p>The green distribution in Fig. <ref type="figure" target="#fig_3">2</ref> requires additional explanation. The available models for estimating the population size (as opposed to species diversity) of an under-sampled assemblage typically assume that we have capture-recapture information available, instead of mere  abundance data <ref type="bibr" target="#b4">[5]</ref>. We cannot extract such information from our data -because a workdocument pair can in principle only be "sighted" once and after that it is not released again "into the wild". Nevertheless, in the case of manuscripts that have been recycled into maculature, the remnants of the same document have often reappeared in different locations -an extreme example is the Roman der Lorreinen-codex of which 9 fragments resurfaced, scattered across 7 different libraries <ref type="bibr" target="#b26">[27]</ref>. We can apply Chao1 to the documents in our corpus that survive fragmentarily and represent them as abundance data, on the basis of the number of fragments that resurfaced of them. This yields an assemblage of 141 documents surviving in 181 fragments, with f 1 = 118 and f 2 = 14. The application of Chao1 yields the following estimate: 635.54 CI(449.85 -947.25) (cf. the green area in Fig. <ref type="figure" target="#fig_3">2</ref>). Note that this number does not estimate the total number of documents that once existed, but rather the size of the subset of manuscripts that were recycled into maculature. In combination with the other estimate, our analyses suggest that ≈31% of the original population of documents with chivalric Middle Dutch epics was recycled into maculature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Simulations</head><p>In this section, we compare the performance of the three estimators for species diversity using simulated data. In the aforementioned seminal paper <ref type="bibr" target="#b17">[18]</ref>, Fisher proposed to model the abundance of species in an assemblage as S n = αx n /n, where S n is the number of species with an abundance of n, x a positive constant (0 &lt; x &lt; 1) which generally approaches 1 and α is the number of singleton species in the assemblage. This logseries is still in wide use and and can be used to define a discrete probability distribution, parameterized by two values: (i) the number of singleton species in the population and (ii) the maximum abundance for a single species (to put a practical cap on the distribution). In an iterative process, we have generated assemblages from a logseries distribution for 250 works, for a fixed f 1 = 75 and x = .99. Next, we mimicked a distribution of these works over a variable number of documents (in a linear range [500, 2500]). We then modelled historic document loss as a fully stochastic process, in which documents are randomly dropped at a certain loss rate (in the linear range [0.05-0.95]). We repeated each experiment 50 times with different random seeds. We can then assess the performance of each estimator with respect to the ground truth of 250 works. The violin plots in Fig. <ref type="figure" target="#fig_4">3</ref> show that Chao1 is the most conservative evaluator that generally realizes the smallest deviation from the ground truth (cf. dashed grey line). Fig. <ref type="figure" target="#fig_6">4</ref> plots the absolute error per estimator as a function of the varying loss rate. Here, we see that Chao1 is most robust estimator throughout, except for extremely small document keep rates (&lt; 0.1).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion</head><p>In his acclaimed 2006 history of Middle Dutch literature, Van Oostrom estimated that the corpus of Middle Dutch chivalric epics must originally have comprised "at least 100 texts" <ref type="bibr" target="#b36">[37]</ref>. All the estimators considered here agree that this in all likelihood too low an estimate: it seems likely that at the very least 152 texts once existed, and potentially even more, of which only 75 (≈49%) survive now, providing an even firmer basis to the claims from a previous study <ref type="bibr" target="#b25">[26]</ref>. Chao1 proved a more reliable estimator in our simulations than the other two methods studied here, which tended to overshoot and thus overestimate the loss of works. Middle Dutch studies might have overestimated the representativeness of the surviving corpus, and future studies should attempt to account for this bias.</p><p>Although there are few previous estimates regarding the loss of works, we are on more solid grounds regarding the loss of documents. In book history, scholars have studied the loss rates for medieval documents, based on data for the sparse set of manuscript collections of which the historic composition is known, so that they can be compared to the books from these collections that are still extant today <ref type="bibr" target="#b1">[2]</ref>. Such studies have estimated a cumulative survival rate of 7% for the sort of non-illustrated manuscripts in which Middle Dutch romances typically were copied <ref type="bibr" target="#b38">[39,</ref><ref type="bibr" target="#b36">37,</ref><ref type="bibr" target="#b29">30]</ref>. While we should present these results with extreme caution for now, it  is remarkable that our analysis suggest an estimate that is in a surprisingly similar range, i.e. ≈8.15% (167/2047 documents), although with a very wide CI (1064-4006). This approach might nevertheless present an exciting new research avenue that could complement the existing insights on the basis of a fully independent kind of evidence than the data used so far. Finally, our analyses suggest that of the original population of documents with chivalric Middle Dutch epics, ≈31% was recycled into maculature (i.e. 635/2047). While more research is needed to support this claim, it is the very first time to the best of our knowledge that this proportion has been estimated in a quantitative manner. This proportion is surprisingly high, which is maybe good news for the philologist, who is after all more likely to discover fragmentary sources than intact sources.</p><p>A number of issues remain with the application of these methods that require further attention. Problematic, for instance, is our assumption that document loss has been a fully stochastic process (which is the way in which we naively simulate this phenomenon here). Although there certainly are random aspects to this process, we know from traditional book history that some codices were less likely to be lost: texts in convolutes had higher survival chances, for instance, and the same has been hypothesized for higher-end (e.g. illustrated) manuscripts <ref type="bibr" target="#b38">[39]</ref>. Future research should develop more principled, perhaps agent-based, models to simulate document loss than the fully stochastic approach adopted here. Finally, it would be interesting to extend this approach to a wider geographic and linguistic range, since these methods allow for an interesting cross-cultural comparison regarding the survival of medieval literature. This geographic variation will be a central component of our future work.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>the number of works</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Distribution of bootstrap estimations for Chao1 and ep on the Middle Dutch data. The Jackknife estimate (with its CI) is added with vertical lines.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Species accumulation curve (blue), including bootstrap distribution for minimum additional document sampling (in grey; the dashed vertical grey line indicates the non-bootstrapped estimate) and for the maculature diversity (in green).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Results for the three estimators (with α = 50 for the Egghe &amp; Proot method) for artificial assemblages of 250 works (see dashed vertical grey line) that were stochastically downsampled.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head></head><label></label><figDesc>reconstruction error (as a function of the per-simulation loss rate) estimator Egghe &amp; Proot Jackknife Chao1</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 4 :</head><label>4</label><figDesc>Figure4: The absolute error for each estimator in each simulation, as a function of the loss rates considered (given a ground truth of 250), with a cubic fit per method.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 :</head><label>2</label><figDesc>Diversity estimates for the Middle Dutch chivalric epics (with CI). Last row is the result for minimum additional sampling.</figDesc><table><row><cell>Method</cell><cell>Estimate</cell><cell>CI</cell></row><row><cell>Chao1</cell><cell>152.42</cell><cell>110.11 -222.98</cell></row><row><cell>Jackknife</cell><cell>177.00</cell><cell>127.81 -226.19</cell></row><row><cell>EP</cell><cell>170.71</cell><cell>116.77 -268.49</cell></row><row><cell>Minsample</cell><cell cols="2">2047.77 1064.19 -4006.42</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">A dramatic example is the Beauvais Missal, of which the dismembered folios are currently being pieced together again in a virtual reconstruction. Updates on the Broken Books project by Lisa Fagin Davis can be followed here: https://web.archive.org/save/https://brokenbooks2.omeka.net/.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">Data and code supporting this paper have been publicly archived: https://doi.org/10.5281/zenodo.4030681.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Acknowledgments</head><p>The authors would like to thank Elisabeth de Bruijn and Remco Sleiderink (University of Antwerp, BE) for the stimulating discussions, in particular about maculature. Additionally, we would like to acknowledge the helpful bibliographic input of the participants at the Dark Archives 20/20 conference (https://web.archive.org/save/https://aevum.space/darkarchives) where this work was previously presented.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The phylogeny of The Canterbury Tales</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">C</forename><surname>Barbrook</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<imprint>
			<biblScope unit="volume">394</biblScope>
			<biblScope unit="page">839</biblScope>
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Medieval Manuscript Production in the Latin West, Explorations with a Global Database</title>
		<author>
			<persName><forename type="first">E</forename><surname>Buringh</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<publisher>Brill</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Robust Estimation of Population Size When Capture Probabilities Vary Among Animals</title>
		<author>
			<persName><forename type="first">K</forename><surname>Burnham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Overton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Ecology</title>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="927" to="936" />
			<date type="published" when="1979">1979</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Some comments on &quot;The estimation of lost multi-copy documents: A new type of informetrics theory&quot; by Egghe and Proot</title>
		<author>
			<persName><forename type="first">Q</forename><surname>Burrell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Informetrics</title>
		<idno type="ISSN">1751-1577</idno>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="101" to="105" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Estimating the Population Size for Capture-Recapture Data with Unequal Catchability</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biometrics</title>
		<imprint>
			<biblScope unit="volume">43</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="783" to="791" />
			<date type="published" when="1987">1987</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Nonparametric Estimation of the Number of Classes in a Population</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scandinavian Journal of Statistics</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="265" to="270" />
			<date type="published" when="1984">1984</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Species Richness: Estimation and Comparison</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-H</forename><surname>Chiu</surname></persName>
		</author>
		<idno type="DOI">10.1002/9781118445112.stat03432.pub2</idno>
		<imprint>
			<date type="published" when="2016-08">Aug. 2016</date>
			<biblScope unit="page" from="1" to="26" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Estimating diversity and entropy profiles via discovery rates of new species</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jost</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Methods in Ecology and Evolution</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="873" to="882" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Entropy and the species accumulation curve: a novel entropy estimator via discovery rates of new species</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">T</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jost</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Methods in Ecology and Evolution</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page" from="1091" to="1100" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Sufficient sampling for asymptotic minimum species richness estimators</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Ecology</title>
		<imprint>
			<biblScope unit="volume">90</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="1125" to="1133" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">How Science Survived: Medieval Manuscripts&apos; &quot;Demography&quot; and Classic Texts&apos; Extinction</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Cisne</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<imprint>
			<biblScope unit="volume">307</biblScope>
			<biblScope unit="page" from="1305" to="1307" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Log-Linear Models for Capture-Recapture</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>Cormack</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biometrics</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="395" to="413" />
			<date type="published" when="1989">1989</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Tipping the Iceberg: Missing Italian Polyphony from the Age of Schism</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Cuthbert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Musica Disciplina</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="page" from="39" to="74" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Ecological Diversity: Measuring the Unmeasurable</title>
		<author>
			<persName><forename type="first">A</forename><surname>Daly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Baetens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">De</forename><surname>Baets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Mathematics</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page">119</biblScope>
			<date type="published" when="2018-07">July 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Comment on &quot;How Science Survived: Medieval Manuscripts&apos; &quot;Demography&quot; and Classic Texts&apos; Extinction</title>
		<author>
			<persName><forename type="first">G</forename><surname>Declercq</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<imprint>
			<biblScope unit="volume">310</biblScope>
			<biblScope unit="page" from="1618" to="1618" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The estimation of the number of lost multi-copy documents: A new type of informetrics theory</title>
		<author>
			<persName><forename type="first">L</forename><surname>Egghe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Proot</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Informetrics</title>
		<idno type="ISSN">1751-1577</idno>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="257" to="268" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Estimating the Richness of a Population When the Maximum Number of Classes Is Fixed: A Nonparametric Solution to an Archaeological Problem</title>
		<author>
			<persName><forename type="first">M</forename><surname>Eren</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLOS ONE</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="1" to="11" />
			<date type="published" when="2012-05">May 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population</title>
		<author>
			<persName><forename type="first">R</forename><surname>Fisher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Corbet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Williams</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Animal Ecology</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="42" to="58" />
			<date type="published" when="1943">1943</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Membra disiecta&quot;: banden met het versneden verleden</title>
		<author>
			<persName><forename type="first">D</forename><surname>Geirnaert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Een inleiding tot de Middelnederlandse letterkunde</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Jansen-Sieben</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Janssens</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Willaert</surname></persName>
		</editor>
		<editor>
			<persName><surname>Verloren</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2000">2000</date>
			<biblScope unit="page" from="85" to="101" />
		</imprint>
	</monogr>
	<note>Medioneerlandistiek</note>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Measuring and Estimating Species Richness, Species Diversity, and Biotic Similarity from Sampling Data</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">J</forename><surname>Gotelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Chao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Encyclopedia of Biodiversity</title>
				<editor>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Levin</surname></persName>
		</editor>
		<meeting><address><addrLine>Waltham</addrLine></address></meeting>
		<imprint>
			<publisher>Academic Press</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="195" to="211" />
		</imprint>
	</monogr>
	<note>Second Edition</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Digital manuscripts as sites of touch: using social media for &apos;hands-on&apos; engagement with medieval manuscript materiality</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Green</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Archive Journal</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<date type="published" when="2018-09">Sept. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Lost Incunable Editions: Closing in on an Estimate</title>
		<author>
			<persName><forename type="first">J</forename><surname>Green</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Mcintyre</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Reconstructing the Print World of Pre-Industrial Europe</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Bruni</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Pettegree</surname></persName>
		</editor>
		<editor>
			<persName><surname>Brill</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="55" to="72" />
		</imprint>
	</monogr>
	<note>Lost Books</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">The Shape of Incunable Survival and Statistical Estimation of Lost Editions</title>
		<author>
			<persName><forename type="first">J</forename><surname>Green</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Mcintyre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Needham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Papers of the Bibliographical Society of America</title>
		<imprint>
			<biblScope unit="volume">105</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="141" to="175" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Haye</surname></persName>
		</author>
		<title level="m">Verlorenes Mittelalter: Ursachen und Muster der Nichtüberlieferung mittellateinischer Literatur</title>
				<imprint>
			<publisher>Brill</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Measuring Archaeological Diversity: An Application of the Jackknife Technique</title>
		<author>
			<persName><forename type="first">D</forename><surname>Kaufman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">American Antiquity</title>
		<imprint>
			<biblScope unit="volume">63</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="73" to="85" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Het Atlantis van de Middelnederlandse ridderepiek. Een schatting van het tekstverlies met methodes uit de ecodiversiteit</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kestemont</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Karsdorp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Spiegel der Letteren</title>
		<imprint>
			<biblScope unit="volume">61</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="271" to="290" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Kienhorst</surname></persName>
		</author>
		<title level="m">De handschriften van de Middelnederlandse ridderepiek</title>
				<imprint>
			<publisher>Sub Rosa</publisher>
			<date type="published" when="1988">1988</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
	<note>Deventer studieën 9</note>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<title level="m" type="main">Books Before Print: Electronic Representations of Literary Texts</title>
		<author>
			<persName><forename type="first">E</forename><surname>Kwakkel</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
			<publisher>Amsterdam University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<title level="m" type="main">Practical Estimation of Diversity from Abundance Data</title>
		<author>
			<persName><forename type="first">E</forename><surname>Marcon</surname></persName>
		</author>
		<ptr target="https://hal-agroparistech.archives-ouvertes.fr/hal-01212435" />
		<imprint>
			<date type="published" when="2015-10">Oct. 2015</date>
		</imprint>
	</monogr>
	<note>working paper or preprint</note>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<author>
			<persName><forename type="first">U</forename><surname>Neddermeyer</surname></persName>
		</author>
		<title level="m">Schriftlichkeit und Leseinteresse im Mittelalter und in der frühen Neuzeit. Quantitative und qualitative Aspekte</title>
				<imprint>
			<publisher>Harrassowitz</publisher>
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
	<note>Von der Handschrift zum gedruckten Buch</note>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Optimal prediction of the number of unseen species</title>
		<author>
			<persName><forename type="first">A</forename><surname>Orlitsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T</forename><surname>Suresh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences of the United States of America</title>
		<imprint>
			<biblScope unit="volume">113</biblScope>
			<biblScope unit="issue">47</biblScope>
			<biblScope unit="page" from="13283" to="13288" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">The Legion of the Lost. Recovering the Lost Books of Early Modern Europe</title>
		<author>
			<persName><forename type="first">A</forename><surname>Pettegree</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Reconstructing the Print World of Pre-Industrial Europe</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Bruni</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Pettegree</surname></persName>
		</editor>
		<editor>
			<persName><surname>Brill</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="1" to="27" />
		</imprint>
	</monogr>
	<note>Lost Books</note>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Survival Factors of Seventeenth-Century Hand-Press Books Published in the Southern Netherlands: The Importance of Sheet Counts, Sämmelbande and the Role of Institutional Collections</title>
		<author>
			<persName><forename type="first">G</forename><surname>Proot</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Reconstructing the Print World of Pre-Industrial Europe</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Bruni</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Pettegree</surname></persName>
		</editor>
		<editor>
			<persName><surname>Brill</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="160" to="201" />
		</imprint>
	</monogr>
	<note>Lost Books</note>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Estimating Editions on the Basis of Survivals: Printed Programmes of Jesuit Plays in the Provincia Flandro-Belgica before 1773, with a Note on the &quot;Book Historical Law</title>
		<author>
			<persName><forename type="first">G</forename><surname>Proot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Egghe</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Papers of the Bibliographical Society of America</title>
		<imprint>
			<biblScope unit="volume">102</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="149" to="174" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Treating Medieval Manuscripts as Fossils</title>
		<author>
			<persName><forename type="first">N</forename><surname>Pyenson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Pyenson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<imprint>
			<biblScope unit="volume">309</biblScope>
			<biblScope unit="page" from="698" to="701" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<title level="m" type="main">From Gutenberg to Google: Electronic Representations of Literary Texts</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">L</forename><surname>Shillingsburg</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><surname>Van Oostrom</surname></persName>
		</author>
		<title level="m">Stemmen op schrift. Geschiedenis van de Nederlandse literatuur van het begin tot 1300</title>
				<imprint>
			<publisher>Prometheus</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">SPECIES: An R Package for Species Richness Estimation</title>
		<author>
			<persName><forename type="first">J.-P</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Statistical Software</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="1" to="15" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Wijsman</surname></persName>
		</author>
		<title level="m">Luxury Bound. Illustrated Manuscript Production and Noble and Princely Book Ownership in the Burgundian Netherlands</title>
				<imprint>
			<publisher>Brepols</publisher>
			<date type="published" when="1400">1400. 2010</date>
		</imprint>
	</monogr>
	<note>-1550</note>
</biblStruct>

<biblStruct xml:id="b39">
	<monogr>
		<title level="m" type="main">The lost literature of medieval England</title>
		<author>
			<persName><forename type="first">R</forename><surname>Wilson</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1952">1952</date>
			<publisher>Methuen &amp; Co</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
