<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Weak Genres: Modeling Association Between Poetic Meter and Meaning in Russian Poetry</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Artjoms</forename><surname>Šeļa</surname></persName>
							<email>artjoms.sela@ijp.pan.pl</email>
							<affiliation key="aff0">
								<orgName type="department">Institute of Polish Language (Polish Academy of Sciences)</orgName>
								<address>
									<addrLine>al. Mickiewicza 31</addrLine>
									<postCode>31-120</postCode>
									<settlement>Kraków</settlement>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Tartu</orgName>
								<address>
									<addrLine>Ülikooli 18</addrLine>
									<postCode>50090</postCode>
									<settlement>Tartu</settlement>
									<country key="EE">Estonia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Boris</forename><surname>Orekhov</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Higher School of Economics</orgName>
								<address>
									<addrLine>ul. Miasnitskaia 20</addrLine>
									<postCode>101000</postCode>
									<settlement>Moscow</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Roman</forename><surname>Leibov</surname></persName>
							<email>roman.leibov@gmail.com</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Tartu</orgName>
								<address>
									<addrLine>Ülikooli 18</addrLine>
									<postCode>50090</postCode>
									<settlement>Tartu</settlement>
									<country key="EE">Estonia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Weak Genres: Modeling Association Between Poetic Meter and Meaning in Russian Poetry</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">CC2461BCDE11190D1CDE9E035F05146B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T22:09+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>poetry</term>
					<term>semantics</term>
					<term>meters</term>
					<term>topic modeling</term>
					<term>clustering</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper aims to formalize an established theory in versification studies known as "semantic halo of a meter" which states that different metrical forms in modern poetry accumulate and retain distinct semantic associations. We use LDA topic modeling on a large-scale corpus of Russian poetry  to represent each poem in one topic space and then proceed to represent each meter as a distribution of aggregated topic probabilities. Using unsupervised classification and extensive sampling we show that robust form-meaning associations are present both within and between metrical forms: two samples of the same meter tend to appear most similar, while two metrical forms of the same family tend to group together. This effect is present if corpus is controlled for chronology and is not an artifact of population size. We argue that similar approach could be used to align and compare semantic halos across languages and traditions to give meaningful general-level answers to questions of literary history.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The existence of a connection between a poetic form and its meaning may seem trivial. Historically, metrical differentiation was driving distinction in genres or types of poetic speech, all the way to theorized Indo-European "long" meter of epic and "short" meter of lyric verse <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b27">28]</ref>. We generally do not expect an introspective meditation from a limerick, while we may expect it from a sonnet. European imitations of Dactylic Hexameter or elegiac distich in modern versification systems are thematically bound to their Classic Age sources. Does an association between poetic form and its semantics also survive in the "general-use" meters, in a modern poetic tradition where the normative connections between a genre and a form quickly decayed? The agreed answer is yes.</p><p>The ability of poetic meters to accumulate and retain distinct semantic features over time is also known as "the semantic halo of meter" in Russian school of quantitative metrical studies <ref type="bibr" target="#b48">[50,</ref><ref type="bibr" target="#b42">44,</ref><ref type="bibr" target="#b51">53]</ref>. Initial observations were based on the usage of meters by single poets <ref type="bibr" target="#b52">[54]</ref> or on anecdotal evidence (notably a few scattered in time poems composed in Trochaic Pentameter). Early scholars saw meter-meaning association as organic, i.e. some intrinsic features of rhythm shaped meter's semantics <ref type="bibr" target="#b23">[24,</ref><ref type="bibr" target="#b47">49]</ref>. Based on the close reading of thousands of 19th century poems Mikhail Gasparov demonstrated that the connection should be historical and is determined by a meter's origins in a local tradition and usage over time that accumulates a distributed, yet distinct semantic profile <ref type="bibr" target="#b13">[14]</ref>.</p><p>Despite the attractiveness of the findings, lack of formalization makes the semantic halo a target easy to criticize and hard to defend. Even if some specific "halos" are not a product of a simple sampling error, any generalizations about the mechanism itself and the structure of relationships between metrical forms remain elusive. A few previous empirical attempts to approach meter-meaning association in Russian <ref type="bibr" target="#b34">[35]</ref> and Bashkir <ref type="bibr" target="#b32">[33]</ref> poetry were able to broadly confirm lexical differences between metrical forms while relying on wordlists comparison, which provides us an entry point to the problem.</p><p>This study tries to address the presence of the semantic halo in Russian poetry using a set of abstracted semantic features (topics) that describe each individual poem in a uniform way. Having all texts aligned within one model allows for performing flexible tests and using classification algorithms to explicate and verify scholarly assumptions. We rely on hierarchical clustering to assess the level of within-meter semantic similarities (are meters similar to themselves) and between-meter relationships (how metrical forms relate to each other). Following the analysis we discuss how formalizing the semantic halo of meter could enhance our understanding of it as a mechanism of cultural transmission and how a similar approach could be used to study the halo effect across various languages and traditions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Corpus</head><p>Data used in this study comes from Poetry Sub-collection of the Russian National Corpus <ref type="bibr">[40]</ref> that includes texts spanning from the 18th to the late 20th century. It roughly covers the whole history of modern Russian versification that started with the introduction of German accentual-syllabic verse in 1730s. The corpus has a clear canonicity bias in its design: 18th and 19th century texts were included in the collection based on their availability in the 20th century scholarly editions <ref type="bibr" target="#b24">[25]</ref>. This leaves a lot of earlier poetic production outside of academic canon unaccounted for and partially drives the inequality in chronological distribution of the poems: more than 75% of texts come from the 20th century. This is also a very non-uniform pool of texts because starting with 1917 Russian poetry split in three generally isolated traditions -Soviet, emigrée and unofficial underground. Having no automated way to separate them, we limit the corpus by the year of 1950, which roughly excludes most of the underground works and stops the timer before the noticeable drift towards the non-classical versification begins. After all subsetting operations and preprocessing steps (see Section 3) we are left with 47,804 texts <ref type="bibr">(2,275,233 words)</ref>.</p><p>This study is mainly focused on the accentual-syllabic (AS) metrical (and usually rhymed) poetry, which survived in the Russian versification much longer than in the Western traditions that turned to verse libre <ref type="bibr" target="#b16">[17]</ref>. AS systems of versification are based on strict limitations for both the number of stresses and the number of syllables in a line, as compared to purely accentual (only stress count matters) or purely syllabic (only syllable count matters). The AS meters are built of recurring smaller units of rhythm -feet that organize stressed and unstressed syllables in patterns, usually of two or three (binary or ternary feet). Since metrical scheme is an abstraction of a poetic rhythm and is constantly altered (expected stressed positions left unstressed and vice versa), we usually speak of strong vs. weak positions in a meter, instead of "stressed" or "unstressed". Table <ref type="table">B</ref>.3 provides a summary of all the classical AS forms that were used in this study. The exception are so-called "dolniks", which step away from AS by loosening rules for syllable count, but their abundance in the 20th century cannot be ignored.</p><p>We utilize the existing corpus metadata that includes annotations for poetic form to, if possible, label each poem with a single unambiguous metrical formula. Corpus annotation was done institutionally under the supervision of experts in linguistics and prosody, however, annotators' agreement or error rate was not reported <ref type="bibr" target="#b15">[16]</ref>. We expect the accuracy to be very high, especially in classic AS forms that are easy distinguishable even with the minimal training. We asked three literary scholars to verify 100 original corpus annotations for metrical forms: on average, they marked 97.7% labels as "true". Mean inter-annotator agreement was 96.6% (low agreement on what to consider "false" labels).</p><p>We were conservative in labeling texts with corpus metadata, preferring homogeneous metrical notations and excluding most of the complex cases of polymetry, logaeds or other heterogeneous forms. We also used simplified information on stanza, relying on just a general clausula pattern. Throughout this paper we use a metrical notation derived from the Russian school of metrics, e.g. Iamb-4-fm stands for Iambic Tetrameter with regularly alternating lines of feminine and masculine clausula (or acatalectic and catalectic lines).</p><p>For the purposes of this study we infer three levels of metrical expression from a single metrical formula:</p><p>1. A general family of metrical pattern (e.g., Trochee, a meter based on binary feet with the strong position on the first syllable); 2. A meter of the poem based on the number of feet (e.g., Trochee-5, Trochaic Pentameter composed of five trochaic feet) 3. A catalectic variant of the meter that describes the pattern of non-stressed syllables after the last stressed one (e.g., Trochee-5-fm; f -stands for feminine (Xu), m -masculine (X), d -dactylic (Xuu) ending of a line).</p><p>Fig. <ref type="figure" target="#fig_0">1</ref> captures this three-level distribution of poems relative to six most frequent metrical families in the corpus (only two most frequent variants per meter are displayed) and provides absolute counts of poems in a family. The dominance of Iambic meters and specifically Iamb-4 as the "normative" meter of Russian accentual-syllabic versification is self-evident. To deal with this extreme inequality in metrical forms further in the paper we heavily rely on random sampling and iterative experiments.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Modeling semantics</head><p>We aim to model meter-meaning association through the semantic features of individual poems. To do so, we train one Latent Dirichlet Allocation (LDA) <ref type="bibr" target="#b4">[5]</ref> topic model on the whole corpus, without any aggregation of poems, writing metrical labels and other metadata in document names.</p><p>Topic models is a collective name for a large family of information extraction algorithms that look for groups of co-occurring elements in a collection of documents. These groups are labelled topics (the original goal was text mining), but models are transferable to, e.g. molecules <ref type="bibr" target="#b53">[55]</ref>, music <ref type="bibr" target="#b26">[27]</ref> and genes <ref type="bibr" target="#b3">[4]</ref>, or any task that requires to abstract groups of similar behaviour from large number of features (words, chords, genes, chemical elements, etc.). Topic modeling is now widely used for text mining and classification in humanities and social sciences <ref type="bibr" target="#b19">[20,</ref><ref type="bibr" target="#b54">56,</ref><ref type="bibr" target="#b7">8</ref>, 42]; it was also shown multiple times that LDA is applicable to the corpora of smaller poetic texts <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b30">31,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b35">36]</ref>.</p><p>Topic models were promisingly used for modeling general questions of cultural history: rate of change in popular music <ref type="bibr" target="#b26">[27]</ref>, modes of scientific exploration of information <ref type="bibr" target="#b29">[30]</ref>, or innovation and retention in historical political discourse <ref type="bibr" target="#b2">[3]</ref>. In these cases topical representation of entities served as a mere approximation of "contents". We aim for a similar abstracted representation of poetic language, very vaguely mimicking scholars who operated high-order semantic labels to describe meanings specific to meters like Night, Road or Death (themes that, according to Gasparov, collectively express some of the main semantic directions of Trochaic Pentameter in Russian poetry <ref type="bibr" target="#b13">[14]</ref>).</p><p>LDA is a generative probabilistic model that is based on a few very important assumptions: 1) each text in the collection is assumed to be generated from k number of topics; 2) each topic is a probability distribution over all available features (where most of the features are very unlikely). LDA represents each document as probability distribution over all k topics, so that all documents could essentially be described by the equal-sized vectors in one "topic space". In other words, LDA tries to infer a specific number of groups of co-occuring words from a corpus automatically; as a consequence, each document becomes represented as combination of these groups. We consider the use of topic models crucial to our goals, because 1) LDA allows to do uniform semantic abstraction on the level of single poems; 2) it expresses each document with potentially low number of interpretable dimensions; 3) topic probabilities of poems allow for a straightforward follow-up analysis; 4) topic models make our approach independent from language and specific domain expertise.</p><p>We followed several corpus preprocessing steps before training a model:</p><p>1. All texts were lemmatized using mystem 3.1 <ref type="bibr" target="#b41">[43]</ref>; 2. General-purpose stop-word list was applied to the corpus (removing conjunctions, particles, prepositions, pronouns and numerals); 3. We wanted to reduce lexical variance of the corpus, taking only 5,000 most frequent words to build a model. LDA output usually improves with less sparse data, so removing rare words is a common procedure. However, since our goal was semantic simplification of poetic language we used a separate word embedding model trained on the same corpus to replace words outside of this "core" 5,000 (word2vec implementation via gensim Python library, vector size=300). A word was replaced if it had a semantic neighbour among its 10 contextually most similar words (measured in cosine similarity of corresponding vectors) that was also a member of the top-1000 words. This procedure allowed us to replace words with their hyponyms, more frequent synonyms or grammatical variants (e.g. replacing diminutive forms) and, in some cases, to explicate traditional metonymy of poetic language (e.g. replacing "Pontus" with "ocean"). The procedure was not perfect and introduced some noise, which however did not have noticeable effect on the model. We also note here that our results do not alter radically if this contextual replacement does not happen or if we use another limit on most frequent words (e.g. 1,000). Despite insignificant effects, we still report results for data with contextual replacements, since we believe that the chosen direction towards the semantic abstraction is important and should be improved in the future. We provide all main results for the unaltered corpus in the Appendix (Table <ref type="table">B</ref>.5). 4. The corpus was also limited by text size to introduce the LDA with at least comparable range of word distributions in a document. We removed extra-small (less than 4 lines) and extra-large (more than 100 lines) poems which left us with approximately 95% of total texts. We further trimmed the corpus based on word counts, leaving out the texts between .10 and .90 percentile of size distribution (between 20 &amp; 102 words, which approx. corresponds to 12 &amp; 50 lines poems when accounted for stop-words removal). These limitations mean that our model primarily is focused on short lyrical poetry (a dominant form in Russian tradition that experienced rapid shrinkage of mean poem length <ref type="bibr" target="#b43">[45]</ref>). We believe however, that whatever results we have should also apply to long narrative poetry, where semantic traditions of metrical usage were predicted to be much more pronounced <ref type="bibr" target="#b13">[14]</ref>. Final text count after all operations is 47,804 (of which 39,220 texts have a single label for their form, derived from the corpus annotations).</p><p>There is no universally recognized way to determine an optimal number of topics for the model <ref type="bibr" target="#b39">[41]</ref>: in this paper we report results for LDA trained with 80 topics, which was a midpoint model in a trade-off between topic coherence (log-likelihood) and perplexity ("surprise" of the model when predicting unseen data). Main study procedures were also applied to a range of LDA models with variable number of topics (from 10 to 200) that showed a robust performance overall (Table <ref type="table">B</ref>.5 in the Appendix). We set LDA priors to alpha=0.1 (we do not expect many topics generating single text, since we do not want to swamp distribution) and beta=0.3 (we do not expect too many words contributing to a topic, but some).</p><p>To perform a quick sanity check of the final model we can look at the distinctive topics in a few meters that were described before qualitatively (Table <ref type="table">B</ref>.4 in the Appendix). While some topics could be seen as compatible with assumed semantic halo of meters, there is, of course, no direct relationship. Topics do not correspond to the abstracted metrical "themes" (Gasparov also did not use them systematically across his descriptions of different meters) but still they appear interpretable and could be used for our purposes of the distributed representation of a meter's "content".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Tracing the halo</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Within-meter similarities</head><p>The theory of "semantic halo of meter" assumes that meaning is non-randomly distributed across metrical forms, that each meter historically builds a unique semantic valency. The theory also (somewhat implicitly) considers the halo effect cumulative: we would not be able to reconstruct a meter's semantics looking at an isolated poem, but a distinct pattern will emerge from much larger sample of meter's usage in a tradition. We can rephrase these premises to say that meter-meaning association assumes some kind of self-similarity within a poetic form. If the halo effect exists, then two independent pools of poems coming from the same meter should appear semantically closer to each other than to other samples coming from different meters.</p><p>Let us say we are an observer of the whole tradition, looking at the metrical halos from the year of 1950 (our corpus upper chronological boundary). To test if the meaning-meter association is noticeable on a general level we perform unsupervised classification on two random samples (without replacement) of 200 poems for each meter that has at least 500 poems. Per each sample we calculate mean topic probabilities to represent aggregated topic distribution within a meter. Since we are dealing with probability distributions, we proceed to calculate Jensen-Shannon divergence (symmetrical Kullback-Leibler divergence <ref type="bibr" target="#b25">[26]</ref>) between all samples and build hierarchical clusters from the resulting distances. Resampling and recalculation then continues for 100 times. This results in 100 dendrograms with clustering information that is used to build a "majority-rule" consensus tree <ref type="bibr" target="#b10">[11]</ref>: it draws branches that correspond to 50% agreement across all dendrograms, so that two branches will not be connected if they did not cluster together at least in half of the trees (Fig. <ref type="figure" target="#fig_1">2a</ref>).</p><p>The same procedure could be applied at the level of metrical variants. Because of data sparsity and noise in metrical annotation, we use only variants of Iamb-4 that have at least 200 poems, while removing the most frequent variant for its diffused semantics (Iamb-4-fm). This leaves us with only four forms of Iamb-4 (Fig. <ref type="figure" target="#fig_1">2b</ref>).</p><p>Without addressing further complications of this approach, it is clear that within-meter semantic similarities are present in the corpus. It is also apparent that semantic difference could be traced in specific metrical variations, although this level of detail will require much better annotations and stanza information. Topic information alone is enough to consistently group two arguably large samples coming from the same meter together (if the median size of a poem in our corpus is 50 words, then meter-meaning association is quite pronounced in samples of 10,000 words). We can check the "cumulative" effect of metrical halo by looking at how the performance of hierarchical clustering change with the sample size.</p><p>To evaluate unsupervised classification we use two metrics: simple Cluster Purity (CP) <ref type="bibr" target="#b6">[7]</ref> (sum of cluster matches divided by number of individual samples) and Adjusted Rand Index (ARI) <ref type="bibr" target="#b22">[23]</ref> that are designed to compare two classifications with the latter also accounting for classification by chance (returning values around 0). We use the same threshold of 500 available poems per meter and the same procedure on the increasing sample sizes (up to 250 poems), calculating CP and ARI for each case of clustering. As expected, clustering accuracy grows with the increasing number of poems in a sample, up to the median ARI=0.73 and CP=0.90 (Fig. <ref type="figure" target="#fig_1">2c</ref>). However, it is important that non-random clustering can be noticed early and some semantic patterns in meters are recognized at 20-40 poems per sample.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Between-meter similarities</head><p>Consensus tree on Fig. <ref type="figure" target="#fig_1">2a</ref> also hints at the overarching semantic relationships between particular metrical forms. Some iambic meters tend to cluster together, the same happens with ternary forms that clearly make one group (with low distinction between Amphibrach forms). We do not really have a "ground truth" for how poetic forms should relate to each other seman- tically, except for some observations of their historical usage, similar origins, etc. However, we may expect that the transmission of meaning is at least partly bounded by metrical families (i.e. Iambic or Trochaic), since they cast broadly similar rhythmic and grammatical boundaries on language <ref type="bibr" target="#b11">[12]</ref>. In some cases meters of one family also share historical connections. For example, most Iambic meters were initially used in high-prestige genres: Iamb-4 in ode, Iamb-5 in drama, Iamb-6 in elegy. At the same time, Trochaic forms were often perceived as a counterpart to elite iambic verse; some of their rhythmical features overlapped with an oral tradition which defined their use as folklore imitations and set up corresponding association range.</p><p>We design a fairly conservative experiment to address the assumed "family influence": since the number of available meters per family is not equal, we consider only those families that have at least two populated meters (&gt; 400 poems). We take two meters per family 20 times at random; in each set of meters every meter is represented by 300 random poems; clusters are calculated the same way as described in Section 4.1 , except that now we verify clustering of forms against their family "ground truth" (Iamb, Trochee, etc.). The process is repeated 100 times for each of 20 sets of meters. We report the distribution of mean ARI and CP values per instance of sampled meters as compared to randomly assigned clusters (Fig. <ref type="figure" target="#fig_2">3a</ref>).</p><p>While the resulting clustering evaluation may not seem high (median ARI is around 0.44, CP -0.76), these values are enough to confirm that, at least to some extent, between-meter relationships are driven by the metrical families. To better demonstrate this effect, we calculated a consensus tree out of 100 clustering results without any restriction on number of meters per metrical family (Fig. <ref type="figure" target="#fig_2">3b</ref>. Iambs, Dactyls, Anapests and some Trochees tend to consistently cluster together; semantics of ternary forms remain somewhat diffused, but they still form one cluster with each other.</p><p>Cases of "wrong attribution" are potentially informative and align with the scholarly work on the subject. The similarity of Iamb-3 and Trochee-4 is well-known <ref type="bibr" target="#b13">[14]</ref>: they both originate in the 18th century's anacreontics and share "song" semantics across many variants. Iamb-43 with regular alternation of lines of different feet count also grew from a lyrical song and a ballad and was associated with lyro-epic poetry.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Semantic halo in time</head><p>It is time to abandon the position of an observer who is "looking back" at the whole tradition. Fig. <ref type="figure" target="#fig_1">2</ref> and Fig. <ref type="figure" target="#fig_2">3</ref> make it clear that clustering is to some extent enhanced by the difference in meters' distribution in time. Since LDA algorithm exploits patterns of word co-occurrence in documents, it naturally ends up with groups of words coming from different times (e.g. "folk" topic, "Soviet" topic or naturalistic war topic). In turn, this drives the divergence calculations: Soviet topic has close to zero probability in 18-19th century texts. We can see this from Dolnik and Trochee-5 consistently appearing in one cluster -two very popular 20th century forms, both rare earlier.</p><p>"Free" Iamb could be a good example here. This form was composed of unregulated alternation of iambic lines of varying feet length and almost exclusively used for specific set of genres: poetic epistolary, fables and epigrams, It was also entirely abandoned after 1850s. In all 1,200 poems of free Iamb there are only two topics that account together for 20% probability <ref type="bibr">(48,animals &amp; 46,communication)</ref>. Despite its name, this meter is frozen in time with a combination of genres imprinted on it, two samples of which is easy to cluster together. In short, our semantic abstraction is not abstract enough to disregard chronological differences.</p><p>The time, however, does not invalidate the general presence of metrical halo; after all, asynchronous development and fashion fluctuations of metrical forms shape their perceived differences and pin their associations to distinct periods (Iamb-4-fm to the "Golden Age" of 1820-1830s, Dactyl-3 to civic and political sentiment of 1850-1880s, Dolnik to modernism poetry). Controlling the corpus for chronology is useful not only for testing the halo effect on a smaller scale, it also creates opportunities to examine the meter-meaning association mechanism behavior in time. We only briefly address it here, since it is a separate problem that deserves its own experimental design and comparative data.</p><p>First, we want to see if within-meter semantic similarities are present if all poems samples are coming from the same time frame. We divide our corpus to 30-year bins (excluding the 18th century because of low variety in popular meters). For each time frame we take its six most frequent meters and run 100 iterations of clustering, taking two random samples of poems per meter and report mean ARI values (Table <ref type="table" target="#tab_0">1</ref>). These values are not directly comparable since we use different pools of meters and sample sizes (half of the minimum meter count per period, floored), but they are enough to point out that halo effect could be traced within limited time frames and, in fact, in samples much smaller than expected (see Fig. <ref type="figure" target="#fig_1">2</ref>) for some periods.</p><p>Second, it is possible to utilize chronological information to ask questions about halo behavior and semantic accumulation. If semantic halo of a meter is taken as a historical phenomenon and not as an organic one, we expect a non-uniform, decreasing strength of association between meter and meaning across time. Specifically, we expect to find a discrepancy in clustering evaluation between the early 19th century poetry and the second half of the century. This would confirm a historical diffusion of metrical semantics and unchaining the connection between form &amp; genre imposed on poetry by normative aesthetics. Since our approach to classification depends on sample size, we simply split 19th century data in two halves and observe distribution of ARI values when clustering is performed on the strictly the same set of meters and sample sizes for each of the two chronological groups.</p><p>The difference between two periods turns out to be significant (Table <ref type="table" target="#tab_1">2</ref>). Samples of 40 poems -the maximum possible sample size to use for two groups -were better distinguishing meters in the first half of the 19th century, signaling a focused metrical usage. If we increase sample size for the second half of the 19th century, then mean clustering accuracy predictably goes up (sample size=100, ARI=0.43), which points to the process of semantic accumulation in meters -they become more diffused semantically, but not unrecognizable.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Topic expression and frequency of a meter</head><p>From the start of this paper there was one daunting question: what if the whole halo effect arises from the simple fact that all metrical forms differ in popularity? It is easy to draw any kind of topic configuration from Iamb-4, less so from Iamb-3, while options for Trochee-3-dm are almost non-existent. The semantic halo thus may be born from a combination of sampling error, chronological differences and a confirmation bias in scholars reading same-meter poems similarly.</p><p>It becomes more complicated, since scholars tell us to expect that the distinctiveness of a halo is naturally reducing with the increase in a frequency of a meter. There is just no space for semantic variation in rare forms. Thus we assume that 1) the relationship between the strength of expression of a meter and its frequency should be linear; 2) if semantics are randomly assigned to meters a trend should be different.</p><p>To measure how "distinct" is semantics within a single meter we can utilize the curves of topic probabilities distribution and look at them from the perspective of "inequality". In less populated meters we expect to find fewer topics that dominate the distribution than in highly frequent forms. Fig. <ref type="figure" target="#fig_3">4a</ref> shows an example: topic probabilities are aggregated from all poems for each of the two metrical variants and arranged according to their overall contribution to meter semantics. For each of the resulting curves a Gini coefficient -originally designed for measuring national inequality in wealth distribution -could be calculated. Gini takes a value of 1 when a distribution is demonically unequal (a single topic is 100% probable) and 0 when it is perfectly equal (each of the 80 topics is 1.25% probable). It is apparent that this coefficient is enough to capture how focused a meter's semantics is at least relatively, while Gini's absolute values would be influenced by LDA priors (0.1 alpha assumes more inequality in topic probabilities of a poem than 0.5 alpha).</p><p>To verify the diffusion of a halo and its (non-)random nature we first calculate Gini coefficients for each metrical variant which was used in corpus at least 10 times. Then the same calculation is done when semantics are redistributed -per each frequency n of a real metrical form we randomly sample (without replacement) n poems from the corpus. In the end all poems end up being randomly reassigned to empty "meter bins" and this redistribution is done 20 separate times. If the halo could not be simply sampled at random, then some divergence in how inequality correlates with sample size between two groups of points should be noticeable.</p><p>Fig. <ref type="figure" target="#fig_3">4b</ref> shows differences between inequality distributions in randomly aggregated poems and actual metrical forms on log-log scales. A decrease of semantic inequality based on a meter's frequency happens as expected with very telling outliers that signal concentrated halo in frequent metrical forms (most notably free Iamb). On the other hand, inequality in randomly redistributed data decays quicker, leaving lots of actual meters above the line. This suggests that even if comparable levels of topical inequality in the very infrequent forms might occur randomly, there is no reason to expect it for semantic halo in general. It is highly improbable to randomly sample semantic curve even of Iamb-4-fm which was always considered a neutral and a general-use form.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion</head><p>We were able to show that topical information alone could be used to recognize a poetic form. Poems originating in one meter retain semantic similarity to themselves. Separate meters demonstrate stable relationships between them which in part are driven by their origins in metrical family. Historical differences in classification accuracy also suggested semantic accumulation in metrical forms and a diffusion of meter's "meaning" over time without swamping it beyond recognition. These findings, we believe, confirm the theory of semantic halo and its main assumptions at least on a general level.</p><p>In the future the effect of metrical halo could be better understood in cultural evolution framework <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b46">48]</ref>, which provides plenty of options to explicate the way of how we think about the workings of historical processes and cultural transmission. Cultural evolution is an emergent field that studies variation, retention and diffusion of cultural information (which is usually defined as any information acquired via social learning). This framework encompasses the research across various disciplines and domains: it was used to reconstruct cultural phylogenies of archaeological artifacts <ref type="bibr" target="#b31">[32]</ref>, folktales <ref type="bibr" target="#b45">[47]</ref> and medieval manuscripts <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b57">59]</ref>; understand how people learn and contribute to cumulative forces of culture <ref type="bibr" target="#b50">[52]</ref>; address innovation rates in popular music <ref type="bibr" target="#b26">[27]</ref>; study macro patterns in language evolution <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b5">6]</ref>, or the effects of population size on the diffusion and retention of cultural information <ref type="bibr" target="#b21">[22,</ref><ref type="bibr" target="#b36">37]</ref>.</p><p>It would not be too far-fetched to say that all new poems originate from previous ones. Most often they are products of imitation: it is extremely rare for a poet to completely escape any exposure to a tradition or single-handedly create a whole versification system. Should it happen, chances are high that these individual efforts will not survive for too long, simply because there will not be enough followers. Poetic forms are persistent and conservative: things like Iambic Pentameter, or rhyming, or a sonnet pattern could survive for centuries. This means new poems share plenty of formal features -like meter -with their predecessors, effectively replicating the previously used form. Arguably nothing should stop a poet in a modern highly individualized tradition of lyrical self-expression from using a metrical form absolutely free, independent from its semantic inertia. We see this is not the case.</p><p>Meters and poetic forms could be seen as behaving similarly to "transmission isolating mechanisms" <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b49">51]</ref> of culture. These mechanisms are certain conditions (marriage traditions, household organization, etc.) that maintain some level of "vertical" (parents to offspring) transmission of information in culture which is usually seen as the domain of extensive "horizontal" connections (peers to peers). Meters, similarly, are limiting the semantic possibilities of poems and pushing meaning creation into fuzzy, yet distinct pathways. This ensures that meters during the modern poetic histories act as "weak genres", replicating an expanding set of features in poems that also share similar formal origins.</p><p>Why this formal isolation should happen in poetry at all? An obvious answer lies in meter's capacity as an effective mnemonic device that pushes language into a high-order pattern and enhances memorability <ref type="bibr" target="#b16">[17]</ref>. Poetic forms originated in oral traditions, which, in turn, relied on many formal mechanisms (meter, rhyme, formulaic language, formulaic plots, etc.) that limited options for how a text could be made and retold to facilitate its memorization and transmission <ref type="bibr" target="#b38">[39]</ref>. Apparently the mnemonic strength of a meter would matter in written traditions too. It is not just a form that is remembered and reproduced; by simply turning its wheels a meter carries its ever-expanding baggage further. The individual histories of metrical forms in Russian poetry had a lot of twists and turns and consciously guided revolutions in metrical expectations (because these expectations existed). However, it seems that no one could truly escape the mnemonic tyranny of the accentual-syllabic verse.</p><p>We expect the semantic halo to be present (to larger or smaller extent) in any poetic tradition based on any versification system that allows for distinct and stable poetic forms across time. Topic models could give us a set of abstract derived measures (classification accuracy, inequality, etc.) that could be compared across languages and traditions. This, in turn, provides an access to general questions of literary history: how poetic genres "fall" in time, to what extent meters continue to be recognizable (if at all), how the invention of new forms happen or what is the role of individual poets and poems in shaping semantic halo.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Code and data availability</head><p>Document-Term Matrix, final preprocessing steps, final models &amp; full analysis are openly available: https://github.com/perechen/semantic_halo_rus We have used R 4.0.2. <ref type="bibr" target="#b37">[38]</ref> for the analysis, LDA implementation in topicmodels package <ref type="bibr" target="#b17">[18]</ref>, relied on tidytext wrappings for the model's output <ref type="bibr" target="#b44">[46]</ref>, phylentropy <ref type="bibr" target="#b8">[9]</ref> and ineq <ref type="bibr" target="#b56">[58]</ref> for calculations, ggtree for drawing trees <ref type="bibr" target="#b55">[57]</ref>, patchwork for assembling plots <ref type="bibr" target="#b33">[34]</ref>. ghibli package <ref type="bibr" target="#b20">[21]</ref> provided the color palette "MononokeMedium" for the plots. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Appendix</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Relative distribution of metrical forms in a family. For each meter that had at least 200 poems two of its most frequent variants are represented. Absolute poem counts include everything.</figDesc><graphic coords="4,74.40,70.14,446.48,279.05" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: a, majority-rule consensus tree showing clustering agreement (Jensen-Shannon divergence, complete linkage) in 100 iterations, 2 random samples per 9 meters, 200 poems per sample. b, consensus tree showing agreement between clustering of iambic variants, 100 poems per sample. c, Performance of hierarchical clustering in CP and ARI against "ground truth" (metrical lables) and against randomly assigned clusters. Run on the same set of meters (at least 500 poems per each), 100 iterations per sample size.</figDesc><graphic coords="7,74.40,70.14,446.48,390.67" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: a, The strength of meter association by their corresponding family, clusters are calculated in equal number of meters per family (k=6, n=12, sample size = 300). 100 iterations per each 20 metrical sets. b, Consensus tree demonstrating stable semantic associations between meters (k=6, n=19, sample size = 300, trees = 100).</figDesc><graphic coords="8,74.40,70.14,446.48,223.24" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: a, The difference in topic inequality (Gini coefficient) between a general-use Iamb-4-fm (0.12) and a specific rare form An-43-mf (0.23). b, Semantic inequality decay based on meter's popularity (black) vs. randomly redistributed poems (grey), 20 independent redistributions. The slopes of two linear models are different (-0.28, -0.39), and the model describes more variation in redistributed data (R 2 = 0.96) than in empirical (R 2 = 0.81).</figDesc><graphic coords="11,74.40,70.14,446.48,279.05" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Meter-meaning association in different chronological frames, 100 iterations per period</figDesc><table><row><cell>Period</cell><cell cols="3">Median ARI Poems per sample Meters</cell></row><row><cell>1800-1829</cell><cell>0.51</cell><cell>30</cell><cell>I4 If T4 I6 I5 I3</cell></row><row><cell>1830-1859</cell><cell>0.47</cell><cell>50</cell><cell>I4 T4 I6 I5 If Amph3</cell></row><row><cell>1860-1889</cell><cell>0.23</cell><cell>30</cell><cell>I4 T4 I6 I5 An3 An43</cell></row><row><cell>1890-1919</cell><cell>0.77</cell><cell>270</cell><cell>I4 I5 T4 I6 An3 T5</cell></row><row><cell>1920-1949</cell><cell>0.58</cell><cell>250</cell><cell>I4 I5 T4 T5 Dk3 An3</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>First and second half of the 19th century, aligned clustering. 1,000 sampling iterations for each period. Difference in ARIs distributions is significant (t-test, t=19.6, p &lt; 0.0001)</figDesc><table><row><cell>Period</cell><cell cols="3">Median ARI Poems per sample Meters</cell></row><row><cell>1800-1849</cell><cell>0.48</cell><cell>40</cell><cell>I4 T4 If I6 I5 Amph4</cell></row><row><cell>1850-1899</cell><cell>0.33</cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table B . 3</head><label>B3</label><figDesc>All major accentual-syllabic meters used in the study. 1 -denotes strong position in a foot (stress expected), 0 -weak. Brackets envelop syllables that may or may not appear (end of a line; dolnik's interval). English examples provided for classical meters. slee|ping, by | a bro|ther's hand Of life,| of crown,| of queen,| at once| dispatch'd</figDesc><table><row><cell>Meter</cell><cell>Type</cell><cell cols="2">Foot Example</cell><cell>Comment</cell></row><row><cell>Iamb</cell><cell>binary</cell><cell>01</cell><cell>01|01|01|01|01(00)</cell><cell>Iambic Pentameter</cell></row><row><cell cols="4">Thus was|I, Trochee binary 10 10|10|10|1(00)</cell><cell>Trochaic Tetrameter</cell></row><row><cell></cell><cell></cell><cell></cell><cell>Tell me| not in| mournful | numbers,</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>Life is | but an | empty | dream</cell><cell></cell></row><row><cell>Dactyl</cell><cell cols="3">ternary 100 100|100|100|1(00)</cell><cell>Dactylic Tetrameter</cell></row><row><cell></cell><cell></cell><cell></cell><cell>Brightest and| best of the | sons of the| morning</cell><cell></cell></row><row><cell cols="4">Amphibrach ternary 010 010|010|010|01(00)</cell><cell>Amphibrachic Tetrameter</cell></row><row><cell></cell><cell></cell><cell></cell><cell>Oh, hush thee,| my baby|, thy sire was | a knight</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>Thy mother | a lady| both lovely | and bright</cell><cell></cell></row><row><cell>Anapest</cell><cell cols="3">ternary 001 001|001(00)</cell><cell>Anapestic Dimeter</cell></row><row><cell></cell><cell></cell><cell></cell><cell>He is gone | on the moun|tain</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>He is lost | to the for|est</cell><cell></cell></row><row><cell>Dolnik</cell><cell>/</cell><cell>/</cell><cell>(00)1(0)01(0)01(00)</cell><cell>3-ictus Dolnik</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>based on number of</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>stressed positions (3)</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>but unstressed syllable</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>interval is limited</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>to 1-2 syllables</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>AŠ was funded by "Large-Scale Text Analysis and Methodological Foundations of Computational Stylistics"(NCN 2017/26/E/ HS2/01019) project supported by Polish National Science Centre. We would like to thank two anonymous reviewers for their attentive reading that allowed us to make this paper better. We thank Joanna Byszuk, Maciej Eder, Antonina Martynenko, Vera Polilova and Oleg Sobchuk for all of their contribution, help and support.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Table <ref type="table">B</ref>. <ref type="bibr" target="#b3">4</ref> Distinctive topics (words translated) in three meters compared to Gasparov's descriptions <ref type="bibr" target="#b13">[14]</ref>. Top 10 topics with most deviation from the mean are listed. Topics that could be relevant for halo are highlighted</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Meter</head><p>Halo <ref type="bibr">(Gasparov)</ref> Topic Top words Median Adjusted Rand Index values for within-meter clustering (Fig. <ref type="figure">2c</ref>) in different samples (ARI @ n poems per sample), using LDA models with varying k number of topics . ARI family column holds ARI values for between-meter similarity tests (Fig. <ref type="figure">3a</ref>). The robust performance of clustering in various models shows that number of topics has little influence on the experimental results, and, thus, surprisingly, is not really an influential variable. LDA with 20 topics shows one of the highest performances at 250 poems per sample, which further highlights the effectiveness of the reduction of semantic information. There is, however, a slight noticeable trade-off between local (within-meter) and global (between-meter) recognition (models with 80 and 100 topics seem to provide the most balanced performance). Additional tests were made for LDA models trained on the original Document-Term Matrix (without replacements of less frequent words with their more frequent semantic neighbours).</p><p>DTM k ARI@10 ARI@60 ARI@100 ARI@150 ARI@250 ARI family </p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Confirming the themes and interpretive unity of Ghazal poetry using topic models</title>
		<author>
			<persName><forename type="first">E</forename><surname>Asgari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ghassemi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Finlayson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Neural Information Processing Systems (NIPS) Workshop for Topic Models</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The phylogeny of The Canterbury Tales</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">C</forename><surname>Barbrook</surname></persName>
		</author>
		<idno type="DOI">10.1038/29667</idno>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<idno type="ISSN">1476-4687</idno>
		<imprint>
			<biblScope unit="volume">394</biblScope>
			<biblScope unit="page" from="839" to="839" />
			<date type="published" when="1998-08-09">Aug. 1998. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Individuals, institutions, and innovation in the debates of the French Revolution</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T J</forename><surname>Barron</surname></persName>
		</author>
		<idno type="DOI">10.1073/pnas.1717729115</idno>
		<idno>Vis- ited on 07/26/2020</idno>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences</title>
		<idno type="ISSN">0027-8424</idno>
		<imprint>
			<biblScope unit="volume">115</biblScope>
			<biblScope unit="issue">18</biblScope>
			<biblScope unit="page" from="4607" to="4612" />
			<date type="published" when="2018-05">May 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Investigating Topic Models&apos; Capabilities in Expression Microarray Data Classification</title>
		<author>
			<persName><forename type="first">M</forename><surname>Bicego</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE/ACM Transactions on Computational Biology and Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1831" to="1836" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Latent Dirichlet Allocation</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Machine Learning Research</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="993" to="1022" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Mapping the Origins and Expansion of the Indo-European Language Family</title>
		<author>
			<persName><forename type="first">R</forename><surname>Bouckaert</surname></persName>
		</author>
		<idno type="DOI">10.1126/science.1219669</idno>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<idno type="ISSN">0036-8075</idno>
		<imprint>
			<biblScope unit="volume">337</biblScope>
			<biblScope unit="page" from="957" to="960" />
			<date type="published" when="2012-08-09">Aug. 2012. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Why Molière most likely did write his plays</title>
		<author>
			<persName><forename type="first">F</forename><surname>Cafiero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-B</forename><surname>Camps</surname></persName>
		</author>
		<idno type="DOI">10.1126/sciadv.aax5489</idno>
	</analytic>
	<monogr>
		<title level="j">Science Advances</title>
		<idno type="ISSN">2375-2548</idno>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">11</biblScope>
			<date type="published" when="2019-11-12">Nov. 2019. Visited on 12/16/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding</title>
		<author>
			<persName><forename type="first">P</forename><surname>Dimaggio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Nag</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Blei</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.poetic.2013.08.004</idno>
	</analytic>
	<monogr>
		<title level="j">Poetics</title>
		<idno type="ISSN">0304422X</idno>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="570" to="606" />
			<date type="published" when="2013-12-03">Dec. 2013. Visited on 03/10/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Philentropy: Information Theory and Distance Quantification with R</title>
		<author>
			<persName><forename type="first">H.-G</forename><surname>Drost</surname></persName>
		</author>
		<idno type="DOI">10.21105/joss.00765</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Open Source Software</title>
		<idno type="ISSN">2475-9066</idno>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<date type="published" when="2018-06-09">June 2018. Visited on 09/13/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Advances in Evolutionary Culture Theory</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">H</forename><surname>Durham</surname></persName>
		</author>
		<idno type="DOI">10.1146/annurev.an.19.100190.001155</idno>
	</analytic>
	<monogr>
		<title level="j">Annual Review of Anthropology</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="187" to="210" />
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Confidence Limits on Phylogenies: An Approach Using the Bootstrap</title>
		<author>
			<persName><forename type="first">J</forename><surname>Felsenstein</surname></persName>
		</author>
		<idno type="DOI">10.1111/j.1558-5646.1985.tb00420.x</idno>
	</analytic>
	<monogr>
		<title level="j">Evolution</title>
		<idno type="ISSN">1558-5646</idno>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="783" to="791" />
			<date type="published" when="1985">1985. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The Linguistics of Verse</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Gasparov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tarlinskaja</surname></persName>
		</author>
		<idno type="DOI">10.2307/20459662</idno>
	</analytic>
	<monogr>
		<title level="j">The Slavic and East European Journal</title>
		<idno type="ISSN">0037-6752</idno>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="198" to="207" />
			<date type="published" when="2008">2008. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">A History of European Versification</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gasparov</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1996-07">July 1996</date>
			<publisher>Oxford University Press</publisher>
			<pubPlace>Oxford, New York</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Izda</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gasparov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Metr i smysl: ob odnom iz mekhanizmov kulturnoi pamiati</title>
				<meeting><address><addrLine>Moscow</addrLine></address></meeting>
		<imprint>
			<publisher>telskii tsentr RGGU</publisher>
			<date type="published" when="1999">1999. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Language trees support the express-train sequence of Austronesian expansion</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">D</forename><surname>Gray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">M</forename><surname>Jordan</surname></persName>
		</author>
		<idno type="DOI">10.1038/35016575</idno>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<idno type="ISSN">1476- 4687</idno>
		<imprint>
			<biblScope unit="volume">405</biblScope>
			<biblScope unit="page" from="1052" to="1055" />
			<date type="published" when="2000-06-09">June 2000. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Poeticheskii korpus v ramkah NKRIA: obschaia struktura i perspektivy ispolzovania</title>
		<author>
			<persName><forename type="first">E</forename><surname>Grishina</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Natsionalnii korpus russkogo iazyka: 2006-2008</title>
				<meeting><address><addrLine>St. Petersburg</addrLine></address></meeting>
		<imprint>
			<publisher>Nestor-Istoria</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="71" to="113" />
		</imprint>
	</monogr>
	<note>Novye rezultaty i perspektivy</note>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Cognitive Poetics and Cultural Memory: Russian Literary Mnemonics</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gronas</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>Routledge</publisher>
			<pubPlace>New York</pubPlace>
		</imprint>
	</monogr>
	<note>1 edition</note>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">topicmodels: An R Package for Fitting Topic Models</title>
		<author>
			<persName><forename type="first">B</forename><surname>Grün</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hornik</surname></persName>
		</author>
		<idno type="DOI">10.18637/jss.v040.i13</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Statistical Software</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="issue">13</biblScope>
			<biblScope unit="page" from="1" to="30" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Diachronic Topics in New High German Poetry</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">N</forename><surname>Haider</surname></persName>
		</author>
		<ptr target="https://dev.clariah.nl/files/dh2019/boa/1031.html" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International Digital Humantities Conference</title>
				<meeting>the International Digital Humantities Conference<address><addrLine>Utrecht</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019-09-17">2019. 09/17/2020</date>
			<biblScope unit="page" from="1" to="7" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Studying the History of Ideas Using Topic Models</title>
		<author>
			<persName><forename type="first">D</forename><surname>Hall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP &apos;08</title>
				<meeting>the Conference on Empirical Methods in Natural Language Processing. EMNLP &apos;08<address><addrLine>Stroudsburg, PA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2008">2008. Visited on 12/11/2019</date>
			<biblScope unit="page" from="363" to="371" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">ghibli: Studio Ghibli Colour Palettes</title>
		<author>
			<persName><forename type="first">E</forename><surname>Henderson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Desrosiers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Chirico</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=ghibli(visitedon" />
		<imprint>
			<date type="published" when="2020-04">Apr. 2020. 09/13/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Demography and Cultural Evolution: How Adaptive Cultural Processes can Produce Maladaptive Losses: The Tasmanian Case</title>
		<author>
			<persName><forename type="first">J</forename><surname>Henrich</surname></persName>
		</author>
		<idno type="DOI">10.2307/4128416</idno>
	</analytic>
	<monogr>
		<title level="j">American Antiquity</title>
		<idno type="ISSN">0002-7316</idno>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="197" to="214" />
			<date type="published" when="2004">2004. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Comparing partitions</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">J</forename><surname>Hubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Arabie</surname></persName>
		</author>
		<idno type="DOI">10.1007/BF01908075</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Classification</title>
		<idno type="ISSN">0176-4268</idno>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">2-3</biblScope>
			<biblScope unit="page" from="193" to="218" />
			<date type="published" when="1985">1985</date>
		</imprint>
	</monogr>
	<note>Print)</note>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Toward a Description of Mácha&apos;s Verse</title>
		<author>
			<persName><forename type="first">R</forename><surname>Jakobson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Selected Writings.</title>
		<editor>R. Jakobson</editor>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="433" to="485" />
			<date type="published" when="1979">1979</date>
			<publisher>The Hague</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Poezija XX veka v poeticheskom podkorpuse Natsional&apos;nogo korpusa russkogo iazyka: problema reprezentativnosti</title>
		<author>
			<persName><forename type="first">K</forename><surname>Korchagin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Trudy instituta im. V.V. Vinogradova</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="235" to="256" />
			<date type="published" when="2015">2015. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">On Information and Sufficiency</title>
		<author>
			<persName><forename type="first">S</forename><surname>Kullback</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Leibler</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Annals of Mathematical Statistics</title>
		<idno type="ISSN">0003-4851</idno>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="79" to="86" />
			<date type="published" when="1951">1951. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">The evolution of popular music: USA 1960-2010</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mauch</surname></persName>
		</author>
		<idno type="DOI">10.1098/rsos.150081(visitedon</idno>
		<ptr target="http://dx.doi.org/10.1098/rsos.150081(visitedon" />
	</analytic>
	<monogr>
		<title level="j">Royal Society Open Science</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2015-07-26">2015. 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Les origines indo-européennes des mètres grecs</title>
		<author>
			<persName><forename type="first">A</forename><surname>Meillet</surname></persName>
		</author>
		<ptr target="http://archive.org/details/lesoriginesindoe00meiluoft(visitedon" />
	</analytic>
	<monogr>
		<title level="m">Les Presses universitaires de France</title>
				<meeting><address><addrLine>Paris</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1923-07-26">1923. 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Mesoudi</surname></persName>
		</author>
		<title level="m">Cultural Evolution: How Darwinian Theory Can Explain Human Culture and Synthesize the Social Sciences</title>
				<imprint>
			<publisher>University of Chicago Press</publisher>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Exploration and Exploitation of Victorian Science in Darwin&apos;s Reading Notebooks</title>
		<author>
			<persName><forename type="first">J</forename><surname>Murdock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Allen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dedeo</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.cognition.2016.11.012</idno>
	</analytic>
	<monogr>
		<title level="j">Cognition</title>
		<idno type="ISSN">00100277</idno>
		<imprint>
			<biblScope unit="volume">159</biblScope>
			<biblScope unit="page" from="117" to="126" />
			<date type="published" when="2017-02-07">Feb. 2017. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">On Poetic Topic Modeling: Extracting Themes and Motifs From a Corpus of Spanish Poetry</title>
		<author>
			<persName><forename type="first">B</forename><surname>Navarro-Colorado</surname></persName>
		</author>
		<idno type="DOI">10.3389/fdigh.2018.00015</idno>
	</analytic>
	<monogr>
		<title level="j">Frontiers in Digital Humanities</title>
		<idno type="ISSN">2297-2668</idno>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page">15</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Evolutionary archeology: Current status and future prospects</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>O'brien</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">L</forename><surname>Lyman</surname></persName>
		</author>
		<idno type="DOI">10.1002/evan.10007</idno>
	</analytic>
	<monogr>
		<title level="j">Evolutionary Anthropology: Issues, News, and Reviews</title>
		<idno type="ISSN">1520-6505</idno>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="26" to="36" />
			<date type="published" when="2002">2002. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Orekhov</surname></persName>
		</author>
		<title level="m">Bashkirskii stikh XX veka. Korpusnoe issledovanie</title>
				<meeting><address><addrLine>St. Petersburg</addrLine></address></meeting>
		<imprint>
			<publisher>Aletejia</publisher>
			<date type="published" when="2019">2019. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<title level="m" type="main">patchwork: The Composer of Plots</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">L</forename><surname>Pedersen</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=patchwork(visitedon" />
		<imprint>
			<date type="published" when="2020-06">June 2020. 09/13/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Semantic halo of a meter: a keyword-based approach</title>
		<author>
			<persName><forename type="first">A</forename><surname>Piperski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Kompiuternaia lingvistika: lingvisticheskiie issledovaniia</title>
				<meeting><address><addrLine>Moscow</addrLine></address></meeting>
		<imprint>
			<publisher>RGGU</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="342" to="354" />
		</imprint>
	</monogr>
	<note>Kompiuternaia lingvistka i intellectualnyie tekhnoloigii</note>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<title level="m" type="main">Mapping Topic Evolution Across Poetic Traditions</title>
		<author>
			<persName><forename type="first">P</forename><surname>Plechac</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">N</forename><surname>Haider</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2006.15732</idno>
		<ptr target="http://arxiv.org/abs/2006.15732" />
		<imprint>
			<date type="published" when="2020-08">Aug. 2020. 09/28/2020</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
	<note>cs, stat</note>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Late Pleistocene Demography and the Appearance of Modern Human Behavior</title>
		<author>
			<persName><forename type="first">A</forename><surname>Powell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Shennan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Thomas</surname></persName>
		</author>
		<idno type="DOI">10.1126/science.1170165</idno>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<idno type="ISSN">0036-8075</idno>
		<imprint>
			<biblScope unit="volume">324</biblScope>
			<biblScope unit="page" from="1298" to="1301" />
			<date type="published" when="2009-06-09">June 2009. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">R: A Language and Environment for Statistical Computing</title>
		<author>
			<persName><forename type="first">Team</forename><surname>Core</surname></persName>
		</author>
		<ptr target="https://www.R-project.org/" />
	</analytic>
	<monogr>
		<title level="m">tria: R Foundation for Statistical Computing</title>
				<meeting><address><addrLine>Vienna, Aus</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<monogr>
		<title level="m" type="main">Memory in Oral Traditions: The Cognitive Psychology of Epic, Ballads, and Counting-Out Rhymes</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Rubin</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1995">1995</date>
			<publisher>Oxford University Press USA</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">Topic modeling, long texts and the best number of topics. Some Problems and solutions</title>
		<author>
			<persName><forename type="first">S</forename><surname>Sbalchiero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Eder</surname></persName>
		</author>
		<idno type="DOI">10.1007/s11135-020-00976-w</idno>
		<ptr target="https://doi.org/10.1007/s11135-020-00976-w(visitedon09/17/2020" />
	</analytic>
	<monogr>
		<title level="j">Quality &amp; Quantity</title>
		<idno type="ISSN">1573-7845</idno>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="1095" to="1108" />
			<date type="published" when="2020-08">Aug. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<analytic>
		<title level="a" type="main">Topic Modeling Genre: An Exploration of French Classical and Enlightenment Drama</title>
		<author>
			<persName><forename type="first">C</forename><surname>Schöch</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Digital Humanities Quarterly</title>
		<idno type="ISSN">1938-4122</idno>
		<imprint>
			<biblScope unit="volume">011</biblScope>
			<biblScope unit="issue">2</biblScope>
			<date type="published" when="2017-05">May 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<analytic>
		<title level="a" type="main">A Fast Morphological Algorithm with Unknown Word Guessing Induced by a Dictionary for a Web Search Engine</title>
		<author>
			<persName><forename type="first">I</forename><surname>Segalovich</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">MLMTA</title>
		<imprint>
			<biblScope unit="page" from="273" to="280" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<analytic>
		<title level="a" type="main">Semanticheskii oreol metra&quot;: termin i poniatie</title>
		<author>
			<persName><forename type="first">M</forename><surname>Shapir</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Universum versus: iazyk, stikh, smysl v russkoi poezii XVIII-XX vekov</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Shapir</surname></persName>
		</editor>
		<meeting><address><addrLine>Moskva</addrLine></address></meeting>
		<imprint>
			<publisher>Iazyki slavianskoi kultury</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="395" to="404" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<analytic>
		<title level="a" type="main">The shortest species: how the length of Russian poetry changed (1750-1921)</title>
		<author>
			<persName><forename type="first">A</forename><surname>Shelya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Sobchuk</surname></persName>
		</author>
		<idno type="DOI">10.12697/smp.2017.4.1.03</idno>
	</analytic>
	<monogr>
		<title level="j">Studia Metrica et Poetica</title>
		<idno type="ISSN">2346-691X</idno>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="66" to="84" />
			<date type="published" when="2017-08-07">Aug. 2017. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b44">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Silge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Robinson</surname></persName>
		</author>
		<ptr target="https://www.tidytextmining.com/(visitedon07/" />
		<title level="m">Text Mining with R: A tidy approach</title>
				<imprint>
			<date type="published" when="2020">2020. 27/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<analytic>
		<title level="a" type="main">Comparative phylogenetic analyses uncover the ancient roots of Indo-European folktales</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">G</forename><surname>Da Silva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">J</forename><surname>Tehrani</surname></persName>
		</author>
		<idno type="DOI">10.1098/rsos.150645</idno>
	</analytic>
	<monogr>
		<title level="j">Royal Society Open Science</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">150645</biblScope>
			<date type="published" when="2016">2016. Visited on 09/17/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b46">
	<monogr>
		<title level="m" type="main">Charting artistic evolution: an essay in theory</title>
		<author>
			<persName><forename type="first">O</forename><surname>Sobchuk</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018. Visited on 07/26/2020</date>
		</imprint>
		<respStmt>
			<orgName>Tartu: Tartu University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b47">
	<analytic>
		<title level="a" type="main">O vzaimootnosheniah stikhotvornogo metra i tematiki</title>
		<author>
			<persName><forename type="first">K</forename><surname>Taranovskii</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">American Contributions to the Fifth International Congress of Slavists</title>
				<imprint>
			<date type="published" when="1963">1963</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="287" to="332" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b48">
	<analytic>
		<title level="a" type="main">Meter and Meaning: The Semantic Halo of Verse Form in English Romantic Lyrical Poems (Iambic and Trochaic Tetrameter)</title>
		<author>
			<persName><forename type="first">M</forename><surname>Tarlinskaja</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Oganesova</surname></persName>
		</author>
		<idno type="DOI">10.5840/ajs198643/422</idno>
		<idno>doi:</idno>
		<ptr target="https://doi.org/10.5840/ajs198643/422" />
	</analytic>
	<monogr>
		<title level="j">The American Journal of Semiotics</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">3/4</biblScope>
			<biblScope unit="page" from="85" to="106" />
			<date type="published" when="1986">1986. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b49">
	<analytic>
		<title level="a" type="main">Do Transmission Isolating Mechanisms (TRIMS) influence cultural evolution? Evidence from patterns of textile diversity within and between Iranian tribal groups</title>
		<author>
			<persName><forename type="first">J</forename><surname>Tehrani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Collard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Understanding Cultural Transmission in Anthropology: A Critical Synthesis</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Ellen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Lycett</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Johns</surname></persName>
		</editor>
		<editor>
			<persName><surname>Berghahn</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="page" from="148" to="164" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b50">
	<analytic>
		<title level="a" type="main">Ratcheting up the ratchet: on the evolution of cumulative culture</title>
		<author>
			<persName><forename type="first">C</forename><surname>Tennie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Call</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tomasello</surname></persName>
		</author>
		<idno type="DOI">10.1098/rstb.2009.0052</idno>
	</analytic>
	<monogr>
		<title level="j">Philosophical Transactions of the Royal Society B: Biological Sciences</title>
		<idno type="ISSN">0962-8436</idno>
		<imprint>
			<biblScope unit="volume">364</biblScope>
			<biblScope unit="page" from="2405" to="2415" />
			<date type="published" when="2009-08-17">Aug. 2009. /17/2020</date>
		</imprint>
	</monogr>
	<note>Visited on 09</note>
</biblStruct>

<biblStruct xml:id="b51">
	<analytic>
		<title level="a" type="main">Towards the concept of semantic halo</title>
		<author>
			<persName><forename type="first">M</forename><surname>Trunin</surname></persName>
		</author>
		<idno type="DOI">10.12697/smp.2017.4.2.03</idno>
	</analytic>
	<monogr>
		<title level="j">Studia Metrica et Poetica</title>
		<idno type="ISSN">2346-691X</idno>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="41" to="66" />
			<date type="published" when="2017-12-07">Dec. 2017. Visited on 07/26/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b52">
	<analytic>
		<title level="a" type="main">Vol&apos;nye iamby Pushkina</title>
		<author>
			<persName><forename type="first">G</forename><surname>Vinokur</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Pushkin i ego sovremenniki: Materialy i issledovania</title>
				<meeting><address><addrLine>Leningrad</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1930">1930</date>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="23" to="26" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b53">
	<analytic>
		<title level="a" type="main">Linguistic measures of chemical diversity and the &quot;keywords&quot; of molecular collections</title>
		<author>
			<persName><forename type="first">M</forename><surname>Woźniak</surname></persName>
		</author>
		<idno type="DOI">10.1038/s41598-018-25440-6</idno>
	</analytic>
	<monogr>
		<title level="j">Scientific Reports</title>
		<idno type="ISSN">2045-2322</idno>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">7598</biblScope>
			<date type="published" when="2018-05-26">May 2018. /26/2020</date>
		</imprint>
	</monogr>
	<note>Visited on 07</note>
</biblStruct>

<biblStruct xml:id="b54">
	<analytic>
		<title level="a" type="main">Topic Modeling on Historical Newspapers</title>
		<author>
			<persName><forename type="first">T.-I</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Torget</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Mihalcea</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities</title>
				<meeting>the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities<address><addrLine>Portland OR</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="96" to="104" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b55">
	<analytic>
		<title level="a" type="main">Two Methods for Mapping and Visualizing Associated Data on Phylogeny Using Ggtree</title>
		<author>
			<persName><forename type="first">G</forename><surname>Yu</surname></persName>
		</author>
		<idno type="DOI">10.1093/molbev/msy194</idno>
	</analytic>
	<monogr>
		<title level="j">Molecular Biology and Evolution</title>
		<idno type="ISSN">0737-4038</idno>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page" from="3041" to="3043" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b56">
	<monogr>
		<title level="m" type="main">ineq: Measuring Inequality, Concentration, and Poverty</title>
		<author>
			<persName><forename type="first">A</forename><surname>Zeileis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Kleiber</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=ineq(visitedon" />
		<imprint>
			<date type="published" when="2014-07">July 2014. 09/13/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b57">
	<analytic>
		<title level="a" type="main">Computational methods and tools</title>
		<author>
			<persName><forename type="first">J</forename><surname>Van Zundert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Handbook of Stemmatology</title>
				<editor>
			<persName><forename type="first">P</forename><surname>Roelli</surname></persName>
		</editor>
		<editor>
			<persName><surname>De Gruyter Reference</surname></persName>
		</editor>
		<imprint>
			<publisher>De Gruyter</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="292" to="356" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
