=Paper=
{{Paper
|id=Vol-3290/short_paper5417
|storemode=property
|title=Measuring Rhythm Regularity in Verse: Entropy of Inter-Stress
Intervals
|pdfUrl=https://ceur-ws.org/Vol-3290/short_paper5417.pdf
|volume=Vol-3290
|authors=Artjoms Šeļa,Mikhail Gronas
|dblpUrl=https://dblp.org/rec/conf/chr/SelaG22
}}
==Measuring Rhythm Regularity in Verse: Entropy of Inter-Stress
Intervals==
Measuring Rhythm Regularity in Verse: Entropy of Inter-Stress Intervals Artjoms Šeļa1,2,∗,† , Mikhail Gronas3,† 1 Institute of Polish Language (Polish Academy of Sciences), al. Mickiewicza 31, 31-120 Kraków 2 University of Tartu, Ülikooli 18, 50090 Tartu, Estonia 3 Dartmouth college, Hanover, NH 03755, USA Abstract Recognition of poetic meters is not a trivial task, since metrical labels are not a closed set of classes. Out- side of classical meters, describing the metrical structure of a poem in a large corpus requires expertise and a shared scienti昀椀c theory. In a situation when both components are lacking, alternative and contin- uous measures of regularity can be envisioned. This paper focuses on poetic rhythm to propose a simple entropy-based measure for poem regularity using counts of non-stressed intervals. The measure is val- idated using subsets of a well-annotated Russian poetic corpus, prose, and quasi-poems (prose chopped into lines). The regularity measure is able to detect a clear di昀昀erence between various organizational principles of texts: average entropy rises when moving from accentual-syllabic meters to accentual vari- ations to free verse and prose. Interval probabilities, when taken as a vector of features, also allow for classi昀椀cation at the level of individual poems. This paper argues that distinguishing between meter as a cultural idea and rhythm as an empirical sequence of sounds can lead to better understanding of form recognition and prosodic annotation problems. Keywords rhythm, meter, poetry, regularity, entropy, diversity 1. Introduction Identifying the meter of a poem is one of those tasks that is deceptively simple. The problem lies not with methods: a wide array of successful solutions exist, ranging from rule-based [3, 1, 18] to probabilistic [16] to deep learning [17, 8]. The problem, as o昀琀en happens, is conceptual: metrical forms are treated as a closed set of classes, when in fact they are not; far from it. There is much more variation in organizational principles than common iambic or trochaic patterns. Not having a matching label for some unusual arrangement of stressed and unstressed syllables in a poem does not make that poem non-regular, or even non-metrical. And, indeed, quite o昀琀en we do not have labels. Here are some numbers from well-annotated, large poetic corpora that are indispensable in today’s work. Only 2% of lines in the Russian corpus do not conform to any of the 昀椀ve classical CHR 2022: Computational Humanities Research Conference, December 12 – 14, 2022, Antwerp, Belgium ∗ Corresponding author. † These authors contributed equally. £ artjoms.sela@ijp.pan.pl (A. Šeļa); mikhail.gronas@dartmouth.edu (M. Gronas) ȉ 0000-0002-2272-2077 (A. Šeļa) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) 231 meters (iamb, trochee, dactyl, anapest, amphibrach); for Czech this number grows to 14%, in the Dutch song collection it is 15%, and 昀椀nally the German corpus has an incredible rate of 68% ‘unrecognized’ lines (out of 170,000 in total).1 This is not simply a failure of recognition systems—even if you suppose there is some noise. This is a problem of domain expertise and levels of variation in modern verse forms. Germanic versi昀椀cation systems, for example, stem from alliterative tonic verse and widely employ meters that are not based on stable, recurring units of rhythm (metrical feet) [6]. With enough domain expertise and a shared taxonomy (that stems from shared theory), all poems in a corpus, in principle, could be described with a meaningful metrical label.2 That is how the Russian poetic corpus is described now [7]. Needless to say, ‘enough domain expertise’ and ‘shared theory’ are luxuries. More o昀琀en than not, we will not have enough resources for the former nor enough scholarly consensus for the latter. Semi-annotated, theory-agnostic and unstructured data is the primary reality of computational work today. That is why parallel approaches—that are not based on labels—for describing metrical and rhythmic features of verse might be considered. This paper proposes a simple way to measure rhythm regularity as one continuous value based on measures from information theory and ecological diversity. Conceptually, our approach focuses on rhythm alone and brackets the ‘meter’ (as an already known template) out. We try to ask how regularly the rhythmical features are organized in a text, to de昀椀ne a single scale that has highly regular iambs on one side, and irregular prose on the other—with many more intermediate and heterogeneous cases in between. Similar at- tempts to de昀椀ne one space for metrical forms have been made before [16], but su昀昀ered from not distinguishing between the metricality and regularity of verse. At the same time, scholars working with syllabic verse (mostly Romanic) o昀琀en looked in the same direction, because sim- ilarly organized isosyllabic verse might have di昀昀erent rhythmic features across languages [4], or authors [13]. Our reasoning here is informed by the so-called ‘Russian school’ of metrics that have es- tablished a distinction between poetic meter and rhythm. In the classic formulation, meter is a theoretical abstracted scheme, while rhythm is an empirical realization of (and deviation from) that scheme in a given poem (with ‘wrong’ stresses, additional syllables, distribution of word boundaries, etc.; for a brief English overview see Starostin & Pilshchikov [22]). Czech, English, Russian and German literatures all had an idea of iambic meter by the 19th century, but the meter’s rhythm was di昀昀erent everywhere: English iambs allowed the most freedom and scheme transgressions (impossible in Czech and Russian); however, Slavic iambic meters still had enough rhythmic variation compared to German that was the strictest in following the iambic pattern (unstressed syllable followed by a stressed one) [23, 14]. Conversely, Span- ish syllabic verse had strong iambic tendencies in its rhythm [4, 14], without having an iambic meter in its cultural repertoire per se. This fundamental distinction was taken further by Maksim Shapir [20], who suggested that the relationship between meter and rhythm is dialectic, continuous, and non-hierarchical (in the sense that rhythm is not an outcome of a meter). Meter is just a rhythm that is repeated 1 Numbers were provided by Petr Plecháč. For the detailed description of the mentioned corpora, see [19]. 2 Even if the label is just ‘free verse’, ‘tonic verse’, or something more exotic like: ‘3-ictus line with maximum inter-stress interval of 2 syllables and regular alternation of masculine and feminine clausulae’. 232 enough times and standardized in a tradition. For example, Russian folksongs allowed a re- curring rhythmical sequence of 昀椀ve syllables with a strong preference for stress on position 3, ‘00100’ (o昀琀en as a hemistich in 10-syllable line). This rhythm was a mere tendency, until it was recognized by poets and literati and parsed as metrical foot [2, 5]. It even received its early name—‘reduplicated amphibrach’—despite being open to equally plausible scansions within an already existing system: either as a trochaic ‘(1)0-10-0’ or anapestic ‘001-00’ meter.3 This is how a ‘penton’ (piatislozhnik) was born in literary tradition for writing stylized folk-inspired poetry—when one of the rhythmical possibilities in folksongs became a rule. Conversely, a meter could be ‘rhythmicized’ by breaking expectations and inertia in a given poem: actual rhythm can be so individual that it would not be possible to make a judgement about a governing scheme. From this perspective, meter is more of a cultural phenomenon, while rhythm is of prosodic nature. Meters, or measures, are crystallized technologies for the organization of speech: it is possible for them to be written down in poetics manuals, taught in schools, transmitted. They are (mis-)recognized by poets, debated, weaponized for ideological reasons or completely deconstructed [11]. Rhythm is a source of cultural conventions and a sub-product of linguistic a昀昀ordances. Here, we concentrate on information from the sub- product alone, while tracing its relationship to culturally recognized meters using existing deep annotations in the corpus of Russian poetry. 2. Materials and methods The underlying idea of a regularity measurement is simple. Any systematic outside force— like meter—that organizes speech prosody will leave its trace in the distribution of possible unstressed intervals. By measuring the shape, or (un)evenness of these distributions we ac- quire a proxy for rhythmical regularity. Ideal iambic meter allows only one type of unstressed interval—one syllable. In practice, it frequently allows one, o昀琀en three; rarely 昀椀ve (pyrrhics happen instead of fully 昀氀edged iamb feet). In prose (at least in most of it) there is no reason to suspect systematic limitations on the allowed inter-stress interval. We can also think about it as an inequality problem—meters are tyrannical forces that allow only a fraction of linguis- tically possible intervals to dominate, while prosaic language has a more democratic (or just disinterested) outlook on the distribution of the possible intervals. Our approach is summarized in Figure 1. Given a binary rhythmical annotation of an in- dividual text, we extract all inter-stress intervals. We include in the notion of ’interval’ an unstressed syllable that precedes the 昀椀rst instance of stress in a line (anacrusis), but we ex- clude unstressed syllables that follow the last stressed syllable (clausula).4 The types (intervals of particular lengths) are then counted and transformed to probabilities. To measure how ‘un- even’ the resulting probability distribution is we use classic Shannon entropy. This measures the uncertainty of a probability distribution and has two features that are useful for us: it 3 To be fair, both trochees and anapests were heavily employed as meters for imitation of folksongs—dactylic line endings played an important role here. 4 Clausulae are dominantly regular in verse. However, the same pattern of line endings (that might depend on the prosody of rhyming words) can be shared across poetic forms of very di昀昀erent organization (syllabic, tonic, accentual-syllabic). Since the appearance of clausulae is an almost constant feature of traditional verse, counting them will not add much to our task of di昀昀erentiating verse of di昀昀erent organization. 233 Figure 1: Summary of the method and two short examples. As expected, Whitman’s free verse in Leaves of Grass has greater entropy of intervals than Milton’s iambic pentameter, even if the latter has irregularities. The scansion was done by a native English speaker. It is open to debate, but a few changes would not alter the overall picture. grows with the number of the outcomes (i.e. possible interval types) and decreases when some outcomes are more likely than others; a fair die will have a larger entropy compared to an even slightly unfair die. In our case, lower entropy value will signal increased regularity, since some intervals end up being much more likely than others (unfair). Entropy is widely used as a diversity measure in ecology and is closely related to a family of measures that, to varying degrees, capture richness (amount of types) and evenness (how some types are more probable than others). To adopt a holistic approach to rhythm diversity, we also calculate Hill numbers [9] for interval probabilities that summarize diversity information in one curve (cf. their recent use to calculate surviving manuscript diversity by Kestemont et al. [10]). To predict stressed syllables in a text, we use a pre-trained, bidirectional RNN model that performed better than dictionary-based methods [15] and was further 昀椀ne-tuned for Russian poetic prosody (‘ru-accent-poet’ Python module). This model has one important limitation: it leaves all monosyllables unstressed. This will introduce noise to our measures, but we want to demonstrate that high-accuracy performance is not necessary when dealing directly with enough rhythmical features. To determine if the entropy of inter-stress intervals is able to capture di昀昀erences in regularity across di昀昀erent types of texts, we measure entropy at the level of individual texts that are sampled from several di昀昀erently organized domains. Poetry comes from the Russian National 234 Corpus [7], prose from the collection of 19th c. narrative 昀椀ction [21]. • Classic meters Sample of 700 poems for each of the classic accentual-syllabic meters (iamb, trochee, anapest, dactyl, amphibrach); • Baseline prose Sample of 1000 paragraphs of Russian 19th century literary prose (2 paragraphs sampled from each of 500 texts); • Chunked prose The calculated regularity of poetry, at least partly, emerges from di- vision into lines that can arti昀椀cially cut mid-sentence and mid-unstressed intervals. To emulate this behavior, we cut continuous prose into quasi-poems. For each chunk, its line length in syllables and overall length in lines is determined randomly by drawing values from the empirical distribution in the poetry corpus. In the end, our quasi-poems closely resemble the average dimensions of actual poetry. We sample 1000 quasi-poems (2 from each of 500 texts) ; • Free verse Verse that is labeled ‘free’ in corpus annotations, and that should not be governed by any surface-level prosodic pattern (700 poems); • Accentual, 1-2 (A1-2) Poems that take an intermediate position between accentual and accentual-syllabic verse by allowing variation of 1-2 syllables in inter-ictus intervals. A meter also known as dolnik (700 poems); • Accentual, 1-3 (A1-3). The same as A1-2, but allowing greater variation of 1-3 syllables. A meter known as taktovik (700 poems); • Accentual (A). Pure accentual: inter-ictus intervals are not under any regulation, the only measure is the tendency for a constant number of strong positions in a line (700 poems). Intuitively, all these forms should be positioned at di昀昀erent parts of the scale of regularity. We can expect that the regularity of Classic meters > A1-2 > A1-3 > A > Free verse (Chun- ked prose) > Baseline prose. This is what we set out to test using the per-poem distributions of regularity measures. 3. Results Figure 2 shows the distribution of entropy measures for individual texts in each category, ar- ranged by median value. While the variance of observations is large, there is a clear increase in average entropy (decrease in regularity) from accentual-syllabic (median around 1) to base- line prose (median above 2). Hill numbers calculated for bootstrapped samples also show a clear di昀昀erence in diversity pro昀椀les between prose, classical meters, and intermediate forms (see Appendix B). Overall, this supports our expectation of how things should be arranged on a regularity scale. The interval distributions also reveal a shortcoming of this measure based only on inter- stress intervals: pure accentual, free verse and chunked prose all show similar median regu- larity. While for free verse and chunked prose this is not surprising, we know that accentual verse has a governing principle and is regular, it is just that this regularity is not re昀氀ected in the distribution of rhythmical patterns. Additionally, trochaic meters show unexpectedly large 235 Figure 2: Distribution of entropy measures per di昀昀erent corpus subsets, arranged by median value. Red boxplots mark all classic accentual-syllabic meters, whiskers correspond to 95% confidence interval. diversity of inter-stress intervals compared to all other accentual-syllabic meters. Partly this is a natural outcome of the measurement: trochee allows even-length intervals from anacrusis (unlike iamb), which can in昀氀ate entropy. However, it is important to note that Russian trochee can allow more metrical freedom than other meters, because of its historical source in folk- songs. The perceived irregularity of the source material became a stylistic marker for folksong imitations that were o昀琀en rendered in trochaic meters. There is an another potential problem with our observations—the length of the line. As one might expect, it is hard to cram a long unstressed interval into a short line. To account for this, we can formalize regularity di昀昀erences across categories by building a linear model that estimates mean entropy for each group, conditioned on the average length of lines in a poem (for more details, see Appendix C). Posterior predictions made for the global average length across corpora are fully consistent with the observed trend. Based on this model, the di昀昀erence that we are seeing does not come from di昀昀erence in length of lines. It is also possible to use raw probabilities of individual poem intervals to perform recognition of classic meters if every text is put in the same feature space. Figure 3 shows a UMAP [12] projection of a subset of our data, with each point corresponding to a single poem. Even with noisy, imperfect scansion, the potential for clustering is evident (�㕘-means clustering with �㕘 set to the number of meters shows an Adjusted Rand Index of 0.6, suggesting a decent clustering force at the scale of individual poems). 236 Figure 3: UMAP projection of interval data using 5000 texts. Each point corresponds to a single poem. Prose and ‘Accentual,1-2’ were added for comparison. Adjusted Rand Index for �㕘-means clustering for data on this plot plot is 0.41. 4. Discussion Our entropy-based, continuous measure of rhythm regularity is able to adequately describe di昀昀erences in verse organization at the level of individual poems. It is also independent from language and versi昀椀cation system, and could be used to compare poetic forms across languages and across meters—as long as the scansion is available. Of course the nature of this measure- ment makes it more suitable for accentual-syllabic foot-based verse, as that style has the great- est in昀氀uence on the distribution of allowed inter-stress intervals. As we have seen in Figure 2, it is hard to distinguish pure accentual and free verse; purely syllabic poems will most likely show a similar range of entropy values. To account for this, regularity measurement would need to include both the regularity of stressed/strong positions (for tonic verse) and the regu- larity of line lengths (for syllabic verse)—but introducing additional dimensions would make comparisons on a single, continuous scale meaningless. There is little point in asking whether tonic verse is ‘more regular’ than syllabic. We hope that this measure might be used for cross-linguistic comparisons, but its simplicity makes it especially useful in unseen and unstructured data scenarios, where expert annotators are unlikely to be available. This includes collections of self-published poetry, rap, and song lyrics. The regularity scale can be also used to answer long-standing questions about tenden- cies that are hard to see: e.g. whether all ‘free verse’ is completely free, or if some authors prefer 237 it with a bit of regularity (like micropolymetry)? We have seen that it is almost impossible to distinguish free verse from quasi-poems that were cut from prose; however, these quasi-poems have much more regulated line length per text, and, under certain conditions, visibly higher average entropy than free verse (see Appendix C). One of the key features of this regularity measure is that it does not depend at all on the art of scansion. Imperfect, automated rhythmical annotation provides enough information both for entropy scores to make sense, and for meters to form distinct clusters. This reminds us that the rhythm of a poem as an empirical sequence of sounds is not equal to (and not always dependent on) meter as a cultural idea. By distinguishing the two concepts, we can focus our e昀昀orts on improving the rhythm annotation: deriving metrical labels is a parallel, and o昀琀en much more costly, process. Acknowledgments AŠ was supported by the “Large-Scale Text Analysis and Methodological Foundations of Com- putational Stylistics” project (SONATA-BIS 2017/26/E/HS2/01019). We are deeply grateful to Benjamin Nagy and Petr Plecháč for their advice and discussion of the early dra昀琀s of the pa- per. We also would like to thank two anonymous reviewers for their helpful and thoughtful feedback. References [1] A. Anttila and R. Heuser. “Phonological and Metrical Variation across Genres”. In: Pro- ceedings of the Annual Meetings on Phonology 3.0 (2016). doi: 10.3765/amp.v3i0.3679. [2] J. Bailey. “Literary Usage of a Russian Folk Song Meter”. In: The Slavic and East European Journal 14.4 (1970), pp. 436–452. doi: 10.2307/305737. [3] K. Bobenhausen and B. Hammerich. “Métrique littéraire, métrique linguistique et métrique algorithmique de l’allemand mises en jeu dans le programme Metricalizer2”. In: Langages 199.3 (2015), pp. 67–88. doi: 10.3917/lang.199.0067. [4] M. De Sisto. “The interaction between phonology and metre. Approaches to Romance and West Germanic Renaissance metre”. PhD Thesis. Radboud University, 2020. [5] M. L. Gasparov. “Russkii narodnii stikh i ego literaturnyie imitatsii”. In: Izbrannyie trudy. Vol. Iii. Moscow: Iazyki russkoi kulʹtury, 1997, pp. 51–131. [6] M. Gasparov. A History of European Versi昀椀cation. Oxford, New York: Oxford University Press, 1996. [7] E. Grishina, K. Korchagin, V. Plungian, and D. Sichinava. “Poeticheskii korpus v ramkah NKRIA: obschaia struktura i perspektivy ispolzovania”. In: Natsionalnii korpus russkogo iazyka: 2006-2008. Novye rezultaty i perspektivy. St. Petersburg: Nestor-Istoria, 2009, pp. 71–113. 238 [8] T. Haider. “Metrical Tagging in the Wild: Building and Annotating Poetry Corpora with Rhythmic Features”. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Online: Association for Compu- tational Linguistics, 2021, pp. 3715–3725. doi: 10.18653/v1/2021.eacl-main.325. [9] M. O. Hill. “Diversity and Evenness: A Unifying Notation and Its Consequences”. In: Ecology 54.2 (1973), pp. 427–432. doi: 10.2307/1934352. [10] M. Kestemont, F. Karsdorp, E. d. Bruijn, M. Driscoll, K. A. Kapitan, P. Ó. Macháin, D. Sawyer, R. Sleiderink, and A. Chao. “Forgotten books: The application of unseen species models to the survival of culture”. In: Science 375.6582 (2022), pp. 765–769. doi: 10.1126 /science.abl7655. [11] M. Martin. The Rise and Fall of Meter: Poetry and English National Culture, 1860–1930. Princeton University Press, 2012. doi: 10.23943/princeton/9780691152738.001.0001. [12] L. McInnes, J. Healy, and J. Melville. “UMAP: Uniform Manifold Approximation and Pro- jection for Dimension Reduction”. In: arXiv:1802.03426 [cs, stat] (2020). url: http://arxiv .org/abs/1802.03426. [13] A. Mittmann, P. H. Pergher, and L. d. S. Alckmar. “What Rhythmic Signature Says About Poetic Corpora”. In: Quantiative Approaches to Versi昀椀cation. 2019, p. 20. [14] P. Plecháč and D. Birnbaum. “Assessing the Reliability of Stress as a Feature of Author- ship Attribution in Syllabic and Accentual Syllabic Verse”. In: Quantitative Approaches to Versi昀椀cation. 2019, pp. 201–210. [15] M. Ponomareva, K. Milintsevich, E. Chernyak, and A. Starostin. “Automated Word Stress Detection in Russian”. In: Proceedings of the First Workshop on Subword and Character Level Models in NLP. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 31–35. doi: 10.18653/v1/W17-4104. [16] J. Porter. The Space of Poetic Meter – Stanford Literary Lab. 2018. url: https://litlab.stanf ord.edu/hooddistance/. [17] J. de la Rosa, Á. Pérez, M. de Sisto, L. Hernández, A. Dı́az, S. Ros, and E. González- Blanco. “Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models”. In: Neural Computing and Applications (2021). doi: 10.1007/s00521-021-06692-2. [18] J. d. l. Rosa, Á. Pérez, L. Hernández, S. Ros, and E. González-Blanco. “Rantanplan, Fast and Accurate Syllabi昀椀cation and Scansion of Spanish Poetry”. In: Procesamiento del Lenguaje Natural 65.0 (2020), pp. 83–90. url: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln /article/view/6280. [19] A. Šeļa, P. Plecháč, and A. Lassche. “Semantics of European poetry is shaped by conser- vative forces: The relationship between poetic meter and meaning in accentual-syllabic verse”. In: Plos One 17.4 (2022), e0266556. doi: 10.1371/journal.pone.0266556. [20] M. Shapir and M. Shapir. “Metrum et rhythmus sub specie semioticae”. In: Universum versus. Vol. 1. Moscow: Iazyki russkoi kulʹtury, 2000, pp. 91–128. 239 Figure 4: Hill numbers up to diversity order 4 for di昀昀erent corpus subsets. [21] O. Sobchuk and E. Lekarevitch. Corpus of Russian narrative prose of the 19th century. 2020. doi: 10.31860/openlit-2020.10-C004. [22] A. Starostin and I. Pilshchikov. “Automated Analysis of Poetic Texts and the Problem of Verse Meter”. In: Current Trends in Metrical Analysis. Vol. 2. Littera. 2011, pp. 133–140. [23] K. Taranovskii. “Rannie russkie iamby i ikh nemetskie obraztsy”. In: O poezii i poetike. Moscow: Iazyki russkoi kulʹtury, 2000, pp. 283–290. A. Code and data The data pre-processing and analysis pipeline is openly available in a repository: https://github.com/perechen/verse_regularity B. Rhythm diversity, Hill numbers Figure 4 displays the decay of Hill numbers (diversity curves) for corpus subsets up to diversity order (�㕞) 4. �㕞 = 0 corresponds to the simple number of types; �㕞 = 1 to Shannon diversity (our regularity measure); �㕞 = 2 to Simpson’s index. Higher positions on a curve signal higher relative diversity. We see that classical meters have the least diversity of unstressed intervals, followed by accentual variants and then followed by the highest, and distinct, curve for prose. The only exception is the behavior of trochee that displays a diversity of intervals (a昀琀er �㕞 = 1) approaching accentual verse and quasi poems. Lines show average Hill numbers for 1000 bootstrapped subsets (100 texts in each run), shaded areas correspond to 95% interval. 240 C. Regularity and line length: posterior predictions To formally model regularity di昀昀erence between corpus subsets given the relationship entropy ∼ line length, we build a Bayesian regression model using the ‘brms’ interface in R, where we estimate the average entropy (�㕅) for each group of texts (�㔶), conditioned on poem’s average line length in syllables (�㕆) within each group (interaction). We add a quadratic term for �㕆, since the �㕅 ∼ �㕆 relationship di昀昀ers across meters and is not linear. We also log-transform �㕆 to have better predictions in the presence of outliers. In ‘brms’ formula notation (modeling �㕅 using normal distribution): R ∼ C * (S + I(S^2)) The le昀琀 side of Figure 5 shows posterior predictions for 昀椀xed global average line length (dotted lines on the right side plot). Since there is an interaction between each subset and length, each prediction for each group uses its own scale. Estimates are consistent with the empirically observed di昀昀erences in Figure 2, with one exception: quasi-poems at the average corpus line length show greater average entropy than accentual and free verse. Note that while some forms show a slight increase in entropy as line lengths increases, the relationship is not clear in many cases, except for free verse and chunked prose that are unconstrained by metrical regularities. Additionally, we omit prose samples from modeling, because the division to lines was absent from them, which means that ‘line length’ was just equal to the length of a paragraph (in syllables). This would introduce unreasonable predictions for unobserved lengths, like iambic poems with an average line length of, say 400 syllables (200-foot iamb! iambic diakosiameter!). The right side of the Figure 5 already shows wide con昀椀dence intervals for extremely short and extremely long lengths for which we don’t have many (if any) observations. 241 Figure 5: Le昀琀: posterior estimates (black) for corpus subsets superimposed on empirical data. Pre- dictions are made for global average line length (9.4 syllables). Right: posterior relationships between entropy and line length for each group. Vertical line marks the point that was used for predictions on the le昀琀. 242