Global Coherence, Local Uncertainty – Towards a Theoretical Framework for Assessing Literary Quality Yuri Bizzoni1,† , Pascale Feldkamp1,† and Kristoffer Nielbo1,† 1 Center for Humanities Computing Aarhus, Jens Chr. Skous Vej 4, Building 1483, DK-8000 Aarhus C, Denmark Abstract A theoretical framework for evaluating literary quality through analyzing narrative structures using simplified narrative representations in the form of story arcs is presented. This framework proposes two complementary models: the first employs Approximate Entropy to measure local unpredictability, while the second utilizes fractal analysis to assess global coherence. When applied to a substantial corpus of 9,089 novels, the findings indicate that narratives characterized by high literary quality, as indicated by reader ratings, exhibit a balance of local unpredictability and global coherence. This dual approach provides a formal and empirical basis for assessing literary quality and emphasizes the importance of considering intrinsic properties and reader perception in literary studies. Keywords literature, information theory, fractal theory, aesthetic theory 1. Introduction Quality assessment of literature is a highly contested matter. Positions in the debate range from constructivist context dependency (‘the success of a work of literature depends entirely on its context’) to work internalism (‘success depends on work-internal features’) [3, 36]. While con- text dependency is evidenced by the variety and seemingly chaotic dynamics of, e.g., bestseller lists, the constructivist argument ignores the convergence of an empirical ‘canon’ for many readers over time and space [34, 59]. Moreover, to attribute the longevity or popularity of cer- tain books to purely contextual factors would seem to be at odds with large-scale consensus among readers, which appear far from volatile [1, 60, 39]. Several shifts have played a role in making terms like “literary quality” or “classics” unpop- ular in the discipline of literary studies, even said to belong to the “precritical era of criticism itself” [23]: Methodological shifts that move the scholarly focus from evaluation to interpre- tation [9], an expansion of the conceptual boundaries of “literature” to encompass texts that challenge traditional ideas of beauty or enjoyment [62], but also constructivist and postcolonial shifts that bring attention to the context of literary evaluation and tradition, canon representa- tivity [24] and the inequality of cultural production [55, 53]. CHR 2024: Computational Humanities Research Conference, December 4–6, 2024, Aarhus, Denmark † These authors contributed equally. £ yuri.bizzoni@cc.au.dk (Y. Bizzoni); pascale.moreira@cc.au.dk (P. Feldkamp); kln@cas.au.dk (K. Nielbo) ȉ 0000-0002-6981-7903 (Y. Bizzoni); 0000-0002-2434-4268 (P. Feldkamp); 0000-0002-5116-5070 (K. Nielbo) © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). 172 CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings Context-dependent positions on literary appreciation range from simply prioritizing context over work-internal factors [23], to beliefs on the contextual determinacy of aesthetic evaluation [19, 15]. An extreme perspective holds that critical evaluation reflects culturally dominant voices, arguing that canon judgments “are only the instruments of entrenched interests” [24, p. iv]. In this sense, a disparity appears to have arisen between a scholarly “denial of quality” [62, 46], and the multitude of quality judgements that are practiced within the literary culture (literary awards, classics book series, prescriptive creative writing courses, etc.). Conversely, work-internalist positions are closer, at least in their purest form, to universal- ist claims, where aesthetic judgments are seen as arising solely from the interaction between the individual reader and writer [10]. On this position, aesthetic pleasure tends to be consid- ered as an a-historical or culturally universal experience and sidetracks as unimportant any “historically contingent” aspect of literature [16]. We can simplify the concept of literary quality by modeling perceived literary quality, thereby differentiating between a contested intrinsic value and the perceived value of a work, which anchors the quality assessment in the reader’s aesthetic experience (i.e., a situated experience). Empirical aesthetics offers well-formed theoretical work supported by empirical findings on the nature of aesthetic experience[22] and preferences at the psychological level [47, 11], as well as cultural and contextual influences on the aesthetic experience [12]. Yet, empirical aesthetics has predominantly focused on the visual modality, in particular of pictorial art, attempting to find patterns correlating with the appreciation of paintings [22]. Within the domain of paintings, prevalent research topics include aesthetic appreciation and judgment [37], their perceptual and cognitive underpinnings [38, 61], and, finally, emotional response [54, 58, 47]. The appreciation of literary prose has received significantly less attention than pictorial art [22, 11], and studies tend to focus on general aspects of aesthetic appreciation of language [33] and literary response [41, 42, 18]. Approaches in empirical aesthetics that are based on fractal theory are particularly promis- ing since they cross aesthetic modalities, that is, they seem to be valid for pictorial [56, 51] liter- ary [43, 44] and musical [40, 25] arts. Collectively, these approaches state that the appreciation of art depends on the presence of fractal-like scale-invariant properties. Although the expla- nation for this can vary, it is typically grounded in neural mechanisms of sensory coding [51]: a prototypical example is Jackson Pollock’s abstract expressionism, where his drip painting technique was used to compose self-similar patterns repeated at multiple scales, which enables “Pollock authentication” through fractal analysis [56]. The same properties can be found in lit- erature, where linguistic properties tend to display fractal behavior [43], but where positively evaluated literary narratives tend to be characterized by a higher degree of self-similarity [8, 4, 5]. These findings of multiscale self-similar repetitions in a set of linguistic properties (e.g., sentence structure, literary entities, or lines) are generally translated into global coherence or consistency of the aesthetic object [5, 7, 50]. In literary prose, for example, it is argued that a high-quality story is characterized by a narrative coherence and multiscale predictability that distinguish it from bland and unpredictable stories [28]. Similarly, high-quality paintings have visual elements that remain consistent at different magnification levels. A recent addition to this global quality property is a local property in literary prose that can identify quality [43, 44]. More specifically, canonical works show higher degrees of sequential unpredictability compared to noncanonical fiction [44]. Combining these two quality indicators, we expect 173 that high-quality literature, that is, literature that is appreciated by a majority, should display a tension between global coherence and local unpredictability and, furthermore, that a model of perceived literary quality should account for both. This paper will presents theoretical framework on two models of literary quality that utilize simplified narrative representations in the form of story arcs [29, 49, 45]. The first model uses approximate entropy to capture local unpredictability of the story arc [44], and the second model is based on fractal analysis to characterize the global coherence of the story arc [28]. Finally, we present an application of the models to a large collection of literary texts that when applied to the texts’ story arcs, covary in non-trivial ways. 2. Proposal and Methodology In this section, we present the theoretical foundation for models of literary quality, focusing on the integration of global coherence and local uncertainty within narrative structures. By employing metrics from empirical aesthetics, fractal theory and information theory, we pro- pose two complementary approaches to quantify these aspects: approximate entropy and the Hurst exponent. Specifically, we suggest the for literature to be appreciated, it should manifest a tension, or positive correlation, between global coherence and local unpredictability (Fig. 1). 2.1. Approximate entropy - a measure of local predictability Approximate entropy (𝐴𝑝𝐸𝑛) is a statistical metric that quantifies the complexity and regular- ity inherent in the time series data. Within the framework of narrative analysis, 𝐴𝑝𝐸𝑛 provides a means to assess the predictability and structural characteristics of story arcs [44]. A lower 𝐴𝑝𝐸𝑛 value signifies a more predictable and regular story arc, indicative of a repetitive and ‘boring’ narrative structure. In contrast, a higher 𝐴𝑝𝐸𝑛 value denotes a more complex and less predictable arc, suggesting a novel and intricate storyline, 1. This metric facilitates the quanti- tative analysis of local narrative predictability, enabling a formal evaluation of literary quality through empirical methodologies. For the specific question of evaluating literary quality, we expect that higher local 𝐴𝑝𝐸𝑛 will be associated with higher literary appreciation, c.f., [44]. In order to estimate 𝐴𝑝𝐸𝑛 for story arc 𝑋 = 𝑥(1), … , 𝑥(𝑛), subsequences of length 𝑚, 𝑦𝑖𝑚 = [𝑥(𝑖), … , 𝑥(𝑖+(𝑚 −1))], and tolerance 𝑟, we start by computing the Chebyshev distance between each sub-sequence 𝑦𝑖𝑚 and 𝑦𝑗𝑚 𝑑𝑖,𝑗𝑚 = max ∣ 𝑦𝑖𝑚 − 𝑦𝑗𝑚 (𝑘) ∣ 𝑘 for each sub-sequence 𝑦𝑖𝑚 to compute the count 𝐶 𝑛−𝑚+1 1 𝐶𝑖𝑚 (𝑟) = ∑ 𝐻 (𝑟 − 𝑑𝑖,𝑗𝑚 ) 𝑛 − 𝑚 + 1 𝑗=1 where 𝐻 (.) is the Heaviside function 174 1, 𝑥 > 0 𝐻 (𝑥) = { 0, 𝑥 ≤ 0 then define 𝑛−𝑚+1 1 𝜙 𝑚 (𝑟) = ∑ log(𝐶𝑖𝑚 (𝑟)) 𝑛 − 𝑚 + 1 𝑖=1 where log(.) is the natural logarithm. Repeat the above for all sub-sequences of length 𝑚 + 1 to compute 𝜙 ( 𝑚 + 1)(𝑟), then 𝐴𝑝𝐸𝑛 is 𝐴𝑝𝐸𝑛(𝑚, 𝑟) = 𝜙 𝑚 (𝑟) − 𝜙 𝑚+1 (𝑟) A note on parameter selections, the choice of 𝑚 and 𝑟 can influence the estimation of 𝐴𝑝𝐸𝑛. Typically, 𝑚 is set to 2 or 3 (in this study 𝑚 = 2), while 𝑟 is chosen as a small percentage of the standard deviation of the story arc (in this study 0.2 ∗ 𝑆𝐷). It is important to note that the values of the optimal parameters may vary depending on the specific application and the characteristics of the data. 2.2. The Hurst exponent - a measure of global coherence Story arcs can be modeled as fractal processes to understand their time-dependent self- similarity. By representing narratives as one-dimensional time series, we can apply fractal analysis to model the underlying pattern of coherence [28, 8, 6]. This approach allows us to quantify the degree of self-similarity in the narrative flow. Using the Hurst exponent, 𝐻 , which quantifies the persistence of a story arc, we can differentiate between persistent, anti-persistent, and short-memory processes, thus characterizing the temporal dynamics in three qualitative ranges [28], 1. We expect that literary appreciation will be associated with coherence, which translates to a persistent story arc such that narrative fluctuations at shorter time scales are approximate copies of narrative fluctuations at longer time scales. The Hurst 𝐻 exponent quantifies persistence or memory in time series, where 0 < 𝐻 < 0.5 is an anti-persistent process, 𝐻 = 0.5 is a short-memory process, and 0.5 < 𝐻 < 1 is a persistent process [20]. For the present proposal, a persistent process indicates continuity of the story arc (i.e., sentiment states will last for a long time). An anti-persistent indicates rigidity (i.e., sentiment states will rapidly decay to a mean state). Finally, short memory indicates a lack of continuity (sentiment states will only be correlated at short time scales). Detrended fluctuation analysis (DFA) [48] is a widely used method for estimating the Hurst exponent for a time series. DFA consists of five steps, 1) initially a random walk process is constructed from the time series: 𝑛 𝑢(𝑛) = ∑(𝑥𝑘 − 𝑥), 𝑛 = 1, 2, ⋯ , 𝑁 , 𝑘=1 where 𝑥 is the mean of the series 𝑥(𝑘), 𝑘 = 1, 2, ⋯ , 𝑁 ; 2) dividing the constructed random walk process into non-overlapping segments; 3) determining the local trends of each segment as the 175 best polynomial fit; 4) getting the variance of the differences between the random walk process and the local trends; and 5) determining the average variance over all the segments. DFA may involve discontinuities at the boundaries of adjacent segments. Such discontinuities can be detrimental when the data contain trends [27], non-stationarity [32], or nonlinear oscillatory components [13, 26]. Adaptive fractal analysis (AFA) provides a robust alternative to DFA that solves these issues [20]. The main advantage of AFA over DFA is that it identifies a global smooth trend, which is obtained by optimally combining local linear or polynomial fitting, and thus no longer suffers from DFA’s problem of discontinuities of adjacent segments. As a result, AFA can automatically deal with arbitrary, strong nonlinear trends that are not unusual to encounter in story arcs[20, 26, 28]. AFA is based on a multiscale non-linear adaptive decomposition algorithm [20]. The first step involves partitioning the time series under study into overlapping segments of length 𝑤 = 2𝑛 + 1, where neighboring segments overlap by 𝑛 + 1 points. In each segment, the time series is fitted with the best polynomial of order 𝑀, obtained using the standard least squares regression; the fitted polynomials in overlapped regions are combined to yield a single global smooth trend. Denoting the fitted polynomials for the 𝑖 − 𝑡ℎ and (𝑖 + 1) − 𝑡ℎ segments by 𝑦 𝑖 (𝑙1 ) and 𝑦 (𝑖+1) (𝑙2 ), respectively, where 𝑙1 , 𝑙2 = 1, ⋯ , 2𝑛 + 1, we define the fitting for the overlapped region as 𝑦 (𝑐) (𝑙) = 𝑤1 𝑦 (𝑖) (𝑙 + 𝑛) + 𝑤2 𝑦 (𝑖+1) (𝑙), 𝑙 = 1, 2, ⋯ , 𝑛 + 1, where 𝑤1 = (1 − 𝑙−1 𝑛 ) and 𝑤2 = 𝑙−1 𝑛 can be written as (1 − 𝑑𝑗 /𝑛) for 𝑗 = 1, 2, and where 𝑑𝑗 denotes the distances between the point and the centers of 𝑦 (𝑖) and 𝑦 (𝑖+1) , respectively. Note that the weights decrease linearly with the distance between the point and the center of the segment. Such weighting ensures symmetry and effectively eliminates any jumps or disconti- nuities around the boundaries of neighboring segments. As a result, the global trend is smooth at the non-boundary points and has the right and left derivatives at the boundary [52]. The parameters of each local fit are determined by maximizing the goodness of fit in each seg- ment. The different polynomials in the overlapping part of each segment are combined so that the global fit will be the best (smoothest) fit of the overall time series. Note that, even if 𝑀 = 1 is selected, i.e., the local fits are linear, the global trend signal will still be nonlinear. For an arbitrary window size 𝑤, we determine, for the random walk process 𝑢(𝑖), a global trend 𝑣(𝑖), 𝑖 = 1, 2, ⋯ , 𝑁 , where 𝑁 is the length of the walk. The residual of the fit, 𝑢(𝑖) − 𝑣(𝑖), char- acterizes fluctuations around the global trend, and its variance yields the Hurst parameter 𝐻 according to the following scaling equation: 𝑁 1/2 1 𝐹 (𝑤) = [ ∑(𝑢(𝑖) − 𝑣(𝑖))2 ] ∼ 𝑤 𝐻 . 𝑁 𝑖=1 By computing the global fits, the residual, and the variance between original random walk process and the fitted trend for each window size 𝑤, we can plot log2 𝐹 (𝑤) as a function of log2 𝑤. The presence of fractal scaling amounts to a linear relation in the plot, with the slope of the relation providing an estimate of 𝐻 . 176 Figure 1: To illustrate the ‘boldness’ of the proposed tension between global coherence and local unpre- dictability, we generated four time series characterized by decreasing levels of local unpredictability and increasing levels of global persistence. Starting from left to right, azure noise (𝐴𝑝𝐸𝑛 = 1.55, 𝐻 = 0.03), white noise (𝐴𝑝𝐸𝑛 = 1.33, 𝐻 = 0.54), 3) pink noise (𝐴𝑝𝐸𝑛 = 0.83, 𝐻 = 0.72), 4) brown/black noise (𝐴𝑝𝐸𝑛 = 0.31, 𝐻 = 1.56). It should be apparent the both intuitively and theoretically unpredictability and persistence are anti-correlated. 3. Application To illustrate the practical application of our proposed models, we apply the models to a sub- stantial corpus of literary texts. By analyzing the sentiment arcs of the narratives, we quantify the local unpredictability using 𝐴𝑝𝐸𝑛 and the global coherence using the Hurst exponent. This application demonstrates how our models align with quality indices, providing empirical sup- port for the notion that a balance of global coherence and local uncertainty enhances perceived literary quality. 3.1. Data To extract the sentiment arcs, we use a corpus of 9, 089 novels published in the US from 1880 to 2000, making it one of the largest corpora of manually cleaned contemporary literature [57, 14, 2]. It was assembled by Hoyt Long and Richard Jean So and was based on the number of libraries worldwide that held a copy of each title, favoring titles with a higher number of holdings. It comprises high-brow fiction (e.g., Joyce Carol Oates, Philip Roth) as well as highly known “genre fiction” (e.g., J.R.R Tolkien, Philip K. Dick), bestsellers (e.g., Agatha Christie, George R.R. Martin) and prestigious experimental literature (e.g., James Joyce). The works span from 341 − 714, 444 words, but more than 97% of the novels are longer than 35,000 words. Overall the corpus totals over one billion words. The corpus has an Anglophone bias: most are US or UK authors writing in English. While this allowed us some form of control on cultural variability within the corpus, it also presents a narrower perspective on literary dynamics. From each novel, we extracted the average valence sentence by sentence using the Syuzhet library [30].1 Syuzhet was chosen based on recent research showing that it performs particu- larly well on detrended narrative arcs, returning values close to human annotations [63]. For this corpus various quality proxies were collected: GoodReads average rating and rating 1 The custom Syuzhet dictionary is extracted from 165, 000 human coded sentences from contemporary literary novels [31]. 177 count,2 library holding numbers,3 and translation numbers.4 3.2. Findings Figure 2: Spearman correlation between the Hurst exponent and approximate entropy (ApEn) of novels in the corpus and four quality proxies: GoodReads average rating and rating count, library holding numbers and translation counts. For all correlations, 𝑝 < 0.01 We estimated 𝐴𝑝𝐸𝑛 and Hurst exponent for each story arc. We correlated these measures with available metadata quality proxies: GoodReads average rating and rating count, library holdings, and translation counts (see Figure 2). As predicted, we find empirical support that global coherence and local predictability are moderately correlated (𝜌 = .33, 𝑝 < .0001). Fur- thermore, average ratings are positively associated with both measures (𝜌𝐴𝑝𝐸𝑛 = .15 and 𝜌𝐻 = .19, 𝑝 < .0001). Interestingly, we observe that 𝐴𝑝𝐸𝑛 is more strongly associated with the additional quality proxies. One of the most interesting findings is, in our opinion, the positive correlation of Hurst exponent and 𝐴𝑝𝐸𝑛 in our corpus in general, and with at least one of the proxies of reception. As we show in 1, the Hurst exponent and 𝐴𝑝𝐸𝑛 tend to be divergent measures, and in fact they have negative correlations for artificial or simpler time series [21]. In other words, a completely random series will have both a higher 𝐴𝑝𝐸𝑛 and a lower Hurst exponent than a completely trending series. However, Hurst exponent and 𝐴𝑝𝐸𝑛 do not measure the same thing, and in series with complex behaviors it is not the case that the highest Hurst corresponds to the lowest entropy (see, e.g., Kristoufek and Vosvrda [35]). Overall, 𝐴𝑝𝐸𝑛 (as we have computed it) here compares each embedding to all the other em- beddings in the series. We can imagine a case in which a time series that has some degree of persistence in its overall trend, but which consists, at the sentence-level, of relatively unpre- dictable shorter sequences. Such a case would not result in a negative correlation between the measures. If the level of “noise” (unpredictability) is both relatively low and constant through- out a trending series, it is likely to result in a high 𝐴𝑝𝐸𝑛 and a high Hurst exponent. This effect may be due to the ability of 𝐴𝑝𝐸𝑛 to capture shorter-range variations in the sen- timental fluctuations of a novel, closer to style than to overall narrative structure. For example, 2 I.e., the average score assigned to a book by users, and how many GoodReads users assigned a score. 3 The number of libraries which hold a copy of the book as indexed on WorldCat. 4 The number of translations of a book in the Index Translationum database. 178 a relatively low 𝐴𝑝𝐸𝑛 could represent a text where the same small-scale sentiment patterns are repeated in a structured and predictable fashion. Reversely, a high Hurst with high 𝐴𝑝𝐸𝑛 might represent a text where the overall narrative arc is highly coherent (and predictable), but where the small-scale succession of positive and negative sentences is more chaotic or unpredictable. In this case, readers may find short-term uncertainty but an overall smoother and foreseeable experience: a trend is builds up through a very diverse set of sentimental “zig-zags”, rather than through a simple linear succession going from the saddest to the happiest point. Narra- tives that manage to strike a balance between short-term surprising patterns and long-term coherent structures might be one of the reasons for the observed correlations. Finally, while 𝐴𝑝𝐸𝑛 slightly correlates with all our metrics of reception, a linear positive relation with the Hurst exponent of sentiment arcs is present only in the case of average GoodReads ratings. Interestingly, this is the only metric that is not related to spread or popularity, while the other three more or less indirectly measure how popular, known or disseminated a text is [17]. 4. Concluding Remarks Our theoretical proposal aims to avoid the contentious issue of universal features by focusing on the reader’s perception of literary quality. By redefining literary quality as perceived literary quality, we emphasize the reader’s experience and the subjective nature of aesthetic appreci- ation. Despite this focus on perception, our approach still leans towards intrinsic properties, as our models of approximate entropy and the Hurst exponent are grounded in the structural characteristics of the narratives themselves. Our findings provide initial empirical support for the hypothesis that tension between global coherence and local unpredictability contributes to perceived literary quality, as reflected in readers’ average ratings. However, there is ample room for further research to explain more of the variance in literary quality assessments. Context-sensitive variables such as genre and reader demographics could significantly shape literary appreciation. Less indirect quality prox- ies with a higher temporal resolution, e.g., reading time, can also provide important correctives to the current proposal. By combining the strengths of empirical aesthetics with fractal theory and information the- ory, our proposal offers a robust framework for evaluating literary quality. For future research, our aim is to empirically investigate how the appreciation of literary language is influenced by or drawn to specific forms of temporal organization of language. Acknowledgments This research was supported by the Velux Foundation (Grant Name The Fabula-Net). Nielbo’s work was supported by the Carlsberg Foundation (Grant Number CF23-1583) and Aarhus Uni- versity Research Foundation (Grant Name The Golden Imprint). Bizzoni and Nielbo’s work was supported by the Innovation Fund Denmark (Grant Number 4298-00018B). 179 References [1] J. Archer and M. L. Jockers. The Bestseller Code. London: Penguin books, 2017. [2] Y. Bizzoni, I. M. S. Lassen, T. Peura, M. R. Thomsen, and K. L. Nielbo. “Predicting Literary Quality - How Perspectivist Should We Be?” In: 1st Workshop on Perspectivist Approaches to Disagreement in NLP, NLPerspectives 2022 as part of Language Resources and Evalua- tion Conference, LREC 2022 Workshop. European Language Resources Association, 2022, pp. 20–25. [3] Y. Bizzoni, P. Moreira, N. Dwenger, I. Lassen, M. Thomsen, and K. Nielbo. “Good Reads and Easy Novels: Readability and Literary Quality in a Corpus of US-published Fiction”. In: Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa). Tórshavn, Faroe Islands: University of Tartu Library, 2023, pp. 42–51. url: https://aclan thology.org/2023.nodalida-1.5. [4] Y. Bizzoni, P. Moreira, M. R. Thomsen, and K. Nielbo. “Sentimental Matters - Predicting Literary Quality by Sentiment Analysis and Stylometric Features”. In: Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis. Ed. by J. Barnes, O. De Clercq, and R. Klinger. Toronto, Canada: Association for Computational Linguistics, 2023, pp. 11–18. doi: 10.18653/v1/2023.wassa-1.2. url: https://aclanthology.org/2023.wassa-1.2. [5] Y. Bizzoni, P. F. Moreira, M. R. Thomsen, and K. L. Nielbo. “The Fractality of Sentiment Arcs for Literary Quality Assessment: the Case of Nobel Laureates”. In: Journal of Data Mining & Digital Humanities Nlp4dh (2023), p. 11406. doi: 10.46298/jdmdh.11406. url: https://jdmdh.episciences.org/11406. [6] Y. Bizzoni, P. F. Moreira, M. R. Thomsen, and K. L. Nielbo. “The Fractality of Sentiment Arcs for Literary Quality Assessment: the Case of Nobel Laureates”. In: Journal of Data Mining & Digital Humanities Nlp4dh (2023), p. 11406. doi: 10.46298/jdmdh.11406. url: https://jdmdh.episciences.org/11406. [7] Y. Bizzoni, T. Peura, M. R. Thomsen, and K. Nielbo. “Sentiment Dynamics of Success: Fractal Scaling of Story Arcs Predicts Reader Preferences”. In: Proceedings of the Work- shop on Natural Language Processing for Digital Humanities. Ed. by M. Hämäläinen, K. Alnajjar, N. Partanen, and J. Rueter. NIT Silchar, India: NLP Association of India (NL- PAI), 2021, pp. 1–6. url: https://aclanthology.org/2021.nlp4dh-1.1. [8] Y. Bizzoni, T. Peura, M. R. Thomsen, and K. L. Nielbo. “Fractal Sentiments and Fairy Tales: Fractal scaling of narrative arcs as predictor of the perceived quality of Andersen’s fairy tales”. In: Journal of Data Mining & Digital Humanities Nlp4dh (2022). doi: 10.46298/jd mdh.9154. [9] E. Bjerck Hagen, C. Hamm, F. Helmich Pedersen, J. M. Sejersted, and E. Vassenden. “Lit- erary Quality: Historical Perspectives”. In: Contested Qualities. Ed. by K. O. Eliassen, J. Hovden, and O. Prytz. Fagbokforlaget, 2018, pp. 47–74. url: https://interaccio.diba.cat/s ites/interaccio.diba.cat/files/contested%5C%5Fqualities%5C%5F0.pdf%5C#page=48. [10] H. Bloom. The Western canon : the books and school of the ages. Harcourt Brace, 1994. 180 [11] M. Burke. “The neuroaesthetics of prose fiction: pitfalls, parameters and prospects”. In: Frontiers in Human Neuroscience 9 (2015). doi: 10.3389/fnhum.2015.00442. url: http://jo urnal.frontiersin.org/Article/10.3389/fnhum.2015.00442/abstract. [12] J. Che, X. Sun, V. Gallardo, and M. Nadal. “Cross-cultural empirical aesthetics”. In: Progress in Brain Research. Vol. 237. Elsevier, 2018, pp. 77–103. doi: 10 . 1016 / bs . pbr . 2 018.03.002. url: https://linkinghub.elsevier.com/retrieve/pii/S0079612318300025. [13] Z. Chen, K. Hu, P. Carpena, P. Bernaola-Galvan, H. E. Stanley, and P. C. Ivanov. “Effect of nonlinear filters on detrended fluctuation analysis”. In: Phys. Rev. E 71.1 (2005), p. 011104. doi: 10.1103/PhysRevE.71.011104. url: https://link.aps.org/doi/10.1103/PhysRevE.71.01 1104. [14] J. Cheng. “Fleshing out models of gender in English-language novels (1850–2000)”. In: Journal of Cultural Analytics 5.1 (2020), p. 11652. doi: https://doi.org/10.22148/001c.116 52. [15] J. Culler. On Deconstruction: Theory and Criticism After Structuralism. Cornell University Press, 2007. [16] J. Derrida and D. Attridge. “”This Strange Institution Called Literature: An Interview with Jaques Derrida””. In: Acts of literature. London: Routledge, 2017, pp. 33–75. [17] P. Feldkamp, Y. Bizzoni, M. R. Thomsen, and K. L. Nielbo. Measuring Literary Quality. Proxies and Perspectives. Report. Darmstadt, 2024. doi: 10.26083/tuprints-00027391. url: https://tuprints.ulb.tu-darmstadt.de/27391/. [18] O. d. C. Fialho. “Foregrounding and refamiliarization: understanding readers’ response to literary texts”. In: Language and Literature: International Journal of Stylistics 16.2 (2007), pp. 105–123. doi: 10.1177/0963947007075979. url: http://journals.sagepub.com/doi/10.1 177/0963947007075979. [19] S. E. Fish. Is there a text in this class? the authority of interpretive communities. 12. print. Cambridge, Mass.: Harvard Univ. Press, 2003. [20] J. Gao, J. Hu, and W.-w. Tung. “Facilitating Joint Chaos and Fractal Analysis of Biosignals through Nonlinear Adaptive Filtering”. In: PLoS ONE 6.9 (2011). Ed. by M. Perc, e24331. doi: 10.1371/journal.pone.0024331. url: http://dx.plos.org/10.1371/journal.pone.002433 1. [21] E. Gospodinova. “Fractal time series analysis by using entropy and hurst exponent”. In: Proceedings of the 23rd International Conference on Computer Systems and Technologies. 2022, pp. 69–75. [22] F. Greb, P. Elvers, and T. Fischinger. “Trends in Empirical Aesthetics: A Review of the Journal Empirical Studies of the Arts from 1983 to 2014”. In: Empirical Studies of the Arts 35.1 (2017), pp. 3–26. doi: 10.1177/0276237415625258. url: http://journals.sagepub.com /doi/10.1177/0276237415625258. [23] J. Guillory. Cultural Capital: The Problem of Literary Canon Formation. University of Chicago Press, 1995. 181 [24] R. von Hallberg. “Editor’s Introduction ”Canons””. In: Critical Inquiry, Special Issue ”Canons” 10.1 (1983), pp. iii–vi. url: https://www.jstor.org/stable/1343403. [25] K. J. Hsü and A. Hsü. “Self-similarity of the ”1/f noise” called music.” In: Proceedings of the National Academy of Sciences 88.8 (1991), pp. 3507–3509. doi: 10.1073/pnas.88.8.3507. eprint: https://www.pnas.org/doi/pdf/10.1073/pnas.88.8.3507. url: https://www.pnas.or g/doi/abs/10.1073/pnas.88.8.3507. [26] J. Hu, J. Gao, and X. Wang. “Multifractal analysis of sunspot time series: the effects of the 11-year cycle and Fourier truncation”. In: Journal of Statistical Mechanics: Theory and Experiment 2009.02 (2009), P02066. doi: 10.1088/1742- 5468/2009/02/p02066. url: http://stacks.iop.org/1742-5468/2009/i=02/a=P02066?key=crossref.879d2c42ec88048312 02df82da8d7a1a. [27] K. Hu, P. C. Ivanov, Z. Chen, P. Carpena, and H. Eugene Stanley. “Effect of trends on detrended fluctuation analysis”. In: Physical Review E 64.1 (2001). doi: 10.1103/PhysRev E.64.011114. url: https://link.aps.org/doi/10.1103/PhysRevE.64.011114. [28] Q. Hu, B. Liu, M. R. Thomsen, J. Gao, and K. L. Nielbo. “Dynamic evolution of sentiments in Never Let Me Go: Insights from multifractal theory and its implications for literary analysis”. In: Digital Scholarship in the Humanities 36.2 (2020), pp. 322–332. doi: 10.1093 /llc/fqz092. url: https://doi.org/10.1093/llc/fqz092. [29] M. Jockers. More Syuzhet Validation. 2016. url: http://www.matthewjockers.net/2016/0 8/11/more-syuzhet-validation/. [30] M. Jockers. “Package ‘Syuzhet’”. In: URL: https://cran. r-project. org/web/packages/syuzhet (2017). [31] M. L. Jockers. Syuzhet: Extract Sentiment and Plot Arcs from Text. 2015. url: https://gith ub.com/mjockers/syuzhet. [32] J. W. Kantelhardt, S. A. Zschiegner, E. Koscielny-Bunde, S. Havlin, A. Bunde, and H. E. Stanley. “Multifractal detrended fluctuation analysis of nonstationary time series”. In: Physica A: Statistical Mechanics and its Applications 316.1-4 (2002), pp. 87–114. [33] C. A. Knoop, V. Wagner, T. Jacobsen, and W. Menninghaus. “Mapping the aesthetic space of literature “from below””. In: Poetics 56 (2016), pp. 35–49. doi: 10.1016/j.poetic.2016.02 .001. url: https://linkinghub.elsevier.com/retrieve/pii/S0304422X16000127. [34] C. Koolen, K. van Dalen-Oskam, A. v. Cranenburgh, and E. Nagelhout. “Literary Quality in the Eye of the Dutch Reader: The National Reader Survey”. In: Poetics 79 (2020), pp. 1– 13. doi: https://doi.org/10.1016/j.poetic.2020.101439. [35] L. Kristoufek and M. Vosvrda. “Measuring capital market efÏciency: long-term mem- ory, fractal dimension and approximate entropy”. In: The European Physical Journal B 87 (2014), pp. 1–9. [36] I. M. S. Lassen, Y. Bizzoni, T. Peura, M. R. Thomsen, and K. L. Nielbo. “Reviewer Prefer- ences and Gender Disparities in Aesthetic Judgments”. In: CEUR Workshop Proceedings. Antwerp, Belgium, 2022, pp. 280–290. url: https://ceur-ws.org/Vol-3290/short%5C%5 Fpaper1885.pdf. 182 [37] H. Leder, B. Belke, A. Oeberst, and D. Augustin. “A model of aesthetic appreciation and aesthetic judgments”. In: British Journal of Psychology 95.4 (2004), pp. 489–508. doi: 10.1 348/0007126042369811. url: https://bpspsychub.onlinelibrary.wiley.com/doi/10.1348/00 07126042369811. [38] P. J. Locher, P. J. Stappers, and K. Overbeeke. “An empirical evaluation of the visual tightness theory of pictorial composition”. In: Acta Psychologica 103.3 (1999), pp. 261– 280. doi: 10.1016/s0001-6918(99)00044-x. url: https://www.scopus.com/inward/record .uri?eid=2-s2.0-0033253633%5C&doi=10.1016%5C%2fs0001-6918%5C%2899%5C%290004 4-x%5C&partnerID=40%5C&md5=74a7e83ba2126c877f0d0409cba48d96. [39] S. Maharjan, J. Arevalo, M. Montes, F. A. González, and T. Solorio. “A Multi-task Ap- proach to Predict Likability of Books”. In: Proceedings of the 15th Conference of the Eu- ropean Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. Valencia, Spain: Association for Computational Linguistics, 2017, pp. 1217–1227. url: https://aclanthology.org/E17-1114. [40] J. McDonough and A. Herczyński. “Fractal patterns in music”. In: Chaos, Solitons & Frac- tals 170 (2023), p. 113315. doi: 10.1016/j.chaos.2023.113315. url: https://linkinghub.else vier.com/retrieve/pii/S0960077923002163. [41] D. S. Miall and D. Kuiken. “Foregrounding, defamiliarization, and affect: Response to literary stories”. In: Poetics 22.5 (1994), pp. 389–407. doi: https://doi.org/10.1016/0304-4 22X(94)00011-5. url: https://www.sciencedirect.com/science/article/pii/0304422X94000 115. [42] D. S. Miall and D. Kuiken. “The form of reading: Empirical studies of literariness”. In: Poetics 25.6 (1998), pp. 327–341. doi: https://doi.org/10.1016/S0304- 422X(98)90003- 1. url: https://www.sciencedirect.com/science/article/pii/S0304422X98900031. [43] M. Mohseni, V. Gast, and C. Redies. “Fractality and Variability in Canonical and Non- Canonical English Fiction and in Non-Fictional Texts”. In: Frontiers in Psychology 12 (2021). url: https://www.frontiersin.org/article/10.3389/fpsyg.2021.599063. [44] M. Mohseni, C. Redies, and V. Gast. “Approximate Entropy in Canonical and Non- Canonical Fiction”. In: Entropy 24.2 (2022), p. 278. doi: 10.3390/e24020278. url: https ://www.mdpi.com/1099-4300/24/2/278. [45] E. Öhman, Y. Bizzoni, P. Feldkamp Moreira, and K. Nielbo. “EmotionArcs: Emotion Arcs for 9,000 Literary Texts”. In: Proceedings of the 8th Joint SIGHUM Workshop on Com- putational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2024). Ed. by Y. Bizzoni, S. Degaetano-Ortlieb, A. Kazantseva, and S. Sz- pakowicz. St. Julians, Malta: Association for Computational Linguistics, 2024, pp. 51–66. url: https://aclanthology.org/2024.latechclfl-1.7. [46] W. van Peer. “Ideology or aesthetic quality?” In: The quality of literature: linguistic studies in literary evaluation. Ed. by W. van Peer. Amsterdam ; Philadelphia: John Benjamins Publishing, 2008, pp. 17–29. 183 [47] M. Pelowski, G. Gerger, Y. Chetouani, P. S. Markey, and H. Leder. “But Is It really Art? The classification of images as ”Art”/”Not Art” and correlation with appraisal and viewer interpersonal differences”. In: Frontiers in Psychology 8.Oct (2017). doi: 10.3389/fpsyg.20 17.01729. url: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85030772360%5 C&doi=10.3389%5C%2ffpsyg.2017.01729%5C&partnerID=40%5C&md5=ef6d2c7f622be8 7d9ec2639df7fab9e1. [48] C. K. Peng, S. V. Buldyrev, S. Havlin, M. Simons, H. E. Stanley, and A. L. Goldberger. “Mosaic organization of DNA nucleotides”. In: Physical review E 49.2 (1994), p. 1685. [49] A. J. Reagan, L. Mitchell, D. Kiley, C. M. Danforth, and P. S. Dodds. “The emotional arcs of stories are dominated by six basic shapes”. In: EPJ Data Science 5.1 (2016), pp. 1–12. doi: https://doi.org/10.1140/epjds/s13688-016-0093-1. [50] C. Redies. “A universal model of esthetic perception based on the sensory coding of natural stimuli”. In: Spatial Vision 21.1-2 (2007), pp. 97–117. doi: 10.1163/1568568077827 53886. [51] C. Redies, J. Hasenstein, and J. Denzler. “Fractal-like image statistics in visual art: simi- larity to natural scenes”. In: Spatial Vision 21.1-2 (2008), pp. 137–148. doi: 10.1163/156856 807782753921. url: https://brill.com/view/journals/sv/21/1-2/article-p137%5C%5F9.xml. [52] M. A. Riley, S. Bonnette, N. Kuznetsov, S. Wallot, and J. Gao. “A tutorial introduction to adaptive fractal analysis”. In: Frontiers in Physiology 3 (2012). doi: 10.3389/fphys.2012.0 0371. url: http://journal.frontiersin.org/article/10.3389/fphys.2012.00371/abstract. [53] E. W. Said. Culture and Imperialism. 1st Vintage Books ed. New York: Vintage Books, 1994. [54] I. Schindler, G. Hosoya, W. Menninghaus, U. Beermann, V. Wagner, M. Eid, and K. R. Scherer. “Measuring aesthetic emotions: A review of the literature and a new assessment tool”. In: PLoS ONE 12.6 (2017). doi: 10.1371/journal.pone.0178899. url: https://www.sc opus.com/inward/record.uri?eid=2-s2.0-85020459006%5C&doi=10.1371%5C%2fjournal .pone.0178899%5C&partnerID=40%5C&md5=4da91e23d4877bafb6bb5297c7d15432. [55] G. C. Spivak. “Can the Subaltern Speak?” In: Die Philosophin 14.27 (1988), pp. 42–58. doi: 10.5840/philosophin200314275. [56] R. P. Taylor. “Order in Pollock’s Chaos”. In: Scientific American 287.6 (2002), pp. 116–121. doi: 10.1038/scientificamerican1202-116. url: https://www.scientificamerican.com/arti cle/order-in-pollocks-chaos. [57] T. Underwood, D. Bamman, and S. Lee. “The transformation of gender in English- language fiction”. In: Journal of Cultural Analytics 3.2 (2018), p. 11035. doi: https : / / d oi.org/10.22148/16.019. [58] O. Vartanian. “The Contributions of Emotion and Reward to Aesthetic Judgment of Vi- sual Art”. In: Brain, Beauty, and Art. Ed. by A. Chatterjee and E. Cardilo. Oxford Univer- sity PressNew York, 2022, pp. 83–87. doi: 10 . 1093 / oso / 9780197513620 . 003 . 0017. url: https://academic.oup.com/book/39122/chapter/338534586. 184 [59] M. Walsh and M. Antoniak. “The Goodreads “Classics”: A Computational Study of Read- ers, Amazon, and Crowdsourced Amateur Criticism”. In: Journal of Cultural Analytics 6.2 (2021). doi: 10.22148/001c.22221. [60] X. Wang, B. Yucesoy, O. Varol, T. Eliassi-Rad, and A.-L. Barabási. “Success in Books: Predicting Book Sales Before Publication”. In: EPJ Data Science 8.1 (2019), p. 31. doi: 10.1140/epjds/s13688-019-0208-6. [61] C. Weir and E. Mandes. Interpreting visual art: A survey of cognitive research about pictures. 2017, pp. 1–262. doi: 10.4324/9781351295444. url: https://www.scopus.com/inward/re cord.uri?eid=2-s2.0-85045869972%5C&doi=10.4324%5C%2f9781351295444%5C&partner ID=40%5C&md5=1caeacab0150b346ff11abfca21996a1. [62] R. Wellek. “The Attack on Literature”. In: The American Scholar 42.1 (1972), pp. 27–42. url: https://www.jstor.org/stable/41207073. [63] B. Yuri and F. Pascale. “Sentiment Analysis for Literary Texts: Hemingway as a Case- study”. In: Journal of Data Mining & Digital Humanities Nlp4dh (2024), p. 13155. doi: 10.46298/jdmdh.13155. url: https://jdmdh.episciences.org/13155. 185