=Paper= {{Paper |id=Vol-3290/short_paper9235 |storemode=property |title=Correlations between GoodReads Appreciation and the Sentiment Arc Fractality of the Grimm brothers' Fairy Tales |pdfUrl=https://ceur-ws.org/Vol-3290/short_paper9235.pdf |volume=Vol-3290 |authors=Yuri Bizzoni,Mads Rosendahl Thomsen,Ida Marie S. Lassen,Kristoffer Nielbo |dblpUrl=https://dblp.org/rec/conf/chr/BizzoniTLN22 }} ==Correlations between GoodReads Appreciation and the Sentiment Arc Fractality of the Grimm brothers' Fairy Tales== https://ceur-ws.org/Vol-3290/short_paper9235.pdf
Correlations between GoodReads Appreciation and
the Sentiment Arc Fractality of the Grimm brothers’
Fairy Tales
Yuri Bizzoni1 , Mads Rosendahl Thomsen2 , Ida Marie S. Lassen1 and
Kristo昀昀er Nielbo3
1
  Center for Humanities Computing Aarhus, Aarhus University, Jens Chr. Skous Vej 4, Building 1483,DK-8000 Aarhus
C
2
  School of Communication and Culture - Comparative Literature, Aarhus University, Langelandsgade 139, Building
1580, DK-8000 Aarhus C
3
  Center for Humanities Computing Aarhus & Interacting Minds Centre, Aarhus University, Jens Chr. Skous Vej 4,
Building 1483, 3rd 昀氀oor, DK-8000 Aarhus C


                                         Abstract
                                         Despite their widespread popularity, fairy tales are o昀琀en overlooked when studying literary quality with
                                         quantitative approaches. We present a study on the relation between sentiment fractality and literary
                                         appreciation by testing the hypothesis that fairy tales with a good balance between unpredictability and
                                         excessive self-similarity in their sentiment narrative arcs tend to be more popular and more appreciated
                                         by audiences of readers. In short, we perform a correlation study of the degree of fractality of the
                                         fairy tales of the Grimm brothers and their current appreciation as measured by their Goodreads scores.
                                         Moreover, we look at the popularity of these fairy tales through time, determining which ones have
                                         come to form a strong “internal canon” in the corpus of the authors and which one have fallen into
                                         relative obscurity.

                                         Keywords
                                         computational narratology, sentiment analysis, fractal analysis, literary quality assessment




1. Introduction
In recent years the increase in size of available corpora and the possibility of performing com-
plex operations on textual data have given rise not only to an explosion of tools for the ex-
ploratory analysis of literary collections [9], but also to new hypotheses in literary and aes-
thetic studies, all the while making existing complex hypotheses relatively easier to test [34,
39]. This change in the ways we can study texts has on one hand brought about the possibility
of testing for complex patterns in linguistic data, such as the presence of long-ranging regular-
ities and multifactorial structures, de昀椀ning new research questions for the 昀椀eld. On the other
hand, it has become possible to study traditional questions with brand new methodologies.

CHR 2022: Computational Humanities Research Conference, December 12 – 14, 2022, Antwerp, Belgium
£ yuri.bizzoni@cc.au.dk (Y. Bizzoni); madsrt@cc.au.dk (M. R. Thomsen); idamarie@cas.au.dk (I. M. S. Lassen);
kln@cas.au.dk (K. Nielbo)
ȉ 0000-0002-6981-7903 (Y. Bizzoni); 0000-0002-4975-6752 (M. R. Thomsen); 0000-0001-6905-5665 (I. M. S. Lassen);
0000-0002-5116-5070 (K. Nielbo)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)




                                                                                                        374
   One growingly popular object of research falling in the second category is the concept of
literary quality [43]. Making use of large-scale quantitative means, several works have tried
to de昀椀ne and study literary quality, focusing on di昀昀erent aspects of literary texts and their
contexts [14]. The riddle of quality is as intriguing as elusive: while large numbers of readers
can o昀琀en converge on their overall judgment of a piece of literature, they can o昀琀en be a loss
to justify why a di昀昀erent text, on similar topics or with a similar perspective, fails to produce
for them the same experience. At the same time, no single text in the history of literature has
attracted unanimous approval, and the reasons that di昀昀erent readers bring for their judgment
of a piece can be both varied and hard to de昀椀ne rigorously.
   The 昀椀rst major di昀케culty in this line of research is how to 昀椀nd proxies to represent quality
itself: should one do it through prestigious prices, book sales, or crowd voting? While this
question has been problematic in any study on quality, it becomes perhaps even more stringent
when dealing with quantitative analyses, since it is necessary to 昀椀nd a way to represent it into
a number or a boolean value.
   It has become relatively common among quantitative studies on literary quality to use large
online platforms for readers such as GoodReads [26] as a way to approximate the ‘mean per-
ceived quality” of a text. Platforms like GoodReads represent a particularly appealing source
of information since they gather data from unconstrained readers, over relatively long spans
of time, and can average on large numbers of individual scores [42]. Usually these studies do
not take into account the average number of raters in order to avoid completely con昀氀ating per-
ceived quality (how much readers liked a book) with popularity (how many readers rated the
book). Even so, using scores from a general purpose platform like GoodReads deals with qual-
ity as a statistical mean of all raters’ appreciation, which is a democratic but not unproblematic
take on concept. Since it allows to take into account the perspective of very di昀昀erent readers,
giving to each the same value, it could be argued that literary quality is a multi-dimensional
problem and by using GoodReads scores we are measuring one dimension of it, while other
means could be used to compound it with its other aspects.
   Beyond the question of how to represent quality, the second main problem in this line of
research is to decide which features we should select to explore it.
   Some studies have looked into classic stylometric features, while others have tried more
sophisticated properties, exploring the semantic and psycholinguistic dimensions of texts.
   A set of works has looked speci昀椀cally into the linearity of textual narratives, representing
them as series extended through the virtual time of reading (or listening). In such cases, the
focus has been on how to best represent such narratives lines or arcs and to de昀椀ne their most
typical or relevant shapes [36].
   A relatively prominent line of research has looked into the sentimental and emotional aspect
of literary texts [24, 25], as well as renown movies scripts or song lyrics [8], following the idea
that the emotions expressed in texts, as drawn through sentiment analysis resources [3, 1, 23,
30] and emotion recognition tools [2, 10] working at the word [31, 32, 22] or paragraph level
[27, 44], might play a fundamental role in readers’ engagement and response [15].
   This line of research seems to have returned some promising results [40], and o昀琀en these
studies focus on the cultural context of narratives in general, linking their sentimental or emo-
tional values to social or historical changes in the broadest sense [41, 17, 4].
   Finally, some studies have recently linked the linear and temporal dimension of texts with




                                               375
that of fractal analysis, the ensemble of techniques to assess the fractal characteristics of an
object and assign a fractal dimension to a dataset [13, 29, 6, 16]. Such studies hypothesize that
the quality of literary narratives might be partly due to fractal properties embedded in texts
[12, 19]. For example, Mohseni, Gast, and Redies [33] looked into the fractality of shallow
stylometric features such as TTR and POS-R, while Hu, Liu, Thomsen, Gao, and Nielbo [21]
explored the self-predictability of a novel’s narrative arc. Following the latter, Bizzoni, Peura,
Nielbo, and Thomsen [7] brought this analysis to the fairy tales of H.C. Andersen, 昀椀nding
a positive correlation between the level of fractality in the 昀椀ne-grained sentiment arc of a
narrative and its appreciation: on average, fairy tales with more “fractal” sentiment arcs would
receive higher scores on GoodReads.
   In this work we attempt to take this line of research slightly further, hypothesizing that while
the level of fractality might correlate with reader appreciation or perceived literary quality,
there most likely is a breaking point or ”sweet spot” a昀琀er which a text’s sentimental fractality
stops being a good thing.
   To check this hypothesis we reproduce the experimental setting of Bizzoni, Peura, Nielbo,
and Thomsen [7] on a selection of fairy tales from the Grimm brothers, and we test whether
(a) a correlation between sentiment fractality and GoodReads scores for the Grimm brothers
exists at all; and (b) whether this correlation is linear or follows a rise-and-fall pattern.


2. Data
We followed Taylor and Edwardes’s 2001 English translation of The Grimms’ Fairy Tales [20].
We chose this edition simply because it is the most popular on the Gutenberg Project, and that
would help us select the fairy tales from the Grimm brothers that are most likely to have re-
ceived ratings from more than one or two reviewers on GoodReads; this selection is composed
of 62 tales. We drew the text from Project Gutenberg’s website [38].
   We worked on English both to replicate as close as possible the work of Bizzoni, Peura,
Nielbo, and Thomsen [7] and to have access to some of the most tested resources for Sentiment
Analysis.
   This selection includes most of the best-known stories from the Grimm brothers such as Ra-
punzel, Hansel and Gretel, and Red Riding Hood. It also contains several titles that are relatively
less widespread, such as Clever Hans, The Salad or The Turnip. Most titles of these selections
are nonetheless read enough to have earned more than ten di昀昀erent ratings on GoodReads.
As we show in Figure 1, there seems to be a weak correlation between the number of ratings
and the average rating of a fairy tale, indicating that the most liked stories are also the ones
attracting most votes (link between popularity and appreciation).


3. Method
We performed the sentiment analysis of the fairy tales using the VAD lexicon [31], one of the
most popular word-based sentiment resources in computational linguistics [32]. Since we are
working on unusually short pieces of narratives, we drew our sentiment arcs from each word
score, without averaging them on sentences or paragraphs. This also allows us to operate at




                                               376
Figure 1: Number of GoodReads ratings (on the x axis) plotted against the average rating score (on the
y axis) for our selection of Grimm fairy tales. A growing number of ratings seems to overall correspond
to a higher average rating: more appreciated stories tend to become also more popular.




Figure 2: Number of GoodReads ratings (on the x axis) plotted against the Hurst exponent (on the y
axis) for our selection of Grimm fairy tales.


the 昀椀nest grain of sentiment analysis, keeping us at the interface between narrative and style,
which is particularly important when working with literary quality.
   As suggested by [21], we use the Hurst exponent to approximate the story arc’s inner coher-
ence. The Hurst exponent, �㔻 , is a measure of self-similar behavior. In the context of story arcs,
self-similarity means that the arc’s 昀氀uctuation patterns at faster time-scales resemble 昀氀uctua-
tion patterns at slower time scales [37]. We use Adaptive Fractal Analysis (AFA) to estimate
the Hurst exponent [18]. AFA is based on a nonlinear adaptive multi-scale decomposition algo-
rithm [18]. The 昀椀rst step of the algorithm involves partitioning an arbitrary time series under
study into overlapping segments of length Ā = 2�㕛 + 1, where neighboring segments overlap
by �㕛 + 1 points. In each segment, the time series is 昀椀tted with the best polynomial of order �㕀,
obtained by using the standard least-squares regression; the 昀椀tted polynomials in overlapped
regions are then combined to yield a single global smooth trend. Denoting the 昀椀tted poly-
nomials for the �㕖 − ýℎ and (�㕖 + 1) − ýℎ segments by �㕦 �㕖 (�㕙1 ) and �㕦 (�㕖+1) (�㕙2 ), respectively, where




                                                   377
�㕙1 , �㕙2 = 1, ⋯ , 2�㕛 + 1, we de昀椀ne the 昀椀tting for the overlapped region as

                           �㕦 (�㕐) (�㕙) = Ā1 �㕦 (�㕖) (�㕙 + �㕛) + Ā2 �㕦 (�㕖+1) (�㕙), �㕙 = 1, 2, ⋯ , �㕛 + 1
where Ā1 = (1 − �㕙−1
                  �㕛
                     ) and Ā2 = �㕙−1
                                  �㕛
                                     can be written as (1 − �㕑�㕗 /�㕛) for �㕗 = 1, 2, and where �㕑�㕗
denotes the distances between the point and the centers of �㕦 (�㕖) and �㕦 (�㕖+1) , respectively. Note
that the weights decrease linearly with the distance between the point and the center of the
segment. Such a weighting is used to ensure symmetry and e昀昀ectively eliminate any jumps or
discontinuities around the boundaries of neighboring segments. As a result, the global trend is
smooth at the non-boundary points, and has the right and le昀琀 derivatives at the boundary [37].
The global trend thus determined can be used to maximally suppress the e昀昀ect of complex
nonlinear trends on the scaling analysis. The parameters of each local 昀椀t is determined by
maximizing the goodness of 昀椀t in each segment. The di昀昀erent polynomials in overlapped part
of each segment are combined using Equation 5 so that the global 昀椀t will be the best (smoothest)
昀椀t of the overall time series. Note that, even if �㕀 = 1 is selected, i.e., the local 昀椀ts are linear,
the global trend signal will still be nonlinear. With the above procedure, AFA can be readily
described. For an arbitrary window size Ā, we determine, for the random walk process þ(�㕖),
a global trend ÿ(�㕖), �㕖 = 1, 2, ⋯ , �㕁 , where �㕁 is the length of the walk. The residual of the 昀椀t,
þ(�㕖) − ÿ(�㕖), characterizes 昀氀uctuations around the global trend, and its variance yields the Hurst
parameter �㔻 according to the following scaling equation:

                                                        �㕁               1/2
                                                    1
                                     �㔹 (Ā) = [       ∑(þ(�㕖) − ÿ(�㕖))2 ] ∼ Ā �㔻
                                                   �㕁 �㕖=1
   By computing the global 昀椀ts, the residual, and the variance between original random walk
process and the 昀椀tted trend for each window size Ā, we can plot log2 �㔹 (Ā) as a function of
log2 Ā. The presence of fractal scaling amounts to a linear relation in the plot, with the slope
of the relation providing an estimate of �㔻 1 . Accordingly, a �㔻 higher than .5 indicates a degree
of linear coherence (e.g., positive sentiments are followed by positive sentiments), while �㔻
lower than .5 indicates a series that tends to revert to the mean (e.g. a positive emotion always
follows a negative emotion).
   A昀琀er determining the Hurst exponent for each fairy tale, we correlated it with the average
scores on Goodreads. To compute the strength of the correlations, we used the most popular
metrics: Pearson [5] and Spearman [35] correlations. A昀琀er computing them on the whole
dataset, we calculated each of these measures on what we considered the possible “upward”
and “downward” trends of the data. To choose the breaking point to divide the data between
a potentially positive and a potentially negative correlation trend, we computed a correlation
matrix between all of the Hurst scores and all of the GoodReads values in our dataset (see
Figure 3).
   We also computed the probability distribution of seeing a highly rated fairy tale versus the
probability of seeing a non-highly rated fairy tale at di昀昀erent Hurst values (see Figure 4). To
do this, we 昀椀rst conventionally set a high rate threshold at 3.5 GoodReads stars, following the
practice of Maharjan, Arevalo, Montes, González, and Solorio [28], and then for each Hurst
1
    Code for computing DFA and AFA is available at https://github.com/knielbo/sa昀케ne




                                                               378
Figure 3: Correlation matrix between Hurst exponents and average GoodReads scores. The higher
correlations appear to be between Hurst exponents between 0.55 and 0.6, and avg. ratings between 3.0
and 4.5. The highest correlations seem set between H=0.56 and H=0.58.


value (H=0.5, H=0.51, and so forth) we computed the probability of seeing a highly rated tale
(higher than the threshold) versus the probability of seeing a non-highly rated tale.
   Both these tests point to a peaking positive correlation of arc fractality with GoodReads
scores between a Hurst exponent of 0.56 and a Hurst exponent of 0.6 circa, which appears in
line with the 昀椀ndings of Bizzoni, Peura, Nielbo, and Thomsen [7].


4. Results
Our analysis shows that the correlation between the number of ratings a fairy tale receives
and its Hurst exponent, on the other hand, does not appear so obvious (Figure 2). a positive
correlation between a tale’s Hurst exponent and its average rating up until a given Hurst value,
and a negative correlation a昀琀erward. As we said in Section 3, we de昀椀ned this breaking point
both by checking the probabilities of a “high scoring tale” (Figure 3) and by computing the cor-
relation matrix between Hurst exponent and GoodReads’ scores (Figure 4). We found that the
point that most clearly divides the positive from the negative correlation in our data appears to
be Hurst=0.57, although naturally it is the whole area between H=0.56 and H=0.59 that seems
to indicate an inversion in the correlation between these two variables. In this way, we can
easily identify two opposing trends in our data, that appear by computing both Pearson’s and
Spearman’s correlations. The Pearson correlation is of 0.5 (p-value: 0.01) for the Hurst interval
0.48-0.58, and of -0.3 (p-value: 0.08) for the interval 0.57-0.75. The Spearman correlation is of




                                                379
Figure 4: Probability of seeing a high scoring vs a non-high scoring fairy tale for di昀昀erent Hurst values.
The probability of a high-scoring tale seems to peak between Hurst=0.56 and Hurst=0.6 circa.


0.45 (p-value: 0.02) for the Hurst interval 0.48-0.58, and of -0.37 (p-value: 0.02) for the interval
0.57-0.75. Following standard boundaries for the strength of linear correlations’ ranges [11], we
can de昀椀ne all these correlations as moderate or robust. In average, all but the second Pearson
correlation have a p-value under the signi昀椀cance threshold of 0.05 2 . In all cases, the negative
correlations appear weaker (and less signi昀椀cant) than the positive ones. This seems to con昀椀rm
what can also be seen in Figure 5: a昀琀er the “ sweet spot” of arc fractality, the steepness of the
trendline is smoother and the link between a fairy tale’s average score and its Hurst exponent
decreases. Looking into the individual titles of the fairy tales is also interesting: the stories
falling into the “internal canon” of the Grimm brothers seems to form a loose cluster. For ex-
ample, fairy tales that have the highest number of readers on GoodReads, that have elicited
reproductions in movies or cartoons, or that tend to appear most o昀琀en in choice anthologies of
the authors appear to cluster at the center of the graph, namely at the high point of GoodReads’
appreciation and fractal equilibrium. As can be seen for example in Figure 6, Little Red Riding
Hood/Little Red-Cap (1900 ratings), Hansel and Gretel (2711 ratings), Rapunzel (2303 ratings)
fall on the upper right quadrant, while in Figure 7 Ashputtel, the Grimms’ version of Cinderella
(1706 ratings), King Grisly-Beard (230 ratings), The Travelling Musicians [of Bremen] (479 rat-
ings) tend to cluster on the upper le昀琀 quadrant of the graph. At the margins of the plots we
can 昀椀nd less widespread pieces like The Fox and the Horse (55 ratings), Sweetheart Roland (88
ratings) or Clever Hans (75 ratings). But independently from the number of ratings, the reader

2
    The second Pearson’s p-value is just slightly above the formal threshold of signi昀椀cance. What this indicates, rather
    than a Yes/No signi昀椀cance output, is that the correlation between Hurst values and readers’ scores becomes less
    obvious a昀琀er the peak area around H=.57. Both the value and the signi昀椀cance of the negative correlations are
    weaker than the positive correlations’ ones.




                                                           380
Figure 5: Correlations between Hurst exponent and average GoodReads rating, breaking point set at
H=0.57.


will recognize the stories in the center as being among the best known from the authors.


5. Discussion
We have computed a correlation between sentiment arcs’ Hurst exponents and average
GoodReads’ scores on a selection of Grimms’ fairy tales, both con昀椀rming the 昀椀ndings of previ-
ous literature on H.C. Andersen [7] and validating our hypothesis of the presence of an ideal
balance or “sweet spot” between de昀椀cient and excessive fractality. Based on our results, it
seems that the correlation between these two variables is positive and increasingly robust up
to a certain point, and becomes negative and weaker from that point on wards. The fact that
our correlations are never “very strong” (e.g. they are never higher than Pearson=0.5) appears
to us as reassuring rather than discouraging. The fractality of a fairy tale’s sentiment arc is
not, naturally, the sole cause for its literary quality or popular appreciation, and a large number
of factors are likely to in昀氀uence the appreciation of a text - we would not expect very strong
correlations from this one feature.
   Naturally, our study’s take on literary quality has a number of limitations. The average
ratings of GoodReads have the substantial advantage of representing a large number of read-
ers, who annotate a text in completely natural circumstances (e.g. they are not being paid or
brought in a lab to do the annotation) over a relatively long span of time. At the same time, like
all works using GoodReads scores or similar averages, we are consciously con昀氀ating quality
with popularity, a perspective that might contrast with a prestige-driven view of quality, in
which a smaller number of experts de昀椀nes the quality of a text. Also, using the simple average
of GoodReads reduces judgments that might be related to di昀昀erent aspects of the text to one




                                               381
Figure 6: Closer look into the titles in the upward trend part of the plot. Pearson correlation=0.44,
p-value=0.03.


single dimension. Although this problem is less severe when we consider a dataset of a single
genre, from a small group of authors, it would become increasingly relevant if we were to apply
this way of measuring quality to heterogeneous collections of texts.
   Overall, our study con昀椀rms the perhaps surprising 昀椀nding that quality (in terms of
GoodReads’ scores) and sentiment fractality hold a correlation for literary fairy tales, but adds
the insight that such correlation might not be always linear: there could be a “sweet spot” for
fractality’s e昀昀ects on readers. If we compare the study from Bizzoni, Peura, Nielbo, and Thom-
sen [7] with our own 昀椀ndings, we see that the most iconic titles from both collections fall in a
similar interval for Hurst scores: The Little Mermaid or The Ugly Duckling for H.C. Andersen,
Red Riding Hood or Hansel and Gretel for the Grimm brothers are between H=0.55 and H=0.60
approximately.
   In future, we will try to test other quality “proxies” (prestigious awards, established canons)
and to expand our feature set beyond sentiment arcs. It is reasonable to assume that di昀昀er-
ent stylistic characteristics complement each other. It would also be interesting to attempt to
predict the popularity of a text through automatic classi昀椀cation. Overall, sentiment fractality
appears to be a promising feature to further our understanding of literary quality.




                                                382
Figure 7: Closer look into the titles in the downwards trend part of the plot. Pearson correlation=-0.43,
p-value=0.03.


References
 [1] C. O. Alm. “Characteristics of high agreement a昀昀ect annotation in text”. In: Proceedings
     of the fourth linguistic annotation workshop. 2010, pp. 118–122.
 [2] C. O. Alm, D. Roth, and R. Sproat. “Emotions from text: machine learning for text-based
     emotion prediction”. In: Proceedings of human language technology conference and con-
     ference on empirical methods in natural language processing. 2005, pp. 579–586.
 [3] E. C. O. Alm. A昀昀ect in*text and speech. University of Illinois at Urbana-Champaign, 2008.
 [4] K. Bebbington, C. MacLeod, T. M. Ellison, and N. Fay. “The sky is falling: evidence of
     a negativity bias in the social transmission of information”. In: Evolution and Human
     Behavior 38.1 (2017), pp. 92–101.
 [5] J. Benesty, J. Chen, Y. Huang, and I. Cohen. “Pearson correlation coe昀케cient”. In: Noise
     reduction in speech processing. Springer, 2009, pp. 1–4.
 [6] J. Beran. Statistics for Long-Memory Processes. 1st ed. New York: Chapman and Hall/CRC,
     1994.
 [7] Y. Bizzoni, T. Peura, K. Nielbo, and M. Thomsen. “Fractal Sentiments and Fairy Tales-
     Fractal scaling of narrative arcs as predictor of the perceived quality of Andersen’s fairy
     tales”. In: Journal of Data Mining & Digital Humanities (2022).




                                                  383
 [8] C. O. Brand, A. Acerbi, and A. Mesoudi. “Cultural evolution of emotional expression in
     50 years of song lyrics”. In: Evolutionary Human Sciences 1 (2019).
 [9] J. Brooke, A. Hammond, and G. Hirst. “GutenTag: an NLP-driven tool for digital human-
     ities research in the Project Gutenberg corpus”. In: Proceedings of the Fourth Workshop
     on Computational Linguistics for Literature. 2015, pp. 42–47.
[10]   E. Cambria, D. Das, S. Bandyopadhyay, and A. Feraco. “A昀昀ective computing and senti-
       ment analysis”. In: A practical guide to sentiment analysis. Springer, 2017, pp. 1–10.
[11]   J. Cohen. “Quantitative methods in psychology: A power primer”. In: Psychological Bul-
       letin (1992), pp. 153–159.
[12]   J. Cordeiro, P. R. M. Inácio, and D. A. B. Fernandes. “Fractal Beauty in Text”. In: Progress
       in Arti昀椀cial Intelligence. Ed. by F. Pereira, P. Machado, E. Costa, and A. Cardoso. Lecture
       Notes in Computer Science. Cham: Springer International Publishing, 2015, pp. 796–802.
       doi: 10.1007/978-3-319-23485-4\_80.
[13]   T. M. Cox. “Long range dependence: A review”. In: Iowa State University. Press, 1984.
[14]   A. van Cranenburgh and C. Koolen. “Identifying literary texts with bigrams”. In: Pro-
       ceedings of the Fourth Workshop on Computational Linguistics for Literature. 2015, pp. 58–
       67.
[15]   I.-A. Drobot. “A昀昀ective Narratology. The Emotional Structure of Stories”. In: Philologica
       Jassyensia 9.2 (2013), p. 338.
[16]   A. Eke, P. Herman, L. Kocsis, and L. R. Kozak. “Fractal characterization of complexity
       in temporal physiological signals”. In: Physiological Measurement 23.1 (2002), R1. url:
       http://stacks.iop.org/0967-3334/23/i=1/a=201.
[17]   K. Eriksson and J. C. Coultas. “Corpses, maggots, poodles and rats: Emotional selection
       operating in three phases of cultural transmission of urban legends”. In: Journal of Cog-
       nition and Culture 14.1-2 (2014), pp. 1–26.
[18]   J. Gao, J. Hu, and W.-w. Tung. “Facilitating Joint Chaos and Fractal Analysis of Biosignals
       through Nonlinear Adaptive Filtering”. In: PLoS ONE 6.9 (2011). Ed. by M. Perc, e24331.
       doi: 10.1371/journal.pone.0024331. url: http://dx.plos.org/10.1371/journal.pone.002433
       1.
[19]   J. Gao, M. L. Jockers, J. Laudun, and T. Tangherlini. “A multiscale theory for the dynam-
       ical evolution of sentiment in novels”. In: 2016 International Conference on Behavioral,
       Economic and Socio-cultural Computing (BESC). Ieee. 2016, pp. 1–4.
[20]   J. Grimm and W. Grimm. Grimms’ Fairy Tales. Trans. by E. Taylor and M. Edwardes.
       Gutenberg Project, 2001.
[21]   Q. Hu, B. Liu, M. R. Thomsen, J. Gao, and K. L. Nielbo. “Dynamic evolution of sentiments
       in Never Let Me Go: Insights from multifractal theory and its implications for literary
       analysis”. In: Digital Scholarship in the Humanities 36.2 (2021), pp. 322–332.
[22]   S. M. Islam, X. L. Dong, and G. de Melo. “Domain-speci昀椀c sentiment lexicons induced
       from labeled documents”. In: Proceedings of the 28th International Conference on Compu-
       tational Linguistics. 2020, pp. 6576–6587.




                                               384
[23]   S. Jain, S. Malviya, R. Mishra, and U. S. Tiwary. “Sentiment Analysis: An Empirical Com-
       parative Study of Various Machine Learning Approaches”. In: Proceedings of the 14th In-
       ternational Conference on Natural Language Processing (ICON-2017). Kolkata, India: NLP
       Association of India, 2017, pp. 112–121. url: https://aclanthology.org/W17-7515.
[24]   M. Jockers. Syuzhet: Extracts Sentiment and Sentiment-Derived Plot Arcs from Text (Version
       1.0. 1). 2017.
[25]   E. Kim and R. Klinger. “A survey on sentiment and emotion analysis for computational
       literary studies”. In: arXiv preprint arXiv:1808.03137 (2018).
[26]   K. Kousha, M. Thelwall, and M. Abdoli. “Goodreads reviews to assess the wider impacts
       of books”. In: Journal of the Association for Information Science and Technology 68.8 (2017),
       pp. 2004–2016. doi: 10.1002/asi.23805. url: https://onlinelibrary.wiley.com/doi/abs/10.1
       002/asi.23805.
[27]   X. Li, L. Bing, W. Zhang, and W. Lam. “Exploiting BERT for end-to-end aspect-based
       sentiment analysis”. In: arXiv preprint arXiv:1910.00883 (2019).
[28]   S. Maharjan, J. Arevalo, M. Montes, F. A. González, and T. Solorio. “A multi-task ap-
       proach to predict likability of books”. In: Proceedings of the 15th Conference of the Eu-
       ropean Chapter of the Association for Computational Linguistics: Volume 1, Long Papers.
       2017, pp. 1217–1227.
[29]   B. Mandelbrot. The Fractal Geometry of Nature. Updated ed. edition. San Francisco: Times
       Books, 1982.
[30]   M. V. Mäntylä, D. Graziotin, and M. Kuutila. “The evolution of sentiment analysis–A
       review of research topics, venues, and top cited papers”. In: Computer Science Review 27
       (Feb. 1, 2018), pp. 16–32. doi: 10.1016/j.cosrev.2017.10.002. url: https://www.sciencedir
       ect.com/science/article/pii/S1574013717300606.
[31]   S. M. Mohammad and P. D. Turney. “Nrc emotion lexicon”. In: National Research Council,
       Canada 2 (2013).
[32]   S. M. Mohammad. “Obtaining Reliable Human Ratings of Valence, Arousal, and Domi-
       nance for 20,000 English Words”. In: Proceedings of The Annual Conference of the Associ-
       ation for Computational Linguistics (ACL). Melbourne, Australia, 2018.
[33]   M. Mohseni, V. Gast, and C. Redies. “Fractality and Variability in Canonical and Non-
       Canonical English Fiction and in Non-Fictional Texts”. In: Frontiers in Psychology 12
       (2021). url: https://www.frontiersin.org/article/10.3389/fpsyg.2021.599063.
[34]   F. Moretti. Distant reading. Verso Books, 2013.
[35]   L. Myers and M. J. Sirois. “Spearman correlation coe昀케cients, di昀昀erences between”. In:
       Encyclopedia of statistical sciences 12 (2004).
[36]   A. J. Reagan, L. Mitchell, D. Kiley, C. M. Danforth, and P. S. Dodds. “The emotional arcs
       of stories are dominated by six basic shapes”. In: EPJ Data Science 5.1 (2016), pp. 1–12.
[37]   M. A. Riley, S. Bonnette, N. Kuznetsov, S. Wallot, and J. Gao. “A tutorial introduction to
       adaptive fractal analysis”. In: Frontiers in Physiology 3 (2012). doi: 10.3389/fphys.2012.0
       0371. url: http://journal.frontiersin.org/article/10.3389/fphys.2012.00371/abstract.




                                               385
[38]   B. Stroube. “Literary freedom: Project gutenberg”. In: XRDS: Crossroads, The ACM Mag-
       azine for Students 10.1 (2003), pp. 3–3.
[39]   T. Underwood. Distant Horizons: Digital Evidence and Literary Change. University of
       Chicago Press, Feb. 14, 2019. doi: 10.7208/9780226612973. url: https://www.degruyt
       er.com/document/doi/10.7208/9780226612973/html.
[40]   S. Veleskia. “Weak Negative Correlation between the Present Day Popularity and the
       Mean Emotional Valence of Late Victorian Novels”. In: Proceedings http://ceur-ws. org
       ISSN 1613 (2020), p. 0073.
[41]   C. J. Walker and B. Blaine. “The virulence of dread rumors: A 昀椀eld experiment.” In: Lan-
       guage & Communication (1991).
[42]   M. Walsh and M. Antoniak. “The Goodreads ‘Classics’: A Computational Study of Read-
       ers, Amazon, and Crowdsourced Amateur Criticism”. In: Journal of Cultural Analytics 4
       (2021), pp. 243–287.
[43]   M. Wilkens. “Canons, close reading, and the evolution of method”. In: Debates in the
       digital humanities (2012), pp. 249–58.
[44]   H. Xu, B. Liu, L. Shu, and P. S. Yu. “DomBERT: Domain-oriented language model for
       aspect-based sentiment analysis”. In: arXiv preprint arXiv:2004.13816 (2020).




                                              386