<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>AndrewPiper</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hao Xu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eric D.Kolaczyk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>McGill University</institution>
          ,
          <addr-line>Montreal, QC H3A 2M7</addr-line>
          ,
          <country country="CA">CANADA</country>
        </aff>
      </contrib-group>
      <fpage>500</fpage>
      <lpage>511</lpage>
      <abstract>
        <p>A core aspect of human storytelling is the element of narrative time. In this paper, we propose a model of narrative revelation using the information-theoretic concept of relative entropy, which has been used in a variety of settings to understand textual similarity, along with methods in time-series analysis to model the properties of revelation over narrative time. Given a beginning state of no knowledge about a story (beyond paratextual clues) and an end state of full knowledge about a story's contents, what are the rhythms of dissemination through which we arrive at this 昀椀nal state? Using a dataset of over 2,700 books of contemporary English prose, we test for various time-dependent characteristics of narrative revelation against four stylistic categories of interest: audience age level, prestige, point-of-view, and 椀昀ctionality.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;narratology</kwd>
        <kwd>information theory</kwd>
        <kwd>contemporary literature</kwd>
        <kwd>discourse structure</kwd>
        <kwd>narrative revelation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Italo Calvino was fond of quoting a Sicilian expression that “time takes no time in a stor8y]”. [
A narrator can tell a story that traverses centuries in a few sentences or can slow time down
to the point where a few seconds takes minutes to describe. Such manipulations of time – one
of the great loves of narrative theory28[
        <xref ref-type="bibr" rid="ref30 ref31">, 31, 32</xref>
        ] – hide a more elementary fact about stories:
no matter how much they may compress or dilate time, they still take time to tell. All stories,
even the shortest, happen in time and cannot be told all at once.
      </p>
      <p>
        The fact that stories take time means that the dissemination of information – the ordering
and divulging of facts about the storyworld – plays an important role in the meaning of the
story. Independent ofwhat is told,how it is told is a key aspect of a story’s meaning.
Narrative theorists refer to this discrepancy as “discourse structur1e3”, [
        <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
        ] and it has largely
been framed as an ordering problem, i.e. the discrepancy between how narrative information
is revealed and the underlying logic of events within the story. A sizable body of empirical
studies has shown, for example, the way modulations in narrative order – such as withholding
salient information or reordering events in non-linear fashion – can in昀氀uence the emotional
or a昀ective response of audiences [
        <xref ref-type="bibr" rid="ref3 ref6">6, 3</xref>
        ].
      </p>
      <p>
        Less attention has been paid to the more elementary question of the amount of novel
information imparted at any given moment in a story. Given a beginning state of no
knowledge about a story (beyond paratextual clues14[]) and an end state of full knowledge about
a story’s contents, what are the rhythms of dissemination through which we arrive at this
昀椀nal state? Does a narrator introduce more information early and then spend time going over
more familiar terrain or, conversely, withhold key pieces of information that we only learn
about towards the close of the story? Are there periods of local exploration, where narrators
spend more time introducing novel information, and periods of exploitation (to borrow a classic
framework from computer science), where narrators immerse audiences in already established
characters, themes, and situations? Do these practices exhibit predictable, periodic behavior
or are they more akin to random walks? Finally, how are such practices impacted by the social
situatedness of narrative 1[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]? When a narrator is cra昀琀ing a story for younger audiences or
telling a true versus 昀椀ctional story or appealing to literary elites on prize committees, do we
see modulations in the way narrative information is revealed?
      </p>
      <p>
        In this paper, we draw on the 昀椀elds of information theory and statistics (including time
series analysis, in the latter case) to develop a model of narrative revelation to capture the
relative amount of new information communicated by authors over narrative time. We use the
information-theoretic concept of relative entropy or Kullback-Leibler divergence to quantify
how much new information is introduced in a given book during a window of text at a given
time  , relative to the prior window at tim e− 1 . We then employ various techniques from
statistics and time series analysis to characterize the temporal dynamics of the resulting traces,
at both the aggregate level (across our corpus) and at the level of individual books. Relative
entropy has been applied to the study of textual di昀erence in numerous settings9][, including
parliamentary discourse2][ and the evolution of scienti昀椀c English [
        <xref ref-type="bibr" rid="ref10 ref4">10, 4</xref>
        ], as well as been shown
to be a good predictor of human visual attention16[], linguistic processing1[
        <xref ref-type="bibr" rid="ref19 ref8">8, 19</xref>
        ], and has
more recently been proposed as a model of implicit cultural learn3i4n]g. S[tatistics – and in
particular, time series analysis – provides us with a well-developed set of tools for detecting
and describing aspects of the temporal behavior in the relative entropies for our corpus, such
as trend, periodicity, and statistical dependency of the present on the past (e.g.7,])[.
      </p>
      <p>We apply our measure of narrative revelation to the CONLIT datas2e4t][, which includes
approximately2700 books from12 genres drawn from contemporary English prose published
since 2001. We use available partitions in the data to test the relationship between patterns of
narrative revelation and di昀erent social categories. In particular, we concentrate on the
following categories in our analysis, including the relevant classes from the CONLIT data:
昀椀ctionality (昀椀ction / non-昀椀ction), prestige (prizewinning novels / bestsellers), age level (YA + Middle
School / Adult Fiction), and point-of-view (昀椀rst person / third person). Note that all but the
椀昀rst condition on 昀椀ctional narratives.</p>
      <p>
        Understanding the dynamics of narrative revelation can provide an important window into
the nature of human storytelling using computational methods. First, it can provide an
objective measure of informational novelty within texts, which can then be associated with reader
judgments. While beyond the scope of the present work, future work will want to explore
this relationship between the rate of novel information and readers’ a昀ective states. Such a
measure can also provide insights into the e昀ects that social settings have on the revelation of
narrative information, such as audience type or the narrator’s goals regarding the
instrumentality of information being communicated (facticity/昀椀ctionality), as well as potentially reveal
audience preferences for story structure when it comes to the distribution of new information.
In particular, it can give us the means to model what is known as the explore/exploit trade-o昀
when it comes to narrative communication3[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. When telling a story we assume that
narrators will oscillate between periods of exploration (introducing and developing novel ideas and
characters) and periods of exploitation (deepening our understanding/attachment to the agents
and experiences already introduced). And yet we currently have little knowledge about how
these relationships evolve over narrative time as it relates to long narrative forms and whether
social factors impact this behavior. Our work thus attempts to provide a novel method for
modeling the dissemination of information over narrative time further contributing to more
general inquiry into the temporal properties of human storytelling.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        A number of approaches to the computational modeling of discourse structure have been
proposed. Schmidt [
        <xref ref-type="bibr" rid="ref29">30</xref>
        ] used topic modeling to identify thematic arcs in television screenplays,
while Thompson, Wojtowicz, and DeDeo3[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] used topic models to study thematic progression
in philosophical texts and social media. Reagan, Mitchell, Kiley, Danforth, and Dod2d7s] [
used sentiment analysis to model the concept of narrative fortun1e2][, for which Elkins 1[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
provides a more in-depth study of the validity of sentiment arcs as models of narrative
structure. Boyd, Blackburn, and Pennebaker5][ used particular word types to capture three primary
narrative stages, and Sap, Jafarpour, Choi, Smith, Pennebaker, and Horvit2z9][ used the
predictability of next sentences to capture the concept of narrative “昀氀ow,” though this is not applied
to questions of narrative time. Piper and Toubia26[] used word embeddings to model narrative
non-linearity using the traveling salesman problem. Ouyang and McKeow22n][and Piper [23]
devised methods for predicting narrative “turning points” as larger structural qualites, drawing
on Aristotelian and Augustinian theories of narrative respectively. Finally, McGrath, Higgins,
and Hintze [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and Liddle 2[0] have used information theoretic frameworks to model stylistic
novelty over narrative time with respect to small collections of literary documents.
      </p>
      <p>
        Our work builds on this prior work in at least two important ways. First, we utilize a large
and diverse collection of publicly successful long narrative for2m4]s. [This overcomes
limitations surrounding prior work’s use of arti昀椀cially constructed corpo2r9a] o[r small
historical literary collection2s1[
        <xref ref-type="bibr" rid="ref20">, 20</xref>
        ]. Second, in using an information-theoretic model of narrative
revelation, quantifying surprise through similarity of word-count distributions in adjacent
windows of text, our models are agnostic with respect to linguistic or thematic content. In contrast,
prior work conditioned on topical distributio3n0s, [
        <xref ref-type="bibr" rid="ref32">33</xref>
        ], particular word types5[], or limited
semantic frameworks such as sentiment 2[
        <xref ref-type="bibr" rid="ref11 ref7">7, 11</xref>
        ]. In this sense, our models approach the question
of discourse structure from a more general perspective.
      </p>
      <p>
        Our reliance on Kullback-Leibler divergence as our principal measure of “information
revelation” also brings analytical a昀ordances. Prior work has shown its relevance for understanding
a variety of cultural domains (see Chang and DeDeo9][for an overview), including the study
of the novelty of parliamentary discours2e][,the evolution of scienti昀椀c English [
        <xref ref-type="bibr" rid="ref10 ref4">10, 4</xref>
        ], human
visual attention [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], linguistic processing1[
        <xref ref-type="bibr" rid="ref19 ref8">8, 19</xref>
        ], and implicit cultural learnin3g4[]. Other
modeling options such as word embeddings, PCA, or topic modeling require knowledge of the
entire text and thus would pollute our measurement of local information novelty relative to a
prior window, where the subsequent direction of the text is assumed to be unknown. While
transformer models or LLMs could potentially be useful for this task, they run the risk of
introducing cultural bias into our models due to the opacity of training data. KLD only measures
the particular linguistic shi昀琀s within a text bringing in no external information. We take up
limitations surrounding the use of KLD to capture the concept of narrative revelation in our
discussion section.
      </p>
      <p>Our work is perhaps closest in spirit to that of Thompson, Wojtowicz, and DeDe3o3][and
their conceptualization of the explore/exploit paradigm in a narrative setting, although it di昀ers
in three key ways. First, their data derive primarily from time-ordered acts of speech (including
from parliamentary and social media sources), rather than long narrative forms. Second, they
use distributions derived from topic modeling within adjacent windows, while we use
wordcount distributions when computing Kullback-Leibler divergence for adjacent time windows.
Third, whereas they use random-walks based on Levy Flights to model their resulting
timeindexed sequences of Kullback-Leibler divergences, with an eye speci昀椀cally towards capturing
narrative (dis)continuity, our focus is on more fundamental properties of time-indexed data like
average, trend, and, in particular, dependency structure, for which we use statistical regression
and time series analysis.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methods</title>
      <p>We de昀椀ne narrative revelation as the practice of disseminating novel information over narrative
time with respect to a local prior window of text. Given what has come immediately before,
how surprising is any new passage? To capture this concept of surprise, we use
KullbackLeibler divergence (KLD), which calculates the relative entropy (or divergence) between two
probability distributions:
  (, ) =
∈
∑ () log ()
()
,
(1)
equal to zero, with equality holding if and onl y iafnd  are equal.
where  is a (discrete) state space and and  are probability mass functions de昀椀ned on ,
such that () = 0</p>
      <p>implies() = 0 . Note that the quantity in (1) is always greater than or</p>
      <p>For our purposes, the state space is time-varying, with de昀椀ned as the union of all words in
the  -th and ( −1) -st adjacent (non-overlapping) windows o1f000 words each. The function s
and  are estimated from the word frequencies in these two respective windows, using Laplace
smoothing to avoid0 values. While conditioning on word frequencies limits the amount of
semantic context that can be inferred from a given window of text, it has the advantage of

observing the literal distribution of information over narrative time. The end result of this
approach is a time series of KLD values, say{ 
 } = 1 , capturing the extent to which
information disseminated at time within a given narrative is novel compared to that disseminated
just previously at tim e− 1 , with larger KLD corresponding to greater novelty. As su ch, 
is intended to capture the amount of new information disseminated over narrative time. We
refer to these representations as the non-normalized time series. Figure 1 provides examples
of this approach and the resulting values and behavior.</p>
      <p>In order to test for an association between revelation and narrative time in aggregate, we
also create normalized representations  of  for each book to control for di昀ering book
lengths. To do so, we 昀椀rst subset all books into 50 equal parts, then subsamp1le000 words for
each part, and then compute   for the  = 49 resulting pairs of adjacent windows. We
refer to these representations as the normalized time series. As we discuss in Section 4.2 these
serve as the basis of our regression analysis to better understand the linear tren ds  of at
the aggregate level.</p>
      <p>Using these approaches, we formulate the following hypotheses:</p>
      <p>H1. Average Rate of Revelation. We expect average revelation at the book level to vary
by all of our measured social categories. Speci昀椀cally, prior wo2r9k] [has indicated that
昀椀ctional narratives are more predictable at the sentence level and thus we expect to see lower
levels of average revelation with respect to 昀椀ctionality at the document level. We also expect
average revelation to be negatively associated with reading level and positively associated with
prestige (more information being more “di昀케cult” for readers to process and thus potentially
more valued by elite readerships).</p>
      <p>
        H2. The Slope of Revelation. We expect there to be an association between revelation
and narrative time. Prior theoretical work has suggested that narratives exhibit structural
patterns [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], which has been con昀椀rmed in di昀erent ways through empirical work [
        <xref ref-type="bibr" rid="ref11 ref26 ref5">5, 27, 11</xref>
        ].
A general linear increase in surprise would support a theory of narrative investment in the
value of plot twists (or “surprise endings”), while a general linear decrease would support the
theory of narrative immersion, i.e. once novel information is introduced a narrative spends less
time introducing more information (exploration) and more time exploiting known information.
While we expect there to be an association between revelation and time, prior work does not
give clear indications of which directionality to expect.
      </p>
      <p>H3. Dependency Patterns of Revelation. Given assumptions about the value of narrative
structure to narrative meaning, we expect there to be discernible dependency patterns to the
rise and fall of revelation, with the present extent of revelation driven by that of the past in
non-trivial ways (e.g., lagged dependency). While no prior work has suggested that narrative
revelation should follow predictable dependency patterns, it could be the case that this is a
latent structural feature to narrative plotting and potentially drives reader enjoyment.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <sec id="sec-4-1">
        <title>4.1. Average Revelation (H1)</title>
        <p>We quantify the average revelation by calculating, for each book, the average of the values
   over times for our non-normalized time series, standardized to account for the
considerable di昀erences in book length in the CONLIT data. To evaluate the support in our data for
the speci昀椀c hypotheses with respect to our various two-level factors of social categories, we
use two-sample -tests and report the results in the form of Cohen ’sas a measure of e昀ect
size (see Table 1). We 昀椀nd that average revelation is associated with all social variables in our
data set with the exception of point-of-view. The largest e昀ect size is reserved for the factor
of instrumentality: non-昀椀ction books engage in higher rates of average information revelation
over narrative time (see also Figure 2). Surprisingly, prestige as captured by prize-winning
novels exhibit e昀ects almost as large as instrumentality and greater than those associated with
reading level. This supports prior work that has shown signi昀椀cant stylistic di昀erences between
prizewinning and bestselling novels25[] and adds a further dimension to understand the ways
in which prestige-driven selection e昀ects prioritize distinctive stylistic traits.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. The Slope of Revelation (H2)</title>
        <p>A regression analysis was conducted to test the association of our narrative revelation variable
   with our narrative time variab le and 昀椀ctionality, as well as their interaction, here
using the normalized time series. All e昀ects were found to be statistically signi昀椀cant (regression
coe昀케cient  -values &lt;10−16). The association between the amount of narrative revelation and
narrative time was small but negative (slope coe昀케cient−0.0038, in comparison to an
intercept of 3.947), suggesting that as narratives progress, narrative revelation decreases slightly on
average in our entire corpus.</p>
        <p>Figure 2 provides the average KLD value for each narrative section by category along with
the standard error. For the purposes of visualization, we show our 50-part model as described
above as well as a 3-part model, where we divide each book’s KLD values for all windows into
three equal-sized sections and take the average. As we can see for our 50-part model, 昀椀ction
books had on average a lower intercep0t.(343 lower) and a steeper slope0.(002 steeper) than
non-昀椀ction, indicating that 昀椀ctional books have lower overall levels of narrative revelation (as
shown above in 4.1) and also a more pronounced decay in narrative revelation. We also note
that for both categories we observe increases in average KLD in the 昀椀nal 1-2 sections of the
50-part model, suggesting that a common approach to narrative closure involves introducing
increased levels of novel information toward the end (something we miss in the more
generalized 3-part model). Such distinctive structure towards the close of narratives is considerably
less pronounced however than the severity of decline of information revelation in the opening
sections of a book. Finally, we found that youth 昀椀ction was similarly associated with greater
decreases of revelation over narrative time, but that prestige and point-of-view were not.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Dependency Patterns of Revelation (H3)</title>
        <p>The results in the previous two sections pertain to the behavior of the average and slope of
narrative revelation in aggregate across books in the CONLIT data. Understanding the behavior

of revleation at the level of individual books is also of substantial interest but requires a more
nuanced analysis. The sequences{</p>
        <p>} = 2 are time series, not only of varying lengths but
also, as it turns out, of varying complexity.</p>
        <p>Exploratory analysis of the non-normalized KLD time series reveals that, while they in
general oscillate, they nevertheless do not typically have a dominant frequency (as determined
using the findfrequency function of the R Forecast package), suggesting the absence of
strictly periodic (and hence easily predicted) behavior of narrative revelation over narrative
time. Further exploration of the autocorrelation behavior of the KLD time series suggests the
use of ARIMA models. Such models are the workhorse of modern time series analysis and
consist of three components: autoregressive (AR), integrated (I), and moving average (MA). The
autoregressive component refers to behavior where the va l ue  at time  can be predicted
by earlier value 
s
 −1 , … ,</p>
        <p>− , for some lags, suggesting a regression-based
relationship with itself. The moving average component allows for this regression-based relationship
to have dependent errors, say over time scales of leng t.h And the integrated component
allows for such combined AR-MA behavior to ride on top of a polynomial trend of o r,daekrin
to the way a line with slope underlies a cloud of points within classical linear regression.</p>
        <p>We used the auto.arima function in theR Forecast package to 昀椀t a separate ARIMA model
to each KLD time series in the CONLIT data. This function includes data-driven selection of
the triple(, , )</p>
        <p>, which we take as the unit of primary interest in our analysis. Of th2e754
books analysed,59% exhibited a trend ( &gt; 0 ). Of those, 80% exhibited downward trends
(i.e. negative slopes). Non-昀椀ction books were 2x more likely to be in the positive slope class.
For our second variable3,9% of all books exhibited autoregressive behavio r&gt;(0 ), meaning
that in a strong minority of books the successive values of narrative revelation are correlated.
Within this group we see that 75% have 昀椀rst order dependencies (p=1) and another 20% have
second-order (p=2), accounting for almost all books with auto-regressive behavior. Where
there is a correlation between successive windows, it tends to reach only 1-2 windows back.
Finally, = 0 was selected for all books in the data set, indicating that these characteristics of
autoregression and/or trend can be viewed as occurring with a backdrop of white noise.</p>
        <p>As in Section 4.1, we test the distribution of our two variables of interest, trend (d) and
dependence (p) across our four social categories (Table 2). We report the percentage of books
associated with each kind of time-dependent behavior for each category. As we can see from
the breakdowns, there are only two scenarios where we observe meaningful di昀erences
between categories (&gt; 5% di昀erence among books). The 昀椀rst is at the level of dependence for
Youth books, where we see 7% fewer books exhibiting auto-regressive behavior. This suggests
that books targeting younger audiences skew in favor of less patterning and more consistency
when it comes to narrative revelation. This is underscored by the fact that this e昀ect is even
stronger for Middle School books compared to Young Adult books. The second notable
difference is similar to what we observed in Figure 2. Fiction books are more likely to exhibit a
detectable trend in the levels of revelation over narrative time, a trend which is overwhelmingly
negative (downward).</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>Our work has aimed to continue prior e昀orts in modeling the temporal dimensions of narrative
communication. Narratives have a fundamental temporal dimension that impacts their
meaning. Accordingly, we have provided a novel method for capturing the dissemination of new
information over narrative time as well as highlighted the utility of well-established statistical
methods for capturing temporal relationships in time-series data. Our hope is that these
frameworks can be applied towards the further study of computational narrative understanding to
deepen our knowledge about the typicalities and particularities of human storytelling.</p>
      <p>Our models support prior wor2k9,[ 26] in showing how 昀椀ctional narratives exhibit
signi昀椀cantly more investment in patterns of narrative exploitation than narrative exploration. Fiction
tends to engage in lower levels of narrative revelation overall and those levels decline more
precipitously over narrative time. Fictional narratives invest more heavily in immersing readers
in well-known information rather than continuously introducing novel information, an e昀ect
that grows stronger over the course of a book’s narrative. Both 昀椀ction and non-昀椀ction exhibit
a tendency to increase narrative revelation in the 昀椀nal closing sections of a book, suggesting a
more universal narrative tendency with regards to narrative structu5]r.e [</p>
      <p>When it comes to books targeting di昀erent reading audiences, we see that the intended age
level of audiences and the selection preferences of elite audiences do appear to e昀ect levels of
narrative revelation (and for younger audiences lower levels of temporal dependence). Authors
engage in lower overall levels of revelation when writing for younger audiences, and higher
levels when attempting to appeal to elite audiences.</p>
      <p>One major open question for this line of research is the degree to which KLD covers the
diverse ways that “narrative revelation” may instantiate itself. Changes in vocabulary
distribution over narrative time that our models capture is one way of thinking about novel information
in a narrative. But we can also imagine how new or surprising information could be encoded in
very similar language but provides a key as yet unknown insight. The revelation of a murderer
in a mystery is the most obvious example where a single name would provide very high
levels of “revelation,” but low levels of KLD. The fact that our measures are inversely associated
with audience reading levels suggest that narrative revelation as we are modeling it may be
capturing the informationload and/or narrative complexity as much as the potential cognitive
disposition of “surprise” on the part of readers. On the other hand, the trend towards increases
in late-section rises of KLD that we are seeing suggests that our models may be capturing this
idea of narrative revelation as a function of novel information as it relates to key plot points.</p>
      <p>
        Similarly, because our models condition on local revelation – where the amount of novel
information is measured with respect to an immediate prior window – we cannot know if such
late-stage increases in revelation are absolutely novel or a return to information that references
earlier parts of the narrative (performing a sense of narrative “closure”). Future models could
explore the extent of revelation with respect to larger windows of text or even the entire text,
in essence capturing the absolute novelty of any new passage with respect to what a book has
divulged up to that point. Such work would also open the door to questions of non-linearity,
as when a passage refers back to a distant prior passage and continues the narrative a昀琀er some
interlude 2[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. We thus see a key avenue for future work to focus on validating and making
precise what aspects of narrative revelation KLD captures and what other kinds and structures
of revelation over narrative time are possible.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This research was generously supported by the Social Sciences and Humanities Research
Council of Canada (895-2013-1011).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bal</surname>
          </string-name>
          .Narratology:
          <article-title>Introduction to the Theory of Narrative</article-title>
          . University of Toronto Press,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Barron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Spang</surname>
          </string-name>
          , and
          <string-name>
            <surname>S. DeDeo.</surname>
          </string-name>
          “Individuals, Institutions, and
          <article-title>Innovation in the Debates of the French Revolution”</article-title>
          .
          <source>InP:roceedings of the National Academy of Sciences 115.18</source>
          (
          <year>2018</year>
          ), pp.
          <fpage>4607</fpage>
          -
          <lpage>4612</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bermejo-Berros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lopez-Diez</surname>
          </string-name>
          , and
          <string-name>
            <surname>M. A. G.</surname>
          </string-name>
          <article-title>Marńteız. “Inducing Narrative Tension in the Viewer through Suspense, Surprise, and Curiosity”</article-title>
          .
          <source>IPno:etics 93</source>
          (
          <year>2022</year>
          ), p.
          <fpage>101664</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bizzoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Degaetano-Ortlieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fankhauser</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Teich.</surname>
          </string-name>
          “
          <article-title>Linguistic Variation and Change in 250 years of English Scienti昀椀c Writing: A Data-Driven Approach”</article-title>
          .
          <source>InF:rontiers in Arti昀椀cial Intelligence</source>
          <volume>3</volume>
          (
          <year>2020</year>
          ), p.
          <fpage>73</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Boyd</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. G.</given-names>
            <surname>Blackburn</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Pennebaker</surname>
          </string-name>
          . “
          <article-title>The Narrative Arc: Revealing Core Narrative Structures through Text Analysis”</article-title>
          .
          <source>ISnc:ience Advances 6.32</source>
          (
          <year>2020</year>
          ),
          <year>eaba2196</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>W. F.</given-names>
            <surname>Brewer</surname>
          </string-name>
          and
          <string-name>
            <given-names>E. H.</given-names>
            <surname>Lichtenstein</surname>
          </string-name>
          . “
          <article-title>Stories are to Entertain: A Structural-A昀ect Theory of Stories”</article-title>
          .
          <source>In:Journal of Pragmatics 6</source>
          .
          <fpage>5</fpage>
          -
          <lpage>6</lpage>
          (
          <year>1982</year>
          ), pp.
          <fpage>473</fpage>
          -
          <lpage>486</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Brockwell</surname>
          </string-name>
          and
          <string-name>
            <surname>R. A</surname>
          </string-name>
          . DavisT.
          <source>ime Series: Theory and Methods</source>
          . Springer science &amp; business media,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>I.</given-names>
            <surname>Calvino</surname>
          </string-name>
          .
          <article-title>Six Memos for the Next Millennium</article-title>
          . Harvard University Press,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K. K.</given-names>
            <surname>Chang</surname>
          </string-name>
          and
          <string-name>
            <surname>S. DeDeo.</surname>
          </string-name>
          “
          <article-title>Divergence and the Complexity of Di昀erence in Text and Culture”</article-title>
          .
          <source>In:Journal of Cultural Analytics 5.2</source>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Degaetano-Ortlieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kermes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Khamis</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Teich.</surname>
          </string-name>
          “
          <article-title>An Information-Theoretic Approach to Modeling Diachronic Change in Scienti昀椀c English”</article-title>
          .
          <article-title>InF:rom Data to Evidence in English Language Research</article-title>
          . Brill,
          <year>2018</year>
          , pp.
          <fpage>258</fpage>
          -
          <lpage>281</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K.</given-names>
            <surname>Elkins</surname>
          </string-name>
          .
          <article-title>The Shapes of Stories: Sentiment Analysis for Narrative</article-title>
          . Cambridge University Press,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>G.</given-names>
            <surname>Freytag</surname>
          </string-name>
          .
          <article-title>Technique of the Drama: An Exposition of Dramatic Composition and Art</article-title>
          . S. Griggs,
          <year>1895</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>G.</given-names>
            <surname>Genette. Narrative Discourse</surname>
          </string-name>
          :
          <article-title>An Essay in Method</article-title>
          . Vol.
          <volume>3</volume>
          . Cornell University Press,
          <year>1983</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>G.</given-names>
            <surname>Genette</surname>
          </string-name>
          .
          <source>Paratexts: Thresholds of Interpretation. 20</source>
          . Cambridge University Press,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>D.</given-names>
            <surname>Herman</surname>
          </string-name>
          . Basic Elements of Narrative. John Wiley &amp; Sons,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L.</given-names>
            <surname>Itti</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Baldi</surname>
          </string-name>
          . “
          <article-title>Bayesian Surprise Attracts Human Attention”</article-title>
          .
          <source>InV:ision Research 49.10</source>
          (
          <year>2009</year>
          ), pp.
          <fpage>1295</fpage>
          -
          <lpage>1306</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>W.</given-names>
            <surname>Labov</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Waletzky</surname>
          </string-name>
          . “
          <article-title>Narrative Analysis: Oral Versions of Personal Experience</article-title>
          .” In: (
          <year>1967</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>R.</given-names>
            <surname>Levy</surname>
          </string-name>
          . “
          <article-title>Expectation-Based Syntactic Comprehension”</article-title>
          .
          <source>In:Cognition 106.3</source>
          (
          <issue>2008</issue>
          ), pp.
          <fpage>1126</fpage>
          -
          <lpage>1177</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>R.</given-names>
            <surname>Levy</surname>
          </string-name>
          . “
          <article-title>Memory and Surprisal in Human Sentence Comprehension”</article-title>
          .
          <source>ISne:ntence Processing</source>
          <volume>78</volume>
          (
          <year>2013</year>
          ), pp.
          <fpage>142</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>D.</given-names>
            <surname>Liddle</surname>
          </string-name>
          . “
          <article-title>Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel”</article-title>
          .
          <source>In:Journal of Cultural Analytics 4.2</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>L.</given-names>
            <surname>McGrath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Higgins</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Hintze</surname>
          </string-name>
          . “Measuring Modernist Novelty”.
          <source>InJo:urnal of Cultural Analytics 3.1</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>J.</given-names>
            <surname>Ouyang</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>McKeown</surname>
          </string-name>
          . “
          <article-title>Modeling Reportable Events as Turning Points in Narrative”</article-title>
          .
          <source>In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source>
          . Lisbon, Portugal: Association for Computational Linguistics,
          <year>2015</year>
          , pp.
          <fpage>2149</fpage>
          -
          <lpage>2158</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>D15</fpage>
          -1257. url: https://www.aclweb.org/anthology/D15-125.7 [23] [24] [25] [26]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piper</surname>
          </string-name>
          . “Novel Devotions: Conversional Reading, Computational Modeling,
          <article-title>and the Modern Novel”</article-title>
          .
          <source>In:New Literary History 46.1</source>
          (
          <issue>2015</issue>
          ), pp.
          <fpage>63</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Piper</surname>
          </string-name>
          . “
          <article-title>The CONLIT Dataset of Contemporary Literature”</article-title>
          .
          <source>InJo:urnal of Open Humanities Data</source>
          <volume>8</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Piper</surname>
          </string-name>
          and
          <string-name>
            <surname>E. Portelance.</surname>
          </string-name>
          “
          <article-title>How Cultural Capital Works: Prizewinning Novels, Bestsellers, and the Time of Reading”</article-title>
          .
          <source>InP:ost45 10</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Piper</surname>
          </string-name>
          and
          <string-name>
            <given-names>O.</given-names>
            <surname>Toubia</surname>
          </string-name>
          .
          <article-title>“A Quantitative Study of Non-linearity in Storytelling”P</article-title>
          .Ione:tics
          <volume>98</volume>
          (
          <year>2023</year>
          ), p.
          <fpage>101793</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Reagan</surname>
          </string-name>
          , L. Mitchell,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kiley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Danforth</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Dodds</surname>
          </string-name>
          . “
          <article-title>The Emotional Arcs of Stories are Dominated by Six Basic Shapes”</article-title>
          .
          <source>InE: PJ Data Science 5.1</source>
          (
          <issue>2016</issue>
          ), pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ricoeur</surname>
          </string-name>
          .
          <source>Time and Narrative</source>
          , Volume
          <volume>1</volume>
          . University of Chicago Press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jafarpour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Choi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Pennebaker</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Horvitz.</surname>
          </string-name>
          “
          <article-title>Quantifying the Narrative Flow of Imagined versus Autobiographical Stories”</article-title>
          .
          <source>PIrno:ceedings of the National Academy of Sciences 119.45</source>
          (
          <year>2022</year>
          ),
          <year>e2211715119</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          . “
          <article-title>Plot Arceology: A Vector-Space Model of Narrative Structure”2.0I1n5: IEEE International Conference on Big Data (Big Data)</article-title>
          .
          <source>Ieee</source>
          .
          <year>2015</year>
          , pp.
          <fpage>1667</fpage>
          -
          <lpage>1672</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sternberg</surname>
          </string-name>
          . “
          <article-title>Telling in Time (I): Chronology and Narrative Theory”</article-title>
          .
          <source>PIone:tics Today 11.4</source>
          (
          <issue>1990</issue>
          ), pp.
          <fpage>901</fpage>
          -
          <lpage>948</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sternberg</surname>
          </string-name>
          . “
          <article-title>Telling in Time (II): Chronology</article-title>
          , Teleology, Narrativity”P.oIent:
          <source>ics Today 13.3</source>
          (
          <issue>1992</issue>
          ), pp.
          <fpage>463</fpage>
          -
          <lpage>541</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>W. H.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wojtowicz</surname>
          </string-name>
          , and
          <string-name>
            <surname>S. DeDeo.</surname>
          </string-name>
          “
          <article-title>Lévy Flights of the Collective Imagination”</article-title>
          . In: arXiv preprint arXiv:
          <year>1812</year>
          .
          <volume>04013</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Veissière</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Constant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Ramstead</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. J.</given-names>
            <surname>Friston</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Kirmayer</surname>
          </string-name>
          . “
          <article-title>Thinking Through Other Minds: A Variational Approach to Cognition and Culture”B.Ienh:avioral and</article-title>
          <source>Brain Sciences</source>
          <volume>43</volume>
          (
          <year>2020</year>
          ),
          <year>e90</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>