<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Generative Primes and Sum maries for Reading in Distractive Environments</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tilman Dingler</string-name>
          <email>t.dingler@tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Namrata Srivastava</string-name>
          <email>namrata.srivastava@monash.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shaun Wallace</string-name>
          <email>shaun.wallace@uri.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Delft University of Technology</institution>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Monash University</institution>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Rhode Island</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The proliferation of mobile phones has made reading on these devices increasingly common, but it has also exposed readers to a plethora of distractions, both internal (e.g., incoming SMS or phone calls) and external (e.g., navigating a footpath). Resuming a reading task after a distraction can be a daunting and time-consuming endeavour. To address this challenge, our research explores using automated text summaries and visualizations, powered by Large Language Models (LLMs), to aid readers in distractive environments. These summaries and visualizations can be triggered both before and after distractions occur, ofering a flexible approach to mitigating interruptions. This paper delves into the concept of generative primes - text summaries and visualizations presented before a distraction happens - and post hoc text summaries and visualizations. We evaluate the efectiveness of these approaches in terms of their ability to faithfully represent the content and how efectively they assist readers in comprehending and recalling the text. Large Language Models open up new possibilities for developing reading interfaces that empower readers to seamlessly handle distractions and efortlessly pick up where they left of in their reading sessions as well as provide opportunities for memory consolidation. Our research sheds light on how these advances can positively impact the reading experience in today's increasingly distractive digital landscape.</p>
      </abstract>
      <kwd-group>
        <kwd>Reading interfaces</kwd>
        <kwd>attention management</kwd>
        <kwd>priming</kwd>
        <kwd>generative AI</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        The advent of mobile technology has transformed the way we engage with written content.
From e-books and digital articles to social media feeds and messaging apps, our mobile devices
have become the primary medium through which we access and consume written information.
While this shift ofers unprecedented convenience and accessibility, it has also introduced new
challenges, particularly in environments where distractions abound [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Reading on mobile
phones in such distractive contexts presents a unique conundrum, as the constant influx of
notifications, both internal (e.g., incoming SMS or phone calls) and external (e.g., navigating a
busy street), disrupts the immersive reading experience. These distractions not only hinder the
enjoyment of reading but also harm comprehension and memory retention [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
nEvelop-O
(S. Wallace)
      </p>
      <p>
        As reading is increasingly becoming an activity that is performed on the go, it is vital to
address the impact of distractive environments. Recent research highlights the prevalence of
mobile phone usage for media consumption. According to a report by the World Economic
Forum, people now spend over four hours a day reading, listening, and watching on their mobile
phones [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This shift in media consumption, including reading habits, raises questions about
the quality of information intake and the ability to maintain focus amid the constant barrage of
distractions.
      </p>
      <p>
        The interruptions experienced while reading on mobile devices are twofold. First, there are
internal disruptions, such as incoming text messages, phone calls, or app notifications, which
divert the reader’s attention away from the text [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Second, external factors, like navigating a
crowded sidewalk, further compound the challenge of maintaining reading continuity [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. As a
result, readers often find themselves in the unenviable position of having to navigate a maze
of distractions before they can resume their reading [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This, in turn, can lead to frustration,
diminished comprehension, and dificulties in recalling information from the text.
      </p>
      <p>
        At the heart of interruptions lies the concept of switching costs, i.e., the time and mental
resources it takes to shift focus from one activity to another (and potentially back). Workers have
been shown to struggle to shift and resume their tasks after interruptions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. One potential
solution to this dilemma lies in integrating cues for task resumption [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In our work, we
investigate in which ways keywords, text summaries, and visualizations can be used as cues to
allow readers to resume reading activities and consolidate comprehension and memory. Here,
we make use of the so-called priming efect , i.e., an efect from psychology where exposure
to one stimulus influences the response to a subsequent stimulus. In essence, we investigate
how textual and visual previews of passages afect reading comprehension [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and help readers
bridge interruptions [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>In recent studies, we have further focused on whether such text primes should be administered
after an interruption occurred or beforehand (e.g., by delaying an incoming call). The design of
these cues, however, has so far been labour-intensive as interruptions can occur at arbitrary
points during text reading and cues hence need to be generated for any number of text passages.</p>
      <p>In our era, where Large Language Models (LLMs) have become increasingly sophisticated,
these summaries and visualizations can be created in an automated fashion, however, and
customized to the reader’s preferences. LLMs have significantly advanced the field of Natural
Language Processing (NLP) and have been instrumental in tasks like text summarization,
language translation, and information retrieval [11, 12]. GPT-3, for example, has demonstrated
remarkable capabilities in understanding and generating human-like text, sparking interest in
leveraging such models to enhance the reading experience [Brown et al., 2020]. These LLMs
can be harnessed to power a new generation of reading interfaces that mitigate the negative
efects of distractions. By providing readers with informative summaries and visualizations,
these interfaces have the potential to facilitate smoother transitions between reading and
interruptions.</p>
      <p>This position paper delves into the concept of generative primes, i.e., content cues in the
form of text, graphical and image visualizations that help readers preview and review text
passages and mitigate the efects of interruptions. We investigate the benefits of presenting
such primes before a distraction occurs and explore post-hoc summaries and visualizations,
which help readers re-orient themselves after being interrupted. Our research evaluates how
well these generative primes represent the content and how efectively they assist readers in
comprehending and recalling the text.</p>
      <p>Our research aims to contribute to a deeper understanding of how generative primes,
empowered by LLMs, can revolutionize how we read in distractive environments. By ofering insights
into their efectiveness and applications, we hope to pave the way for a more seamless and
enjoyable reading experience in the era of constant digital distractions.
2. The Concept of Generative Primes
The concept of ”generative primes” represents an approach to address the challenges of reading in
distractive environments. Generative primes are cues that provide readers with text summaries
and visualizations either before a distraction occurs or after it has disrupted their reading. These
elements act as cognitive aids, preparing the reader for the upcoming content or assisting in
the resumption of the reading task after an interruption. By combining textual summaries
and visualizations, generative primes create a holistic reading experience that can enhance
comprehension, recall, and engagement.</p>
      <p>
        In the past, we utilized the priming efect to facilitate text comprehension [ 13] and tested
diferent visualizations, including text highlights, structured mind maps, and image galleries [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
Primes had to be manually crafted, which limited this type of research to controlled lab studies
with selected texts. by leveraging the power of LLMs to generate automated summaries and
images from text, these generative primes can finally be scaled and deployed in the wild.
      </p>
      <p>To implement generative primes, Large Language Models (LLMs) like ChatGPT are invaluable
for generating text summaries. ChatGPT is trained to understand and generate human-like
text, making it ideal for creating concise, coherent, and informative textual overviews of longer
passages. These summaries can serve as previews or reorientation tools, ensuring that readers
stay connected to the content despite distractions.</p>
      <p>Visualizations are essential components of generative primes that complement text summaries.
Software tools like Midjourney and DALL-E have been revolutionary in this regard. Midjourney
can create visualizations through image stories, distilling complex textual information into
visually appealing and easily comprehensible narratives. Meanwhile, DALL-E can generate
innovative visualizations, such as mind maps that visually represent the structure and connections
within the text.</p>
      <p>
        The synergy of text summaries from LLMs like ChatGPT and visualizations from software like
Midjourney and DALL-E enables the creation of generative primes that cater to diverse learning
and reading styles. These primes hold the potential to revolutionize the way we interact with
written content in distractive environments, making the reading experience more engaging,
informative, and resilient to interruptions.
3. Research Plan and Outlook
In our recent work, we explored the impact of interruptions on reading comprehension and
how these efects can be mitigated using reviews (summaries of already read content) and
previews (summaries of upcoming content) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We conducted a series of pilot studies involving
participants reading on mobile devices and being interrupted by various tasks. The studies
aimed to determine whether presenting these summaries before or after interruptions could aid
in re-engaging with the reading material and improving comprehension.
      </p>
      <p>We found that previews shown after interruptions significantly enhanced reading
comprehension compared to not showing any summaries. Reviews also helped, but to a lesser extent
than previews. The efectiveness of these priming cues varied based on the type of reading
material and the timing of the cue presentation.</p>
      <p>Despite the efectiveness of previews in aiding comprehension, most participants expressed a
preference for reviews. They found reviews more helpful in summarizing key points and aiding
in answering comprehension questions. A significant number of participants (79.3%), however,
preferred having some form of summary (either review or preview) compared to none.</p>
      <p>These studies suggest that smart reading applications could incorporate text summaries,
especially previews after interruptions, to facilitate better comprehension and re-engagement
with reading tasks. Hence, we are now in the process of utilizing LLMs to generate text
summaries and visualizations on the fly to create text primes. Figure 2 shows the process of
creating three types of primes—text summary, image story, and a mindmap—from prompting to
Image-Priming
Generation</p>
      <p>Please write a brief and concise summary of the
following article. It should provide an overview of the
main points, key arguments, and ndings, but should
not exceed 100 words. Ensure the summary
accurately re ects the essence of the article, giving the reader
an understanding of the content without having to
read the entire piece. Remember, this summary
should serve as a preview to the main article.
&lt;story goes here. .&gt;
Read the fol owing story and identify 4 key moments
in the story. I want to generate 4 images based on the
key moments. Describe each key moment in
adjectives and verbs so that I can describe it to an il
ustrator. Al the pictures should be described in a manner
that they are independent. Replace Chan with Yui
Aragaki, a famous Japanese actor.</p>
      <p>&lt;story goes here. .&gt;
Mindmap-Priming
Generation</p>
      <p>Read the fol owing passage and create a succinct
mind map in Markdown format for XMind
compatibility, suitable for mobile phone display with a tree table
structure.
&lt;story goes here. .&gt;
promptTheprompt oimagine
/imagine
prompt
pave the way for a more seamless
and
enjoyable
reading
experience in
the</p>
      <p>digital age.</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>Our work
defines
and
highlights
the
potential
of
generative
primes, which
include
text
summaries
and
visualizations, to
mitigate
the
challenges
of reading
in
distractive
environments.</p>
      <p>By
providing
readers
with
informative
cues
before
and
after
distractions
occur, these
generative
primes
have
the
ability
to
enhance
comprehension, recall, and
engagement.</p>
      <p>Further
research
may
focus
on
the
custo
mization
options
and
evaluate
the
efectiveness
of these
primes in
diferent reading
contexts.</p>
      <p>Ultimately, we aim
to
contribute
to
the
development
of reading
interfaces
that empower
readers to
navigate
distractions
and
sea
mlessly
continue
their
reading
tasks.
We thank the participants of our studies as well as the continuous support by Adobe Research
and the team at the Documents Intelligence Lab, foremost Rajiv Jain and Jennifer Healey.
[11] OpenAI, Chatgpt: Optimizing language models for dialogue, 2020.
[12] OpenAI, Gpt-4 technical report, 2023. arXiv:2303.08774.
[13] K. Angerbauer, T. Dingler, D. Kern, A. Schmidt, Utilizing the efects of priming to
facilitate text comprehension, in: Proceedings of the 33rd Annual ACM Conference
Extended Abstracts on Human Factors in Computing Systems, CHI EA ’15,
Association for Computing Machinery, New York, NY, USA, 2015, p. 1043–1048. URL: https:
//doi.org/10.1145/2702613.2732914. doi:10.1145/2702613.2732914.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.</given-names>
            <surname>Leiva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Böhmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gehring</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krüger</surname>
          </string-name>
          ,
          <article-title>Back to the app: The costs of mobile application interruptions</article-title>
          ,
          <source>in: Proceedings of the 14th International Conference on Human-Computer Interaction with Mobile Devices and Services</source>
          , MobileHCI '12,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2012</year>
          , p.
          <fpage>291</fpage>
          -
          <lpage>294</lpage>
          . URL: https://doi.org/10.1145/2371574. 2371617. doi:
          <volume>10</volume>
          .1145/2371574.2371617.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Oulasvirta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tamminen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Roto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kuorelahti</surname>
          </string-name>
          ,
          <article-title>Interaction in 4-second bursts: the fragmented nature of attentional resources in mobile hci</article-title>
          ,
          <source>in: Proceedings of the SIGCHI conference on Human factors in computing systems</source>
          ,
          <year>2005</year>
          , pp.
          <fpage>919</fpage>
          -
          <lpage>928</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. K.</given-names>
            <surname>Foroughi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. E.</given-names>
            <surname>Werner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barragán</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Boehm-Davis</surname>
          </string-name>
          ,
          <article-title>Interruptions disrupt reading comprehension</article-title>
          .,
          <source>Journal of Experimental Psychology: General</source>
          <volume>144</volume>
          (
          <year>2015</year>
          )
          <fpage>704</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S. T.</given-names>
            <surname>Iqbal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. P.</given-names>
            <surname>Bailey</surname>
          </string-name>
          ,
          <article-title>Investigating the efectiveness of mental workload as a predictor of opportune moments for interruption</article-title>
          ,
          <source>in: CHI '05 Extended Abstracts on Human Factors in Computing Systems, CHI EA '05</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2005</year>
          , p.
          <fpage>1489</fpage>
          -
          <lpage>1492</lpage>
          . URL: https://doi.org/10.1145/1056808.1056948. doi:
          <volume>10</volume>
          .1145/ 1056808.1056948.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>World</given-names>
            <surname>Economic</surname>
          </string-name>
          <string-name>
            <surname>Forum</surname>
          </string-name>
          ,
          <article-title>The rise and rise of media on your mobile phone - in one chart</article-title>
          , Available online at: https://www.weforum.org/agenda/2021/02/ media-consumption
          <string-name>
            <surname>-</surname>
          </string-name>
          mobile-phone/,
          <year>2021</year>
          . Accessed:
          <volume>10</volume>
          .
          <fpage>11</fpage>
          .
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Brumby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Back</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Gould</surname>
          </string-name>
          ,
          <article-title>Recovering from an interruption: Investigating speed- accuracy trade-ofs in task resumption behavior</article-title>
          .,
          <source>Journal of Experimental Psychology: Applied</source>
          <volume>19</volume>
          (
          <year>2013</year>
          )
          <fpage>95</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Mark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <article-title>No task left behind? examining the nature of fragmented work</article-title>
          ,
          <source>in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '05</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2005</year>
          , p.
          <fpage>321</fpage>
          -
          <lpage>330</lpage>
          . URL: https://doi.org/10.1145/1054972.1055017. doi:
          <volume>10</volume>
          .1145/1054972.1055017.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Schneegass</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Draxler</surname>
          </string-name>
          ,
          <article-title>Designing task resumption cues for interruptions in mobile learning scenarios</article-title>
          ,
          <source>in: Technology-Augmented Perception and Cognition</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>125</fpage>
          -
          <lpage>181</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Dingler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kern</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Angerbauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <article-title>Text priming - efects of text visualizations on readers prior to reading</article-title>
          , in: R. Bernhaupt,
          <string-name>
            <given-names>G.</given-names>
            <surname>Dalvi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. K.</given-names>
            <surname>Balkrishan</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. O'Neill</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          Winckler (Eds.),
          <string-name>
            <surname>Human-Computer</surname>
            <given-names>Interaction - INTERACT</given-names>
          </string-name>
          <year>2017</year>
          , Springer International Publishing, Cham,
          <year>2017</year>
          , pp.
          <fpage>345</fpage>
          -
          <lpage>365</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Jain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Healey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Bylinskii</surname>
          </string-name>
          , T. Dingler,
          <article-title>Mitigating the efects of reading interruptions by providing reviews and previews</article-title>
          ,
          <source>in: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems, CHI EA '21</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2021</year>
          . URL: https://doi.org/10.1145/3411763. 3451610. doi:
          <volume>10</volume>
          .1145/3411763.3451610.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>