<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Dataset with Enhanced Attributes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lucien Heitz</string-name>
          <email>heitz@ifi.uzh.ch</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Mattis</string-name>
          <email>n.m.mattis@vu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oana Inel</string-name>
          <email>inel@ifi.uzh.ch</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wouter van Atteveldt</string-name>
          <email>wouter.van.atteveldt@vu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Communication Science, Vrije Universiteit Amsterdam</institution>
          ,
          <addr-line>Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics, University of Zurich</institution>
          ,
          <addr-line>Zurich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Digital Society Initiative, University of Zurich</institution>
          ,
          <addr-line>Zurich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we present the Informfully Dataset with Enhanced Attributes (IDEA) for news article recommendations. The dataset consists of an open-source collection of user profiles, news articles with a high topic and outlet diversity, item recommendations, and rich user-item interactions from a field study on behavioral changes in news consumption. The records include both quantitative data from real-time session tracking as well as self-reported data from user surveys on satisfaction with news, knowledge acquisition, and personal background information. This paper outlines the data collection procedure and potential use cases of the dataset for designing normative recommender systems. It provides the documentation of all data collections together with insights into the data quality.</p>
      </abstract>
      <kwd-group>
        <kwd>choice architecture</kwd>
        <kwd>news dataset news recommender design</kwd>
        <kwd>political learning</kwd>
        <kwd>topic personalization</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction and Background</title>
      <p>
        User experiments are vital for the evaluation and understanding of the societal impact of content
curation in the news domain [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. However, despite voices in the community demanding more
ifeld studies [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ], empirical research remains rare [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], especially in the normative domain [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
A complicating factor in this is that there are only a handful of datasets publicly available to
the research community that share results from user experiments with news [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. Table 1
provides an overview of the most prominent resources available. All datasets provide user-item
interactions, mostly in combination with the corresponding news articles. Additionally, some of
the existing data collections provide information on users [
        <xref ref-type="bibr" rid="ref8 ref9">9, 8, 10</xref>
        ] and article recommendations
[10, 11]. Unfortunately, many datasets have sparsity issues [12], a lack of meaningful content
diversity [11, 13], or omit descriptions of users’ backgrounds [14, 15, 11]. Furthermore, while
previous studies found that the image as well as the visualization/display style of a news article
are crucial for predicting user engagement [16, 17], such information is not present in any of
the existing datasets. To counteract and alleviate these weaknesses, we present the Informfully
(W. v. Atteveldt)
      </p>
      <p>The Informfully Dataset with Enhanced Attributes includes news articles (text and images
via URLs, with high topic and source diversity), interactions (including enhanced attributes,
e.g., reading progress, like/dislike ratings for each article, and session data), recommendation
lists, and visual references for the item presentation. Further enhancements include timestamps
for all datapoints and the entire navigation history for session reconstruction. By providing
information on users, items (news articles), user-item interactions, and the item presentation,
our dataset enables researchers to build diferent content-based and collaborative filtering
recommender systems (RS). Complementary to the in-app user data (i.e., interaction and page
view/session data), we provide self-reported survey measures for all users.</p>
      <p>The inclusion of these datapoints is motivated by the normative thinking at the heart of
the NORMalize Workshop [18]. In doing so, our dataset enables switching from
interactiondriven thinking (i.e., an assessment of users based solely on their interaction data, cf. [19]) to
normative-driven thinking (i.e., an assessment of users based on survey data to check if the
normative goals behind the recommender design have been fulfilled).</p>
      <p>
        Looking not only at the impact of algorithms on user engagement within RS, but also the
impact on user attitudes and behavior outside of RS is critical, as the relevant values [18]
promoted through normative curation strategies might not be visible in engagement alone.
After all, the goals of normative RS are not primarily a change in user interactions, but a change
towards a given normative goal that results from these interactions (e.g., knowledge acquisition,
critical thinking, or becoming aware of important societal issues). In news recommendations,
these normative goals can foster an increase of political participation, active deliberation, and
provide a voice to underrepresented minorities [
        <xref ref-type="bibr" rid="ref2">2, 20</xref>
        ].
      </p>
      <p>By including measures such as users’ political preferences, attitudes towards algorithms,
diversity needs, and satisfaction with the news, IDEA enables researchers to explore news
engagement in relation to its antecedent efects. Since IDEA’s enhanced set of attributes
goes beyond simple user-item interactions, it opens new ways of operationalizing norms and
values in the context of RS. It helps to assess the interplay of algorithmic recommendation and
individual-level preferences in determining normatively meaningful news engagement and
content curation strategies in the news domain.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Dataset Creation</title>
      <p>We conducted a pre-registered field experiment in the United Kingdom in November and
December of 2023.2 The dataset was created using the Informfully Platform [21, 22, 23], allowing
us to track participants in real-time and record all their app interactions over the course of a
two-week long field study that required participants to use the app on a daily basis. 3</p>
      <sec id="sec-3-1">
        <title>2.1. Experimental Setting</title>
        <p>The experiment for creating IDEA is based on a two-week long user study in which users were
exposed to diferent nudging and personalization conditions: To complete the study, participants
needed to 1) express interest and fill in an online Qualtrics recruitment survey, 2) download
and install the Informfully app on their phone, 3) use Informfully during two consecutive
experimental weeks that difered in their experimental manipulations (see Section 2.3 for the
details on the experimental treatment), and 4) complete two in-app surveys.</p>
        <p>The recruitment survey measured a number of relevant control variables that can afect user
behavior irrespective of the experimental manipulations.4 The in-app surveys measured recall,
subjective knowledge, and user satisfaction—all of which could have been influenced by the
experimental manipulations themselves. During the experiment, we asked participants to use
the app on a daily basis and to spend at least five minutes reading news each day. To maximize
external validity, we asked participants to use the app in the exact same way as they would
use a commercial product. Thus, participants were free to choose what to read, when to read,
and—as long as the minimum level of engagement was achieved—how long to read.</p>
        <p>Within the app, we showed users news from six diferent news outlets (incl. The BBC,
The Guardian, The Independent, i news, Sky News, and the Evening Standard) that were
automatically scraped and recommended.5 We aimed to provide a diverse range of content from
news outlets that difer in their target audience as well as the style and focus of news coverage.
2We used a moving time window for recruitment, as the onboarding was done in multiple waves. While the efective
starting date varies between subjects, the overall duration of the experiment remains the same for everyone.
3GitHub repository of the platform: https://github.com/Informfully/Platform
4We chose to measure a broad range of attitudes, preferences, and behaviors that had been either theoretically or
empirically linked to news engagement. For example, we measured users’ interest in diferent news topics, political
interest and ideology, and attitudes toward algorithmic and journalistic news curation.
5GitHub repository of the scrapers: https://github.com/Informfully/Scrapers</p>
        <p>Respondents were shown 26 articles per day, covering 13 diferent news topics (two articles
per topic).6 During the experiment, users could see the expected reading time for the articles
inside the experimental environment, which is, on average, four to five minutes. 7 As such, we
predominantly recommended articles that were suitable for brief reading sessions.</p>
        <p>The Informfully Platform was used to automatically track and collect interaction data. It is
an all-in-one research platform for content distribution. The infrastructure of this platform
includes an app for content delivery, giving researchers complete control over when and what
items are shown to the experimental participants. Informfully has been evaluated and assessed
in previous studies (for details, please see [21, 22]), with its design and item visualization refined
by information retrieval insights [24, 25]. By using Informfully, we were able to control the
presentation of all news items and make it consistent across platforms (i.e., Android and iOS
devices). Figure 2 presents screenshots of how the news feed and items were displayed to
participants inside the app. A built-in tutorial enabled people to familiarize themselves with the
news apps and its features (e.g., setting bookmarks for creating a reading list or rating articles).
The home feed in Figure 2A is automatically updated each day, allowing participants to always
have access to the most recent news articles.
6The articles presented to participants present only a small subset of the total number of articles that were scraped
for the experiment. The dataset also includes articles that were not recommended to participants. These articles
can be leveraged to, e.g., calculate outlet-specific text complexity scores and understand the various viewpoints or
stances news outlets communicate for topics of interest, among others.
7Calculations are based on the average reading speed for native adult speakers of English.</p>
      </sec>
      <sec id="sec-3-2">
        <title>2.2. Participants</title>
        <p>Participants were recruited online via a third-party marketing agency. They were paid £3 for
completing the intake survey and another £15 for finishing the complete study. If they finished
the study, they were also eligible for one of two £100 prizes from a draw. Once participants had
expressed interest in the study and completed an initial intake survey, they were given login
details for Informfully, to then download and use for the remainder of the experiment. In total,
N = 593 users filled in the intake survey and used the app at least one time (for details, please
see Figure 1). Overall, the dataset includes 199 male, 387 female, and 7 non-binary participants,
who were on average 37 (M = 37.01; SD = 11.73) years old.8</p>
        <p>Participants were rather highly educated, having on average obtained a college degree, were
fairly interested in news (M = 5.03; SD = 2.17), diversity (M = 5.97; SD = 0.76), and had high
political interest (M = 5.19; SD = 1.34). Furthermore, their attitude towards both algorithmic
(M = 3.80; SD = 1.30) and journalistic (M = 3.55; SD = 1.35) preference was similarly close to
neutral. On average, respondents were active on 11 days (M = 10.61; SD = 4.02) and opened
6 unique articles per day (M = 5.98; SD = 4.30). While user engagement varied considerably
between participants at times, we opted to include as many respondents as possible in the
dataset. This gives researchers the freedom to decide on their own whom to include and at
what cut-of points they want to filter out participants.</p>
      </sec>
      <sec id="sec-3-3">
        <title>2.3. User Groups</title>
        <p>Table 2 presents an overview of the four diferent user feeds/groups of the experiments. These
feeds difered in terms of item placement and text complexity for one of the articles. Group A
received an original environmental news article (Env. OG) in the first position of their feed and
the most popular news items from the previous day in position five. Group B received a feed
identical to Group A, with the diference that the environmental article in the top position is
rewritten (Env. RW) to be more accessible. Group C has the feed of Group A, and Group D has
the feed of Group B, with the environmental article and popular article switching places.</p>
        <p>During the first week of the experiment, the random articles in positions 2-4 and 6-26 were
the exact same across all groups. To ensure topic diversity within the recommendation list,
each of these 24 random positions was populated using two articles from each of the twelve
available topics.9 In the second week, we introduced explicit and implicit conditions for topic
personalization for a subgroup of users. Table 3 provides an overview of the diferent conditions.
Conditions 1 means participants had the same recommender algorithm as in the first week.
Condition 2 and 3 were exposed to personalization.</p>
        <p>Overall, this created 3 × 4 diferent strategies for constructing news feeds. Implicit preferences
are based on the log files, where we calculated which article topics participants spent the most
time on during the first week. To determine explicit user preferences, we ask participants after
the first week to select their most liked topic in the in-app survey.
8Since our sampling included an element of self-selection, our final sample is not representative of the UK.
9We manually mapped the topics of each outlet to a unified topic list in order to have a consistent naming convention.
The topics present in the unified topic list are: business, crime, entertainment &amp; arts, football, health, life &amp; style,
politics, science, sport, technology, UK news, and world news. Environment was an additional topic, but it never
appeared in a random position (it was limited to position 1 or 5 of the feed).</p>
        <p>If topic personalization was present, we populated positions 2-4 and 6-8 according to their
preferences.10 The remaining positions, 9-26, consisted of nine preference-based articles (three
articles per topic preference) and nine filler articles (one random article for each non-preferred
topic to ensure suficient topic diversity across the news feed). Random articles were picked
from a shared pool, with three articles for each topic.11</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Documentation and Analysis</title>
      <p>The Informfully Dataset with Enhanced Attributes features 593 users together with 10, 954
news articles and a total of 34, 890 user-article interactions. Overall, the dataset consists of nine
document collections that provide detailed tracking and interaction data across two weeks. All
data is exported from the Informfully back end together with user background information
from Qualtrics. Section 3.1 provides the description of each of these collections and Section 3.2
ofers insight into the dataset quality.
10The group allocation outlined here is for users that completed the onboarding and in-app surveys on time. A
subset of users, however, delayed activating the app and/or completing the survey. This is reflected in the data
records, as their group allocation schedule varies between the first and second week.
11More details on the curation process are listed in the online codebook.</p>
      <sec id="sec-4-1">
        <title>3.1. Document Collections</title>
        <p>The dataset includes the collection of news articles (Articles) retrieved from six diferent
news outlets, reading list and favorites (Bookmarks, Favorites), all user-item interactions
(Interactions, article ratings (Ratings), the list of article recommendations (Recommendation,
in-app user surveys (Surveys), users and their survey responses (Users), and the session
navigation history (Views).12 The collections contain the following information:
Articles: Collection that holds all articles that were retrieved from six diferent news outlets
(i.e., The BBC, The Guardian, The Independent, i news, Sky News, and the Evening
Standard) and displayed to users during the study. For each article, the collection holds
the title, lead, and accompanying metadata, such as the publication outlet, author, and
image URL.13 (Total size: 10, 954 entries.)
Bookmarks &amp; Favorites: Holds users’ bookmarks in the reading list and their favorites in the
archive. (Total size: 2, 479 bookmark entries and 3, 115 favorite entries.)
Interactions: Records of each time a user has selected and opened an article. The collection
stores both the item and user ID, together with a timestamp, the reading time, and the
maximum scroll percentage of an article. (Total size: 34, 890 entries.)
Ratings: This collection records each instance where a user agreed (thumbs up) or disagreed
(thumbs down) with a statement below an article. During the study, respondents could
indicate (dis)agreement with two statements, namely “I find this article interesting” and
“I find this article easy to read.” (Total size: 28, 382 entries.)
Recommendations: Contains all article recommendations that were made for any given user
over the course of the experiment. Holds the article and user ID together with a timestamp
and list position for each recommendation. Includes 26 recommendations for each user
per day.14 (Total size: 207, 220 entries.)
Surveys: Collection that stores the weekly in-app survey items. Among others, this collection
holds the wording of and response to each survey item that respondents were shown in
the questionnaires.15 (Total size: 43, 078 entries.)
Users: Holds a range of self-reported measures from the Qualtrics intake survey and context
variables such as respondents’ experimental conditions in the experiment’s first and
second week. Data on participants’ backgrounds consists of replies to questions on: 1)
internal political eficacy, 2) political interest and position, 3) news interest and
consumption habits, 4) environmental news interests, and 5) attitude towards algorithmic content
curation and diversity. (Total size: 593 users, 14 diferent self-reported measures.)
Views: Record of the navigation throughout the entire app. Each page/mask of the app (see
Figure 2A-D) has a unique ID (e.g., home screen or article view). This collection tracks
and timestamps the transition from one page to another, allowing to reconstruct all user
sessions in their entirety. (Total size: 84, 747 entries.)
12For an in-depth technical documentation, please see the online documentation: https://informfully.readthedocs.io/
en/latest/database.html; please see the codebook for a non-technical explanation of all attributes and reliability
analyses for all survey scales: https://github.com/Informfully/Datasets/blob/main/IDEA/Codebook.pdf
13For legal reasons, the article text and image are only shared via URL references in the dataset.
14Recommendations were accessible for 24h. They were removed upon inserting the new daily batch.
15The experiment included two in-app surveys that were shown to respondents after the first and second week.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Data Analysis</title>
        <p>Figure 3 presents an overview of the quantitative data on daily active users (Figure 3A), daily
user-item interactions (Figure 3B), distribution of news topics among the opened and read
articles (Figure 3C), and article length (Figure 3D).16 We see almost constant daily active users
counts around 400 (M = 388.93, SD = 77.62) and more than two thousand (M = 2326.00,
SD = 632.02) daily interactions with news articles. When looking at the recommendation lists
and the user interactions, the dataset has a sparsity of 83% and an item Gini coeficient of 0.36.
The spike in active users and daily interaction on day six of the experiment coincided with us
sending out reminders for the upcoming survey at the end of the first week.</p>
        <p>We can also see that users’ news engagement was quite varied, spanning a broad selection of
topics.17 As such, the data leaves room for analyzing user behavior across diferent groups, over
time, and in combination with changing news supply. Further analyses of the news content
itself (e.g., in the form of annotations such as valence and viewpoints) may provide additional
insights into the overall selection patterns as well as their relation to self-reported measures.
16Please note only articles with more than 500 words have been recommended to users.
17“Filler articles” is a catch-all topic for randomly selected articles that were included in the news feed. These fillers
were present if there were too few articles to meet the quotas of the curation strategy.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Discussion and Limitations</title>
      <p>The Informfully Dataset with Enhanced Attributes (IDEA) is a first controlled attempt at
providing a resource that documents the entire recommendation procedure and can be used
to inform the design and development of normative-aware news recommender systems. In
the following, we highlight the applicability of the dataset and point out its limitations. But
ifrst, we want to reiterate that the sampling procedure for study participants involved an
element of self-selection. The final sample was not representative of the UK. IDEA presents the
interaction profile of a specific part of the UK population at a specific point in time. Therefore,
the generalizability of deriving engagement dynamics for users is limited.</p>
      <p>One limitation of IDEA is that participants’ news engagement might not always be completely
genuine, as they needed to fulfill certain criteria/daily quotas to be eligible for remuneration.
Using a realistic news app and providing a broad range of content has likely alleviated this issue.
However, compared to other datasets, such as the MIND [11], IDEA ultimately comes from
a controlled experiment. It presents only a partial insight into the news consumption habits
of individuals and does not track engagement with external news resources that participants
might have used parallel to participating in the experiment.</p>
      <p>IDEA is composed of more than 10,000 high-quality news articles, covering six sources from
the United Kingdom, namely, The BBC, The Guardian, The Independent, i news, Sky News,
and the Evening Standard. Due to our controlled study design, participants were exposed to
only a fraction of the news articles included in our dataset. Nevertheless, the diverse and rich
selection of the outlets allows for designing news recommender systems incorporating several
dimensions of diversity, such as source diversity as well as more normative aspects focusing on
exposure to diverse topical or political viewpoints.</p>
      <p>Finally, leveraging IDEA to develop normative RS might necessitate additional analysis steps.
For example, if researchers want to examine normative aspects such as readers’ engagement
with opposing viewpoints or minority voices, these dimensions must first be extracted and
annotated from the body of the news articles. Nonetheless, by combining granular behavioral
and self-reported data, IDEA provides ample room to examine how person- and context-specific
characteristics co-determine user engagement. These insights could eventually inform and be
translated into normative RS designs. While collecting these rich characteristics in a controlled
study come at the expense of the dataset size, to the best of our knowledge, no other news
dataset (see Table 1) provides such an extensive list of features.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>News engagement is a complex phenomenon and reducing it to clicks alone is reductive [26].
IDEA supports complex user engagement analyses by recording users’ news reading behavior
together with the associated reading time, scroll percentage, information on articles’
likes/dislikes, bookmarks, favorites, and references of how articles were presented. More importantly,
as opposed to the majority of existing datasets, these analyses can be correlated with rich
user attitudes and perceptions, such as political interest and orientation, diversity values, and
preferences for algorithmic curation and journalism, among others.</p>
      <p>With its combination of behavioral and self-reported data, IDEA allows researchers to explore
the drivers and dynamics of news engagement across a diverse range of topics and news outlets
within an externally valid field experiment. It covers the efects of diferent recommendation
algorithms of four groups across three conditions per group (random vs. based on explicit or
implicit user preferences) for a total of twelve experimental conditions. Thus, despite a rather
simplistic underlying recommendation logic, we hope the Informfully Dataset with Enhanced
Attributes can become a useful resource for researchers across various disciplines, ranging from
computer science and information retrieval to communication science and journalism.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work was partially funded by the Digital Society Initiative (DSI) of the University of Zurich
under a grant of the DSI Excellence Program, the Graduate Campus (GRC) of the University of
Zurich under a Travel Grant, GRC grant no. 2023_Q1_TG_095), as well as by the Dutch Research
Council (NWO), NWO grant no. 406.DI.19.073; project lead: Prof. Wouter van Atteveldt.
Nemig–a bilingual news collection and knowledge graph about migration, arXiv preprint
arXiv:2309.00550 (2023).
[10] B. Kille, F. Hopfgartner, T. Brodt, T. Heintz, The plista dataset, in: Proceedings of the 2013
international news recommender systems workshop and challenge, 2013, pp. 16–23.
[11] F. Wu, Y. Qiao, J.-H. Chen, C. Wu, T. Qi, J. Lian, D. Liu, X. Xie, J. Gao, W. Wu, et al., Mind: A
large-scale dataset for news recommendation, in: Proceedings of the 58th annual meeting
of the association for computational linguistics, 2020, pp. 3597–3606.
[12] M. Singh, Scalability and sparsity issues in recommender datasets: a survey, Knowledge
and Information Systems 62 (2020) 1–43.
[13] S. Vrijenhoek, Do you mind? reflections on the mind dataset for research on diversity in
news recommendations, in: International Workshop on Algorithmic Bias in Search and
Recommendation, Springer, 2023, pp. 147–154.
[14] J. A. Gulla, L. Zhang, P. Liu, Ö. Özgöbek, X. Su, The adressa dataset for news
recommendation, in: Proceedings of the international conference on web intelligence, 2017, pp.
1042–1048.
[15] G. de Souza Pereira Moreira, F. Ferreira, A. M. Da Cunha, News session-based
recommendations using deep neural networks, in: Proceedings of the 3rd workshop on deep
learning for recommender systems, 2018, pp. 15–23.
[16] J. Beel, H. Dixon, The ‘unreasonable’efectiveness of graphical user interfaces for
recommender systems, in: Adjunct Proceedings of the 29th ACM Conference on User Modeling,
Adaptation and Personalization, 2021, pp. 22–28.
[17] L. Heitz, Classification of normative recommender systems, in: Proceedings of the First</p>
      <p>Workshop on the Normative Design and Evaluation of Recommender Systems, 2023.
[18] S. Vrijenhoek, L. Michiels, J. Kruse, A. Starke, N. Tintarev, J. Viader Guerrero, Normalize:
The first workshop on normative design and evaluation of recommender systems, in:
Proceedings of the 17th ACM Conference on Recommender Systems, 2023, pp. 1252–1254.
[19] M. Kaminskas, D. Bridge, Diversity, serendipity, novelty, and coverage: a survey and
empirical analysis of beyond-accuracy objectives in recommender systems, ACM Transactions
on Interactive Intelligent Systems (TiiS) 7 (2016) 1–42.
[20] N. Mattis, P. Masur, J. Möller, W. Van Atteveldt, Nudging towards news diversity: A
theoretical framework for facilitating diverse news consumption through recommender
design, New Media &amp; Society (2022) 26. doi:0.1177/14614448221104413.
[21] L. Heitz, J. A. Lischka, A. Birrer, B. Paudel, S. Tolmeijer, L. Laugwitz, A. Bernstein, Benefits
of diverse news recommendations for democracy: A user study, Digital Journalism 10
(2022) 1710–1730.
[22] L. Heitz, J. A. Lischka, R. Abdullah, L. Laugwitz, H. Meyer, A. Bernstein, Deliberative
diversity for news recommendations: Operationalization and experimental user study, in:
Proceedings of the 17th ACM Conference on Recommender Systems, 2023, pp. 813–819.
[23] L. Heitz, J. A. Croci, M. Sachdeva, A. Bernstein, Informfully - research platform for
reproducible user studies, in: Proceedings of the 18th ACM Conference on Recommender
Systems, 2024.
[24] L. Rossetto, M. Baumgartner, R. Gasser, L. Heitz, R. Wang, A. Bernstein, Exploring
graphquerying approaches in lifegraph, in: Proceedings of the 4th Annual on Lifelog Search
Challenge, Springer, 2021, pp. 7–10.
[25] L. Rossetto, M. Baumgartner, N. Ashena, F. Ruosch, R. Pernisch, L. Heitz, A. Bernstein,
Videograph–towards using knowledge graphs for interactive video retrieval, in:
International Conference on Multimedia Modeling, Springer, 2021, pp. 417–422.
[26] T. Groot Kormelink, I. Costera Meijer, A user perspective on time spent: Temporal
experiences of everyday news use, Journalism Studies 21 (2020) 271–286.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zanker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gröning</surname>
          </string-name>
          ,
          <article-title>Recommender systems in computer science and information systems-a landscape of research</article-title>
          , in: E-Commerce and Web Technologies: 13th International Conference, EC-Web
          <year>2012</year>
          , Vienna, Austria, September 4-
          <issue>5</issue>
          ,
          <year>2012</year>
          . Proceedings 13, Springer,
          <year>2012</year>
          , pp.
          <fpage>76</fpage>
          -
          <lpage>87</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>N.</given-names>
            <surname>Helberger</surname>
          </string-name>
          ,
          <article-title>On the democratic role of news recommenders</article-title>
          , in: Algorithms, Automation, and
          <string-name>
            <surname>News</surname>
          </string-name>
          , Routledge,
          <year>2021</year>
          , pp.
          <fpage>14</fpage>
          -
          <lpage>33</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernstein</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. De Vreese</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Helberger</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Zweig</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Heitz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Tolmeijer</surname>
          </string-name>
          , et al.,
          <article-title>Diversity in news recommendation</article-title>
          ,
          <source>Dagstuhl Manifestos</source>
          <volume>9</volume>
          (
          <year>2021</year>
          )
          <fpage>43</fpage>
          -
          <lpage>61</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>H.</given-names>
            <surname>Sargeant</surname>
          </string-name>
          , E. Pirkova,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Kettemann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wisniak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scheinin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Bevensee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pentney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Woods</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Heitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Kostic</surname>
          </string-name>
          , et al.,
          <article-title>Spotlight on artificial intelligence and freedom of expression: A policy manual, Organization for Security and Co-operation in Europe (</article-title>
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Bauer, Escaping the mcnamara fallacy: Towards more impactful recommender systems research</article-title>
          ,
          <source>Ai Magazine</source>
          <volume>41</volume>
          (
          <year>2020</year>
          )
          <fpage>79</fpage>
          -
          <lpage>95</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Heitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Inel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vrijenhoek</surname>
          </string-name>
          ,
          <article-title>Recommendations for the recommenders: Reflections on prioritizing diversity in the recsys challenge</article-title>
          ,
          <source>in: Proceedings of the Recommender Systems Challenge</source>
          <year>2024</year>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Treuillier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Castagnos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Dufraisse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Brun</surname>
          </string-name>
          ,
          <article-title>Being diverse is not enough: Rethinking diversity evaluation to meet challenges of news recommender systems</article-title>
          ,
          <source>in: Adjunct Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>222</fpage>
          -
          <lpage>233</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Lucas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. F. G.</given-names>
            <surname>da Silva</surname>
          </string-name>
          , L. F. de Figueiredo,
          <article-title>Npr: a news portal recommendations dataset</article-title>
          ,
          <source>in: Proceedings of the First Workshop on the Normative Design and Evaluation of Recommender Systems</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Iana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Grote</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Nikolajevic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ludwig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Weinhardt</surname>
          </string-name>
          , H. Paulheim,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>