<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards a Big Data Platform for News Angles?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marc Gallofre Ocan~a</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lars Nyre</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas L. Opdahl</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bj rnar Tessem</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christoph Trattner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Csaba Veres</string-name>
          <email>Csaba.Veresg@uib.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. of Information Science and Media Studies, University of Bergen</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Marc.Gallofre</institution>
          ,
          <addr-line>Lars.Nyre,Andreas.Opdahl,Bjornar.Tessem</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Finding good angles on news events is a central journalistic and editorial skill. As news work becomes increasingly computer-assisted and big-data based, journalistic tools therefore need to become better able to support news angles too. This paper outlines a big-data platform that is able to suggest appropriate angles on news events to journalists. We rst clarify and discuss the central characteristics of news angles. We then proceed to outline a big-data architecture that can propose news angles. Important areas for further work include: representing news angles formally; identifying interesting and unexpected angles on unfolding events; and designing a big-data architecture that works on a global scale.</p>
      </abstract>
      <kwd-group>
        <kwd>Big data Journalistic tools News Semantic technologies</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Journalistic work is becoming increasingly reliant on computers and the
internet [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. Miroshnichenko [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] argues strongly for arti cial intelligence (AI) in
journalism and points to four areas of impact: data mining, topic selection,
commentary moderation, and news writing. Journalistic robots developed by
commercial companies such as Narrative Science and Automated Insights can
already generate news stories in areas like nance and sports automatically [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
According to [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], Automated Insight's Wordsmith tool wrote and published 1.5
billion news stories in 2016 alone, possibly more than all the human journalists
in the world combined.
      </p>
      <p>
        These developments in AI are driven in part by the availability of big and
open data sources that are relevant for journalism. For example, researchers have
investigated how news events can be extracted from big-data sources such as
Tweets [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and other texts [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Maiden et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] propose the INJECT tool to
support journalistic creativity during the early phases of news work. Their tool
suggests relevant news stories to trigger new ideas for story angles more quickly
and e ciently, and it has been tested in Norwegian and German newspapers.
      </p>
      <p>
        Researchers have also used semantic technologies, such as RDF and OWL [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
to make big and open data sources more readily available for journalistic
purposes [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] | and for journalistic AI tools [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Fernandez et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] propose an
ontology for streamlining news production and distribution processes. Heravi
et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] advocate social semantic journalism, which uses natural-language
processing (NLP) and semantic metadata together to detect news events from
socially-generated big data, verify information and its sources, identify
eyewitnesses, and contextualise news events and their coverage. Leban et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] present
a platform that collects news messages and lifts them into a semantic knowledge
graph (in RDF) in order to detect and describe news events in real time. In
collaboration with Wolftech AB, a software company that delivers newsroom
systems to the international market, our research group has developed News
Hunter, an architecture and proof-of-concept prototype that supports
journalists and other news professionals by building and mining semantic knowledge
graphs that represent news-related information [
        <xref ref-type="bibr" rid="ref27 ref3">27,3</xref>
        ] (see Section 3).
      </p>
      <p>
        All newsworthy events have remarkable qualities whether or not journalists
are aware of them. Certain events are so remarkable that no e ort is needed
to nd the best angle, whereas other events need to be probed, explored, and
criticised to identify an angle that will interest the readers (or listeners, viewers).
Finding good angles on news events thus resembles topic selection [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], but
also is technique for presenting news stories in interesting ways. The task has
traditionally been the responsibility of professional journalists [30, p. 115] and is
considered a journalistic \trick-of-the-trade". It is covered in most introductory
textbooks, but appears to have been little theorised in the research literature.
      </p>
      <p>As news work becomes increasingly computer-assisted and big-data based,
journalistic tools must become better able to identify and propose suitable angles
too. This paper therefore investigates whether and how our News Hunter
architecture and tool can be evolved to handle big data and extended to provide
support for news angles. We ask: Which characteristics of news angles need to
be captured and represented for them to be supported by journalistic tools? and
What are the open research issues related to creating a big-data platform that
supports news angles? Our aim is not to automate, but to support: we want to
aid journalists by detecting new events and by suggesting newsworthy angles on
them, along with relevant background information.</p>
      <p>To investigate these questions, the rest of the paper is organised as follows:
Section 2 rst clari es and discusses the central characteristics of news angles
and related terms. Section 3 proceeds to describe News Hunter, our evolving
big-data architecture and tool for journalistic work. Finally, Section 4 concludes
the paper by reviewing open research issues.</p>
      <p>Event: Football team A beats team B 2{0 in city C on date D.
Impact: \bHriisntkoroifcablalyn kimruppotrctya."nt team B is now relegated and on the
In uence: \vTiohleenrceesuilntsthofeitrehamomAe tcoowrrne.l"ate with civil unrest and domestic
Con ict: \Coach A publicly insults rival coach B!"
Con ict: \Supporters of these two teams have been ghting in the past."
Recency: \Join our feed for live results."
Actionability: \Join our newspaper's campaign to get rid of coach B!"
Proximity: \Goalkeeper B grew up down the street from our editorial o ce."
Milestone: \t3h8e mseirniuestetsoinptloaythais10m0a0tcmhi,ntuetaemswBitwhinllobpeetnhaeltyrastgatienasmt tehveemr.i"n
Human interest: \Left mid elder B plays in honour of his terminally ill sibling."
2</p>
    </sec>
    <sec id="sec-2">
      <title>What is a News Angle?</title>
      <p>
        Certain events are so remarkable that they are newsworthy in themselves. Other
events need to be presented in a certain way to become interesting for its
readers (listeners, viewers). Several decades ago already, Altheide [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] observed that
reporters rely on \`angles,' or story lines, which give the speci c events new
meaning", to which Shoemaker and Reese [30, p. 115] add that \[a] prede ned
story `angle,' [. . . ] provides reporters a theme around which to build a story".
They also mention \news values [that] distil what people nd interesting and
important to know about" [30, p. 106].
2.1
      </p>
      <sec id="sec-2-1">
        <title>De nition</title>
        <p>We de ne a news angle tentatively as how a journalist or other news
professional makes an event interesting for an audience. As an example, Table 1 lists
alternative angles on the same event: a football game (we will go on to analyse
the impact angle in more detail below). In addition to gaining the audience's
attention, a news angle such as these serves several additional purposes:
{ it provides a criterion for selecting events that are worth reporting;
{ it points towards additional facts to report;
{ it suggests which information sources to use; and
{ it can serve as a template for how to present the event.</p>
        <p>
          We focus more on the rst three than on the fourth. Using basic concepts from
literary theory [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], we focus more on what is told (the fabula) than how it is
told (the discourse), which together form a narrative. Hence, nding an angle
on an event is a creative but fact-based task. It takes as input a limited factual
description of an event and produces as output a richer description that contains
additional facts that are related to and augment the event and that connects the
core facts to the interests of the audience.
        </p>
        <p>
          Of course, there is no such thing as neutral factual content. Journalists and
editors continuously choose which events to report, how visible to make them,
who to interview, which other data sources to use, and how to word the nal
story | a phenomenon often referred to as news framing or slanting. Even
seemingly objective big data collected by surveillance cameras or other sensors are,
in the end, products of human choices of whether and where to place the
cameras and sensors and of how to analyse and disseminate the captured data [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
Yet computer-assisted journalism may in the future serve to limit | or o er
alternatives too | human framing and slanting of the news.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Example: The impact angle</title>
        <p>
          Several researchers, such as [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ], have listed common news angles used by
journalists. Additional lists have been provided by practitioners [
          <xref ref-type="bibr" rid="ref29 ref33">33,29</xref>
          ]. In future
work, we want to synthesise these and other reviews into a taxonomy of news
angles. As an example, this paper will discuss one of them, impact, in a little
more detail according to: how it is described in the literature, its most common
subtypes, its indicators, the data sources available to assess the indicators, and
whether and how the angle ampli es and/or is ampli ed by other angles. The
purpose is to better understand the requirements for a big-data platform that
can support this and other angles.
        </p>
        <p>
          Description The literature describes the impact angle in various ways (of which
some are perhaps angle subtypes):
{ Prominence: \The importance of a story is measured in its impact: how many
lives it a ects. Fatalities are more important than property damage." [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ]
{ Disaster: \Describes the impact of negative situations (and usually either
what brought them about, how it's a ecting the new subject, or what's
being done about it)." [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]
{ An incident: \Anything that goes wrong has the potential to become
newsworthy, such as an industrial explosion, car crash or school shooting." [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]
Types of impact Events can be impactful in several ways, including: loss of life,
physical injury, mental distress, damage to the environment, loss of property,
and damage to property, including public infrastructure. Impact can thus be
subdivided accordingly into: human impact, environmental impact, damage to
property, etc. We envisage a big-data architecture where specialised agents for
each subtype (or subsubtype etc.) continuously crawl a knowledge (RDF) graph
in search of impactful events that can trigger a variant of the impact angle. Other
agents can search for indicators of other angles, such as groups of reports that
describe the same event with very di erent sentiments (potentially a subtype of
con ict ) or events that are related to an in uential person.
Indicators and data sources Indicators of human impact are: loss of life, physical
injuries, and mental distress, which can be gleaned from analysis of small as
well as big data sets. Loss of life and physical injuries can be lifted in real-time
from the o cial social-media feeds and online hospital logs if they are available.
Otherwise, they must be synthesised from other news reports or, using
triangulation, from less trusted social-media sources. Mental distress in an area can also
be identi ed through large-scale sentiment analysis of social-media messages.
        </p>
        <p>Damaged infrastructure can be indicated by and triangulated from a range
of sources, such as surveillance cameras and other sensors, citizen reports on
social media, messages from public authorities, deviating arrival times of and
timetable changes for public transport. Environmental impact and damage to
property can be derived from many of the same sources.</p>
        <p>Estimating past impacts from archival materials can be much easier, as
authorities and open data sources maintain statistics, for example, of accidents and
disasters by type and various measures of impact.</p>
        <p>For impact types such as these to be identi able by the agents that operate
on the knowledge graph, the represented events must be continuously enriched
with additional types of information both from small-data sources like public
authorities, trusted news sources, and o cial social media accounts and from
big-data sources like social media and the Internet of Things (IoT).
Interactions High-impact events are newsworthy in themselves, and the core
facts established by the agents can be presented to the audience more or less as
is. Lower-impact events can also turn out to be interesting: either because there
are (potential) secondary consequences, such as a limited avalanche blocking
a train line during the holiday season, or because they are ampli ed through
interaction with other angles, such as proximity or in uential people: a minor
ood in a residential area can become global news if it lls the basement of a
celebrity's home.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Audience and genre expectations</title>
        <p>
          A news angle is (almost always) relative to an audience: in case of the in uence
angle, di erent audiences may have widely di erent views of which people are
famous and, to a lesser extent, powerful. News angles rely on the type of events
that interest the intended audience [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. For example, the angles and topics that
interest people who read a local newspaper context are quite di erent from those
of the international news section of BBC World. Indeed, analyses of media users
in order to better understand their preferences and habits is itself a big-data
analysis problem.
        </p>
        <p>
          News angles are also in uenced by the general characteristics of the news
market. Traditional journalism is undergoing an economic crisis due to online
news competition, and many newsrooms have had to trim their sta while
producing more news than ever [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. This leads to variations of copy-paste journalism
and click baits. Adjustments have also been made to adapt to the online news
market [
          <xref ref-type="bibr" rid="ref11 ref5">5,11</xref>
          ]. Higher-level journalistic tools that support news angles is a
promising way of improving both quality and productiveness in a time of crisis and
hard competition.
        </p>
        <p>For each angle type and indicator, newsworthiness criteria can be established,
taking into account: the market addressed by the newspaper, the characteristics
of the audience, and the genre of the given news story. Optimising
newsworthiness criteria for di erent media forms and genres is an empirical problem that
can potentially be answered with big-data analytics, comparing factual
descriptions of past events with news criteria most prevalent in the audience of the
corresponding news reports. Multiple angles on the same event can be possible.
Sometimes, only the best one should be chosen; other times, two or more of
them could be combined to suggest a better story or to reach di erent niches of
readers (listeners, viewers).
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Towards a Big-Data Architecture for News Angles</title>
      <p>
        News Hunter is an evolving architecture and proof-of-concept prototype for
supporting journalistic work, which has been developed by our research group in
collaboration with Wolftech AB, a supplier of newsroom software systems for
the international market [
        <xref ref-type="bibr" rid="ref27 ref3">27,3</xref>
        ]. It has been designed to continually harvest news
items and social media messages from the web; analyse and represent them
semantically in a knowledge graph; classify, cluster, and label them; enrich them
with additional information from encyclopedic and other reference sources; and
present them in real time to journalists | either as tips about new events or as
background material for stories they are already working on.
3.1
      </p>
      <sec id="sec-3-1">
        <title>Current News Hunter architecture</title>
        <p>
          The current version of News Hunter comprises the following components:
{ Harvesters continuously download news texts and other relevant data items,
such as social-media messages, from the web.
{ Uploaders load harvested data into the appropriate database.
{ The TextDB stores textual data items such as news stories and social media
messages in raw form.
{ A (currently online) Translator translates other-language texts into the
canonical language, which is currently English.
{ The GraphDB represent harvested data items semantically as knowledge
graphs in RDF format. (In the GraphDB, each data item is also known as
an event, although they may not be important enough to be called news
events.)
{ The Lifter represents each text in the TextDB as a knowledge graph in
the GraphDB with the coordinated aid of several more speci c analysers:
concept extractors identify the central keywords in the text and disambiguate
their meaning. Topic analysers identify the central topics the text is about,
independent of the keywords that are used. Named-entity analysers identify
the own names of individuals, such as the people, organisations, and places
that are mentioned. Sentiment analysers identify the positive and negative
emotions in the text and its various phrases. Categorisers (or Labellers)
assess how well the text ts prede ned taxonomies, such as the IPTC News
Codes [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
{ An Event detector identi es bursts and other changes in the occurrence
frequencies of concepts, topics, named entities, and sentiments in a geographical
or social region.
{ A (currently limited) Enricher extends the core knowledge graphs produced
by the Lifter with additional semantic reference data retrieved from the LOD
cloud and from proprietary sources.
{ A Social networker performs basic social-network analyses on the graph
(currently limited to focussing on a nities).
{ The Editor lets journalists write up new stories, which the Lifter
continuously analyses semantically.
{ A (currently limited) Retriever uses the semantic analyses of the new story
to identify relevant background information in the GraphDB and retrieve
related stories and other texts from the TextDB.
{ The Front end (Figure 1) contains the editor and presents relevant
background information and related stories and other texts to journalists and
other news professionals.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Current News Hunter technologies</title>
        <p>
          The current prototype [
          <xref ref-type="bibr" rid="ref26 ref4">26,4</xref>
          ] is mainly written in Python and C# as an ASP.NET
application. Its components are interconnected through REST APIs [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] in a
Flask-based micro-service architecture.
        </p>
        <p>The Harvester component uses a Python script to collect news-related texts
(also called data items) from a variety of sources, such as Facebook, RSS, and
online newspapers. Current focus is on downloading and parsing RSS feeds into
JSON les using Feedparser. The JSON les are then stored in raw form in an
Elasticsearch TextDB and, if necessary, in English using Microsoft's Translate
API. The (English-language) JSON les are sent to a C# .NET pipeline that
analyses the texts semantically in terms of: concepts, topics, named entities,
sentiments, and categories. The lifted data is then stored in a BrightstarDB
GraphDB (or triple store), which can be queried using SPARQL though a
Microsoft LINQ .NET component. The news-related texts (data items) represented
in the knowledge graph are also clustered in order to detect new events.</p>
        <p>The di erent analysers use a variety of tools and techniques such as
TextRank, TF-IDF, SVM (support vector machines), MLP (multi-layer perceptron),
RAKE (Rapid Automatic Keyword Extraction), and DBSCAN clustering, among
others. Both SVM and MLP were implemented using the Keras Python library.
DBSCAN clustering was implemented using the Scikit-learn Python library.
TFIDFs were calculated with the Textacy and Spacy Python libraries. Sentiment
analysis was done using the AFINN Python library.</p>
        <p>
          The front-end application was written in HTML and CSS combined with
AngularJS, and it was prototyped using Sketch and Marvel. Froala was used as
text editor. An overview of News Hunter is presented in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. For further details
on the tools, techniques, con gurations, and evaluations we have used, see [
          <xref ref-type="bibr" rid="ref26 ref4">26,4</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Leveraging big data for news angles</title>
        <p>The current prototype does not yet scale to big data and does not support news
angles. We are therefore evolving News Hunter into a big-data architecture that
can be extended with components that support news angles.</p>
        <p>The new architecture must be open-ended in two ways: It must allow user
organisations to interface with their existing back-end tools, such as existing
semantic and other analysis services and existing text and graph databases. It
must also allow them to use open and proprietary data sources in combination.
For open data, News Hunter may provide storage and analysis as a service. But
news organisations may also want to use News Hunter to store and analyse
proprietary data, either self-produced or licensed. The architecture should therefore
be able to combine cloud and local data storage and analysis as seamlessly as
possible.</p>
        <p>The (orange/yellow) graph on top of Figure 2 illustrates some of the
highlevel reasoning steps needed to support news angles, whereas the (blue/green)
grid at the bottom shows the di erent types of information that must be dealt
with. Along the vertical axis in Figure 2, information can be either: Journalistic,
meaning that it is text written by professional journalists. Textual, meaning that
it is textual, but not written by professional journalists. Other, meaning that is
is non-textual, which currently means that it is represented as knowledge graphs
represented in RDF. Of course, future versions of our architecture may also cover
other information types than texts and knowledge graphs, such as other types
of structured data, along with images, audio, and video, introducing additional
rows in Figure 2 and requiring additional analysis and storage techniques such as
speech-to-text conversion, image/video analysis, and other types of databases.</p>
        <p>Along the horizontal time axis, the information can be either: Archival,
representing past events. Breaking, representing currently unfolding events.
Working, representing not-yet-reported events. Future, representing anticipated,
predicted, scheduled, or recurring events.</p>
        <p>Each event is represented as a small event graph, represented as RDF, with
both a core that describes the event directly and an extension that provides
context. The central resource (or node) in the core graph represents the event
itself, with related resources that result from lifting event data to semantic form.
For example, journalistic stories and other texts can be lifted using techniques
such as concept extraction, topic identi cation, named-entity recognition (of
people, places, organisations, works, etc.), and sentiment analysis. The
extension graph results from enriching the core graph with additional information
in RDF format, for example from open semantic reference data sets in the
Linked Open Data (LOD) cloud | such as DBpedia, Wikidata, GeoNames,
and LinkedGeoData | or from proprietary data sources that have been lifted
to semantic format.</p>
        <p>Event graphs will usually overlap to a large extent because they include the
same RDF resources for: people, places, and organisations; concepts, topics, and
categories; RDF types; etc. Figure 2 therefore indicates that multiple overlapping
(or otherwise similar) smaller graphs will be clustered and merged to form larger,
more detailed, and reliable graphs. Exploiting overlapping and similar event
graphs in this way is essential both for generating richer (more complete and
detailed) event descriptions and to corroborate them. In particular for social
media messages, unless the originator is known and trusted, triangulation of
information from several independent sources is essential to ensure that only
reliable event data are reported as news.</p>
      </sec>
      <sec id="sec-3-4">
        <title>A lambda architecture that supports news angles</title>
        <p>We are currently evolving News Hunter into a big-data architecture that can be
extended with components that support news angles along the lines shown in
Figure 3, on top of Apache's Kafka platform1.</p>
        <p>
          The new architecture is based on the Lambda architecture pattern [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], which
is designed for service-oriented big-data processing and is able to analyse big data
from social media sources with satisfactory performance [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. An advantage of
the Lambda architecture | as opposed to the alternative Kappa architecture [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]
| is that it supports both real-time streaming analysis of all incoming data items
and batch-oriented deeper analyses (and re-analyses) of selected data items that
later turn out to be particularly interesting.
        </p>
        <p>Harvesting system The new architecture is designed for continuously gathering
news-related information from a variety of sources through the harvesting
system, which is conceived as a message publishing and subscribing system. This
pub-sub system lets News Hunter connect to a wide variety of external data
sources, from social networks via commercial news services to the Internet of
Things (IoT). It will lter and prioritise the incoming data streams and store
the raw data in a data lake. Built on top of Apache Kafka1, it provides a
scalable and parallel messaging mechanism. The harvesting system will comprise the
Harvester components in the current architecture and add a new Filter and/or
Prioritiser component.</p>
        <p>Data lake The data lake stores incoming data items in their raw form as Kafka
topics, along with their English translations. It will comprise the TextDB and
Translator components in the current architecture, and we will consider adding
more powerful big-data storage technologies as the tool evolves.
1 https://kafka.apache.org/
Knowledge graph (Semantic news) The knowledge graph contains semantic triples
that are lifted from the data lake in real time as new data items arrive. Data
lifting consists of concept extraction, topic identi cation, named-entity recognition,
sentiment analysis, categorisation/labelling, and relation extraction. The graph
will comprise the GraphDB and Lifter components in the current architecture,
and we plan introducing a Relation extractor component.</p>
        <p>News analysis The analysis layer analyses the lifted data items further, in
real time and possibly as batch. Streaming news analysis in real time takes
semantically-lifted data straight from lifting and is intended to provide
journalists with both (1) real-time updates of the stories they are working on and
(2) potentially newsworthy new events. Semantic news analysis in batch takes
semantically-lifted data stored in the knowledge graph and is intended to (3) provide
journalists with background information related to the stories they are working
on and (4) organising and enriching the knowledge graph with data from other
sources; performing social network analysis on the graph to identify super-nodes,
sub-networks, a nities; and detecting clusters of overlapping or similar events.</p>
        <p>The streaming news analysis will comprise the Event detector, Enricher, and
Social networker components in the current architecture. In order to support
news angles and other higher-level services to journalists, the analysis layer will
also explore new components such as Organisers that continuously assess and
improve the structure to the knowledge graph. Analogy reasoners that aim to
identify other less obvious but semantically deeper connections between past and
present events and stories and related background information. Anglers leverage
the semantic analyses and background information to identify, generate, and rank
candidate angles on potential news events that a journalist is already working
on or that have been detected in the knowledge graph.</p>
        <p>Service layer The service layer is in charge of making all the knowledge and
analysis results available to newsrooms through a GUI and a REST API. It is this
layer that makes the architecture and tool available to journalists and other news
professionals. It will o er dashboards and user interfaces that present potentially
relevant angles, stories, and other background information to journalists based
on their current activities and preferences. The service will comprise the Editor
and Front end components, and it will extend the Retriever.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>The paper has discussed how journalistic tools can be improved by combining
open and big data sources with the concept of news angles. This is an
important research problem, because news work is becoming increasingly
computerassisted and big-data based, creating opportunities for a new generation of tools
that provide even higher-level support for journalistic work. We have clari ed
and discussed the central characteristics of news angles and related terms and
outlined how our architecture and tool for journalistic work, News Hunter, can
be evolved and extended into a big-data platform that supports news angles.</p>
      <p>Our work is part of a research project, News Angler, that is carried out
in collaboration with Wolftech AB, a supplier of newsroom software systems
for the international market. The News Angler project has two primary goals:
(1) to improve and evolve the News Hunter architecture towards a big-data
architecture that scales to the needs of international news organisations and
(2) to extend News Hunter to support news angles. The project thus combines
a moderate-ambition research and development goal (1) with a higher-ambition
basic research goal (2). While eyeing the second, longer-term goal, the present
paper also lays out concrete steps towards the rst.</p>
      <p>Our work on news angles is only beginning, and a long line of research and
development issues remain. On the architecture level (1), are porting the News
Hunter prototype to Linux on top of Apache's big-data stack, leveraging tools
such as Kafka, Cassandra, and Spark. Most of the current components will have
to be reengineering or reimplemented as part of this e ort. Many of them should
also be extended and improved in the process (in particular the Enricher, Social
networker, and Retriever ), and some new components introduced (for example
the Filter, Relation extractor, Organiser, Analogy reasoner, and a locally-running
Translator ). In order to make our platform open-ended, we need to de ne
clearcut APIs between our components, (a) to make it easier for user organisations
to interface with their existing back-end tools, (b) to make it easier for user
organisations to use open and proprietary data sources in combination, and
(c) to make it easier for ourselves to combine and compare alternative component
implementations, such as multiple named-entity recognisers.</p>
      <p>On the news-angle level (2), we want to develop new components that
continuously analyse the input stream and knowledge graph for new events and
newsworthy angles. One component type will manage prerequisites for angles,
for example agents that identify approaching anniversaries; con icting
descriptions of an event; natural disasters and their impact; swift changes in popularity
or political power; etc. Another component type will match angles to (breaking
or historical) events, possibly combining multiple angles on the same event.</p>
      <p>
        We think our work on the News Hunter prototype and its News Angler
components can contribute to the wider research literature in several ways. Relation
extraction is an research area that is central for our project. Currently, our event
graphs tend to have a star-like structure, with a central RDF node representing
the event itself and with lots of related annotation nodes that are connected to
it, but less often to one another. Yet it is the relations between concepts, topics,
named entities, and sentiments that describe an event most precisely. Gangemi
et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] have recently proposed FRED, a library and online tool for relation
extraction that may be useful for our purposes. The emerging generation of
neural-network base compositional vector representations of word meanings [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]
may also o er new ways to extract relations from text. Relating events by
analogy is another interesting research task, along with, e.g.: collecting and creating
taxonomies of news angles; developing a user-friendly way of reading and
writing news angles; identifying interesting and unexpected news angles; and fully
exploiting open data, in particular linked open data.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgement</title>
      <p>Early development of News Hunter was supported by NCE (Norwegian Centre of
Expertise) Media. News Angler is funded by the Norwegian Research Council's
IKTPLUSS programme as project 275872. The authors are indebted to Arne
Berven and Bjarte Djuvik N ss at Wolftech AB for fruitful discussions and to
Kamal Alipour, Ole Andreas Christensen, Kjetil Jacobsen Villanger, and Sindre
Moldeklev who made central contributions to the earlier versions of News Hunter.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Allemang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
          </string-name>
          , J.:
          <article-title>Semantic web for the working ontologist: E ective modeling in RDFS and OWL</article-title>
          .
          <string-name>
            <surname>Elsevier</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Altheide</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rasmussen</surname>
            ,
            <given-names>P.K.</given-names>
          </string-name>
          :
          <article-title>Becoming news: A study of two newsrooms</article-title>
          .
          <source>Sociology of Work and Occupations</source>
          <volume>3</volume>
          (
          <issue>2</issue>
          ),
          <volume>223</volume>
          {246 (May
          <year>1976</year>
          ). https://doi.org/10.1177/073088847600300206, http://journals.sagepub.com/ doi/10.1177/073088847600300206
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Berven</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Christensen</surname>
            ,
            <given-names>O.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moldeklev</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Opdahl</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villanger</surname>
            ,
            <given-names>K.J.</given-names>
          </string-name>
          : News Hunter:
          <article-title>Building and mining knowledge graphs for newsroom systems</article-title>
          . NOKOBIT |
          <article-title>Norsk konferanse for organisasjoners bruk av informasjonsteknologi 26 (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Christensen</surname>
          </string-name>
          , Ole Andreas, Villanger, Kjetil Jacobsen:
          <article-title>News Hunter: A semantic news aggregator</article-title>
          .
          <source>Master's thesis, Univ. of Bergen</source>
          (
          <year>2017</year>
          ), http://hdl.handle. net/
          <year>1956</year>
          /16192
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ekdale</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singer</surname>
            ,
            <given-names>J.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tully</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harmsen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Making change: Di usion of technological, relational, and cultural innovation in the newsroom</article-title>
          .
          <source>Journalism &amp; Mass Communication Quarterly</source>
          <volume>92</volume>
          (
          <issue>4</issue>
          ),
          <volume>938</volume>
          {
          <fpage>958</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuentes</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sanchez</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fisteus</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          :
          <article-title>The NEWS ontology: Design and applications</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>37</volume>
          (
          <issue>12</issue>
          ),
          <volume>8694</volume>
          {
          <fpage>8704</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Fielding</surname>
          </string-name>
          , R.T.:
          <article-title>Architectural styles and the design of network-based software architectures</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Californa, Irvine (
          <year>2000</year>
          ), https://www.ics. uci.edu/~fielding/pubs/dissertation/fielding_dissertation.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Reforgiato</given-names>
            <surname>Recupero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Nuzzolese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.G.</given-names>
            ,
            <surname>Draicchio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Mongiov</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Semantic web machine reading with FRED</article-title>
          .
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <issue>6</issue>
          ),
          <volume>873</volume>
          {
          <fpage>893</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Garc</surname>
            <given-names>a</given-names>
          </string-name>
          , R.,
          <string-name>
            <surname>Perdrix</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gil</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliva</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The semantic web as a newspaper media convergence facilitator</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>6</volume>
          (
          <issue>2</issue>
          ),
          <volume>151</volume>
          {
          <fpage>161</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Gervas</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Computational approaches to storytelling and creativity</article-title>
          .
          <source>AI</source>
          Magazine
          <volume>30</volume>
          (
          <issue>3</issue>
          ),
          <volume>49</volume>
          {62 (Jul
          <year>2009</year>
          ). https://doi.org/10.1609/aimag.v30i3.2250, https:// aaai.org/ojs/index.php/aimagazine/article/view/2250
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Gynnild</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Journalism innovation leads to innovation journalism: The impact of computational exploration on changing mindsets</article-title>
          .
          <source>Journalism</source>
          <volume>15</volume>
          (
          <issue>6</issue>
          ),
          <volume>713</volume>
          {
          <fpage>730</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Heravi</surname>
            ,
            <given-names>B.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGinnis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Introducing social semantic journalism</article-title>
          .
          <source>The Journal of Media Innovations</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <volume>131</volume>
          {
          <fpage>140</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hogenboom</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frasincar</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaymak</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          , De Jong, F.:
          <article-title>An overview of event extraction from text</article-title>
          . In: Workshop on Detection, Representation, and
          <article-title>Exploitation of Events in the Semantic Web (DeRiVE</article-title>
          <year>2011</year>
          ) at Tenth International Semantic Web Conference (ISWC
          <year>2011</year>
          ). vol.
          <volume>779</volume>
          , pp.
          <volume>48</volume>
          {
          <fpage>57</fpage>
          .
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>14. International Press Telecommunications Council: Newscodes | IPTC, https:// iptc.org/standards/newscodes/</mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Jackoway</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Samet</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sankaranarayanan</surname>
          </string-name>
          , J.:
          <article-title>Identi cation of live news events using Twitter</article-title>
          .
          <source>In: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks</source>
          . pp.
          <volume>25</volume>
          {
          <fpage>32</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Kitchin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>The data revolution: Big data, open data, data infrastructures and their consequences</article-title>
          .
          <source>Sage</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kreps</surname>
          </string-name>
          , J.:
          <article-title>Questioning the lambda architecture</article-title>
          , https://www.oreilly.com/ ideas/questioning
          <article-title>-the-lambda-architecture</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Latar</surname>
            ,
            <given-names>N.L.</given-names>
          </string-name>
          :
          <article-title>The robot journalist in the age of social physics: The end of human journalism? In: The new world of transitioned media</article-title>
          , pp.
          <volume>65</volume>
          {
          <fpage>80</fpage>
          . Springer (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Leban</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fortuna</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brank</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grobelnik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Event Registry: Learning about world events from news</article-title>
          .
          <source>In: Proceedings of the 23rd International Conference on World Wide Web</source>
          . pp.
          <volume>107</volume>
          {
          <fpage>110</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Machill</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beiler</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The importance of the internet for journalistic research: A multi-method study of the research performed by journalists working for daily newspapers, radio, television and online</article-title>
          .
          <source>Journalism Studies</source>
          <volume>10</volume>
          (
          <issue>2</issue>
          ),
          <volume>178</volume>
          {
          <fpage>203</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Maiden</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zachos</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brown</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brock</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nyre</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nygard</surname>
            <given-names>Tonheim</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Apsotolou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Evans</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          :
          <article-title>Making the news: Digital creativity support for journalists</article-title>
          .
          <source>In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems</source>
          . p.
          <fpage>475</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Marz</surname>
          </string-name>
          , N.:
          <article-title>How to beat the CAP theorem</article-title>
          , http://nathanmarz.com/blog/ how-to
          <article-title>-beat-the-cap-theorem</article-title>
          .html
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>McNair</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The sociology of journalism</article-title>
          . Arnold, London (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In: Advances in neural information processing systems</source>
          . pp.
          <volume>3111</volume>
          {
          <issue>3119</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Miroshnichenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>AI to bypass creativity. Will robots replace journalists? (The answer is \yes"</article-title>
          ).
          <source>Information</source>
          <volume>9</volume>
          (
          <issue>7</issue>
          ) (
          <year>Jul 2018</year>
          ). https://doi.org/10.3390/info9070183, http://www.mdpi.com/2078-2489/9/7/183
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Moldeklev</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Improving usefulness and ease of use for a prototype tool for journalists</article-title>
          .
          <source>Master's thesis</source>
          , University of Bergen (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Opdahl</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berven</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alipour</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Christensen</surname>
            ,
            <given-names>O.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villanger</surname>
            ,
            <given-names>K.J.</given-names>
          </string-name>
          :
          <article-title>Knowledge graphs for newsroom systems</article-title>
          . NOKOBIT|
          <article-title>Norsk konferanse for organisasjoners bruk av informasjonsteknologi 24 (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Persico</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pescape</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Picariello</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sperl</surname>
          </string-name>
          , G.:
          <article-title>Benchmarking big data architectures for social networks data processing using public cloud platforms</article-title>
          .
          <source>Future Generation Computer Systems</source>
          <volume>89</volume>
          ,
          <fpage>98</fpage>
          {
          <fpage>109</fpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.future.
          <year>2018</year>
          .
          <volume>05</volume>
          .068, http://www.sciencedirect.com/ science/article/pii/S0167739X17328303
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Phillips</surname>
          </string-name>
          , B.:
          <article-title>16 story angles that reporters relish</article-title>
          . https://www.prdaily.com/Main/ Articles/16_story_angles_that_reporters_relish_17748.
          <string-name>
            <surname>aspx</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Shoemaker</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reese</surname>
          </string-name>
          , S.D.:
          <article-title>Mediating the message: Theories of in uences on mass media content (</article-title>
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31. Sj vaag, H.:
          <article-title>Homogenisation or di erentiation? The e ects of consolidation in the regional newspaper market</article-title>
          .
          <source>Journalism Studies</source>
          <volume>15</volume>
          (
          <issue>5</issue>
          ),
          <volume>511</volume>
          {
          <fpage>521</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Troncy</surname>
          </string-name>
          , R.:
          <article-title>Bringing the IPTC news architecture into the semantic web</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>483</volume>
          {
          <fpage>498</fpage>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Upchurch</surname>
          </string-name>
          , W.:
          <article-title>Ten common news angles for media releases</article-title>
          . http: //www.streetdirectory.com/etoday/ten
          <article-title>-common-news-angles-for-mediareleases-uuofou</article-title>
          .
          <source>html</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>