<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>User Controlled News Recommendations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jon Espen Ingvaldsen Jon Atle Gulla Özlem Özgöbek</string-name>
          <email>jonespi@idi.ntnu.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Norwegian University of Science and Norwegian University of Science and Department of Computer Engineering, Technology, Department of Computer Technology, Department of Computer Ege University, and Information Science, Trondheim and Information Science</institution>
          ,
          <addr-line>Trondheim Izmir</addr-line>
          ,
          <country>Norway Norway Turkey</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2008</year>
      </pub-date>
      <abstract>
        <p>The adoption of mobile devices is pushing the Internet into a more personal and context aware space. A common challenge for online news services is to deliver contents that are interesting to read. In this paper, we describe the user interface design of the SmartMedia news recommender prototype. Through deep analysis of textual news contents it is able to deliver local, recent and personalized news experiences, and the user interface is designed to give the users control over the news stream compositions. We will present its innovative user interface and the approach taken to transform raw textual data into well defined and meaning bearing entities.</p>
      </abstract>
      <kwd-group>
        <kwd>Recommender system</kwd>
        <kwd>news</kwd>
        <kwd>mobile</kwd>
        <kwd>user interfaces</kwd>
        <kwd>user control</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>The Smartmedia project1 at NTNU targets construction of context
aware news experiences based on deep understanding of text in
continuous news streams [4, 9]. The goal of the Smartmedia
project is to deliver a mobile and context aware news experience
based on deep understanding of textual contents, combining both
geo spatial exploration and context aware recommendations. The
system is designed with scalability in mind and ability to support
multiple languages.</p>
      <p>Privacy is an important aspect when engineering recommender
systems and exploitation of user interaction and context data.
When dealing with personal data and privacy, transparency tools
are tools that can provide to the concerned individual clear
visibility of aspects relevant to these data and the individual’s
privacy. The combination of transparency tools and user control
yields viable functionality to empower users to protect their
privacy [5].</p>
      <p>In the Smartmedia project, we want to build transparent news
recommender systems where the user can control gathered data
and how their news streams are composed based on geo spatial
1 http://research.idi.ntnu.no/SmartMedia
location, personal interest profile and time. When designing
userfriendly systems for mobile devices, we need to be careful about
the amount of buttons and menu items introduced. In this paper
we will describe the news recommender system prototype and its
mobile user interface where the users can control their news
stream recommendations from three toggleable buttons.</p>
    </sec>
    <sec id="sec-2">
      <title>2. IMPLEMENTATION</title>
      <p>The backend of the news recommender prototype developed is
constructed as a pipeline of operations harvesting and
transforming Rich Site Summary (RSS) entries and raw text data
into a semantic and searchable representation. The pipeline and its
operations are implemented with using Apache Storm2. This
distributed computing framework enable scalability and ability to
handle large amounts of news items from a magnitude of
publishers continuously.</p>
      <p>As shown in Figure 1, the news processing pipeline consists of
five steps. The first step creates an input stream by continuously
monitoring a large set of RSS feeds. Whenever a new news item
occurs, properties such as the title, lead text and HTML sources
are extracted. The HTML sources are parsed and cleaned to
extract a representative body text. In the second step, natural
language processing operations such as language identification,
sentence detection and part-of-speech tagging is applied to extract
entity mentions from the textual data. The third step uses
supervised models to map entity mentions to referent entities in
the WikiData3 and Geonames4 knowledge bases. These models
combine textual similarities, graph relations and entity frequency
and co-occurrence statistics to classify the relevance of multiple
referent candidates. First Story Detection (FSD) is applied in the
forth step to group news items describing the same news story. In
the fifth step this semantic representation is indexed and made
searchable. As this backend architecture is stream based, it is able
to index and promote recent news items.</p>
      <p>WikiData is the community-created knowledge base of Wikipedia
[12]. Since its public launch in 2012, the knowledge base has
gathered more than 15 millions entities, including more than 34
million statements and over 80 million labels and descriptions in
more than 350 languages [3]. Most geographical entities in
WikiData provide a reference to Geonames containing more
detailed geographical properties. In the implementation of the
Smartmedia prototype, the news and entity information including
news text, titles, publication timestamps, entity labels and</p>
      <sec id="sec-2-1">
        <title>2 http://storm.apache.org/</title>
      </sec>
      <sec id="sec-2-2">
        <title>3 https://www.wikidata.org/</title>
      </sec>
      <sec id="sec-2-3">
        <title>4 https://www.geonames.org/</title>
        <p>geospatial properties are indexed in a Lucene based search index.
This index makes the news items and their related entities
searchable and creates a foundation for detailed querying.
When a user is opening the news app on the mobile a request
containing user id, location and preferences are sent to the
backend. Here, a multi factor search query is formed to retrieve
relevant news entries from the index.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. USER INTERFACE</title>
      <p>A web-based user interface is developed to make the news stream
contents explorable on mobile devices. In this interface, the user is
allowed to extract news items that are relevant to the geo special
locality context, personal interests and given point of time. These
three relevance factors are customizable and the user can select
whether or not they should influence the retrieval and ranking of
available news items.</p>
      <p>To customize the geographical locality, the user specifies a
circular relevance region on a map. Figure 2a shows an example
of such a relevance region. By default, the relevance region is set
to users current GPS location with a 50 km radius. By moving the
region or modifying the radius, users can generate a local
newspaper for any region of the world. If the location factor is
disabled, it means that the system is recommending news from
any location in the world and news that are not containing
location information.</p>
      <p>In the current Smartmedia prototype, we have predefined a
handful of user interest profiles. Examples of such profiles are
stock trader, soccer fan, technology geek, etc. Each profile
consists of a weighted concept vector, where each entry is a
WikiData entry associated with an interest score between 0 and 1.
By selecting any of these interest profiles, the retrieved news will
be influenced and biased towards the interest topics. When the
personal interest factor is disabled, the user retrieve a news
composition which is general and without such bias.</p>
      <p>To customize the time-factor, the user is presented with a calendar
where it is possible to move in time and retrieve either recent or
historic news items. When, the time-factor is disabled the user
will retrieve news solely based on the other relevance factors
(location and personal interests).</p>
      <p>Figure 2b shows an example of how news stories are presented.
Here we see one news article “Theresa May urges media restraint
in coverage of terror suspects” from the Guardian about politics
and terror, followed by another news story from BBC. The three
circular buttons on the bottom of the screen allow users to toggle
whether their locality, personal interest profile and time setting
such influence news story retrieval.</p>
      <p>By clicking on a news story, the user gets the ingress of the news
story and a list of the most salient entities for the selected news
story. Figure 1c shows the ingress and relevant WikiData entities
from the news article about Theresa May. As we can see, our
news story about politics and terror related to Syria, Theresa May,
ISIL and Sky News. By hovering these items, the user is
presented with their textual WikiData description. On figure 2c,
we can see that the WikiData entity for Theresa May contains the
description “British politician”.</p>
      <p>In general, the three buttons at the bottom of the screen for
location, interest profile and time can at any time be activated and
de-activated to provide very different recommendation strategies.
For example, keeping all buttons active with default parameters
means that the system will recommend news articles that have
recently takes place in the vicinity of the reader and are consistent
with her profile. Figure 3 describes different combinations of
recommendation factors and summarizes how the user can control
the retrieval and composition of news items.</p>
    </sec>
    <sec id="sec-4">
      <title>4. RELATED WORK</title>
      <p>People nowadays have access to more worldwide news
information than ever before. As Internet services get more
information about their users and their context, they can deliver
personal and customized contents and user experiences.
The prototype system, described in this paper, share similarities to
other academic news applications such as NewsStand [8, 10] and
News@Hand [1, 2]. Both these systems map textual news
contents to entities defined in a knowledge base.</p>
      <p>NewsStand targets geo spatial exploration of news. It is an
example application of a general framework developed to enable
people to search for information using a map query interface. It
utilize maps both to explore and find news stories and to visualize
and present single news events.</p>
      <p>News@hand combines textual features and collaborative
information to make news suggestions. It uses Semantic Web
technologies to describe the news contents and user preferences.
Both news items and user profiles are represented in terms of
concepts appearing in domain ontologies, and semantic relations
among those concepts are exploited to enrich the above
representations, and enhance recommendations.</p>
      <p>Both these NewsStand and News@Hand have user interfaces
targeting desktops and larger device screens. They both provide
user control over the retrieved set of news, either through a map
or category based navigation or preferences settings.</p>
      <p>Tran and Herder [11] have looked at the studied news event
timelines and shown that manually constructed timelines are
subjective and often missing important dates or other information.
By complementing the timelines with elements extracted
algorithmically from multiple sources, it is possible to create more
objective and argumentative timelines. However, the manual
processing and editing efforts are still needed to enhance the
communicative qualities of the timelines, and to adapt it to the
needs of the readers
Parra et al. [6, 7] presents SetFusion, a visual user-controllable
interface for hybrid recommender system. Their approach enables
users explore and control the importance of recommender
strategies using an interactive Venn diagram visualization. Their
evaluations indicate that this interface had a positive effect on the
user experience and improved users engagement. Their idea of
using the Venn diagram to explain intersections among
recommendation approaches is transferable and valuable to the
news domain.</p>
      <p>Local
Retrieve news from a geospatial area</p>
      <sec id="sec-4-1">
        <title>Personal</title>
        <p>Retrieve news matching the interest profile of
the user</p>
      </sec>
      <sec id="sec-4-2">
        <title>Temporal</title>
        <p>Retrieve news published after a given date</p>
      </sec>
      <sec id="sec-4-3">
        <title>Local and personal</title>
        <p>Retrieve news that both relate to a geospatial
area and match users interest profile.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Local and temporal</title>
        <p>Retrieve news related to a given geospatial
area and published after a given date.</p>
      </sec>
      <sec id="sec-4-5">
        <title>Personal and temporal</title>
        <p>Retrieve news matching the interest profiles
of the user and published after a given date.</p>
      </sec>
      <sec id="sec-4-6">
        <title>Local, personal and temporal</title>
        <p>Retrieve news with relevance for the selected
geospatial area and interest profile, and
published after a given date.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. CONCLUSIONS AND FUTURE WORK</title>
      <p>The predefined user profiles can be replaced or used in
combination with more personal profiles trained on traced
interaction logs from the system. As users leave interaction data
behind, we can gather knowledge about what the users interests
are. However, for new users where no past interaction records
exist, we have a cold-start problem where we still benefit on
predefined stereotypes.</p>
      <p>In future work we plan to use trained personal profiles with
predefined stereotypes in combination. We will also gather user
feedback and evaluate to which extent users want to control and
customize their news presentations and study how their
requirements can be met in a mobile user interface design.
Deep understanding of textual contents together with knowledge
base structures provides a fundament for innovative and
intelligent applications. This paper has described one such
innovation from the news domain, and how its mobile user
interface allow users to control the composition of news. A
screencast video demonstrating the prototype and its user interface
is available at: http://vimeo.com/121835936</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Cantador</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          et al.
          <year>2008</year>
          .
          <article-title>Ontology-based personalised and context-aware recommendations of news items</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. 1</source>
          , (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Erxleben</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          et al.
          <year>2014</year>
          .
          <article-title>Introducing Wikidata to the Linked Data Web. The Semantic Web-ISWC</article-title>
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Gulla</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          et al.
          <year>2013</year>
          .
          <article-title>Learning User Profiles in Mobile News Recommendation</article-title>
          .
          <source>Journal of Print and Media Technology Research</source>
          . II,
          <volume>3</volume>
          (
          <year>2013</year>
          ),
          <fpage>183</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Hansen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Marrying transparency tools with usercontrolled identity management</article-title>
          .
          <source>The Future of Identity in the Information Society</source>
          . (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Parra</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          et al.
          <year>2014</year>
          .
          <article-title>See what you want to see</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>Proceedings of the 19th international conference on Intelligent User Interfaces - IUI '14</source>
          (New York, New York, USA, Feb.
          <year>2014</year>
          ),
          <fpage>235</fpage>
          -
          <lpage>240</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Parra</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Brusilovsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>User-controllable personalization: A case study with SetFusion</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Samet</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          et al.
          <year>2014</year>
          .
          <article-title>Reading news with maps by exploiting spatial synonyms</article-title>
          .
          <source>Communications of the ACM</source>
          .
          <volume>57</volume>
          ,
          <issue>10</issue>
          (Sep.
          <year>2014</year>
          ),
          <fpage>64</fpage>
          -
          <lpage>77</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Tavakolifard</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et al.
          <year>2013</year>
          .
          <article-title>Tailored news in the palm of your hand: a multi-perspective transparent approach to news recommendation</article-title>
          .
          <source>(May</source>
          <year>2013</year>
          ),
          <fpage>305</fpage>
          -
          <lpage>308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Teitler</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lieberman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>NewsStand: A new view on news</article-title>
          .
          <source>Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems</source>
          . (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Herder</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Detecting Filter Bubbles in Ongoing News Stories</article-title>
          .
          <source>Extended Proc. UMAP</source>
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Vrandečić</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          .
          <source>Communications of the ACM</source>
          . (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>