<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta />
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>There are five steps involved in the data processing. The first step creates
an input stream by continuously monitoring a set of RSS feeds from a
wide range of news publishers. Whenever a new news item occurs, RSS
entry properties such as the title, lead text and HTML sources are
retrieved. The HTML sources are parsed and cleaned to extract a
representative body text. In the second step, natural language processing
operations such as language identification, sentence detection and
part-ofspeech tagging is applied to extract entity mentions from the textual data.
The third step uses supervised models to map entity mentions to referent
entities in the WikiData knowledge bases. These models combine textual
similarities, WikiData graph relations and entity frequencies and
cooccurrence statistics to classify the relevance of multiple referent
candidates. First Story Detection is applied in the fourth step to group
news items describing the same news story. In the fifth step this semantic
representation is indexed and made searchable. As this backend
architecture is stream based, it is able to index and promote recent news
items soon after they are discovered.</p>
      <p>WikiData is the community-created knowledge base of Wikipedia[13].
Since its public launch in 2012, the knowledge base has gathered more
than 15 millions entities, including more than 34 million statements and
over 80 million labels and descriptions in more than 350 languages[4].
Most geographical entities in WikiData provide a reference to Geonames
containing more detailed geographical properties. In the implementation
of the Smartmedia prototype, the entity information from these knowledge
bases where indexed in a Lucene3 based search index. This index makes
the entities searchable and creates a foundation for addressing entity
labels, descriptions and aliases, entity relations and geospatial properties.</p>
      <p>When a user is opening the news app on the mobile a request containing
user id, location and preferences are sent to the backend. Here, a multi
factor search query is formed to retrieve relevant news entries from the
index.
4. USER INTERFACE
A web-based and responsive user interface is developed to make the news
stream contents explorable on mobile devices. In this interface, the user is
2 http://storm.apache.org/</p>
    </sec>
    <sec id="sec-2">
      <title>3 https://lucene.apache.org/core/</title>
    </sec>
    <sec id="sec-3">
      <title>4 http://geojson.org/</title>
      <p>allowed to extract news items that are relevant to the geo special locality
context, personal interests and given point of time. These three relevance
factors are customizable and the user can select whether or not they should
influence the retrieved news items.</p>
      <p>To customize the geographical locality, the user specifies a circular
relevance region on a map. Figure 2a shows an example of such a
relevance region. By default, the relevance region is set to users current
GPS location with a 50 km radius. By moving the region or modifying the
radius, users can generate a local newspaper for any region of the world. If
the location factor is disabled, it means that the system is recommending
news from any location in the world and news that are not containing
location information.</p>
      <p>In the current Smartmedia prototype, we have predefined a handful of user
interest profiles. Each user profile contains an alias and a weighted vector
of WikiData entities. Examples of predefined profiles in the system are
stock trader, soccer fan, technology geek, etc. By selecting any of these
interest profiles, the retrieved news will be influenced and biased towards
the interest topics. When the personal interest factor is disabled, the user
retrieve a news composition which is general and without such bias.</p>
      <p>By changing the time-factor, the user is presented with a calendar where
can move in time and retrieve either recent or historic news items. When,
the time-factor is disabled the user will retrieve news solely based on the
other relevance factors (location and personal interests).</p>
      <p>By clicking on a news story, the user gets the ingress of the news story and
a list of the most salient entities for the selected news story. Figure 2c
shows the ingress and relevant WikiData entities from the news article
about Theresa May. As we can see, our news story about politics and
terror related to Syria, Theresa May, ISIL and Sky News. By hovering
these items, the user is presented with their textual WikiData description.
On figure 1c, we can see that the WikiData entity for Theresa May
contains the description “British politician”.</p>
      <p>In general, the three buttons at the bottom of the screen for location,
interest profile and time can at any time be activated and de-activated in
combinations to provide very different recommendation strategies. For
example, keeping all buttons active with default parameters means that the
system will recommend news articles that have recently takes place in the
vicinity of the reader and are consistent with her profile. A screencast
video describing the features of the system and its user interface is
available at https://vimeo.com/121835936
5. CONCLUSIONS AND FUTURE WORK
Many see the full stack of semantic web technologies as a complex
implementation of some really simple and good ideas about adding
meaning to data. There are great rewards in understanding the full stack
and what it can do, but most news organizations find great rewards by
looking into linked data in combination with traditional information
retrieval techniques.</p>
      <p>In this paper we have shown a prototype of a news recommender system
that demonstrates some of the context and geo spatial aware features
online news services can achieve by using available and open knowledge
bases and data processing and storage technologies.</p>
      <p>Future work for the Smartmedia prototype will focus on improvement on
entity linking qualities and evaluations of user needs. The user evaluations
will look into to which extent users find the ability to control their news
feed in terms of location, interest profile and time valuable and useful.
}
1:</p>
      <p>{
}
2: {
a)
type: "Point"
coordinates: [ 2]
0: -0.129948
1: 51.4958
entityId: "Q264766"
name: "Theresa May"
description: "British politician"
associations: [ ... 21]}
entityId: "Q763388"
name: "Home Office"
description: "ministerial department of the Government of the United Kingdom"
associations: [ ... 3]
shape: {</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Asikin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wörndl</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Stories around You: Locationbased Serendipitous Recommendation of News Articles</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>Proceedings of 2nd International Workshop on News Recommendation and Analytics</source>
          . (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Cantador</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellogín</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Castells</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>News@ hand: A semantic web approach to recommending news. Adaptive hypermedia and adaptive web-based systems</article-title>
          . (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Cantador</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellogín</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Castells</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Ontology-based personalised and context-aware recommendations of news items</article-title>
          .
          <source>Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. 1</source>
          , (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Erxleben</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Günther</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Introducing Wikidata to the Linked Data Web. The Semantic Web-ISWC</article-title>
          <year>2014</year>
          .
          <article-title>(</article-title>
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Goossen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <surname>IJntema</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>News personalization using the CF-IDF semantic recommender</article-title>
          .
          <source>Proceedings of the International Conference on Web Intelligence, Mining and Semantics (WIMS)</source>
          .
          <article-title>(</article-title>
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Gulla</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingvaldsen</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fidjestøl</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nilsen</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haugen</surname>
            ,
            <given-names>K.R.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Su</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Learning User Profiles in Mobile News Recommendation</article-title>
          .
          <source>Journal of Print and Media Technology Research</source>
          . II,
          <volume>3</volume>
          (
          <year>2013</year>
          ),
          <fpage>183</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>IJntema</surname>
          </string-name>
          , W. and
          <string-name>
            <surname>Goossen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>Ontology-based news recommendation</article-title>
          .
          <source>Proceedings of the 2010 EDBT/ICDT Workshops</source>
          .
          <article-title>(</article-title>
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Meguebli</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kacimi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Building rich user profiles for personalized news recommendation</article-title>
          .
          <source>Proceedings of 2nd International Workshop on News Recommendation and Analytics</source>
          . (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Ozgobek</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gulla</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Erdur</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>A survey on challenges and methods in news recommendation</article-title>
          .
          <source>In Proceedings of the 10th International Conference on Web Information System and Technologies (WEBIST</source>
          <year>2014</year>
          ).
          <article-title>(</article-title>
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>and Teitler</surname>
            ,
            <given-names>B.E.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Reading news with maps by exploiting spatial synonyms</article-title>
          .
          <source>Communications of the ACM</source>
          .
          <volume>57</volume>
          ,
          <issue>10</issue>
          (Sep.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Tavakolifard</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gulla</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Almeroth</surname>
            ,
            <given-names>K.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingvaldesn</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nygreen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Berg</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Tailored news in the palm of your hand: a multi-perspective transparent approach to news recommendation</article-title>
          .
          <source>WWW '13 Companion Proceedings of the 22nd International Conference on World Wide Web. (May</source>
          <year>2013</year>
          ),
          <fpage>305</fpage>
          -
          <lpage>308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Teitler</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lieberman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>NewsStand: A new view on news</article-title>
          .
          <source>Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems.</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Vrandečić</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          .
          <source>Communications of the ACM.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>