User Controlled News Recommendations Jon Espen Ingvaldsen Jon Atle Gulla Özlem Özgöbek Norwegian University of Science and Norwegian University of Science and Department of Computer Engineering, Technology, Department of Computer Technology, Department of Computer Ege University, and Information Science, Trondheim and Information Science, Trondheim Izmir, Norway Norway Turkey jonespi@idi.ntnu.no jag@idi.ntnu.no ozlem.ozgobek@ege.edu.tr ABSTRACT location, personal interest profile and time. When designing user- The adoption of mobile devices is pushing the Internet into a more friendly systems for mobile devices, we need to be careful about personal and context aware space. A common challenge for online the amount of buttons and menu items introduced. In this paper news services is to deliver contents that are interesting to read. In we will describe the news recommender system prototype and its this paper, we describe the user interface design of the mobile user interface where the users can control their news SmartMedia news recommender prototype. Through deep analysis stream recommendations from three toggleable buttons. of textual news contents it is able to deliver local, recent and personalized news experiences, and the user interface is designed to give the users control over the news stream compositions. We 2. IMPLEMENTATION will present its innovative user interface and the approach taken to The backend of the news recommender prototype developed is transform raw textual data into well defined and meaning bearing constructed as a pipeline of operations harvesting and entities. transforming Rich Site Summary (RSS) entries and raw text data into a semantic and searchable representation. The pipeline and its Categories and Subject Descriptors operations are implemented with using Apache Storm2. This H.4.7 [Information Systems Applications] Communications distributed computing framework enable scalability and ability to Applications – Information browsers handle large amounts of news items from a magnitude of publishers continuously. General Terms As shown in Figure 1, the news processing pipeline consists of Algorithms, Design, Experimentation, Human Factors. five steps. The first step creates an input stream by continuously monitoring a large set of RSS feeds. Whenever a new news item Keywords occurs, properties such as the title, lead text and HTML sources Recommender system, news, mobile, user interfaces, user control are extracted. The HTML sources are parsed and cleaned to extract a representative body text. In the second step, natural . language processing operations such as language identification, sentence detection and part-of-speech tagging is applied to extract entity mentions from the textual data. The third step uses 1. INTRODUCTION supervised models to map entity mentions to referent entities in The Smartmedia project1 at NTNU targets construction of context the WikiData3 and Geonames4 knowledge bases. These models aware news experiences based on deep understanding of text in combine textual similarities, graph relations and entity frequency continuous news streams [4, 9]. The goal of the Smartmedia and co-occurrence statistics to classify the relevance of multiple project is to deliver a mobile and context aware news experience referent candidates. First Story Detection (FSD) is applied in the based on deep understanding of textual contents, combining both forth step to group news items describing the same news story. In geo spatial exploration and context aware recommendations. The the fifth step this semantic representation is indexed and made system is designed with scalability in mind and ability to support searchable. As this backend architecture is stream based, it is able multiple languages. to index and promote recent news items. Privacy is an important aspect when engineering recommender WikiData is the community-created knowledge base of Wikipedia systems and exploitation of user interaction and context data. [12]. Since its public launch in 2012, the knowledge base has When dealing with personal data and privacy, transparency tools gathered more than 15 millions entities, including more than 34 are tools that can provide to the concerned individual clear million statements and over 80 million labels and descriptions in visibility of aspects relevant to these data and the individual’s more than 350 languages [3]. Most geographical entities in privacy. The combination of transparency tools and user control WikiData provide a reference to Geonames containing more yields viable functionality to empower users to protect their detailed geographical properties. In the implementation of the privacy [5]. Smartmedia prototype, the news and entity information including In the Smartmedia project, we want to build transparent news news text, titles, publication timestamps, entity labels and recommender systems where the user can control gathered data and how their news streams are composed based on geo spatial 2 http://storm.apache.org/ 3 https://www.wikidata.org/ 1 4 http://research.idi.ntnu.no/SmartMedia https://www.geonames.org/ geospatial properties are indexed in a Lucene based search index. personal interest factor is disabled, the user retrieve a news This index makes the news items and their related entities composition which is general and without such bias. searchable and creates a foundation for detailed querying. To customize the time-factor, the user is presented with a calendar When a user is opening the news app on the mobile a request where it is possible to move in time and retrieve either recent or containing user id, location and preferences are sent to the historic news items. When, the time-factor is disabled the user backend. Here, a multi factor search query is formed to retrieve will retrieve news solely based on the other relevance factors relevant news entries from the index. (location and personal interests). Figure 2b shows an example of how news stories are presented. Here we see one news article “Theresa May urges media restraint 3. USER INTERFACE in coverage of terror suspects” from the Guardian about politics A web-based user interface is developed to make the news stream and terror, followed by another news story from BBC. The three contents explorable on mobile devices. In this interface, the user is circular buttons on the bottom of the screen allow users to toggle allowed to extract news items that are relevant to the geo special whether their locality, personal interest profile and time setting locality context, personal interests and given point of time. These such influence news story retrieval. three relevance factors are customizable and the user can select whether or not they should influence the retrieval and ranking of By clicking on a news story, the user gets the ingress of the news available news items. story and a list of the most salient entities for the selected news story. Figure 1c shows the ingress and relevant WikiData entities To customize the geographical locality, the user specifies a from the news article about Theresa May. As we can see, our circular relevance region on a map. Figure 2a shows an example news story about politics and terror related to Syria, Theresa May, of such a relevance region. By default, the relevance region is set ISIL and Sky News. By hovering these items, the user is to users current GPS location with a 50 km radius. By moving the presented with their textual WikiData description. On figure 2c, region or modifying the radius, users can generate a local we can see that the WikiData entity for Theresa May contains the newspaper for any region of the world. If the location factor is description “British politician”. disabled, it means that the system is recommending news from any location in the world and news that are not containing In general, the three buttons at the bottom of the screen for location information. location, interest profile and time can at any time be activated and de-activated to provide very different recommendation strategies. In the current Smartmedia prototype, we have predefined a For example, keeping all buttons active with default parameters handful of user interest profiles. Examples of such profiles are means that the system will recommend news articles that have stock trader, soccer fan, technology geek, etc. Each profile recently takes place in the vicinity of the reader and are consistent consists of a weighted concept vector, where each entry is a with her profile. Figure 3 describes different combinations of WikiData entry associated with an interest score between 0 and 1. recommendation factors and summarizes how the user can control By selecting any of these interest profiles, the retrieved news will the retrieval and composition of news items. be influenced and biased towards the interest topics. When the Figure 1. Steps of the news stream processing pipeline. a) b) c) Figure 2. Screenshots from the Smartmedia prototype. a) The map query interface. b) Presentation of news stories. c) Presentation of news details. Both these NewsStand and News@Hand have user interfaces 4. RELATED WORK targeting desktops and larger device screens. They both provide People nowadays have access to more worldwide news user control over the retrieved set of news, either through a map information than ever before. As Internet services get more or category based navigation or preferences settings. information about their users and their context, they can deliver personal and customized contents and user experiences. Tran and Herder [11] have looked at the studied news event timelines and shown that manually constructed timelines are The prototype system, described in this paper, share similarities to subjective and often missing important dates or other information. other academic news applications such as NewsStand [8, 10] and By complementing the timelines with elements extracted News@Hand [1, 2]. Both these systems map textual news algorithmically from multiple sources, it is possible to create more contents to entities defined in a knowledge base. objective and argumentative timelines. However, the manual NewsStand targets geo spatial exploration of news. It is an processing and editing efforts are still needed to enhance the example application of a general framework developed to enable communicative qualities of the timelines, and to adapt it to the people to search for information using a map query interface. It needs of the readers utilize maps both to explore and find news stories and to visualize Parra et al. [6, 7] presents SetFusion, a visual user-controllable and present single news events. interface for hybrid recommender system. Their approach enables News@hand combines textual features and collaborative users explore and control the importance of recommender information to make news suggestions. It uses Semantic Web strategies using an interactive Venn diagram visualization. Their technologies to describe the news contents and user preferences. evaluations indicate that this interface had a positive effect on the Both news items and user profiles are represented in terms of user experience and improved users engagement. Their idea of concepts appearing in domain ontologies, and semantic relations using the Venn diagram to explain intersections among among those concepts are exploited to enrich the above recommendation approaches is transferable and valuable to the representations, and enhance recommendations. news domain. [2] Cantador, I. et al. 2008. Ontology-based personalised and context-aware recommendations of news items. Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Local Technology. 1, (2008). Retrieve news from a geospatial area [3] Erxleben, F. et al. 2014. Introducing Wikidata to the Personal Linked Data Web. The Semantic Web–ISWC 2014. Retrieve news matching the interest profile of (2014). the user Temporal Retrieve news published after a given date [4] Gulla, J.A. et al. 2013. Learning User Profiles in Mobile News Recommendation. Journal of Print and Media Local and personal Technology Research. II, 3 (2013), 183–194. Retrieve news that both relate to a geospatial area and match users interest profile. [5] Hansen, M. 2008. Marrying transparency tools with user- Local and temporal controlled identity management. The Future of Identity in Retrieve news related to a given geospatial area and published after a given date. the Information Society. (2008). Personal and temporal Retrieve news matching the interest profiles [6] Parra, D. et al. 2014. See what you want to see. of the user and published after a given date. Proceedings of the 19th international conference on Local, personal and temporal Intelligent User Interfaces - IUI ’14 (New York, New Retrieve news with relevance for the selected York, USA, Feb. 2014), 235–240. geospatial area and interest profile, and published after a given date. [7] Parra, D. and Brusilovsky, P. 2015. User-controllable personalization: A case study with SetFusion. International Journal of Human-Computer Studies. Figure 3. Combinations of selectable recommendation factors (2015). [8] Samet, H. et al. 2014. Reading news with maps by exploiting spatial synonyms. Communications of the 5. CONCLUSIONS AND FUTURE WORK ACM. 57, 10 (Sep. 2014), 64–77. The predefined user profiles can be replaced or used in combination with more personal profiles trained on traced [9] Tavakolifard, M. et al. 2013. Tailored news in the palm interaction logs from the system. As users leave interaction data of your hand: a multi-perspective transparent approach to behind, we can gather knowledge about what the users interests news recommendation. (May 2013), 305–308. are. However, for new users where no past interaction records exist, we have a cold-start problem where we still benefit on predefined stereotypes. [10] Teitler, B. and Lieberman, M. 2008. NewsStand: A new view on news. Proceedings of the 16th ACM In future work we plan to use trained personal profiles with SIGSPATIAL international conference on Advances in predefined stereotypes in combination. We will also gather user geographic information systems. (2008). feedback and evaluate to which extent users want to control and customize their news presentations and study how their requirements can be met in a mobile user interface design. [11] Tran, G. and Herder, E. 2015. Detecting Filter Bubbles in Ongoing News Stories. Extended Proc. UMAP 2015. Deep understanding of textual contents together with knowledge (2015). base structures provides a fundament for innovative and intelligent applications. This paper has described one such innovation from the news domain, and how its mobile user [12] Vrandečić, D. and Krötzsch, M. 2014. Wikidata: a free interface allow users to control the composition of news. A collaborative knowledgebase. Communications of the screencast video demonstrating the prototype and its user interface ACM. (2014). is available at: http://vimeo.com/121835936 6. REFERENCES [1] Cantador, I. et al. 2008. News@ hand: A semantic web approach to recommending news. Adaptive hypermedia and adaptive web-based systems. (2008).