refer: a Linked Data based Text Annotation and Recommender System for Wordpress Tabea Tietz1 , Jörg Waitelonis1 , Joscha Jäger2 , and Harald Sack1 1 Hasso-Plattner-Intitute, Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany firstname.lastname@hpi.de, 2 yovisto GmbH, August-Bebel-Str. 25-53, 14482 Potsdam, Germany firstname@yovisto.com 1 Introduction When searching for an arbitrary subject in weblogs or archives, users often don’t get the information they are really looking for, because they are overwhelmed with an overflow of information while sometimes the presented information is too scarce to make any use of it. Without further knowledge about the context or background of the intended subject users are easily frustrated because they either cannot handle the amount of information or they might give up because they cannot make sense of the topic at all. Furthermore, authors of online-platforms often deal with the issue to provide useful recommendations of other articles and to motivate the readers to explore more of the available but often hidden content of their blog or archive. In the demo presentation, we present refer, a semantic annotation and visu- alization system integrated into the Wordpress platform. refer enables content creators to (semi-)automatically annotate their texts with DBpedia resources as part of the original writing process and visualize them automatically. With refer users are encouraged to take an active part in discovering a platform’s informa- tion content interactively. They can discover background information as well as relationships among persons, places, events, and anything related to the subject in current focus and are inspired to navigate the previously hidden information on a platform. Related systems include Pundit [2] and WYSIWYM [1]. The original concept of the presented user interface and a first prototype have already been presented in [4]. 2 refer Components refer consists of the following tools and visualizations, integrated into Wordpress (cf. Fig. 1). An Annotator is implemented as an extension of the Wordpress text editing interface to create semantic text annotations based on DBpedia. To create annotations, the author selects a text fraction in the Wordpress editing environment. The author can further choose between manual and automated an- notation by means of automated Named Entity Linking (NEL). For automated annotation, refer deploys KEA-NEL [6], which implements entity linking with DBpedia entities [5]. Nevertheless, there are still errors that have to be revised manually. The user has to be supported in selecting the correct entity from the large knowledge-base which is a challenge especially for highly ambiguous tex- tual mentions. Further, utilities are applied to rank and organize the candidate lists according to e. g. string similarity with the entity mention, or popularity of the entity [3] to increase the users’ annotation performance. The refer Annotator provides two configurable user interface modes: (1) The Modal Annotator is triggered via a button in the TinyMCE control panel, whereby the suggested entities are displayed in a modal dialog window (cf. Fig. 1.1) in a table-based layout. Entities are divided into the four categories Person (green), Place (blue), Event (yellow) and Thing (purple). The text abstract and entity IRI is displayed on mouseover. The selected annotation is encoded in RDFa markup, which is added to the according text fragment. (2) The Inline Annotator (cf. Fig. 1.2) is triggered automatically via highlighting a text fragment and opens a circu- lar category menu which allows the user to instantly show suggestions from the respective category (Person, Place, Event, Thing). By selecting a category, the suggested entities are displayed. In order to provide more context within the relatively small space, these entities are divided into dynamically retrieved sub-categories. While the Inline Annotator provides fast and simple means of semantic text annotation without losing the context of the written text, the Modal Annotator provides a larger overview and enables a better comparison of entities. Infoboxes are used to visualize annotations in the article view, and anno- tated entities are indicated directly in the article text. On mouseover, an infobox as in Fig. 1.3 is shown right below the annotated text fragment, which contains basic information about the entity, e.g. a label, a thumbnail, and additional data in a table-layout. The infobox allows the user to gather basic facts about an en- tity and relations to other entities. While some basic information can be derived just from the webpage‘s RDFa microdata and is displayed instantly, additional content is asynchronously loaded from the web service. When the text fragment or any of the infobox entities are clicked, the Relation Browser slides down from the top of the page with the selected entity in focus. The Relation Browser and Recommender visualizes relationships be- tween annotations as well as suggestions for further reading. It allows users to navigate and explore relations among entities. It can be opened by the user either via click on the refer icon bar on top of the page or by selecting an entity of inter- est in the article text, (e.g. Jules Verne). The entity Jules Verne then becomes the focus-entity. Its DBpedia abstract and image are displayed on the bottom of the Relation Browser and its background color depends on its category (Person, Place, Event, Thing). Based on the focus-entity, related entities (derived from DBpedia and all annotations available on the platform) are displayed in a four column grid. The pagination bars on the right hand side of each column allow the users to browse through all entities in a category. When hovering one of the displayed entities (e.g. Jacques Cousteau in Fig. 1.4), relations to the focus-entity 2 and to further entities in the grid-view (e.g. Oceanography) are visualized by line connectors. A label (property) indicates the direction and type of connection. If there are more entity-relations than displayed in the first overview, connec- tions to hidden entities are indicated by dotted lines, which can be activated via hovering a small ’plus’ icon inside the entity box. A click on an entity in the grid-view replaces the focus-entity with the selected item and refreshes the related entities in all categories. A ranked list of recommended blogposts for the entity in focus is displayed on the bottom-left. The recommendations comprise blogposts that cover the focus-entity and entities related to the focus entity. The more entities are related with the entity in focus, the higher is the rank of the recommended article in the list. 3 Summary We have presented refer as a tool to help blog authors to enhance their content with the help of Linked Open Data and to engage the readers to further explore the available content as well as to provide background information from DBpedia and Wikipedia. A proof-of-concept is provided as being integrated into a daily weblog3 . The Wordpress-plugin of refer is publicly available for download4 , and a screencast5 demonstrates the annotator interfaces. Acknowledgement: This work has been funded by the German Government, Federal Ministry of Education an Research under project number 03WKCJ4D. References 1. A. Khalili and S. Auer. WYSIWYM – integrated visualization, exploration and authoring of semantically enriched unstructured content. Semantic Web Journal, 1, 2014. 2. C. Morbidoni and A. Piccioli. The Semantic Web: ESWC 2015 Satellite Events, chapter Curating a Document Collection via Crowdsourcing with Pundit 2.0, pages 102–106. Springer, 2015. 3. J. Osterhoff, J. Waitelonis, and H. Sack. Widen the Peepholes! Entity-Based Auto- Suggestion as a rich and yet immediate Starting Point for Exploratory Search. In Proc. of 2nd Workshop Interaction and Visualization in the Web of Data (IVDW 2012), 2012. 4. T. Tietz, J. Waitelonis, J. Jäger, and H. Sack. Smart media navigator: Visualizing recommendations based on linked data. In 13th Int. Semantic Web Conf., 2014. 5. R. Usbeck et al. Gerbil: General entity annotator benchmarking framework. In Proc. of the 24th Int. Conf. on World Wide Web, pages 1133–1143. ACM, 2015. 6. J. Waitelonis and H. Sack. Named entity linking in #tweets with kea. In 6th workshop on ’Making Sense of Microposts’, Named Entity Recognition and Linking (NEEL) Challenge, at the 25th Int. World Wide Web Conf., 2016. 3 http://blog.yovisto.com 4 http://refer.cx/ 5 http://s16a.org/sites/default/files/refer/refer_Screencast.mov 3 Fig. 1. refer components: Modal Annotator (1), Inline Annotator (2), Infobox (3), Relation Browser and Recommender (4) 4