=Paper= {{Paper |id=Vol-2721/paper485 |storemode=property |title=ScholarLensViz: A Visualization Framework for Transparency in Semantic User Profiles |pdfUrl=https://ceur-ws.org/Vol-2721/paper485.pdf |volume=Vol-2721 |authors=Felicitas Löffler,Valentin Wesp,Samira Babalou,Philipp Kahn,René Lachmann,Bahar Sateli,René Witte,Birgitta König-Ries |dblpUrl=https://dblp.org/rec/conf/semweb/LofflerWBKLSWK20 }} ==ScholarLensViz: A Visualization Framework for Transparency in Semantic User Profiles== https://ceur-ws.org/Vol-2721/paper485.pdf
 ScholarLensViz: A Visualization Framework for
    Transparency in Semantic User Profiles

 Felicitas Löffler1 ID , Valentin Wesp1 ID , Samira Babalou1 ID , Philipp Kahn1 ,
               René Lachmann1 ID , Bahar Sateli4 ID , René Witte4 ID ,
                              Birgitta König-Ries1,2,3 ID ?
                 1
                   Heinz Nixdorf Chair for Distributed Information Systems
        Institute for Computer Science, Friedrich Schiller University Jena, Germany
                            {firstname.lastname}@uni-jena.de
      2
         Michael Stifel Center for Data-Driven and Simulation Science, Jena, Germany
    3
        German Center for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig,
                                          Germany
             4
               Semantic Software Lab, Concordia University, Montréal, Canada
                            {lastname@semanticsoftware.info}



         Abstract. Personalized applications are a two-edged sword. They are
         convenient and assist users by keeping the focus on relevant topics, but
         they are often black boxes and users typically do not know why cer-
         tain entries appear in their profile. As transparency and provenance are
         essential for researchers, in this paper, we introduce ScholarLensViz, a
         visualization component for scholarly user profiles displaying a scholar’s
         research competences including the provenance. It also provides visual-
         izations to inspect the diversity of a profile and to analyze the semantic
         similarity of the profile entries.


Keywords: Visualization, Provenance, Semantic User Profiles, LOD


1      Introduction

Scholarly user profiles are gaining increasing attention and are becoming more
important for various application scenarios, such as expertise retrieval [10],
search and recommendation of research articles [1] or research network analy-
sis [6]. These intelligent systems support scholars in time-consuming daily tasks,
such as data discovery and data reuse, by providing content tailored to a scholar’s
preferences. In most applications, scholars are permitted to inspect these col-
lected user preferences but do not know where the information comes from.
However, the justification of entries is an important issue in scholarly profiles.
Users need explanations why a certain preference is presented.
?
    Copyright c 2020 for this paper by its authors. Use permitted under Creative Com-
    mons License Attribution 4.0 International (CC BY 4.0).
    In this work, we address these issues and introduce ScholarLensViz 5 , a visu-
alization framework to display scholarly profiles based on a semantic user model.
In earlier work, Bakalov et al. [2] proved the feasibility of a transparent visu-
alization of semantic user profiles. We further developed that idea and present
a new graphical user interface with linkage to the LOD cloud. ScholarLensViz
presents a scholar’s competences and their provenance. We obtain the compe-
tences from publications and store them in an RDF graph, using a workflow
being introduced in our previous work ScholarLens [8]. The system also enables
the inspection of a profile’s diversity and provides visualizations to explore the
semantic similarity of the profile entries.


2     Related Work
Various scholarly applications have emerged that create user profiles out of dif-
ferent resources, for different purposes and with various visualizations. Semantic
Scholar [1], a smart search over research articles, generates user profiles from
publications based on paper heuristics. In contrast, AMiner [10] focuses on ex-
pertise retrieval, a research field that aims at finding experts on a certain topic.
Following a topic modeling approach, data is collected from institutional web-
sites, documents and conferences. The visualization provided includes radar dia-
grams for author’s statistics and stacked area charts for a scholar’s automatically
extracted research interests. Full semantic approaches are VIVO [5] and Scho-
lia [6]. While VIVO provides a framework, an ontology and a graphical user
interface for representing and visualizing scholars and their research context
within an organization, Scholia aims at visualizing scientific bibliographic infor-
mation through Wikidata. Both applications provide visualizations for co-author
networks and research topics a scholar is competent in. In addition, Scholia of-
fers various entry points to explore not only authors and publications but also
organizations, locations or projects and supports numerous display formats such
as timelines, scatter charts, line chart or trees. Provenance information, e.g.,
explanations from which source a topic or research interest has been extracted,
is less considered in current approaches for scholarly user profiles. Our system
attempts to close that gap and to leverage transparency and provenance in the
visualization of semantic user profiles.


3     Architecture
Fig. 1 presents the overall concept. A scholar’s provided or selected publications
are converted from PDF to XML and are added to ScholarLens [8] (orange-
colored) that processes the documents and extracts competences. The user model
comprises three core concepts: scholars, their competence topics and a set of
scholarly documents. All scholars are instances of the User Model Ontology
(UM) [3] User class. Users have CompetenceRecords containing the individual
5
    ScholarLensViz source code, https://github.com/fusion-jena/ScholarLensViz
Competence as a literal. These classes are used from the Competence Manage-
ment Ontology (CM) [4]. In addition, we model a user’s publications with the
PUBO Ontology [9] that provides properties to describe relations between docu-
ments and semantic annotations generated by the pipeline. The final output is an
RDF graph containing the competences and being visualized by ScholarLensViz
(blue-colored). We separated the visualization (client-side) and the calls to the




Fig. 1: Overall concept to construct scholarly profiles with ScholarLens and to
visualize and integrate it in personalized applications with ScholarLensViz.


knowledge base (server-side). Thus, a server handles the SPARQL requests and
provides HTTP methods to the client. This separation minimizes security risks
such as sending direct SPARQL queries from the client to the knowledge base.
Embedding ScholarLensViz into personalized applications, for instance as wid-
get or standalone application, requires additional authentication mechanism to
be handled by the integrating system (green-colored).

4   ScholarLensViz
ScholarLensViz provides three dialogs. (A) The start dialog aims to display a
user’s Top-25 ranked competences and their provenance. The elements of the
pie chart (Fig. 2) represent a user’s competences. The arc of the pie slices de-
notes the competence rank, and the color represents the category obtained from
the Computer Science Ontology (CSO) [7] by matching the competence entries
(which are presented as DBpedia entities) with CSO labels. The comment tab,
becoming visible by clicking on a pie element, provides the competence descrip-
tion obtained from DBpedia. In order to justify the origin of the competences,
we provide a provenance tab displaying the sentences from a user’s publications
and highlighting the relevant phrases that contain the competence and its rank,
the paper title and, if available, the DOI.
    In order to inspect the diversity of a user’s profile and the semantic similar-
ity between the profile entries, ScholarLensViz offers two further charts (Fig. 3).
Fig. 2: The start dialog presents a scholarly user profile in a pie chart and displays
the publications from which the competences have been extracted.



(B) A force-directed graph and (C) a chord chart visualizing the semantic sim-
ilarity and relatedness between the competences computed with Sematch [11].
While semantic similarity is hierarchy-based, the semantic relatedness consid-
ers all relations between concepts [11]. We use the average of both values as
both values influence the diversity of a profile. The more diverse a profile is,
the less strong connections appear in the force-directed graph per competence.
The chord chart allows a better analysis on the accumulated similarity values
over the selected competences. The length of the outer arc reflects the number
of similar connections a competence provides.




Fig. 3: Semantic similarity and relatedness of the profile entries are displayed
in a force-directed graph (left) and a chord chart (right). Dotted lines in the
force-directed graph denote a lesser (<0.4) semantic similarity, a dashed line a
medium (0.4 – 0.7) similarity and solid (>0.7) lines a strong semantic similarity
between the profile entries. The colors in the chord chart correspond to the
colored competence entries.
5     Demonstration
In this demo6 , visitors can inspect the competences from known computer scien-
tists working in the fields of Semantic Web. In addition, we added experts from
other computer science domains as a basis for comparison. Users can select the
competences and can change the thresholds for the semantic similarity.

Acknowledgements This work was supported by the German Academic Ex-
change Service (DAAD) within the scope of the PPP program and the Deutsche
Forschungsgemeinschaft (DFG) within the scope of the GFBio project (229241684).


References
 1. Ammar, W., Groeneveld, D., Bhagavatula, C., Beltagy, I., Crawford, M., Downey,
    D., Dunkelberger, J., Elgohary, A., Feldman, S., Ha, V., Kinney, R., Kohlmeier,
    S., Lo, K., Murray, T., Ooi, H., Peters, M., Power, J., Skjonsberg, S., Wang, L.,
    Wilhelm, C., Yuan, Z., van Zuylen, M., Etzioni, O.: Construction of the literature
    graph in semantic scholar. In: Proceedings of the 2018 Conference of the North
    American Chapter of the Association for Computational Linguistics (2018)
 2. Bakalov, F., Meurs, M.J., König-Ries, B., Sateli, B., Witte, R., Butler, G., Tsang,
    A.: An approach to controlling user models and personalization effects in recom-
    mender systems. In: IUI. ACM, New York, NY, USA (2013)
 3. Jovanovic, J., Gasevic, D., Siadaty, M.: Intelleo user model ontology. http:
    //intelleo.eu/ontologies/user-model/spec/ (March 2012)
 4. Jovanovic, J., Siadaty, M., Gasevic, D., Milikic, N.: Intelleo competences ontology.
    http://www.intelleo.eu/ontologies/competences/spec/ (2011)
 5. Krafft, D., Cappadona, N., Caruso, B., Corson-Rikert, J., Devare, M., Lowe, B.:
    Vivo: Enabling national networking of scientists. In: Proceedings of the Web Sci-
    ence Conference 2010 (2010)
 6. Nielsen, F.Å., Mietchen, D., Willighagen, E.: Scholia, scientometrics and wikidata.
    In: Blomqvist, E., Hose, K., Paulheim, H., Lawrynowicz, A., Ciravegna, F., Hartig,
    O. (eds.) The Semantic Web: ESWC 2017 Satellite Events. pp. 237–259. Springer
    International Publishing, Cham (2017)
 7. Salatino, A.A., Thanapalasingam, T., Mannocci, A., Osborne, F., Motta, E.: The
    computer science ontology: A large-scale taxonomy of research areas. In: Interna-
    tional Semantic Web Conference 2018, Monterey (CA), USA, 2018.
 8. Sateli, B., Löffler, F., König-Ries, B., Witte, R.: Scholarlens: extracting compe-
    tences from research publications for the automatic generation of semantic user
    profiles. PeerJ Computer Science 3, e121 (Jul 2017)
 9. Sateli, B., Witte, R.: Semantic representation of scientific literature: bringing
    claims, contributions and named entities onto the linked open data cloud. PeerJ
    Computer Science 1, e37 (Dec 2015)
10. Wan, H., Zhang, Y., Zhang, J., Tang, J.: Aminer: Search and mining of academic
    social networks. Data Intelligence (2019)
11. Zhu, G., Iglesias, C.A.: Computing semantic similarity of concepts in knowledge
    graphs. IEEE Transactions on Knowledge and Data Engineering 29(1) (2017)

6
    https://dev.gfbio.org/scholar/