-

1613-0073

Look beyond the Surface: A Demo for Explaining Knowledge Graph Embeddings and Entity Similarity

Huu Tan Mai

huu.mai@telecom-paris.fr 0 2

Youmna Ismaeil

youmna.ismaeil@de.bosch.com 0 1

Trung-Kien Tran

trungkien.tran@de.bosch.com 0

Hendrik Bloeckeel

hendrik.blockeel@kuleuven.be 1

Daria Stepanova

daria.stepanova@de.bosch.com 0

Explainable Entity Similarity, Knowledge Graphs, Knowledge Graph Embeddings

0 Bosch Center for Artificial Intelligence , Robert Bosch Campus 1, 71272 Renningen , Germany 1 Department of Computer Science, KU Leuven , Leuven BE 3000 , Belgium 2 Télécom Paris , Palaiseau , France

Knowledge Graph embedding (KGE) methods are concerned with mapping entities and relations in a KG into a low-dimensional vector space. KGEs have been efectively used for a variety of tasks such as link prediction, and entity classification or entity similarity. However, these methods are often considered as black boxes, providing users with no insights into the information captured by the embeddings and justifications for the computed outcome on a particular task. Recently, FeaBI, a framework for interpreting pre-computed entity embeddings relying on entity neighborhoods, has been proposed. In this paper we present a demo for this work. Our intuitive and interactive demo allows users to conveniently exploit the respective framework for computing embedding-based similarity between KG entities as well as generating and visualizing explanations for the respective similarity.

CEUR ceur-ws.org

1. Introduction

Knowledge Graph embeddings (KGEs) (see, e.g., [ 1 ]) represent entities and relations in a lowdimensional vector space. They have been useful in a range of tasks, including link prediction (e.g., [ 2, 3, 4 ]), entity classification (e.g., [ 5, 6 ]) or entity similarity. However, despite their success, KG embeddings are often regarded as black boxes. Lack of transparency and interpretability of KGEs limits users’ understanding of their inner mechanisms, and undermines the trust in these models. E.g., given an entity, embedding-based suggestions regarding other entities similar to it might be less convincing if the user cannot examine the reasons behind the similarities.

Recently, a framework named FeaBI [ 7 ] has been proposed for explaining pre-computed entity embeddings. More specifically, given a KG and its embedding model FeaBI employs embedded feature selection techniques to extract from the KG propositional features in the form of relations and entities that are important for a given KG embedding model. These features are treated as KG embedding model explanations. FeaBI can be conveniently used for explaining similarities between entities.

Training Embedding Models e.g., TransE, CompGCN, NodePiece,

SNoRE entity embeddings

KG Feature Generation Initial feature vectors for entity embeddings

Feature Selection Recounssitnrugcitnioitnialoffeeantutirtye evmecbtoerdsdings

2. Demo Overview

Model

TransE CompGCN NodePiece

SNoRe

TransE CompGCN NodePiece

SNoRe

FeaBI total runtime (s) 10.39 ± 0.37s 9.28 ± 0.31s 9.96 ± 0.19s 16.94 ± 0.20s 30.89 ± 2.02s 26.45 ± 1.0s 30.53 ± 0.82s 33.01 ± 0.68s Feabi (Backend). For a given KG and its embedding model, FeaBI computes KG embedding explanations defined as a list of KG features ranked based on their importance for the generation of the KG embedding. The top most important features are then used to build interpretable representations of the KG entity embeddings. The main components of FeaBI are KG embedding training, feature construction and feature selection (see [ 7 ] for details). The training of the KG embedding model is naturally the most time-consuming step, which typically takes up to 5 hours (e.g., for CompGCN on FB15K237 dataset). Therefore, in our demo we provide a number of pre-trained embedding models. At the moment we support 4 popular embedding models: TransE [ 2 ], CompGCN [ 8 ], NodePiece [ 9 ] and SNoRe [ 10 ], but other pretrained embeddings can also be provided by users as illustrated in Figure 3.

Table 1 shows the running time of the feature construction and feature selection steps of FeaBI for two popular KGs and embedding models available in the demo. Webservice. The webservice handles the communication of FeaBI with the frontend. In the frontend, the KG and KG embedding models are first selected by the user, and then passed to FeaBI via the webservice. Subsequently, FeaBI computes the results, which are then sent to the webservice and presented to the user via the frontend.

Frontend. The frontend allows users to conveniently explore the model explanations for a given embedding model, entity embedding explanations, as well as explanations for similarities between a pair of selected entities retrieved by the webservice.

The workflow of the demo proceeds as follows. First, the user selects a KG and an embedding model from the provided list (or uploads custom ones) via the visual interface. Then, a model explanation (i.e., a list of symbolic features ranked by their importance) is automatically generated and presented to the user (see Figure 4).

Additionally, the demo ofers a possibility to compare entities in the KG in terms of their similarity relying on the given embedding model. As shown in Figure 5, for a given entity provided by the user, similar entities can be retrieved based on the distance metric in the embedding space (cosine similarity and Euclidean distance are currently supported). The user can select any pair of entities and use the system to generate explanations for their similarity, i.e., a list of selected KG features that the entities share along with their graph-based visualizations.

3. Conclusion

We presented a demo for FeaBI [ 7 ], which is a recently proposed framework for explaining KG embedding models. While the work in [ 7 ] focuses on technical details of the method, our demo system allows the users to easily analyse KG features captured by an embedding model as well as reasons behind embedding-based entity similarities. Future directions include the analysis of explanations for relation embeddings as well as the consideration of ontologies and KG schemes within the studied framework.

Acknowledgements. This work was partially funded by the grant ANR-20-CHIA-0012-01 (“NoRDF”) and the European project SMARTEDGE (grant number 101092908).

[1]

Wang ,

Mao ,

Wang ,

Guo , Knowledge graph embedding: A survey of approaches and applications , IEEE Transactions on Knowledge and Data Engineering 29 ( 2017 ) 2724 - 2743 .

[2]

Bordes ,

Usunier ,

García-Durán ,

Weston ,

Yakhnenko , Translating embeddings for modeling multi-relational data , in: NeurIPs , 2013 , pp. 2787 - 2795 .

[3]

Trouillon ,

Welbl , S. Riedel, É. Gaussier, G. Bouchard, Complex embeddings for simple link prediction , in: ICML, 2016 , pp. 2071 - 2080 .

[4]

Yang ,

Yih ,

He ,

Gao ,

Deng , Embedding entities and relations for learning and inference in knowledge bases , in: ICLR , 2015 .

[5]

Ristoski ,

Rosati ,

T. D.

Noia ,

R. D.

Leone , H. Paulheim, Rdf2vec: RDF graph embeddings and their applications , Semantic Web 10 ( 2019 ) 721 - 752 .

[6]

T. N.

Kipf ,

Welling , Semi-supervised classification with graph convolutional networks , in: ICLR 2017 , OpenReview .net, 2017 .

[7]

Ismaeil ,

Stepanova ,

T. K.

Tran ,

Blockeel , Feabi: A feature selection-based framework for interpreting kg embeddings , ISWC ( 2023 ).

[8]

Vashishth ,

Sanyal ,

Nitin ,

P. P.

Talukdar , Composition-based multi-relational graph convolutional networks , in: ICLR , 2020 .

[9]

Galkin ,

E. G.

Denis ,

Wu ,

W. L.

Hamilton , Nodepiece: Compositional and parametereficient representations of large knowledge graphs , in: ICLR , 2022 .

[10]

Mežnar ,

Lavrač ,

Škrlj , Snore: Scalable unsupervised learning of symbolic node representations , IEEE Access 8 ( 2020 ) 212568 - 212588 .