-

1613-0073

Medical Diagnosis based on an Augmented Knowledge Graph

Niclas Heilig

niclas.heilig@medicalvalues.de 0 2

Jan Kirchhof

jan.kirchhoff@medicalvalues.de 0

Florian Stumpe

florian.stumpe@medicalvalues.de 0

Joan Plepi

joan.plepi@uni-marburg.de 1

Lucie Flek

lucie.flek@uni-marburg.de 1

Heiko Paulheim

heiko@informatik.uni-mannheim.de 2

Medical Diagnosis, Knowledge Graph, Explainable Prediction, RDF2vec

0 MedicalValues GmbH , Germany 1 Philipps-Universität Marburg , Germany 2 University of Mannheim , Germany

Medical diagnosis is the process of making a prediction of the disease a patient is likely to have, given a set of symptoms and observations. This requires extensive expert knowledge, in particular when covering a large variety of diseases. Such knowledge can be coded in a knowledge graph - encompassing diseases, symptoms, and diagnosis paths. Since both the knowledge itself and its encoding can be incomplete, refining the knowledge graph with additional information helps physicians making better predictions. At the same time, for deployment in a hospital, the diagnosis must be explainable and transparent. In this paper, we present an approach using diagnosis paths in a medical knowledge graph. We show that those graphs can be refined using latent representations with RDF2vec, while the final diagnosis is still made in an explainable way. Using both an intrinsic as well as an expert-based evaluation, we show that the embedding-based prediction approach is beneficial for refining the graph with additional valid conditions.

CEUR ceur-ws.org

1. Introduction

Medical diagnosis is defined as the process of predicting the disease a patient is likely to have, given a set of symptoms and observations. This process is often not a one shot endeavor, but can involve diferent steps, such as examinations and the collection of additional evidence (e.g., through blood pictures, X-ray imaging, etc.). We call these processes diagnosis paths. In order to arrive at the correct diagnosis, a physician needs to know those paths, collect the required evidence and combine the information about symptoms and observations.

The information on the diferent symptoms, observations and diseases can be stored in a knowledge graph [ 1 ]. Diagnosis paths can also be formalized in such a knowledge graph, and be used to reason about a patient’s diagnosis, just like a medical expert following that path. The SeWeBMeDA-2022: 5th Workshop on Semantic Web solutions for large-scale biomedical data analytics, May 29, 2022, CEUR Workshop Proceedings medicalvalues knowledge graph encodes such knowledge on more than 380 diseases, together with the corresponding diagnosis paths. Currently, the expert system based on this knowledge graph, ofered by the company medicalvalues, is in use in one large laboratory chain (4,000 employees) and also in diferent pilot stages in leading German university hospitals, including the university hospital in Mannheim (5,000 employees).

In the medical field, transparency, accountability and determinism of medical diagnosis systems are a key requirement for building systems that are accepted by medical experts. The medicalvalues knowledge graph comes with a reasoning system which uses the diagnosis paths to not only provide an expert with suggested diagnoses, but also with an explanation why it came to the conclusion.

The diagnosis paths in the knowledge graph are valid, but usually not complete, and may miss some alternatives and rare side conditions. In this paper, we show how patient data can help training a machine learning model to complete the diagnosis paths in the graph. To that end, we explore the use of knowledge graph embeddings [ 2, 3 ] to predict additional edges in the graph.

While it is also possible to utilize knowledge graph embeddings to directly infer a diagnosis, this would be at odds with the aim of providing explainable diagnoses. In contrast, we follow a knowledge graph refinement approach [4], focusing on augmenting the existing diagnosis paths (which then, in turn, provide explainable diagnoses) instead of predicting diseases directly. The refined diagnosis paths are evaluated for the validity by medical experts.

The rest of this paper is structured as follows. We review related work in section 2. We introduce the medicalvalues knowledge graph in section 3, and outline our refinement approach in section 4, followed by an evaluation in section 5. We conclude with a summary and an outlook on future work.

2. Related Work

The biomedical domain has been one of the earliest and most vivid adopters of semantic web technologies, ontologies, and knowledge graphs. The Linked Open Data cloud1 depicts a large subset of linked open datasets coming from the life sciences domain [5], most notably BioPortal [6] and Bio2RDF [7]. Moreover, a recent survey on domain-specific knowledge graphs showed a wide adoption of knowledge graphs in the healthcare domain. [8]

While knowledge graphs can be used for various purposes in the medical domain, such as the exploration of scientific literature, the focus of our work is on medical diagnosis. Similar approaches are discussed in [9, 10, 11, 12], where knowledge graphs of symptoms and diseases are constructed from electronic medical records (EMR), medical literature, and/or other sources by means of relation extraction. However, while those approaches build a bipartite graph of symptoms and diseases, the medicalvalues knowledge graph uses more complex diagnosis pathways and rules (see below).

Some approaches have been proposed in the recent years which cover particular subfields of medicine [13] or diseases [14]. In contrast to those works, the medicalvalues knowledge graph aims at covering a larger variety of diseases.

When it comes to refining existing medical knowledge graphs, the combination of knowledge graph embeddings and machine learning models, similarly to this paper, has been utilized in the past, e.g., for predicting drug-drug interaction [15, 16], gene-disease interaction [17], or other tasks, such as protein-protein interaction, protein function similarity, protein sequence similarity, and phenotype-based gene similarity [18]. Unlike the work presented in this paper, those approaches mostly use a single in-domain knowledge graph for their predictions, while we present a method using an integrated augmented knowledge graph incorporating numerous types of information.

3. The medicalvalues Knowledge Graph

The medicalvalues GmbH2 is a startup company developing software systems to provide decision support in hospitals. This software helps physicians to assess the risk of sufering from a disease and to decide about further examination steps. All decisions proposed by the software are substantiated with medical guidelines curated by medical experts. The disease information is modeled in a knowledge graph. Medical experts working at medicalvalues use the internal graph editor to encode expert knowledge, extracted from widely-accepted medical sources, in the form of rules for medical diagnosis.

The customers of medicalvalues are hospitals and laboratory providers. Therefore, integrations into clinic information systems (CIS) and laboratory information systems (LIS) are built. As the laboratory parameters are comparatively easy to analyze by machines, the medical focus lies on the evaluation of laboratory properties. Also, the current medical procedures show that it is possible to improve the laboratory parameters analysis.

In preliminary studies with medical experts, diferent representations of knowledge graphs (e.g., RDF graphs, labeled property graphs) were explored. Ultimately, labeled property graphs were found to be the most intuitive and usable and were therefore chosen as a representation mechanism for the medicalvalues knowledge graph. The graph is stored in a relational Postgres database, and the user interface for the medical experts is provided as a web application.

To identify the medical risk factors which can increase the risk to sufer from a disease, the medicalvalues knowledge graph reuses widely accepted coding systems. One of them is the International Classification of Diseases and Related Health Problems (ICD). The ICD system is maintained by the World Health Organization (WHO) and contains codes to exactly identify what a patient sufers from [ 19]. The diseases in the medicalvalues knowledge graph are associated with an ICD code, e.g., for epidemiological research or the billing process in hospitals. Furthermore, medicalvalues uses also the SNOMED coding system [20] to identify diseases, findings, and imaging results. In contrast to ICD, SNOMED is commonly used to describe the medical inputs to draw conclusions on possible diseases. For laboratory parameters, the Logical Observation Identifiers and Codes (LOINC) specialized on the identification of laboratory measurements, provides the identifiers in the graph [ 21]. Using common coding systems in the medicalvalues software is crucial to provide integration with other systems and datasets.

4. Refining Disease Diagnosis Paths with RDF2vec

For our experiments, we first use only the information in the medicalvalues knowledge graph, which contains a set of artificial test patients. We then build an enriched graph with additional patient data, extracted from a real dataset (MIMIC-IV), in order to provide extra evidence for refining the original medicalvalues knowledge graph.

4.1. Extending the medicalvalues Knowledge Graph into an Augmented Knowledge Graph

The patient data is extracted from the Medical Information Mart for Intensive Care IV (MIMICIV) database [22]. MIMIC-IV consists of data about more than 40,000 patients and is part of the PhysioNet repository of freely-available medical research data [23]. The database contains patient data collected at the intensive care units at the Beth Israel Deaconess Medical Center.

We process the data such that we are able to connect patients in MIMIC-IV with diseases of the medicalvalues knowledge graph. Since both MIMIC-IV and the medicalvalues knowledge graph use the same identifiers (i.e., LOINC and ICD-10), the linking can be done directly based on those identifiers. For every patient, the measured laboratory parameters, and the disease, which was recorded at hospital admission, are extracted. Additionally, we store the patients’ age and gender. Moreover, the measured parameters are evaluated, meaning that a numerical value (e.g., a Bilirubin concentration of 0.43mg/dl in a blood sample) is mapped to an interpreted diagnostic finding, usually on a three-point nominal scale, i.e., stating that the value is normal, increased, or decreased.

This categorization is based on the reference ranges of parameters which are defined by the medical experts and coded in the medicalvalues knowledge graph. They usually specify the 0.95-confidence interval of the parameter values of healthy patients. In total, the medicalvalues knowledge graph contains reference ranges for 529 parameters. Figure 2 shows the distribution of the values of parameter Ferritin in the MIMIC-IV dataset, together with the corresponding borders of the reference range.3

After the processing step, we merge the patients and their evaluated parameters with the 3Please note that the distribution is not representative for the mostly healthy overall population, but contains only diseases in the medicalvalues knowledge graph. As this graph is a labeled property graph, a translation into an RDF graph is needed. This RDF graph contains parameters together with their evaluation as nodes. Every evaluated patient is added to the graph as a new node. The measured parameters are then connected to the parameters of the medical rules.

Figure 3 shows an example of a patient (Patient_1) connected to a rule. The patient has a recorded disease (Cholestase_(Ikterus)) and a laboratory parameter (Bilirubin_total_increased). That parameter is automatically augmented by additional diagnoses (Bilirubin_ total_not_decreased, Bilirubin_total_not_normal) which are logic consequences of the parameter and simplify the connection of the patient to diferent diagnostic rules.

The resulting RDF graph has 59,813 nodes and 1,717,685 edges. The average in and out degree of nodes is 28.72.

It is important to point out that the augmented knowledge graph is only used for making predictions, but the patient data are not permanently added to the medicalvalues knowledge graphs. Thereby, privacy issues are avoided.

4.2. Training RDF2vec Embedding Vectors

RDF2vec is a method that computes a continuous vector representation for each node in a graph [24]. This method consists of two steps: extracting sequences from the graph using random walks, and utilizing the word2vec algorithm to generate vector embeddings from those sequences. hospitalized subjects for which the Ferritin value was determined. This explains why the majority of the sample has a value above the upper bound of the reference range.

While alternatives to random walks have been proposed as well, the result with those are not yet very conclusive [25, 26], which is why we decided to stick to random walks as a sequence extraction technique. In particular, we compare two flavors of random walks: • Classic random walks, as in the original RDF2vec implementation. We start with a fixed number of random walks from each node, only following the outgoing edges. As a result, there is the same number of random walks starting in each node. • Mid-Walks, originally introduced for RDF2vec Light [27], which also starts a fixed number of random walks per node, but following incoming and outgoing edges. As a result, the nodes can appear in any position in the walk, and this may not necessarily be in the starting position. Mid-walks are assumed to cover a wider variety of knowledge about nodes when not traversing all paths in the graph.

In both setups, we use a walk length of 4, and 100 walks per node. We use jRDF2vec to extract the walks and train the embedding vectors.4

The generated walks are used as input for word2vec using the skip-gram model [28]. Word2vec is a neural network that estimates the probabilities of words occurring in the context of other words. The walk sequences can be used as inputs to word2vec, as equivalent to sentences. After the training process the weights of the word2vec network are used as vector embeddings. In our experiments, we consider diferent dimensionality for the vectors (i.e., 50, 100, 200, 500, 1000, 2000).

4https://github.com/dwslab/jRDF2Vec 4.3. Predicting Augmentations for Diagnosis Paths

Given the knowledge encoded in the vector representations, we try to predict links in the knowledge graph. To that end, embedding vectors for each pair of a rule and a risk factor are concatenated and passed to a binary classifier to make a decision whether the risk factor should be included in the corresponding diagnosis rule or not. In figure 3, four risk factors (on the right-hand side) and one rule are shown. Therefore, this leads to four possible combinations between the rule and risk factors that can be predicted. The positive samples can be built using the existing relations in the graph. In the example, this is the relation from Bilirubin_total_increased to Rule_Cholestase and the relation from Alkaline_Phosphatase_increased to Rule_Cholestase. So, we take the corresponding vector embeddings of rule and risk factor and concatenate them to create a positive sample.

The prediction itself is then modeled as a binary classification task, using the concatenated vectors of the condition and the rule as input, as described in [ 2 ].

For generating positive samples, the approach of using existing links can be applied. On the other hand, there is no information of negative samples in the medicalvalues knowledge graph. As the graph is currently built and not many disease paths are considered complete, we have to work with the open-world assumption. Therefore, we experiment with three diferent approaches for building the negative samples: • In the first approach, we randomly sample pairs of rules and risk factors which are not present in the graph, and use them as negative samples (random). • We sample random risk factors per rules which are not connected in the graph, and use them as negative samples (per rule). This ensures that the classifier sees positive and negative examples for each rule. • In the last approach, we use explicit negations. For example, if we know that Bilirubin_total_increased is a positive example for the rule Rule_Cholestase, i.e., an increased bilirubin value is a signal for Cholestase, we construct a negative example for the signals relation using Bilirubin_total_decreased and the same rule (explicit). In other words: for all indicators which have an explicit opposite, such as an increased and a decreased value, we use those opposites for creating negative examples.

In our experiments, we use three classifiers, namely Support Vector Machines (SVMs), Logistic Regression, and Random Forests. For all of them, the optimal parameter settings are determined in an internal cross validation in grid search.5 It should be pointed out that although the classifiers, as well as the embeddings, are not interpretable, we aim at using them augmenting an interpretable decision system, which, in total, will provide transparent predictions. Fig. 4 shows the overall workflow of our approach.

5. Evaluation

The evaluation of the approach is done in two phases. In the first phase, we analyze how well the held-out relations can be reconstructed. We use a train/test split, with tuning via 5For SVM: kernel function and C, for Logistic Regression: solver and C, for Random Forests: number of trees, number of features, maximum depth, minimum samples in split and leaf nodes, sampling strategy.

proposed diagnosis path refinement classifier rule vector risk factor vector medivalvalues knowledge graph MIMIC IV dataset linking based on identifiers 10-fold cross-validation on the train partition. Here, we perform a number of ablation studies to investigate the influence of diferent parameters on the model. The code for the experiments is available online.6

In the second phase, in order to analyze the model quality manually, we asked a medical expert to review a sample of the produced predictions from the models.

5.1. Internal Evaluation

For the internal evaluation, we used the RDF2vec model trained on the merged augmented knowledge graph. In order to measure the impact on using the patient data as additional training data, we also trained an RDF2vec model on the plain medicalvalues knowledge graph as a baseline. We removed 25% of the relations of a condition and a rule from the graph as a test set. The target of the evaluation is to predict the presence of a relation, given a condition and a rule.

Table 2 shows the results of the internal evaluation. The F1 score of almost 93% is very encouraging. Moreover, the results show that using the augmented graph, enriched with information from the patient dataset, drastically improves both the precision and the recall of the prediction.

In addition, we tested the influence of individual methodological decisions of the model in order to better understand their impact.

Table 3 shows the influence of the two walk strategies (random walks vs. mid-walks). While mid-walks achieve a slightly better results when combined with Random Forests classifier, we decided to stick with random walks since the results are more stable across diferent classifiers.

The negative sampling strategy has also been evaluated. For the random sampling approach, the exact same amount of negatives as positive examples were generated. In the per rule sampling approach, we used diferent numbers of negatives, leading to diferently skewed datasets. The negatives for the opposite sampling approach, on the other hand, use a negative example for all positive examples for which a defined opposite exists. We can observe that the

6https://gitlab.com/medicalvalues-public/mvrdf2vec

best results are obtained using the opposite sampling strategy, while adding more negatives (and thereby skewing the dataset) leads to worse results for all classifiers. This shows that the incorporation of domain knowledge for generating negative samples improves the results.

In a final ablation study, we explore the embedding dimension, as shown in Table 5. While the original RDF2vec papers typically used 200 or 500 dimensions [24], and studies on other knowledge graph showed good results also for a smaller number of dimensions [27], we observed the best results for 1,000 and 2,000 dimensions. Ultimately, the best results were obtained by using opposites for negative labels, 1,000 dimensions, and a random forest classifier.

Besides the qualitative exploration, we also study the computational performance. All experiments were performed on a standard commodity laptop. The most time-consuming step was the conversion of the MIMIC-IV database and its conversion to a knowledge graph, which took more than 12 hours. On the other hand, training the RDF2vec model and computing the predictions could be performed in under 10 minutes, even for the higher dimensionalities.

5.2. External Evaluation

In order to harden the results in a more realistic setup, we presented a set of predictions to a medical domain expert. We used Random Forests with explicit negative sampling that was the best performing setup, and we collected predictions for pairs of rules and parameters from the MIMIC-IV dataset, where for the latter, we made the restriction that at least five evaluations (i.e., mappings to a nominal interpretation, see above) exist. The rules referred to a set of selected diseases. These diseases were chosen with help of the medical expert with the criterion that the disease paths includes at least one of the extracted and successfully evaluated parameters from MIMIC-IV. In total, 318 predictions were created that way and shown to the expert, who was asked to rate them on a five-point scale, as shown in Table 6.

Overall, we evaluated new relations for six diseases, i.e., iron deficiency anemia (IDA) , Hypothyroidism (Hypo), autoimmune hepatitis (AH), nonautoimmune hemolytic anemias (NHA), Benign neoplasm of pituitary gland (BNPG), and the tumor lysis syndrome (TLS). The results are shown in Table 7. It can be observed that the number of correct (37) and plausible (24) predictions is clearly higher than that of unlikely (11) and wrong (2) ones. The majority, however, are relations for which no statement can be made.

Examining some of the predictions in detail, we observe that only a small amount of contrary predictions (increased and decreased for the same parameter) was made. E.g., for hypothyroidism, only for two parameters (IgA, Cholesterol) a relation for both an increased and a decreased value is predicted; however, the relation between the disease and the value was correctly identified, and an expert can easily discard the wrong condition.

The predictions for iron deficiency anemia also contain an interesting result. The only two predictions marked as wrong by the expert were made for the parameter Ferritin. A decreased Ferritin value is a key indicator for iron deficiency anemia , but the model predicted the opposite. To understand the prediction, we looked at the records of the patients sufering from iron deficiency anemia . In total, 3,529 patients were associated with iron deficiency anemia , and Ferriting was measured only for 1,423 of them. This could hint at the fact that the disease was already known, and the patients were in the hospital for diferent reasons. This indicates that the prediction itself was not necessarily wrong, but based on a wrong assumption, i.e., the model treating the diagnosed disease and pre-known (maybe even not acute) diseases alike.

Summarizing, the evaluation shows two key outcomes: (1) it is possible to produce novel relations to present an expert with, e.g., for augmenting the knowledge graph in a human-inthe-loop setting, and (2) additional evidence and information is required in many cases to make an informed decision, i.e., the prediction alone is not suficient.

6. Conclusion and Outlook

In this paper, we have introduced the medicalvalues knowledge graph, which is used for medical diagnosis using so-called diagnosis paths. Those paths allow for a transparent prediction of a patient’s disease. Since the paths are developed manually, they are notoriously incomplete.

To tackle this incompleteness, we have introduced an approach which first enriches the medicalvalues knowledge graph into a augmented graph, connecting it to a large dataset of patient records. On that augmented graph, we have trained vector embeddings with RDF2vec, which are used to predict completions of the existing diagnosis paths. Both in an internal validation as well as in an expert evaluation, we have shown that the prediction of such extensions is possible with high precision. This methodology of enriching the graph and producing predictions therewith is independent of the task and domain at hand.

One key limitation of the approach is the external data used, which is data gathered from intensive care units. Therefore, diseases which do rarely lead to treatments in intensive care are not well covered. In order to augment diagnosis paths for as versatile diseases as possible, other external datasets should be considered as well. Here, the connectors to clinic information systems (CIS) and laboratory information systems (LIS) may also add large-scale instance data in the future, which can also be exploited with the same methodology.

So far, drugs are not represented in the medicalvalues knowledge graph. In the future, we would like to include them, both as a part of a patient’s medical history (i.e., existing medication), as well as possible treatments once a diagnosis is made. To that end, we plan to augment the graph with existing datasets on drugs and drug interactions.

When contrasting the internal and external evaluation, we have seen that while in both cases, the precision is rather high, there are still diferences, most prominently the high number of predictions which an expert cannot make any statement on. Here, it would be interesting which kind of additional information the expert needs to make a decision. Moreover, we plan for a study with a larger pool of medical experts from diferent medical sub fields in order to better assess the capabilities of the approach in diferent medical fields.

A lightweight augmentation would be pointing the expert to the original patient files as evidence. A more complex approach could incorporate search in scientific databases like PubMed [29] for articles which contain both the disease and the symptom at hand.

In the future, we want to explore the utility of other embedding models beyond RDF2vec [ 2 ]. From a technical perspective, other solutions are also possible. For example, explanation models [30] or symbolic rule learners [31] could be used in addition to the prediction method in order to provide the expert with more fine-grained explanations for the prediction instead of just the indication of a missing condition itself.

Once such explanation models and/or additional clues are available, we plan to have a larger expert evaluation study, which will not only incorporate a larger number of medical experts, but also a contrastive evaluation of which external clues (e.g., patient records, scientific sources, etc.) is considered the most helpful by the medical experts.

In the medical knowledge graph used in this paper, it is also possible to incorporate edge weights (reflecting, e.g., the frequency at which a particular influence of a condition on a disease is observed). Such weights could be incorporated in the embeddings [32] in order to further improve the results (and, in particular, give more emphasis on more prevalent relations). Moreover, numeric values are currently only utilized via discretization, and we will explore more sophisticated approaches in the future.

In summary, we have shown the potential use of knowledge graphs in medical diagnosis systems, and discussed how knowledge graph embeddings can be utilized to refine those knowledge graphs, while still retaining the explainability and accountability of the overall systems. The method of combining the knowledge graph to be augmented with auxiliary data into an augmented graph has been shown to be an efective means of using external data for knowledge graph refinement, which can also be transferred to other domains. [4] H. Paulheim, Knowledge graph refinement: A survey of approaches and evaluation methods, Semantic web 8 (2017) 489–508. [5] M. Schmachtenberg, C. Bizer, H. Paulheim, Adoption of the linked data best practices in diferent topical domains, in: International Semantic Web Conference, Springer, 2014, pp. 245–260. [6] N. F. Noy, et al., Bioportal: ontologies and integrated data resources at the click of a mouse,

Nucleic acids research 37 (2009) W170–W173. [7] F. Belleau, M.-A. Nolin, N. Tourigny, P. Rigault, J. Morissette, Bio2rdf: towards a mashup to build bioinformatics knowledge systems, Journal of biomedical informatics 41 (2008) 706–716. [8] B. Abu-Salih, Domain-specific knowledge graphs: A survey, Journal of Network and

Computer Applications 185 (2021) 103076. [9] I. Y. Chen, M. Agrawal, S. Horng, D. Sontag, Robustly extracting medical knowledge from ehrs: A case study of learning a health knowledge graph, in: Pacific Symposium on Biocomputing, 2019. [10] P. Ernst, C. Meng, A. Siu, G. Weikum, Knowlife: a knowledge graph for health and life sciences, in: International Conference on Data Engineering, 2014, pp. 1254–1257. [11] M. Rotmensch, Y. Halpern, A. Tlimat, S. Horng, D. Sontag, Learning a health knowledge graph from electronic medical records, Scientific reports 7 (2017) 1–11. [12] M. Wang, J. Zhang, J. Liu, W. Hu, S. Wang, X. Li, W. Liu, Pdd graph: Bridging electronic medical records and biomedical knowledge graphs via entity linking, in: International Semantic Web Conference, Springer, 2017, pp. 219–227. [13] P. Liu, et al., Hkdp: A hybrid knowledge graph based pediatric disease prediction system, in: International Conference on Smart Health, Springer, 2016, pp. 78–90. [14] X. Chai, Diagnosis method of thyroid disease combining knowledge graph and deep learning, IEEE Access 8 (2020) 149787–149795. [15] R. Celebi, E. Yasar, H. Uyar, O. Gumus, O. Dikenelli, M. Dumontier, Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction using linked open data (2018). [16] M. R. Karim, M. Cochez, J. B. Jares, M. Uddin, O. Beyan, S. Decker, Drug-drug interaction prediction based on knowledge graph embeddings and convolutional-lstm network, in: International conference on bioinformatics, computational biology and health informatics, 2019, pp. 113–123. [17] S. Nunes, R. T. Sousa, C. Pesquita, Predicting gene-disease associations with knowledge graph embeddings over multiple ontologies, arXiv preprint arXiv:2105.04944 (2021). [18] R. T. Sousa, S. Silva, C. Pesquita, Supervised biomedical semantic similarity, bioRxiv (2021). [19] World Health Organization, The ICD-10 classification of mental and behavioural disorders: clinical descriptions and diagnostic guidelines, 1992. [20] IHTSDO, SNOMED CT Starter Guide, IHTSDO, 2018. [21] A. W. Forrey, et al., Logical observation identifier names and codes (loinc) database: a public use set of codes and names for electronic reporting of clinical laboratory test results., Clinical chemistry 42 (1996) 81–90. [22] A. Johnson, L. Bulgarelli, T. Pollard, S. Horng, L. A. Celi, R. Mark, MIMIC-IV, ???? [23] A. L. Goldberger, et al., PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals., Circulation 101 (2000). [24] P. Ristoski, J. Rosati, T. Di Noia, R. De Leone, H. Paulheim, Rdf2vec: Rdf graph embeddings and their applications, Semantic Web 10 (2019) 721–752. [25] M. Cochez, P. Ristoski, S. P. Ponzetto, H. Paulheim, Biased graph walks for rdf graph embeddings, in: International Conference on Web Intelligence, Mining and Semantics, 2017, pp. 1–12. [26] B. Steenwinckel, et al., Walk extraction strategies for node embeddings with rdf2vec in knowledge graphs, in: International Conference on Database and Expert Systems Applications, Springer, 2021, pp. 70–80. [27] J. Portisch, M. Hladik, H. Paulheim, Rdf2vec light–a lightweight approach for knowledge graph embeddings, in: International Semantic Web Conference (Posters, Demos, and Industry Tracks, 2020. [28] T. Mikolov, K. Chen, G. Corrado, J. Dean, Eficient Estimation of Word Representations in

Vector Space (2013). URL: http://arxiv.org/abs/1301.3781. [29] J. McEntyre, D. Lipman, Pubmed: bridging the information gap, Cmaj 164 (2001) 1317–1319. [30] W. Zhang, B. Paudel, W. Zhang, A. Bernstein, H. Chen, Interaction embeddings for prediction and explanation in knowledge graphs, in: International Conference on Web Search and Data Mining, 2019, pp. 96–104. [31] C. Meilicke, M. W. Chekol, D. Rufinelli, H. Stuckenschmidt, Anytime bottom-up rule learning for knowledge graph completion., in: IJCAI, 2019, pp. 3137–3143. [32] A. A. Taweel, H. Paulheim, Towards exploiting implicit human feedback for improving rdf2vec embeddings, arXiv preprint arXiv:2004.04423 (2020).

[1]

Hogan , et al., Knowledge

graphs

, Synthesis Lectures on Data, Semantics, and Knowledge 12 ( 2021 ) 1 - 257 .

[2]

Portisch ,

Heist ,

Paulheim , Knowledge graph embedding for data mining vs. knowledge graph embedding for link prediction - two sides of the same coin? , Semantic Web 13 ( 2022 ).

[3]

Wang ,

Mao ,

Wang ,

Guo , Knowledge graph embedding: A survey of approaches and applications , IEEE Transactions on Knowledge and Data Engineering 29 ( 2017 ) 2724 - 2743 .