1. Introduction

Meta-reasoning over OWL 2 QL using Datalog

Haya Majid Qureshi

Wolfgang Faber

0 0 University of Klagenfurt , Austria

181 187

There has been increasing interest in enriching ontologies with meta-modeling and meta-querying for the past few years. Unfortunately, the Direct Semantics for OWL2 and SPARQL does not support meta-constructs in a satisfactory way: While meta-axioms can be syntactically expressed using punning, they are not treated as expected semantically. Meta-queries (for example, asking for classes that also occur as individuals) are not defined in SPARQL under the Direct Semantics Entailment Regime. To overcome this, a new semantic flavour for SPARQL, called Metamodeling Semantics Entailment Regime (MSER), has been introduced. In previous work, Cima et al. have proposed a reduction from OWL 2 QL query answering to query answering over Datalog. In this paper, we report on experiments for MSER query answering conducted with various Datalog engines.

eol>Meta-Reasoning Ontology SPARQL Datalog

1. Introduction

2 QL. It provides a reduction from query-answering over OWL 2 QL to Datalog queries and reported experimental results using two Datalog engines, Logicblox and RDFox. This work summarises results obtained in [ 4 ], in which more Datalog engines have been evaluated. Some additional experimental results obtained since the publication of [ 4 ] are also included.

This work aims to reflect the idea of query answering under MSER that addresses the feasibility challenge for OWL 2 QL ontology language with (or without) the distinct flavour of metamodeling in Datalog back-end tools. Also, we investigated the problem of typing constraints by DSER via posing meta-queries to OWL 2 QL theories and evaluating the performance of these queries in Datalog engines.

2. Preliminaries

ment Regime (MSER) from [ 3 ].

In this section, we briefly recall query answering under the Meta-modelling Semantics EntailIn MSER, SPARQL query answering over OWL 2 QL ontologies is reduced to Datalog query ℛ answering. It defines (i) a translation function mapping OWL 2 QL axioms to Datalog facts is summarised Table 1 and (ii) a fixed rule base

that captures inferences in OWL 2 QL reasoning (the full set of rules is available at https://git-ainf.aau.at/Haya.Qureshi/mhf-algo-testing). This representation is closer to a meta-programming representation than other Datalog embeddings that translate each axiom to a rule.

3. Experiments

In this section we briefly describe the experiments that we have conducted, including the tools we used, the ontologies and queries we considered, and report on the outcomes. For a detailed discussion, see [ 4 ]. All material is available at https://git-ainf.aau.at/Haya.Qureshi/ mhf-algo-testing. We have implemented MSER in Java. For the Datalog back-end, we have evaluated nine tools, which stem from diferent paradigms. These tools are: RDFox, LogicBlox, XSB, Clingo, DLV2, DLVHex, HexLite, Alpha and NoHR .

Our experiments are based on the widely used Lehigh University Benchmark (LUBM)1 dataset (with 1 and 9 universities) and Making Open Data Efectively USable (MODEUS) 2 ontologies in four sizes.

The LUBM datasets describe a university domain with information like departments, courses, students, and faculty. This dataset comes with 14 queries with diferent characteristics (low selectivity vs high selectivity, implicit relationships vs explicit relationships, small input vs large input, etc.).

The MODEUS ontologies describe the Italian Public Debt domain with information like ifnancial liability or financial assets to any given contracts [ 5 ]. It comes with 8 queries. These queries are pure meta-queries as they span over several levels of the knowledge base. MODEUS ontologies are meta-modelling ontologies with meta-classes and meta-properties.

We ran experiments on a Linux batch server, running Ubuntu 20.04.3 LTS (GNU/Linux 5.4.088-generic x86_64) on one AMD EPYC 7601 (32-Core CPU), 2.2GHz, Turbo max. 3.2GHz. The machine is equipped with 515GB RAM and a 4TB hard disk. Java applications used OpenJDK 11.0.11 with a maximum heap size of 25GB. For each query, we have limited RAM to 8GB and runtime to 15 minutes. OFT and OFM refer to exceeding the time and memory limits, respectively. 3.1. Results We next report the results of our experiments. All reported times are in seconds and include loading the Datalog program including facts and rules and answering the query. The best performance for each query is highlighted in bold face.

In Tables 2 we report the performance on standard queries over LUBM, respectively. While for the smaller ontology almost all queries could be answered by all systems within the resource limits, performance varies considerably. This situation is exasperated for the larger ontology, for which LogicBlox, NoHR, and Alpha could not answer any of the queries. On the other hand, Clingo and DLV2 exhibit consistently fast performance.

In Table 3, we have considered the meta-queries mq1, mq4, mq5, and mq10 from [ 6 ] as they contain variables in-property positions and are long conjunctive queries. We have also considered two special-case queries sq1 and sq2 from [ 3 ] to exercise the MSER features and identify the new challenges introduced by the additional expressivity over the ABox queries. Basically, in special-case queries, we check the impact of DISJOINTWITH and meta-classes in a query. For this, like in [ 3 ], we have introduced a new class named TypeOfProfessor and make FullProfessor, AssociateProfessor and AssistantProfessor an instance of this new class and also we define FullProfessor, AssociateProfessor and AssistantProfessor to be disjoint from each other. Then, in sq1 we are asking for all those and , where is a professor, is a type of professor and is an instance of . In sq2, we have asked for diferent pairs of professors. 1http://swat.cse.lehigh.edu/projects/lubm/ 2http://www.modeus.uniroma1.it/modeus/node/6 It can be seen in Table 3 that the overall performance of meta-query evaluation is similar to the one in Table 2. Clingo and DLV2 exhibits the regular performance. XSB and RDFox shows the good performance on LUBM(1) but their performance get efected by the size of ontology. On the other hand, LogicBlox, NoHR, Alpha, HexLite and DLVHex shows slower performance but deteriorates with the size of the ontology.

In Table 4 we report the performance on the larger MODEUS queries. It can be seen immediately that many of the systems struggle considerably with these. Some considerations on the causes of this are: The MODEUS dataset consists of meta-layers, which appear to cause many tools to do more inferencing. We also conjecture that the presence of many disjoint axioms causes particularly many inferences.

On the positive side, DLV2 and XSB exhibit acceptable performance for these queries, with DLV2 being the best overall performer. DLV2 exhibits very stable performance with roughly the same execution time for all queries, which is quite remarkable. We assume that the magic set technique implemented in DLV2 has a huge impact here. The time is afected slightly by the size of the dataset, which is expected, though. XSB uses a top-down evaluation and therefore has similar advantages as the magic set technique.

Interestingly, we believe that at least LogicBlox (and perhaps also RDFox) also implements a magic set technique, yet does not seem to be able to take advantage from it. We conjecture that those systems build quite complicated and large datastructures for the Datalog program, for instance various indices. These systems might perform better when huge amounts of memory are available and several queries are posed over the same program without reloading it.

4. Conclusion

In this work, we have tested several Datalog engines on OWL 2 QL MSER query answering without any restriction, as defined in [ 3 ]. While most tools are able to answer queries over smaller ontologies, scalability seems to be an issue for many of them. However, there are some exceptions, notably XSB and DLV2, which also show good performance over large and complex ontologies. Indeed, our experiments show that DLV2 appears to be a promising back-end for meta-querying over OWL 2 QL.

We show that query answering under Datalog reduction of MSER with metamodeling and meta-querying feature is feasible for some tools (or, in our case, just DLV2). At the same time, some sufer from the existence of meta-axioms over several layers. The meta-queries over LUBM do not include meta-axioms. However, most tools could perform well despite the metamodeling capabilities associated with the query language that extracts the information spanning several levels of an ontology. On the other hand, some tools could perform with MSER without the metamodeling feature in ontologies and with standard queries, while others get afected by the size of the ontology.

[1]

Motik , On the properties of metamodeling in OWL , Journal of Logic and Computation 17 ( 2007 ) 617 - 637 . doi: 10 .1093/logcom/exm027.

[2]

Glimm , Using SPARQL with RDFS and OWL Entailment , in: Reasoning Web International Summer School , Springer, 2011 , pp. 137 - 201 .

[3]

Cima , G. De Giacomo,

Lenzerini ,

Poggi , On the SPARQL metamodeling semantics entailment regime for OWL 2 QL ontologies , in: Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics , 2017 , pp. 1 - 6 .

[4]

H. M.

Qureshi ,

Faber , An Evaluation of Meta-reasoning over OWL 2 QL , in: International Joint Conference on Rules and Reasoning , Springer, 2021 , pp. 218 - 233 .

[5]

Lenzerini ,

Lepore ,

Poggi , Metaquerying made practical for OWL 2 QL ontologies , Information Systems 88 ( 2020 ) 101294 .

[6]

Kontchakov ,

Rezk ,

Rodriguez-Muro ,

Xiao ,

Zakharyaschev , Answering SPARQL queries over databases under OWL 2 QL entailment regime , in: International Semantic Web Conference, Springer, 2014 , pp. 552 - 567 .