=Paper=
{{Paper
|id=None
|storemode=property
|title=Modelling Structured Domains Using Description Graphs and Logic Programming
|pdfUrl=https://ceur-ws.org/Vol-846/paper_22.pdf
|volume=Vol-846
|dblpUrl=https://dblp.org/rec/conf/dlog/MagkaMH12
}}
==Modelling Structured Domains Using Description Graphs and Logic Programming==
Modelling Structured Domains Using Description Graphs and Logic Programming? Despoina Magka, Boris Motik, and Ian Horrocks Department of Computer Science, University of Oxford Wolfson Building, Parks Road, OX1 3QD, UK 1 Introduction OWL 21 is commonly used to represent objects with complex structure, such as com- plex assemblies in engineering applications [5], human anatomy [13], or the structure of chemical molecules [7]. In order to ground our discussion, we next present a concrete application of the latter kind; however, the problems and the solution that we identify apply to numerous similar scenarios. The European Bioinformatics Institute (EBI) has developed the ChEBI2 ontology— a public dictionary of molecular entities used to enhance interoperability of applica- tions supporting tasks such as drug discovery. In order to automate the classification of molecular entities, ChEBI descriptions were translated into OWL and then classified us- ing automated reasoning. However, this approach was hindered by the fundamental in- ability of OWL to precisely represent the structure of complex molecular entities, as the tree-model property of OWL prevents one from describing non-tree-like relationships using schema axioms. For example, OWL axioms can state that butane molecules have four carbon atoms, but they cannot state that the four atoms in a cyclobutane molecule are arranged in a ring. Please note that this applies to schema descriptions only: the structure of a particular cyclobutane molecule can be represented using class and prop- erty assertions, but the general definition of all cyclobutane molecules—a problem that terminologies such as ChEBI aim to solve—cannot be fully described in OWL. As we show in Section 3, an ontology may therefore fail to entail certain desired consequences. A common solution to this problem is to extend OWL 2 with a rule-based formalism such as SWRL;3 however, this either results in undecidability [9] or requires restrictions in the shape of the rules [8], which typically prevent the rules from axiomatising the re- quired structures. An alternative approach suggests a combination of OWL 2, rules, and description graphs (DGs) [12]—a graphical notation for describing non-tree-like struc- tures. Decidability of reasoning is ensured via a property separation condition and by requiring DGs to be acyclic. Intuitively, the latter means that DGs can describe struc- tures of arbitrary shape, but bounded in size, while the former limits the interaction between the OWL and DG parts, thus preventing multiple DG structures from merg- ing into one structure of (potentially) unbounded size. As reported in [7], DGs solved ? Work supported by EU FP7 SEALS and EPSRC projects ConDOR, ExODA, and LogMap. 1 http://www.w3.org/TR/owl2-overview/ 2 http://www.ebi.ac.uk/chebi/ 3 http://www.w3.org/Submission/SWRL/ only some of the problems related to the representation of structured objects, and our subsequent discussions with EBI researchers revealed the following drawbacks. First, the DG approach does not allow one to define structures based on the absence of certain characteristics. For example, an inorganic molecule is commonly described as ‘a molecule not containing a carbon atom’, which can then be used to classify water as an inorganic molecule. Designing an axiomatisation that produces the desired entail- ment is very cumbersome with the DG approach: apart from stating that ‘each water molecule consists of one oxygen and two hydrogen atoms’, one must additionally state that ‘these three atoms are the only atoms in a water molecule’ and that ‘neither hydro- gen nor oxygen atoms are carbon atoms’. Second, the separation conditions governing the interaction of the OWL 2 and DG components makes the combined language rather difficult to use, as no role can be used in both components. Third, the acyclicity condi- tion from [12] is rather cumbersome: a modeller must add a number of negative class assertions to DGs so as to make any ontology with cyclic implications between DGs unsatisfiable. This solution fails to cleanly separate the semantic consequences of an ontology from the acyclicity check. In response to this critique, in this paper we present a radically different approach to modelling complex objects via a novel formalism that we call Description Graph Logic Programs (DGLP). At the syntactic level, our approach combines DGs, rules, and OWL 2 RL axioms.4 In order to overcome the first problem, we give semantics to our formalism via a translation into logic programs interpreted under stable model seman- tics. As we show in Section 4, the resulting formalism can capture conditions based on the absence of information. Moreover, we address the second problem by ensuring de- cidability without the need for complex property separation conditions. To address the third problem, in Section 5 we discuss existing syntactic acyclicity conditions and argue that they unnecessarily rule out some very simple and intuitively reasonable ontologies. As a remedy, we present a novel semantic acyclicity condition. Roughly speaking, a precedence relation describing allowed implications between DGs is specified by the modeller; a cyclic ontology that is not compatible with this precedence relation entails a special propositional symbol. A cyclic ontology can still entail useful consequences, but termination of reasoning can no longer be guaranteed. In Section 6 we consider the problem of reasoning with negation-free ontologies and ontologies with stratified negation. We show that the standard bottom-up evaluation of logic programs can de- cide the relevant reasoning problems for semantically acyclic ontologies, and that it can also decide whether an ontology is semantically acyclic. In Section 7 we briefly discuss a preliminary evaluation of our formalism which indicates that reasoning with DGLP ontologies is practically feasible. Proofs of the technical results can be found online.5 2 Preliminaries We assume the reader to be familiar with OWL and description logics; for brevity, we write OWL axioms using the DL notation. Let Σ = (ΣC , ΣF , ΣP ) be a first-order logic signature, where ΣC , ΣF , and ΣP are countably infinite sets of constant, function, and 4 http://www.w3.org/TR/owl-profiles/ 5 http://www.cs.ox.ac.uk/isg/people/despoina.magka/pubs/reports/DGLPTechnicalReport.pdf predicate symbols, respectively, and where ΣP contains the 0-ary predicate ⊥. The arity of a predicate A is given by ar (A). A vector t1 , . . . , tn of first-order terms is often abbreviated as ~t. An atom is a first-order formula of the form A(~t), where A ∈ ΣP and ~t is a vector of the terms t1 , . . . , tar (A) . A rule r is an implication of the form B1 ∧ . . . ∧ Bn ∧ not Bn+1 ∧ . . . ∧ not Bm → H1 ∧ . . . ∧ H` (1) where H1 , . . . , H` are atoms, B1 , . . . , Bm are atoms different from ⊥, m ≥ 0, and ` > 0. Let head (r) = {Hi }1≤i≤` , body + (r) = {Bi }1≤i≤n , body − (r) = {Bi }n