<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Distribution Semantics for Probabilistic Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Elena Bellodi</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Evelina Lamma</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabrizio Riguzzi</string-name>
          <email>fabrizio.riguzzig@unife.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simone Albani</string-name>
          <email>simone.albani@student.unife.it</email>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Ferrara</institution>
          ,
          <addr-line>Via Saragat 1, I-44122, Ferrara</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present DISPONTE, a semantics for probabilistic ontologies that is based on the distribution semantics for probabilistic logic programs. In DISPONTE each axiom of a probabilistic ontology is annotated with a probability. The probabilistic theory de nes thus a distribution over normal theories (called worlds) obtained by including an axiom in a world with a probability given by the annotation. The probability of a query is computed from this distribution with marginalization. We also present the system BUNDLE for reasoning over probabilistic OWL DL ontologies according to the DISPONTE semantics. BUNDLE is based on Pellet and uses its capability of returning explanations for a query. The explanations are encoded in a Binary Decision Diagram from which the probability of the query is computed.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Representing probabilistic knowledge and reasoning with it is fundamental in
order to realize the full vision of the Semantic Web, due to the ubiquity of
uncertainty in the real world and on the Web [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Various authors have advocated
the use of probabilistic ontologies, see e.g. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], and many proposals have been
put forward for allowing ontology languages, and OWL in particular, to represent
uncertainty.
      </p>
      <p>
        Similarly, in the eld of logic programming, there has been much work on
introducing uncertainty in the programs. Among the various proposals, the
distribution semantics [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] has emerged as one of the most e ective approaches and
it underlies many languages such as PRISM [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], ICL [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], Logic Programs with
Annotated Disjunctions [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] and ProbLog [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In this semantics a probabilistic
logic program de nes a probability distribution over a set of normal logic
programs (called worlds). The distribution is extended to a joint distribution over
worlds and queries; the probability of a query is obtained from this distribution
by marginalization. In general, the problem of integrating logic and probability
has been much studied lately, with proposals such as Markov Logic [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], Multi
Entity Bayesian Networks [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and Probabilistic Relational Models [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>In this paper we propose to apply this approach to ontology languages and,
in particular, to the OWL DL fragment, that is based on the description logic
SHOIN (D). However, the approach is applicable in principle to any description
logic. We called the approach DISPONTE for \DIstribution Semantics for
Probabilistic ONTologiEs" (Spanish for \get ready"). The idea is to annotate each
axiom of a theory with a probability and assume that each axiom is
independent of the others. A probabilistic theory de nes thus a distribution over normal
theories (worlds) obtained by including an axiom in a world with a probability
given by the annotation. The probability of a query is again computed from this
distribution with marginalization.</p>
      <p>
        We also present the system BUNDLE for \Binary decision diagrams for
Uncertain reasoNing on Description Logic thEories" that performs inference over
probabilistic OWL DL ontologies. BUNDLE uses the inference techniques
developed for probabilistic logic programs under the distribution semantics [
        <xref ref-type="bibr" rid="ref21 ref8">8,21</xref>
        ]
and, in particular, the use of Binary Decision Diagrams (BDDs) for encoding
explanations to queries and for computing their probability.
      </p>
      <p>
        BUNDLE is based on the Pellet reasoner [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] for OWL DL and exploits its
capability of returning explanations for queries in the form of a set of sets of
axioms from which BUNDLE builds a BDD for computing the probability. In
this way we provide an e ective reasoning system for DISPONTE.
      </p>
      <p>The paper is organized as follows. Section 2 describes the distribution
semantics for logic programs while Section 3 presents DISPONTE. Section 4 illustrates
BUNDLE and Section 5 discusses current limitations of DISPONTE and
BUNDLE. Section 6 describes related works while Section 7 concludes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The Distribution Semantics in Probabilistic Logic</title>
    </sec>
    <sec id="sec-3">
      <title>Programming</title>
      <p>
        The probabilistic logic programming languages based on the distribution
semantics di er in the way they de ne the distribution over logic programs. Each
language allows probabilistic choices among atoms in clauses. Let us consider
ProbLog [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] which is the language with the simplest syntax. A ProbLog
program T is composed of a normal logic program TC and a set of probabilistic
facts TP . Each probabilistic fact is of the form pi :: Fi: where pi is a probability
(i.e. pi 2 [0; 1]) and Fi is a atom. This means that every grounding of Fi is a
Boolean random variable that assumes true value with probability pi and false
with probability 1 pi.
      </p>
      <p>Let us call TF the set of atoms obtained by removing the probabilistic
annotation from the probabilistic facts. Let us consider the case in which TC [ TF
does not contain function symbols so that its Herbrand base is nite. Let us call
ground(T ) the grounding of a normal program T . Since there are no function
symbols, ground(TC [ TF ) is nite and so is the grounding ground(TF ) obtained
by grounding the probabilistic atoms with constants from the Herbrand universe
of TC [ TF . So each probabilistic fact Fi has a nite set of groundings.</p>
      <p>A substitution is a set of couples V =c where V is a variable and c is a constant.
A substitution j is applied to a logic atom F , indicated with F j , by replacing
the variables in the substitution with constants. A substitution j is grounding
for logic atom F if F j is ground. Suppose that a grounding is obtained with
the substitution j : Fi j corresponds to a Boolean random variable Xij that is
independent of the others.</p>
      <p>Example 1. The following ProbLog program T encodes a very simple model of
the development of an epidemic or pandemic:</p>
      <p>C1 = epidemic : f lu(X); epid(X); cold:
C2 = pandemic : f lu(X); n + epid(X); pand(X); cold:
C3 = f lu(david):
C4 = f lu(robert):
F1 = 0:7 :: cold:
F2 = 0:6 :: epid(X):
F3 = 0:3 :: pand(X):</p>
      <p>This program models the fact that if somebody has the u and the climate
is cold there is the possibility that an epidemic or a pandemic arises. We are
uncertain whether the climate is cold but we know for sure that David and
Robert have the u. epid(X) and pand(X) can be considered as "probabilistic
activators" of the e ects in the head given that the causes (f lu(X) and cold)
are present. n + epid(X) means the negation of epid(X).</p>
      <p>Fact F1 has only one grounding so there is a single Boolean variable X11. Fact
F2 has two groundings, epid(david) and epid(robert) so there are two Boolean
random variables X21 and X22. F3 also has two groundings so there are two
Boolean random variables X31 and X32.</p>
      <p>In order to present the distribution semantics, let us rst give some de nitions.
An atomic choice is a selection of a value for a grounding of a probabilistic
fact F and is represented by the triple (Fi; j ; k) where j is a substitution
grounding Fi and k 2 f0; 1g. A set of atomic choices is consistent if (Fi; j ; k) 2
; (Fi; j ; m) 2 ) k = m, i.e., only one truth value is selected for a ground
fact. A composite choice is a consistent set of atomic choices. The probability
of composite choice is P ( ) = Q(Fi; j;1)2 pi Q(Fi; j;0)2 (1 pi). A selection
is a total composite choice (one atomic choice for every grounding of every
probabilistic fact). A selection identi es a normal logic program w called a
world in this way: w = TC [ fFi j j(Fi; j ; 1) 2 g. The probability of w is
P (w ) = P ( ) = Q(Fi; j;1)2 pi Q(Fi; j;0)2 (1 pi). Since ground(TF ) is nite
the set of worlds is nite: WT = fw1; : : : ; wmg and P (w) is a distribution over
worlds: Pw2WT P (w) = 1. A world w is compatible with a composite choice
if</p>
      <p>We can de ne the conditional probability of a query Q given a world as
P (Qjw) = 1 if w j= Q and 0 otherwise. This allows to de ne a joint distribution
of the query and the worlds P (Q; w) by using the product rule of the theory of
probability: P (Q; W ) = P (Qjw)P (w). The probability of Q can then be obtained
from the joint distribution by the sum rule (marginalization over Q):
P (Q) =</p>
      <p>X P (Q; w) =
w2WT</p>
      <p>X P (Qjw)P (w) =
w2WT</p>
      <p>X
w2WT :wj=Q</p>
      <p>P (w)
(1)
In Example 1, T has 5 Boolean random variables and thus 32 worlds. The query
epidemic is true in 5 of them and its probability is P (epidemic) = 0:588.</p>
      <p>
        It is often unfeasible to nd all the worlds where the query is true so inference
algorithms nd instead explanations for the query [
        <xref ref-type="bibr" rid="ref21 ref8">8,21</xref>
        ] , i.e. composite choices
such that the query is true in all the worlds that are compatible with them . For
example, 1 = f(F2; fX=davidg; 1); (F1; fg; 1)g is an explanation for the query
epidemic and so is 2 = f(F2; fX=robertg; 1); (F1; fg; 1)g.
      </p>
      <p>Each explanation identi es a set of worlds, those that are compatible with
it, and a set of explanations K identi es the set !K of worlds compatible with
one of its explanations (!K = fw j 2 K; g). A set of explanations K is
covering for a query Q if every world in which Q is true is in !K . For example,
K = f 1; 2g is covering for the query epidemic.</p>
      <p>The probability of a query can thus be computed from a covering set of
explanations for the query by computing the probability of the Boolean formula
B(Q) = _</p>
      <p>^
2K (Fi; j;1)2</p>
      <p>Xij</p>
      <p>^
(Fi; j;0)2
:Xij
(2)
For Example 1, the formula is B(epidemic) = X11 ^ X21 _ X11 ^ X22.</p>
      <p>
        Explanations however, di erently from possible worlds, are not necessarily
mutually exclusive with respect to each other, so the probability of the query can
not be computed by a summation as in (1). In fact computing the probability
of a DNF formula of independent Boolean random variables is a #P-complete
problem [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. The method that was found to be the most e cient up to now
consists in building a Binary Decision Diagram for the formula and using a
dynamic programming algorithm on the BDD [
        <xref ref-type="bibr" rid="ref21 ref8">8,21</xref>
        ]. A BDD is a rooted graph
that has one level for each variable. Each node n has two children, a 0-child and
a 1-child. The leaves store either 0 or 1. Given values for all the variables, a
BDD can be used to compute the value of the formula by traversing the graph
starting from the root, following the edges corresponding to the variables values
and returning the value associated to the leaf that is reached. The BDD for
Example 1 is shown in Figure 1.
      </p>
      <p>X21
X22
X11
n3
1
n1
n2
0</p>
      <p>A BDD performs a Shannon expansion of the Boolean formula f (X), so
that if X is the variable associated to the root level of a BDD, the formula
f (X) can be represented as f (X) = X ^ f X (X) _ :X ^ f :X (X) where f X (X)
(f :X (X)) is the formula obtained by f (X) by setting X to 1 (0). Now the two
disjuncts are mutually exclusive and the probability of f (X) can be computed as
P (f (X)) = P (X)P (f X (X))+(1 P (X))P (f :X (X)) Figure 2 shows the function
Prob that implements the dynamic programming algorithm for computing the
probability of a formula encoded as a BDD.</p>
      <p>
        Languages with non-binary choices such as Logic Programs with Annotated
Disjunctions can be handled by encoding the choices with binary variables [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>The DISPONTE Semantics for Probabilistic Ontologies</title>
      <p>DISPONTE assigns a semantics to probabilistic ontologies following the
approach of the distribution semantics for probabilistic logic programs. It de nes
a probability distribution over non-probabilistic ontologies called worlds. This
probability distribution is extended to a joint distribution of the worlds and a
query and the probability of the query is obtained by marginalization.</p>
      <p>The probabilistic ontologies we consider associate to each axiom of the
ontology a Boolean random variable that indicates whether the axiom is present
in a world. A probabilistic ontology is thus a set of annotated axioms of the form
pi :: Ai
(3)
or of unannotated axioms of the form Ai, for i = 1; : : : ; n, where pi is the
probability with which axiom Ai is included in a world. Let us call OA the set
fA1; : : : ; Ang and Xi the Boolean random variable associated to axiom Ai. Each
Xi is independent of every Xj with i 6= j. The probability of each Xi of being
true is pi. If the pi :: annotation is omitted for an axiom, we assume that the
axiom is certain, i.e., that it has probability 1.</p>
      <p>A world w is obtained by sampling a value for Xi for every axiom Ai of OA
and by including Ai in w if Xi = 1. Since the random variables for the di erent
axioms are independent, the probability P (w) of w is obtained as:
P (w) = Y pi</p>
      <p>Y
(1</p>
      <p>pj )
Ai2w</p>
      <p>Aj2OAnw
Given a query Q to O, we can de ne its conditional probability of being true
given a world P (Qjw) in the following intuitive way: P (Qjw) = 1 if w j= Q and
P (Qjw) = 0 if w 6j= Q.</p>
      <p>The probability P (Q) can be obtained from the joint distribution of the query
and the worlds by the sum rule:</p>
      <p>P (Q) = X P (Q; w) = X P (Qjw)P (w) =
w w</p>
      <p>X P (w)
w:wj=Q
Similarly to the case of probabilistic logic programming, the probability of a
query Q given a probabilistic ontology O can be computed by rst nding the
explanations for Q in O. An explanation in this context is a subset of axioms of
O that is su cient for entailing Q. Typically minimal explanations are sought
for e ciency reasons. All the explanations for Q must be found, corresponding
to all ways of proving Q. Let EQ be set of explanations and e be an explanation
from EQ. The probability of Q can be obtained by computing the probability of
the DNF formula</p>
      <p>F (Q) = _</p>
      <p>
        ^ pi
e2EQ Ai2e
Example 2. This example is inspired by Examples 3.1, 4.1, 4.2 and 4.3 of [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] that
describe a probabilistic ontology about cars. We know for sure that a SportCar
is a Car to which a max speed greater than 245Km/h is associated:
      </p>
      <p>SportsCar v Car u 9max speed: 245Km=h
We also know that a Car is a subset of the class of vehicles HasF ourW heels
with probability 0.9:</p>
      <p>0:9 :: Car v HasF ourW heels
Please note that this does not mean that a member of the class Car is a member
of HasF ourW heels with probability 0.9, see Section 5. johns car is an instance
of SportsCar with probability 0.8:</p>
      <p>0:8 :: johns car : SportsCar
We want to know what is the probability P (Q1) of axiom Q1 = johnsCar :
HasF ourW heels being true. Q1 has a single explanation containing the axioms
(4), (5) and (6). Since (4) is certain, P (Q1) is 0:8 0:9 = 0:72.</p>
      <p>
        Example 3. Let us consider another example, inspired by the people+pets
ontology proposed in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. We know that kevin is a DogOwner with probability
0.6 and a CatOwner with probability 0.6:
0:6 :: kevin : DogOwner;
0:6 :: kevin : CatOwner:
(4)
(5)
(6)
(7)
Moreover we know for sure that DogOwner and CatOwner are subclasses of
P etOwner
      </p>
      <sec id="sec-4-1">
        <title>DogOwner v P etOwner</title>
      </sec>
      <sec id="sec-4-2">
        <title>CatOwner v P etOwner</title>
        <p>(9)
(10)
Then the query axiom Q2 = kevin : P etOwner has two explanations, one
composed of the axioms (7) and (9) and the other composed of the axioms (8) and
(10). Since (9) is certain, the probability of the rst explanation is 0.6.
Similarly, the probability of the second explanation is again 0.6. If we associate the
Boolean random variable X1 to (7) and X2 to (8), the query axiom is true if
the formula X1 _ X2 is true. Thus, P (Q2) = P (X1 _ X2). Since X1 and X2
are independent, we get P (Q2) = 0:6 + 0:6 0:6 0:6 = 0:84: As you can see,
the fact that kevin is an instance of both DogOwner and CatOwner increases
the probability that he is an instance of P etOwner: if he were an instance of
DogOwner only, its probability of being a P etOwner would be 0.6 and similarly
if he were an instance of CatOwner only.</p>
        <p>Now suppose that we known that P etOwner is a subclass of Ecologist with
probability 0.7:
0:7 :: P etOwner v Ecologist
(11)
The query axiom Q3 = kevin : Ecologist has again two explanations, one
composed of axioms (7), (9) and (11) and the other composed of the axioms
(8), (10) and (11). Since (9) is certain, the probability of the rst
explanation is 0:4 0:6 = 0:24. Similarly, the probability of the second explanation is
0:5 0:6 = 0:3. If we associate the Boolean random variable X3 to (11), Q3 is
a consequence of the theory if X1 ^ X3 _ X2 ^ X3 is true. A BDD that can be
built for this formula is the one shown in Figure 1 after replacing variable X21
with X1, variable X22 with X2 and variable X11 with X3.</p>
        <p>The probability of node n3 computed by Prob is 0:7 1 + 0:3 0 = 0:7. The
probability of node n2 is 0:6 0:7 + 0:4 0 = 0:42 and the probability of node
n1 (and of Q3) is 0:6 0:7 + 0:4 0:42 = 0:588.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>The BUNDLE System</title>
      <p>
        BUNDLE computes the probability of a query Q given a probabilistic ontology
O that follows the DISPONTE semantics. BUNDLE exploits an underlying
ontology reasoner that is able to return all explanations for a query. One of these
system is Pellet [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] that is a complete OWL-DL reasoner. Pellet takes as input
an OWL ontology in various formats, including the RDFXML language.
      </p>
      <p>
        In order to assign probabilities to axioms, we exploit the possibility given by
OWL1.1 of declaring an annotation property for axioms. We thus annotate the
axioms with the XML tag bundle:probability whose value should be a real
number in [
        <xref ref-type="bibr" rid="ref1">0,1</xref>
        ].
      </p>
      <p>BUNDLE takes as input two RDFXML les, one containing the ontology
and one containing the annotations. For Example 3, the ontology le contains
the following de nition of P etOwner:
&lt;owl:Class rdf:about="#PetOwner"&gt;
&lt;rdfs:subClassOf&gt;</p>
      <p>&lt;owl:Class rdf:about="#Ecologist" /&gt;
&lt;/rdfs:subClassOf&gt;
&lt;/owl:Class&gt;
The annotation le contains the annotation for the above axiom in the following
form:
&lt;owl11:Axiom&gt;
&lt;rdf:subject rdf:resource="#PetOwner"/&gt;
&lt;rdf:predicate rdf:resource="&amp;rdfs;subClassOf"/&gt;
&lt;rdf:object rdf:resource="#Ecologist"/&gt;
&lt;bundle:probability&gt;0.6&lt;/bundle:probability&gt;
&lt;/owl11:Axiom&gt;
BUNDLE rst uses the annotation le for building a data structure P M ap
that associates axioms with their probability. In order to do so, axioms are
rst converted to strings. We use the Manchester syntax to obtain a string
representation of an axiom.</p>
      <p>Then BUNDLE uses the Explain function of Pellet to compute explanations
for a query axiom. BUNDLE thus accepts all the forms of query axioms that
are accepted by Pellet's Explain function, namely subclass, instance, property
value, theory inconsistency and class unsatis ability.</p>
      <p>Pellet returns the explanations for the query in the form of a set of sets of
axioms. Then BUNDLE performs a double loop over the set of explanations and
over the set of axioms in each explanation in which it builds a BDD representing
the set of explanations. To manipulate BDDs we used the JavaBDD library1
that provides a Java interface to the major BDD libraries such as CUDD2.</p>
      <p>Outside the outer loop, two data structures are initialized: V arAxAnn is an
array that maintains the association between Boolean random variables (whose
index is the array index) and axioms together with their probability, and BDD
represents the set of explanations. BDD is initialized to the BDD representing
the zero Boolean function. Then the outer loop is entered in which BDDE is
initialized to the BDD representing the one Boolean function. In the inner loop
the axioms of an explanation are considered one by one. Each axiom is rst
looked up in P M ap to get its probability. If NULL is returned this means that
this is a certain axiom and it does not need to be considered anymore. Then
the axiom is searched for in V arAxAnn to see if it has already been assigned
a random variable. If not, a cell is added to V arAxAnn to store the axiom
with its probability. At this point we know the axiom's position i in V arAxAnn
1 http://javabdd.sourceforge.net/
2 http://vlsi.colorado.edu/~fabio/CUDD/
and so the index of its Boolean variable Xi. We obtain a BDD representing
Xi = 1 and we conjoin it with BDDE. At the end of the inner loop the BDD for
the current explanation, BDDE, is disjoined with BDD. After the two cycles,
function Prob of Figure 2 is called over BDD and its result is returned to the
user.</p>
      <p>BUNDLE has been implemented in Java and will be available for download
from http://sites.unife.it/bundle. It has been successfully tested on
various examples, including those of Section 3.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Discussion</title>
      <p>The probabilistic knowledge that can be expressed with the DISPONTE
semantics is epistemic by nature, namely it represents degrees of belief in the axioms
rather that statistical information. While this is reasonable for many axioms, for
subclass and subproperty axioms one may want to express statistical
information, for example with a probabilistic subclass axiom p :: A v B one may want
to express the fact that a random individual of A has probability p of belonging
to B. The DISPONTE semantics, instead, interpret the axioms as stating that
A v B is true with probability p. The di erence is that, if two individuals i and j
belong to class A, the probability that they both belong to B in the DISPONTE
semantics is p while with a statistical interpretation is p p. Thus statistical
information can be used to de ne a degree of partial overlap between classes.
Extending DISPONTE to take account of this case is possible, it requires to
de ne a probability distribution over models rather than over theories.</p>
      <p>However, to reason with such knowledge, the inference engine must be
modied in its inference procedure and cannot be used as a black box as in BUNDLE.
In fact, BUNDLE assigns a single Boolean random variable to the axiom A v B,
while with a statistical interpretation a di erent Boolean random variable must
be assigned to each assertion that an individual of class A belongs to class B.
We leave this extension for future work.</p>
      <p>Another limitation of BUNDLE is the use of the OWL 1.1 Axiom construct
to specify probabilities. This seems to restrict the kind of axioms on which
probabilities can be placed, since the object of the RDF triple does not allow
complex class expressions. However this limitation can be overcome by de ning a
new class which is equivalent to the complex class expression and using the new
class name in the RDF triple. In the future we plan to investigate the possibility
of annotating the axioms directly in the ontology le.</p>
      <p>As regards the complexity of reasoning on DISPONTE, it is equal to the
complexity of the underlying description logic plus the #P complexity of computing
the probability of a DNF formula of independent Boolean random variables,
assuming the cost of keeping track of explanations during inference is negligible.
Thus, the problem of inference in DISPONTE remains decidable if it was so in
the underlying description logic.</p>
    </sec>
    <sec id="sec-7">
      <title>Related Work</title>
      <p>
        Our work di ers from previous work in many respects. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposed an
extension of the description logic ALC that is able to express statistical information
on the terminological knowledge such as partial concept overlapping. Similarly,
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] presents a probabilistic description logic based on Bayesian networks that
deals with statistical terminological knowledge. As illustrated in Section 5,
currently we are not able to express statistical terminological knowledge but it is
possible to extend the semantics to do so. Di erently from us, [
        <xref ref-type="bibr" rid="ref11 ref6">6,11</xref>
        ] do not
allow probabilistic assertional knowledge about concept and role instances. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
allows assertional knowledge about concept and role instances together with
statistical terminological knowledge and combines the resulting probability
distributions using cross-entropy minimization. In the future we plan to compare the
DISPONTE semantics extended with statistical information with this approach.
      </p>
      <p>
        [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposed a probabilistic extension of OWL that admits a translation into
Bayesian networks. The semantics that is proposed assigns a probability
distribution P (i) over individuals, i.e. Pi P (i) = 1, and assigns a probability to a class
C as P (C) = Pi2C P (i), while we assign a probability distribution over
theories. PR-OWL [
        <xref ref-type="bibr" rid="ref1 ref2">2,1</xref>
        ] is an upper ontology that provides a framework for building
probabilistic ontologies. It allows to use the rst-order probabilistic logic MEBN
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] for representing uncertainty in ontologies. The use of a full edged rst-order
probabilistic logic distringuishes this work from ours, where we tried to provide
a minimal extension to description logics.
      </p>
      <p>
        A di erent approach to the combination of description logic with probability
is taken by [
        <xref ref-type="bibr" rid="ref13 ref14 ref5">5,13,14</xref>
        ] where the authors use probabilistic lexicographic entailment
from probabilistic default reasoning. The logics proposed in these papers allow
both terminological probabilistic knowledge as well as assertional probabilistic
knowledge about instances of concepts and roles. PRONTO [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is one of the
systems that allows to perform inference in this semantics.
      </p>
      <p>
        Similary to [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], the terminological knowledge is interpreted statistically while
the assertional knowledge is interpreted epistemically by assigning degrees of
beliefs to assertions, thus di ering from our current treatment of terminological
knowledge. Moreover it also allows to express default knowledge about
concepts that can be overridden in subconcepts and whose semantics is given by
Lehmann's lexicographic default entailment.
      </p>
      <p>
        These works are based on Nilsson's probabilistic logic [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] where a
probabilistic interpretation P r de nes a probability distribution over the set of
interpretations I. The probability of a logic formula according to P r, denoted
P r( ), is the sum of all P r(I) such that I 2 I and I j= .
      </p>
      <p>A probabilistic knowledge base K is a set of probabilistic formulas of the
form p. A probabilistic interpretation P r satis es p i P r( ) p. P r
satis es K, or P r is a model of K, i P r satis es all F 2 K. We say p is a
tight logical consequence of K i p is the in mum of P r( ) subject to all models
P r of K. Computing tight logical consequences from probabilistic knowledge
bases can be done by solving a linear optimization problem.</p>
      <p>Nilsson's probabilistic logic di ers from the distribution semantics: while a
probabilistic knowledge base in Nilsson's logic may have multiple models that
are probabilistic interpretations, a probabilistic program under the distribution
semantics has a single model that de nes a single distribution over
interpretations. Also, while in Nilsson's logic we want to compute the lowest p such that
P r( ) p holds for all P r, in the distribution semantics we want to compute p
such that P ( ) = p. Nilsson's logic complexity is lower than the #P complexity
of the distribution semantics.</p>
      <p>In fact Nilsson's logic allows weaker conclusions than the distribution
semantics. For example, consider a probabilistic program composed of 0:4 :: a: and
0:5 :: b: and a probabilistic knowledge base composed of a 0:4 and b 0:5. The
distribution semantics allows to say that P (a_b) = 0:7, while with Nilsson's logic
the lowest p such that P r(a _ b) p holds is 0.5. This is due to the fact that in
the distribution semantics the probabilistic atoms are considered independent,
which allows to make stronger conclusions. However, note that this does not
restrict expressiveness as you can specify with the distribution semantics any
joint probability distribution over the atoms of the Herbrand base interpreted
as Boolean random variables, possibly introducing new random facts if needed.</p>
      <p>
        Alternative approaches to modeling imperfect and incomplete knowledge in
ontologies are based on fuzzy logic. A good survey of these approaches is
presented in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
7
      </p>
    </sec>
    <sec id="sec-8">
      <title>Conclusions</title>
      <p>We have presented the semantics DISPONTE for probabilistic ontologies that
is inspired by the distribution semantics of probabilistic logic programming. We
have also presented the system BUNDLE that is able to compute the probability
of queries from an uncertain OWL DL ontology.</p>
      <p>In the future, we plan to extend DISPONTE to take into account statistical
terminological knowledge and improve the way in which the input to BUNDLE
is speci ed.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Carvalho</surname>
            ,
            <given-names>R.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laskey</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Costa</surname>
            ,
            <given-names>P.C.</given-names>
          </string-name>
          :
          <article-title>PR-OWL 2.0 - bridging the gap to OWL semantics</article-title>
          . In:
          <article-title>International Workshops on Uncertainty Reasoning for the Semantic Web (</article-title>
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Costa</surname>
            ,
            <given-names>P.C.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laskey</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laskey</surname>
            ,
            <given-names>K.J.</given-names>
          </string-name>
          :
          <article-title>Pr-owl: A bayesian ontology language for the semantic web</article-title>
          . In:
          <article-title>International Workshops on Uncertainty Reasoning for the Semantic Web</article-title>
          . vol.
          <volume>5327</volume>
          , pp.
          <volume>88</volume>
          {
          <fpage>107</fpage>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>De Raedt</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kimmig</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toivonen</surname>
          </string-name>
          , H.:
          <article-title>ProbLog: A probabilistic Prolog and its application in link discovery</article-title>
          .
          <source>In: International Joint Conference on Arti cial Intelligence</source>
          . pp.
          <volume>2462</volume>
          {
          <issue>2467</issue>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>A probabilistic extension to ontology language OWL</article-title>
          .
          <source>In: Hawaii International Conference On System Sciences. IEEE</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Giugno</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lukasiewicz</surname>
          </string-name>
          , T.:
          <string-name>
            <surname>P-SHOQ(D):</surname>
          </string-name>
          <article-title>A probabilistic extension of SHOQ(D) for probabilistic ontologies in the semantic web</article-title>
          .
          <source>In: European Conference on Logics in Arti cial Intelligence. LNCS</source>
          , vol.
          <volume>2424</volume>
          , pp.
          <volume>86</volume>
          {
          <fpage>97</fpage>
          . Springer (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Heinsohn</surname>
          </string-name>
          , J.:
          <article-title>Probabilistic description logics</article-title>
          .
          <source>In: Conference on Uncertainty in Arti cial Intelligence</source>
          . pp.
          <volume>311</volume>
          {
          <fpage>318</fpage>
          . Morgan Kaufmann (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Jaeger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Probabilistic reasoning in terminological logics</article-title>
          .
          <source>In: International Conference on Principles of Knowledge Representation and Reasoning</source>
          . pp.
          <volume>305</volume>
          {
          <issue>316</issue>
          (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kimmig</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demoen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raedt</surname>
            ,
            <given-names>L.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Costa</surname>
            ,
            <given-names>V.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rocha</surname>
          </string-name>
          , R.:
          <article-title>On the implementation of the probabilistic logic programming language problog</article-title>
          .
          <source>Theory Pract</source>
          . Log. Program.
          <volume>11</volume>
          (
          <issue>2-3</issue>
          ),
          <volume>235</volume>
          {
          <fpage>262</fpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Klinov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Pronto: A non-monotonic probabilistic description logic reasoner</article-title>
          .
          <source>In: European Semantic Web Conference. LNCS</source>
          , vol.
          <volume>5021</volume>
          , pp.
          <volume>822</volume>
          {
          <fpage>826</fpage>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Koller</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Probabilistic relational models</article-title>
          .
          <source>In: International Workshop on Inductive Logic Programming. LNCS</source>
          , vol.
          <volume>1634</volume>
          , pp.
          <volume>3</volume>
          {
          <fpage>13</fpage>
          . Springer (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Koller</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levy</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfe</surname>
            <given-names>er</given-names>
          </string-name>
          , A.:
          <article-title>P-classic: A tractable probablistic description logic</article-title>
          .
          <source>In: National Conference on Arti cial Intelligence</source>
          . pp.
          <volume>390</volume>
          {
          <issue>397</issue>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Laskey</surname>
          </string-name>
          , K.B.,
          <string-name>
            <surname>da</surname>
            <given-names>Costa</given-names>
          </string-name>
          , P.C.G.:
          <article-title>Of starships and klingons: Bayesian logic for the 23rd century</article-title>
          .
          <source>In: Conference in Uncertainty in Arti cial Intelligence</source>
          . pp.
          <volume>346</volume>
          {
          <fpage>353</fpage>
          . AUAI Press (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Probabilistic default reasoning with conditional constraints</article-title>
          . Ann. Math. Artif. Intell.
          <volume>34</volume>
          (
          <issue>1-3</issue>
          ),
          <volume>35</volume>
          {
          <fpage>88</fpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Expressive probabilistic description logics</article-title>
          .
          <source>Artif. Intell</source>
          .
          <volume>172</volume>
          (
          <issue>6-7</issue>
          ),
          <volume>852</volume>
          {
          <fpage>883</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Straccia</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Managing uncertainty and vagueness in description logics for the semantic web</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>6</volume>
          (
          <issue>4</issue>
          ),
          <volume>291</volume>
          {
          <fpage>308</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Nilsson</surname>
            ,
            <given-names>N.J.:</given-names>
          </string-name>
          <article-title>Probabilistic logic</article-title>
          .
          <source>Artif. Intell</source>
          .
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <volume>71</volume>
          {
          <fpage>87</fpage>
          (
          <year>1986</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Obrst</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCandless</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoutenburg</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fox</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nichols</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prausa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sward</surname>
          </string-name>
          , R.:
          <article-title>Evolving use of distributed semantics to achieve net-centricity</article-title>
          .
          <source>In: AAAI Fall Symposium</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bechhofer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Tutorial on
          <string-name>
            <surname>OWL</surname>
          </string-name>
          (
          <year>2003</year>
          ), http: //www.cs.man.ac.uk/~horrocks/ISWC2003/Tutorial/
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Poole</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Abducing through negation as failure: stable models within the independent choice logic</article-title>
          .
          <source>J. of Log. Program</source>
          .
          <volume>44</volume>
          (
          <issue>1-3</issue>
          ),
          <volume>5</volume>
          {
          <fpage>35</fpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Markov logic networks</article-title>
          .
          <source>Machine Learning</source>
          <volume>62</volume>
          (
          <issue>1-2</issue>
          ),
          <volume>107</volume>
          {
          <fpage>136</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Extended semantics and inference for the Independent Choice Logic</article-title>
          .
          <source>Log. J. IGPL</source>
          <volume>17</volume>
          (
          <issue>6</issue>
          ),
          <volume>589</volume>
          {
          <fpage>629</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Sato</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>A statistical learning method for logic programs with distribution semantics</article-title>
          .
          <source>In: International Conference on Logic Programming</source>
          . pp.
          <volume>715</volume>
          {
          <fpage>729</fpage>
          . MIT Press (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Pellet: A practical OWL-DL reasoner</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <volume>51</volume>
          {
          <fpage>53</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <fpage>URW3</fpage>
          -XG:
          <article-title>Uncertainty reasoning for the World Wide Web, nal report</article-title>
          , http: //www.w3.org/2005/Incubator/urw3/XGR-urw3/
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Valiant</surname>
            ,
            <given-names>L.G.</given-names>
          </string-name>
          :
          <article-title>The complexity of enumeration and reliability problems</article-title>
          .
          <source>SIAM J. Comp</source>
          .
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <volume>410</volume>
          {
          <fpage>421</fpage>
          (
          <year>1979</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Vennekens</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verbaeten</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bruynooghe</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Logic programs with annotated disjunctions</article-title>
          .
          <source>In: International Conference on Logic Programming. LNCS</source>
          , vol.
          <volume>3131</volume>
          , pp.
          <volume>195</volume>
          {
          <fpage>209</fpage>
          . Springer (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>