<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On Correspondences between Probabilistic First-Order and Description Logics</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pavel Klinov</string-name>
          <email>pklinov@cs.man.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <email>bparsia@cs.man.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ulrike Sattler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computer Science, University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper analyzes the probabilistic description logic PSHIQ by looking at it as a fragment of probabilistic rst-order logic with semantics based on possible worlds. We argue that this is an appropriate way of investigating its properties and developing extensions. We show how the previously made arguments about di erent types of rst-order probabilistic semantics apply to P-SHIQ. This approach has advantages for the future of both P-SHIQ, which can further evolve by incorporating semantic theories developed for the full rst-order case, and the rst-order logic, for which very few interesting decidable fragments are currently known. The paper also presents a probabilistic logic P-SHIQ+ which addresses some of the identi ed limitations of P-SHIQ.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>A common complaint about Description Logics (DLs), especially as the basis
for ontology languages, is that they fail to support non-classical representations
of uncertainty. While DLs typically can represent and reason with some forms
of uncertainty (e.g., with existential or disjunctive information) in a variety of
ways , until recently they have not handled probability.</p>
      <p>
        One answer to this complaint is the P-SH family of logics which allow for the
incorporation of probabilistic formulae as an extension of the familiar and widely
used SH DLs. Unlike Bayesian extensions to DLs (e.g., [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]), the P-SH family
consists of purely logical extensions to the semantics and inference services.
These logics are also decidable, generally of the same worst case complexity as
the base logic, and can be implemented on top of existing DL reasoners [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>However, there are several issues with the P-SH family both from an
expressivity and from a theoretical point of view. First, it has not been fully clear
whether it properly combines statistical and subjective probabilities. Second,
probabilistic ABoxes have a number of strong and strange restrictions (including
no probabilistic roles and only one probabilistic individual per ABox). Finally,
the semantics of the P-SH family (in terms of possible worlds) is not particularly
familiar in the DL setting.</p>
      <p>
        It is standard to analyze DLs by considering them as fragments of
rstorder logic. In this paper, we apply this methodology to the analysis of the
P-SH family of probabilistic DLs by considering them as fragments of FOPL2
(FOL with possible world based semantics [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]). This analysis yields insight into
the semantics of P-SHIQ and into principled ways of extending it to address
troublesome limitations.
1.1
      </p>
      <p>
        P-SHIQ
P-SHIQ [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is a probabilistic generalization of the DL SHIQ. It has a number
of properties that make it an attractive formalism for managing uncertainty
in OWL ontologies. First, it is very expressive as it supports arbitrary SHIQ
concepts in probabilistic axioms. Second, any SHIQ ontology can be used as
a basis for a P-SHIQ ontology which facilitates transition from classical to
probabilistic ontological models [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Thirdly, its reasoning tasks are decidable.
Syntax of P-SHIQ The syntax of P-SHIQ is an extension of the syntax of
SHIQ with conditional constraints [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Conditional constraints are expressions
of the form (DjC)[l; u] where C and D are SHIQ concepts1. Conditional
constraints can be used for representing uncertainty in both terminological (TBox)
and assertional (ABox) knowledge. A probabilistic TBox (PTBox) is a 2-tuple
P T = (T ; P) where T is a SHIQ TBox and P is a nite set of default2
conditional constraints. Informally, a PTBox axiom (DjC)[l; u] means that \generally,
if a randomly chosen individual belongs to C, its probability of belonging to D
is between l and u". A probabilistic ABox (PABox) is a nite set of strict
conditional constraints pertaining to a single probabilistic individual o. All constraints
in a PABox are of the restricted form (Dj&gt;)[l; u]. Informally, they mean that \o
is a member of D with probability between l and u" [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Semantics and Reasoning with P-SHIQ Although the semantics of
PSHIQ is standardly given in terms of possible worlds [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we present a slightly
di erent formulation based on a closely related notion of types [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. This will
help to ensure uniform use of terminology in the later sections. Two preliminary
notions are needed:
De nition 1 (Closure, Concept Type). For an ontology O = (T ; A), where
T is a TBox and A is an ABox, the concept closure of T , denoted as cl(T ), is
the smallest set of concepts such that:
{ &gt; is in cl(T ),
{ For all C v D 2 T , C and D are in cl(T ),
{ If C 2 cl(T ) then its subconcepts are in cl(T ),
{ If C 2 cl(T ) then :_C (negation in the negation normal form) is in cl(T ).
A concept type Ct of an ontology O is a subset of cl(T ) such that for all C 2 Ct,
C 2 Ct i :_C 2= Ct. Given an interpretation ( I ; I ) j= T , a domain element
e 2 I is an instance of type Ct i e 2 CI for all C 2 Ct.
1 Without loss of generality we will further assume that C; D are atomic concepts.
2 Default aspects of P-SHIQ are immaterial in the context of this paper.
The relation ctype(e) = fCtje is instance of Ctg is a well-de ned function. The
set fCt cl(T )j dCi2Ct Ci is satis able w.r.t. T g is denoted T ypes(O).
De nition 2 (Type Consistency). A type Ct is said to be consistent with a
concept C (Ct a` C) if C 2 Ct.
      </p>
      <p>It is convenient to de ne probabilistic models using types. A probabilistic
interpretation P r is a function P r : T ypes(O ! [0; 1]) such that
PT 2T ypes(O) P r(T ) = 1. The probability of a concept C, denoted as P r(C), is
de ned as PT a`C P r(T ). P r(DjC) is used as an abbreviation for
P r(C u D)=P r(C) given P r(C) &gt; 0. A probabilistic interpretation P r
satises a conditional constraint (DjC)[l; u] (or P r is a model of (DjC)[l; u]) denoted
as P r j= (DjC)[l; u] i P r(C) = 0 or P r(DjC) 2 [l; u]. Finally, P r satis es a set
of conditional constraints P i it satis es each of the constraints.</p>
      <p>
        The following are the core reasoning problems of P-SHIQ [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]:
{ Probabilistic Satis ability (PSAT). PSAT is the problem of deciding whether
there exists a probabilistic interpretation that satis es given PTBox.
{ Tight Logical Entailment (TLogEnt). TLogEnt is the problem of computing
the tightest probability intervals for logical consequences.
1.2
      </p>
    </sec>
    <sec id="sec-2">
      <title>First-Order Probabilistic Logic</title>
      <p>
        FOPL2 is a probabilistic generalization of rst-order logic aimed at capturing
belief statements (the subscript 2 stands for the Type 2 semantics [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]), like
\the probability that Tweety (a particular bird) ies is over 90%" [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. It is very
expressive allowing to attach probabilities to arbitrary rst-order formulas (both
closed and open). Its representational and computational properties have been
well investigated, and the results are applicable to its fragments.
Syntax of FOPL2 The syntax of FOPL2 is de ned as follows [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]: assume a
rst-order alphabet of function and predicate names, and a countable set of
object variables Xo. Object terms are formed by closing Xo o under function
application as usual. The language also contains eld terms, which range over
reals (with 0 and 1 being distinguished constants), and probability terms of the
form w( ), where is a rst-order formula. Field terms are closed o under
applications of functions ; + on reals (e.g., t1 + t2 is a eld term i t1; t2 are).
Then FOPL2 formulas are de ned as follows:
{ If P is an n-ary predicate name in and t1; : : : ; tn are object terms, then
      </p>
      <p>P (t1; : : : ; tn) is an atomic formula.
{ If t1; t2 are eld terms, then t1 t2, t1 t2, t1 &lt; t2, t1 &gt; t2, t1 = t2 are
atomic formulas. Standard relationships between (in)equality relations are
assumed to hold.
{ If ; are formulas and x 2 Xo, then ^ , _ , 8(x) , 9(x) , : are
formulas. Standard relationships between logical connectives and quanti ers
are assumed to hold.</p>
      <p>Finally, the denotation w( j )
t is the abbreviation of w( ^
)
t
w( ).</p>
      <p>
        Semantics of FOPL2 A probabilistic interpretation (Type 2 probability
structure in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) M is a tuple (D; S; ; ), where D is a domain, S is a set of states
(sometimes called possible worlds ), is a function S ! D (where D is a
set of predicates and functions over D) which preserves arity, and is a
probability distribution over S. Let v be valuation function that maps each object
variable to an element of D. Then M together with a state s and a valuation v
associates each object term t with an element of D ([t]M;s;v 2 D) and each eld
term t with a real number. (M; s; v) also associate formulas with truth values as
follows (we write (M; s; v) j= i is true in M ):
      </p>
      <p>Field terms of the form w( ) are associated as follows: [w( )]M;v = fs 2
Sj(M; s; v) j= g.</p>
      <p>{ (M; s; v) j= P (x) if v(x) 2 (s; P ).
{ (M; s; v) j= t1 &lt; t2 if [t1]M;s;v &lt; [t2]M;s;v.</p>
      <p>{ (M; s; v) j= 8(x) if (M; s; v[x=d]) j= for all d 2 D.</p>
      <p>Other formulas, e.g. ^ , : , t1 = t2, etc. are de ned as usual. It remains
to de ne the mapping for the eld terms of the form w( ): [w( )]M;v = fs0 2
Sj(M; s0; v) j= g (note that the association does not depend on a state here).</p>
      <p>
        As usual, a FOPL2 formula is called satis able if there exists an interpretation
and a valuation in which the formula is true. It is called valid (denoted j= ) i
it is satis ed by all interpretations and valuations. More detailed exposition of
FOPL2 2 can be found in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
2
      </p>
      <sec id="sec-2-1">
        <title>Correspondences between P-S HI Q and FOPL2</title>
        <p>This section presents a translation between P-ALCI and the FOPL2. For the
sake of brevity we will limit our attention to ALCI concepts as the translation
can be extended to SHIQ concepts in a straightforward way (inverse roles are
included because they will be important in the following section). We will show
that the translation preserves satis ability of formulas so that P-SHIQ can be
viewed as a fragment of FOPL2.
2.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Syntactic Translation</title>
      <p>
        We rst de ne the injective function to be the mapping of P-ALCI
formulas to FOPL2. It is a superset of the standard translation of ALCI formulas
into the formulas of FOL [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] (in the Table 2.1 A; B stand for concept names,
R for role names, C; D for concept expressions, P for role names or inverses,
var 2 fx; yg; var0 = x if var = y and y if var = x).
      </p>
      <p>This function transforms a P-ALCI ontology into the collection of FOPL2
theories. More precisely, a PTBox is transformed into a set of formulas while each
PABox is translated into its own FOPL2 theory. This is because PABox axioms
in P-SHIQ are treated as PTBox axioms (just labeled with the individual name)
so they do not correspond to ground formulas in FOPL2. Interestingly, in our
extension of P-SHIQ (see Section 3) they do (as they should).</p>
      <p>Proof. The proof will proceed by construction, i.e. we will show that a satisfying
probabilistic model in P-SHIQ can be transformed into a satisfying probabilistic
interpretation in FOPL2 and vice versa.</p>
      <p>
        ()): Assume there exists P r : T ypes(O) ! [0; 1] s.t. P r j= T and P r j= P.
We will construct M = (D; S; ; ) s.t. M j= ( ) for all 2 T [ P. For
that we rst observe that the existence of P r implies the existence of a SHIQ
interpretation: ( I ; I ) j= T [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Thus we can de ne D and as follows: D = I ;
(t) = ( 1(t))I for all unary and binary predicates t. Note that interprets all
predicate names independently of a state.
      </p>
      <p>The key is to construct the set S and ensure the following property:
P r(fCt 2 T ypes(O)jCt a` Cg) = (fs 2 Sj(M; s; v) j= (C)g)
for any concept C and some valuation v
(1)</p>
      <p>Property (1) guarantees that P r(C) = ( (C)) for all (C 2 cl(T ) (hence
P r j= (DjC)[l; u] implies (M; v) j= l w( (D; x)j (C; x)) u for some v). It can
be achieved by de ning a straightforward bijection between types and states
using . For each type Ct = fC1; : : : ; Ckg we de ne a state
s = f (C1; x); : : : ; (Ck; x)g (we use s = (Ct) as a shorthand notation). It
is not hard to see that, if di Ci is satis able w.r.t. T then Vi (Ci; x) is satis
able w.r.t. (T ).</p>
      <p>Similarly to P-SHIQ we use the notation s a` where is a translation of
some ALCI concept expression into FOPL2 i 2 s. We leave out the
demonstration that Ct a` C implies (Ct) a` (C; x).</p>
      <p>Finally, we de ne the probability function: (s) = P r( 1(s)) for all s 2 S.</p>
      <p>It remains to show that M = (D; S; ; ) satis es the corresponding formula
in FOPL2 and some valuation v. For the sake of brevity we will only show this for
probabilistic axioms. Consider a PTBox axiom (BjA)[l; u]. It is translated into
the formula l w( (B; x)j (A; x)) u. Now PPCCtat`aB`AuAPP(C( Ct)t) 2 [l; u], therefore
Psa` (B;x)^ (A;x) (s) 2 [l; u] (by de nition of S and ). In order to show that</p>
      <p>Psa` (A;x) (s)
this is equivalent to fs2S:(M;s;v)j=B(x)^A(x))g 2 [l; u] it su ces to prove that, if
fs2S:(M;s;v)j=A(x)g
there exists a state s0 2 S in M s.t. s0 a` c(x) (c is a unary predicate) then
there exists a valuation v s.t. (M; s0; v) j= c(x). Take some s0 2 S. Then there
exists v : (M; s0; v) j= c1(a) ^ ^ ck(a) ^ (T ) where a is a fresh constant (this
follows from consistency of concept types). Since c = ci 2 s0 for some i it follows
that (M; s0; v[a=x]) j= c(x).</p>
      <p>(() can be shown completely analogously, so we skip the details.
2
2.3</p>
    </sec>
    <sec id="sec-4">
      <title>Why the Translation Matters?</title>
      <p>This translation provides a new insight into both P-SHIQ and FOPL2 being
valuable for the following reasons:</p>
      <p>
        Firstly, while it has been claimed that P-SHIQ can represent and reason
about both statistical and subjective (i.e. degrees of belief) knowledge [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], it is
vulnerable to all arguments regarding the issues with the possible world
semantics of FOPL2 and its inappropriateness for handling statistics [
        <xref ref-type="bibr" rid="ref10 ref11 ref3">10, 11, 3</xref>
        ]. The
most serious issue is that generic probabilistic statements that are aimed at
representing statistics do not ful ll this role. The proper statistical interpretation of
a statement (DjC)[l; u] would be: \if a randomly chosen domain element belongs
to C, its probability of belonging to D is in [l; u]. However, in P-SHIQ semantics
it instead quanti es over randomly chosen named individuals. The di erence is
illustrated on the following example:
Example 1 (Inadequate Handling of Statistics). Consider the PTBox: T = fA
(= 1R:B); B (= 1R :A)g; P = f(Aj&gt;)[0:5; 0; 5]g. The TBox component
implies a bijection between the interpretations of A and B so it seems that if the
probability that a random object is in A is 0:5 then the probability of being in
B should also be 0:5 (because there are as many instances of B in the domain
as there are of A). However, the result is [0; 1] because this relationship between
the interpretations is not captured in the possible world based semantics.
      </p>
      <p>Secondly, the translation helps to understand the limitations of P-SHIQ,
namely, the inability to combine probabilistic knowledge about multiple
individuals in a single ABox, unnatural separation of classical and probabilistic
individuals, inability to represent probabilistic relationships between roles, etc. The
common reason behind all those issues is that the set of states S is quite
coarsegrained and contains only information about concepts, but not roles or named
individuals (constants). We will take it up in Section 3 in which extensions to
P-SHIQ are proposed.</p>
      <p>Finally, the translation is also valuable for the analysis of FOPL2. Since it
is a highly undecidable logic it is natural to analyze its interesting decidable
fragments and P-SHIQ is one of such fragments. It is known that all fragments
of FOPL2 that have bounded model property are decidable. Interestingly
PSHIQ does not have such property (because SHIQ does not have nite model
property) but it is decidable which suggests that there might be other criteria for
decidability. In particular, it is known that given the unrestricted use of variables
it is undecidable even with a single unary predicate.
3</p>
      <sec id="sec-4-1">
        <title>Possible extensions to P-SHI Q</title>
        <p>P-SHIQ has a number of limitations which are caused by two main reasons.
First, it does not make use of all features that can be supported by the FOPL2
semantics. Second, as brie y explained in the previous section, possible world
semantics is not the best ground for representing and reasoning with statistical
knowledge. In this paper we attempt to address the rst type of limitations (the
resulting logic will be called P-SHIQ+). The important issues, which will be
addressed in P-SHIQ+, are the following:</p>
        <p>Separation between classical and probabilistic individuals. In
PSHIQ all named individuals are split onto those for which there is some
probabilistic knowledge (i.e. PABox axioms) and those for which there is not. As
a result, there is no straightforward way to combine classical and probabilistic
reasoning for a single individual although that seems a very natural thing to do.
For example, consider the following knowledge base:
Example 2. fBig u Dense v Heavy; ball : Big; (ball : Dense)[0:9; 1]g
Intuitively the answer to the query ball : (Heavy)[?; ?] should be [0:9; 1].
However, due to the separation, the two balls are regarded as distinct individuals so
such entailment is not supported. There are, of course, workarounds, for
example, it is possible to do some sort of \punning" between the two balls and obtain
the desired result.</p>
        <p>
          Isolated PABoxes. P-SHIQ does not allow for the representation of
probabilistic role assertions between probabilistic individuals. In other words, while
it is possible to state that (jim : P arent)[0:9; 0:95] it is not possible to state that
((jim; pete) : parentOf )[0:9; 0:95]. There is a way to represent probabilistic role
assertions in P-SHOIN but only between a classical and a probabilistic
individual [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Thus it is not possible to assert a probabilistic role between individuals
for whom we store other probabilistic knowledge. Consider the example:
Example 3. fjohn : :T all[0:1; 0:2]; pete : T all[0:9; 1];
(john; pete) : f riendOf [0:8; 1]g
In P-SHIQ either john or pete will have to be a classical individual so we will
not be able to use all the available probabilistic knowledge to answer the query
john : (T all u 9f riendOf::T all)[?; ?] (the probability that John is tall but has
a short friend).
        </p>
        <p>No support of probabilistic role hierarchies. P-SHIQ only supports
generic probabilistic relationships between concepts but not roles.</p>
        <p>We next proceed to the syntax and semantics of P-SHIQ+.
Syntactically, P-SHIQ+ di ers from P-SHIQ in two main aspects. Firstly, the
notion of a conditional constraint is extended. Secondly, instead of a collection
of independent PABoxes in P-SHIQ, a P-SHIQ+ ontology contains only a
single probabilistic component, namely, a collection of conditional constraints.
One preliminary de nition is needed.</p>
        <p>De nition 3 (Entity). An entity in P-SHIQ+ is one of: role, concept, concept
assertion, role assertion.</p>
        <p>Intuitively, entities are the things that can appear in conditional constraints.
Now we can de ne the extended syntax of conditional constraints.
De nition 4 (Conditional Constraint). A conditional constraint in
P-SHIQ+ is an expression of the form: ( j )[l; u] where ; are entities.
Note that every conditional constraint in P-SHIQ is a well-formed conditional
constraint in P-SHIQ+. No other probabilistic formulas are allowed. A
probabilistic ontology in P-SHIQ+ has a simpler structure than in P-SHIQ:
De nition 5 (Probabilistic Ontology). A probabilistic ontology in P-SHIQ+
is a tuple P O = (O; P) where O = (T ; A) is a SHIQ ontology3 and P is a nite
collection of conditional constraints.</p>
        <p>Any P-SHIQ ontology can be represented as a P-SHIQ+ ontology, however,
the representation of PABox constraints is a bit di erent. Instead of implicitly
attributing statements of the form (Cj&gt;)[l; u] to a particular individual, it is
represented as (C(a)j&gt;)[l; u] and becomes an element of P. In other words,
PABox constraints in P-SHIQ+ correspond to ground probabilistic formulas in
FOPL2 while in P-SHIQ they do not.
From an analysis of P-SHIQ alone, it may not be immediately clear that its
drawbacks can be addressed. However, looking at the corresponding fragment of
FOPL2 immediately reveals that none of the limitations is fundamental. FOPL2
does not require any separation between constants and neither does it prohibit
attaching probabilities to binary predicates. In fact, what is necessary from the
FOPL2 point of view is that the speci cation of the states provides enough
3 Role hierarchies are assumed to be a part of the TBox.
information to determine if a particular formula is true at the given state (to be
able to assign probabilities to weighted formulas). P-SHIQ speci cation only
allows that for unary predicates and their combinations. We are going to extend
it to binary predicates as well. It is worth noting that FOPL2 does not impose
restrictions on the set of states, so one is free to extend it in an arbitrary way.</p>
        <p>Recall from Section 1 that states in P-SHIQ are types, that is, subsets of
the closure of relevant concepts. Unfortunately types do not provide enough
information about relationships between domain objects. Our idea is to extend
single type structures to two type structures, which we call links. Informally, a
link is a speci cation of an ordered pair of domain elements which includes types
of both as well as relationships between them.</p>
        <p>De nition 6 (Extended Concept Type). Concept types in P-SHIQ+ are
extensions of P-SHIQ concept types which may contain individual names. They
have to satisfy one additional requirement, namely, if a 2 Ct and a : C 2 O then
C 2 Ct.
ctype is also extended: ctype(e) = Ct if additionally e = aI for all a 2 Ct.
De nition 7 (Role type). Let R be the set of roles in an ontology O = (T ; A)
(i.e. role names and their inverses) that can participate in probabilistic
statements. Then a role type Rt is a subset of R that satis es the following condition:
if R 2 Rt and R v S 2 O or Inv(R) v Inv(S) 2 O then S 2 Rt.</p>
        <p>Given an interpretation ( I ; I ) j= O, a pair of domain elements (e; f ) is an
instance of a role type Rt if (e; f ) 2 RI for all R 2 Rt.</p>
        <p>Analogously to ctype, the function rtype(e; f ) = fRtj(e; f ) is instance of Rtg.
Observe that it is also a well-de ned, total function.</p>
        <p>De nition 8 (Link). A link L is a tuple (Ct1; Rt; Ct2) where Ct1; Ct2 are in
T ypes(O) and Rt is a role type such that:
{ If 8R:C 2 Ct1 and R 2 Rt then C 2 Ct2.
{ If 8R:C 2 Ct2 and Inv(R) 2 Rt then C 2 Ct1.
{ If 0R:C 2 Ct1 and R 2 Rt then :_ C 2 Ct2.</p>
        <p>{ If 0R:C 2 Ct2 and Inv(R) 2 Rt then :_C 2 Ct1.</p>
        <p>De nition 9 (Instantiation function). Given a model I = ( I ; I ) j= O
the function link(e; f ) on the set of pairs of domain elements is de ned as
(ctype(e); rtype(e; f ); ctype(f )) where ctype is the concept type function and
rtype is the role type function.</p>
        <p>Observe that link is a well-de ned function since ctype and rtype are
wellde ned. It is important to emphasize that links inherit the key property of types
that any ordered pair of domain elements is an instance of one and only one
link. We use Links(O) to denote the set of all links L such that there exists an
interpretation I = ( I ; I ) of O and e; f 2 I with link(e; f ) = L.</p>
        <p>Now we can de ne the notion of consistency between a link and an entity.
De nition 10 (Consistency with an Entity). A link (Ct1; Rt; Ct2) is
consistent with an entity w.r.t. the ontology O (denoted as L a` ) if:
{ If is a role R and R 2 Rt,
{ If is a concept C and C 2 Ct1,
{ If is a concept assertion C(e) and fe; Cg 2 Ct1,
{ If is a role assertion R(e; f ) and (e 2 Ct1 and R 2 Rt and f 2 Ct2) or
(f 2 Ct1 and R 2 Rt and e 2 Ct2).</p>
        <p>We now proceed to probabilistic models in P-SHIQ+.</p>
        <p>De nition 11 (Probabilistic Interpretation, Probability of an Entity).
A probabilistic interpretation P r of a probabilistic ontology P O = (O; P) is
a probability function on Links(O) (a mapping P r : Links(O) ! [0; 1] s.t.
PL2Links(O) P r(L) = 1.</p>
        <p>The probability of an entity denoted as P r( ) is the
sum PL2Link(T );La` P r(L). Probability of the conjunction of two entities
P r( ^ ) is the sum PL2Links(O);La` ;La` P r(L). Conditional probability
P r( j ) is an abbreviation of P r( ^ )</p>
        <p>P r( ) .</p>
        <p>As long as any xed ordered pair of domains elements is an instance of
one and only one link, we can compute the probability of an entity as a sum
of probabilities of all links which are consistent with it (intuitively, links are
disjoint, that is, do not share elements, and exhaustive i.e., cover the space of
all pairs). For example, the probability of a role is a total probability of all links
in which the ordered pair of individuals is related via this role (informally this
means \the probability that a randomly selected pair is related via the role").
De nition 12 (Probabilistic Satis ability, Logical Entailment). A
probabilistic interpretation P r satis es (or is a model of ) a P-SHIQ+ formula
( j )[l; u] (denoted as P r j= ) if P r( j ) 2 [l; u]. P r satis es a collection
of P-SHIQ+ formulas if it satis es all of them. P r satis es an ontology (O; P)
if P r j= P.</p>
        <p>A P-SHIQ+ formula ( j )[l; u] is a logical consequence of a P-SHIQ+
ontology P O if it is satis ed by all probabilistic models of P O. It is a tight logical
consequence of P O if l (resp. u) is the minimum (resp. the maximum) over all
P r j= P O such that P r( ) &gt; 0.</p>
        <p>
          Problems of deciding probabilistic satis ability (PSAT) and computing tight
logical entailment (TLogEnt) are stated equivalently to P-SHIQ (see [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]).
        </p>
        <p>One can verify that P-SHIQ+ is a fragment of FOPL2 by adapting the
translation of P-SHIQ to FOPL2. All that has to be changed in the proof of
Theorem 1 is the construction of states so that they correspond to links. Finally
we present a few examples of the P-SHIQ+ advantages over P-SHIQ:
Example 4 (Combining classical and probabilistic individuals). Consider the
ontology from the Example 2. Now both occurrences of ball denote the same
individual. The probability of Heavy(ball) is de ned as: P r(Heavy(ball)) =
PIj=Heavy(ball) P r(I). Consider the set of links that are consistent with ball. All
of them have Big in the rst concept type and at least 90% also have Dense.
This means that at least 90% of them have Heavy in the rst concept type (by
the de nition of types), therefore the result is [0:9:1] as it should be.
Example 5 (Probabilistic role assertions). Ontology from the Example 3 is a
wellformed P-SHIQ+ ontology. The answer to the query
(john : (T all u 9f riendOf::T all)[?; ?] is [0:5; 0:9].</p>
        <p>Example 6 (Probabilistic role hierarchies). It is possible to assert in P-SHIQ+
that at least 60% of classmates are friends: (f riendOf jclassmateOf )[0:60; 1].
3.3</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Possible Further Extensions</title>
      <p>P-SHIQ+ does address some important limitations of P-SHIQ but it is still
not a fully appropriate formalism for representing and reasoning with
statistical knowledge. Its semantics is based on possible worlds (links) which results
in counterintuitive inference in some situations (see Example 1). No further
extension of the possible world semantics can solve this issue, so it is natural to
investigate the possibilities of using domain probability distributions.</p>
      <p>
        At the moment it is not entirely clear whether the improper handling of
statistics turns out to be critical in practical modeling or not. To be visible the
problem requires some speci c interaction between interpretations of concepts
and probabilistic axioms. However, designers of probabilistic ontologies must be
aware of it. We are planning to investigate this issue by extending the previously
created BRCA ontology [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. If the issue happens to be critical then there might
several ways to rectify it. The two main alternatives are Bacchus' Lp logic with
domain based semantics followed by applying belief functions to adjust subjective
beliefs basing on the general statistics [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ], and Halpern's Type 3 semantics
which combines features of domain based and prossible world based models [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
4
      </p>
      <p>Conclusion
In this paper we presented a new look at the probabilistic description logic
PSHIQ as a fragment of probabilistic rst-order logic. We gave a formal syntactic
translation of P-SHIQ knowledges bases into FOPL2 theories and proved its
faithfulness. This brought an extra insight into P-SHIQ, most importantly,
into its limitations, both those that can be addressed by extending the existing
possible world based semantics and those which cannot.</p>
      <p>
        We then proceeded to address the rst type of limitations and presented an
extended logic P-SHIQ+ which adds a few important capabilities to the
existing arsenal. Namely, it enables full probabilistic role assertions, probabilistic role
hierarchies and eliminates the unnatural separation between classical and
probabilistic individuals. The logic is still decidable and even has the same worst case
complexity as P-SHIQ. Our next goal is to implement P-SHIQ+ by applying
techniques that proved useful for P-SHIQ [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Koller</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levy</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfe</surname>
            <given-names>er</given-names>
          </string-name>
          , A.:
          <string-name>
            <surname>P-CLASSIC</surname>
          </string-name>
          :
          <article-title>A tractable probabilistic description logic</article-title>
          .
          <source>In: Advances in Arti cial Intelligence Conference</source>
          . (
          <year>1997</year>
          )
          <volume>390</volume>
          {
          <fpage>397</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Klinov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Pronto: A non-monotonic probabilistic description logic reasoner</article-title>
          .
          <source>In: European Semantic Web Conference (ESWC</source>
          <year>2008</year>
          ), Posters &amp; Demos. (
          <year>2008</year>
          )
          <volume>822</volume>
          {
          <fpage>826</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Halpern</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          :
          <article-title>An analysis of rst-order logics of probability</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>46</volume>
          (
          <year>1990</year>
          )
          <volume>311</volume>
          {
          <fpage>350</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Expressive probabilistic description logics</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>172</volume>
          (
          <issue>6-7</issue>
          ) (
          <year>2008</year>
          )
          <volume>852</volume>
          {
          <fpage>883</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Klinov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Probabilistic modeling and OWL: A user oriented introduction into P-SHIQ(D)</article-title>
          .
          <source>In: OWL: Experiences and Directions</source>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Probabilistic logic programming with conditional constraints</article-title>
          .
          <source>ACM Transactions on Computational Logic</source>
          <volume>2</volume>
          (
          <issue>3</issue>
          ) (
          <year>2001</year>
          )
          <volume>289</volume>
          {
          <fpage>339</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tendera</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>The complexity of nite model reasoning in description logics</article-title>
          .
          <source>Information and Computation</source>
          <volume>199</volume>
          (
          <issue>1</issue>
          {2) (
          <year>2005</year>
          )
          <volume>132</volume>
          {
          <fpage>171</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Lukasiewicz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Probabilistic default reasoning with conditional constraints</article-title>
          .
          <source>Annals of Mathematics and Arti cial Intelligence</source>
          <volume>34</volume>
          (
          <issue>1-3</issue>
          ) (
          <year>2002</year>
          )
          <volume>35</volume>
          {
          <fpage>88</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuiness</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nardi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F.</given-names>
          </string-name>
          :
          <article-title>Description Logic Handbook</article-title>
          . Cambridge University Press (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Bacchus</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Lp, a logic for representing and reasoning with statistical knowledge</article-title>
          .
          <source>Computational Intelligence</source>
          <volume>6</volume>
          (
          <year>1990</year>
          )
          <volume>209</volume>
          {
          <fpage>231</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Bacchus</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Representing and reasoning with probabilistic knowledge</article-title>
          . MIT Press (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Klinov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>On improving the scalability of checking satis ability in probabilistic description logics. Submitted to the Scalable Uncertainty Management Conference (</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>