<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Probabilistic Description Logics: Reasoning and Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Riccardo Zese</string-name>
          <email>riccardo.zese@unife.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dipartimento di Ingegneria</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Ferrara Via Saragat</institution>
          <addr-line>1, I-44122, Ferrara</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The last decade has seen an exponential increase in the popularity of the
Semantic Web. However, given the nature of the domains usually modeled in
such scenario and the origin of available data, the interest for the development
of methods for combining probability with Description Logics (DLs) has been
exponentially increased as well.</p>
      <p>
        A possible probabilistic semantics for DLs is DISPONTE [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ], which applies
to them the distribution semantics, one of the most prominent semantics in
probabilistic logic programming. DISPONTE allows to annotate axioms with
a probability, interpreted as epistemic probability, indicating the degree of our
belief in the truth of the corresponding axiom.
      </p>
      <p>Prob-ALC considers only epistemic probabilities, while crALC extends ALC
by allowing only statistical probabilities. In both these approaches the
probability can be assigned to a limited set of axioms, di erently from DISPONTE
where every axiom can be probabilistic. P-SHIQ(D) uses probabilistic
lexicographic entailment from probabilistic default reasoning and allows to annotate
with a probabilistic interval both assertional and terminological axioms. BE L
exploits Bayesian networks to extend the E L DL, while Probabilistic Datalog
uses Markov networks.</p>
      <p>
        Several algorithms have been proposed for supporting the development of the
Semantic Web. E cient DL reasoners are able to extract implicit information
from the modeled ontologies. Despite the availability of many DL reasoners, the
number of probabilistic reasoners is quite small. BUNDLE [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ] is a reasoner able
to compute the probability of queries w.r.t. DISPONTE DL KBs. It implements
the tableau algorithms and returns the set of all explanations for the query, then
represented with a Binary Decision Diagram (BDD), i.e., a tree representing a
boolean formula, used for computing the probability.
      </p>
      <p>
        However, some tableau expansion rules are non-deterministic forcing to
explore all the non-deterministic choices to compute the set of all explanations
for the query. This non-determinism can be managed with Prolog language.
Thus, we developed TRILL [
        <xref ref-type="bibr" rid="ref5 ref6">6, 5</xref>
        ] which implements the tableau algorithm in
Prolog to perform inference over DISPONTE DLs. We also developed TRILLP
[
        <xref ref-type="bibr" rid="ref5 ref6">6, 5</xref>
        ], which builds a monotone Boolean formula, called \pinpointing formula",
instead of the set of explanations, which compactly represents them and can
be directly translated into a BDD. Finally, TORNADO builds BDDs instead of
pinpointing formulas during the inference process. TRILL, TRILLP and
TORNADO are available at http://trill.ml.unife.it in the web service \TRILL
on SWISH".
      </p>
      <p>Other examples are PRONTO, which follows P-SHIQ(D) semantics and
BORN following BE L semantics. A completely di erent approach addresses
reasoning for Datalog ontologies with an Abductive Logic Programming
framework named SCIFF, with existential and universal variables, and Constraint
Logic Programming constraints in rule heads.</p>
      <p>
        The correct values of the axioms' probabilities are unfortunately di cult to
set, since they depend on many di erent factors. Therefore, it is necessary to
develop systems able to automatically learn such values. Moreover, often KBs
are incomplete or poorly structured, requiring systems able to correct erroneous
information and learn new de nitions. We developed EDGE [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] that learns the
parameters of a DISPONTE KB from the information available in the domain.
It exploits BUNDLE for building the BDDs representing explanations for the
input examples and an Expectation Maximization algorithm to de ne probability
values. We also developed LEAP [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which combines EDGE with the learning
system CELOE, in order to learn the structure of a DISPONTE KB by
building new axioms. EDGE is used to learn the parameters of the KB. A di erent
approach is used in Goldminer where association rules are exploited to de ne
probabilistic terminological axioms.
      </p>
      <p>
        However, nowadays most of the KBs are de ned following the vision of Big
Data and Linked Open Data. Thus, they require the implementation of
algorithms exploiting parallelization and cloud computing to handle such big amount
of data. Therefore, we extended EDGE and LEAP by developing EDGEMR [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
and LEAPMR [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], which distribute the work load.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cota</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zese</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamma</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Structure learning with distributed parameter learning for probabilistic ontologies</article-title>
          .
          <source>In: Doctoral Consortium of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD</source>
          <year>2015</year>
          ). pp.
          <volume>75</volume>
          {
          <issue>84</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Cota</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zese</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamma</surname>
          </string-name>
          , E.:
          <article-title>Distributed parameter learning for probabilistic ontologies</article-title>
          .
          <source>In: 25th International Conference on Inductive Logic Programming (ILP</source>
          <year>2015</year>
          )
          <article-title>(</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamma</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zese</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Probabilistic description logics under the distribution semantics</article-title>
          .
          <source>Semantic Web</source>
          <volume>6</volume>
          (
          <issue>5</issue>
          ),
          <volume>447</volume>
          {
          <fpage>501</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamma</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zese</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cota</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Learning probabilistic description logics</article-title>
          .
          <source>In: Uncertainty Reasoning for the Semantic Web III</source>
          , pp.
          <volume>63</volume>
          {
          <fpage>78</fpage>
          . LNCS, Springer International Publishing, Berlng, Heidelberg (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zese</surname>
          </string-name>
          , R.:
          <article-title>Probabilistic Semantic Web, Studies on the Semantic Web</article-title>
          , vol.
          <volume>28</volume>
          . IOS Press, Amsterdam (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Zese</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bellodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riguzzi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cota</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamma</surname>
          </string-name>
          , E.:
          <article-title>Tableau reasoning for description logics and its extension to probabilities</article-title>
          . Ann. Math. Artif. Intell. pp.
          <volume>1</volume>
          {
          <issue>30</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>