<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Study of MEBN Learning for Relational Model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cheol Young Park</string-name>
          <email>cparkf@gmu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kathryn Blackmond Laskey</string-name>
          <email>klaskey@gmu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paulo Costa</string-name>
          <email>pcosta@gmu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shou Matsumoto</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Getoor</institution>
          ,
          <addr-line>L., Tasker, B.: Introduction to statistical relational learning. MIT Press, Cambridge, 2007</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Nevile</institution>
          ,
          <addr-line>J., Jensen, D.: Relational dependency networks. In: An Introduction to Statistical Relational Learning</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Oliver Schulte</institution>
          ,
          <addr-line>Hassan Khosravi, Flavia Moser</addr-line>
          ,
          <institution>and Martin Ester. Join bayes nets: A new type of bayes net for relational data. Technical Report 2008-17, Simon Fraser University</institution>
          ,
          <addr-line>2008. also in CS-Learning Preprint Archive</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Volgenau School of Engineering George Mason University Fairfax, VA</institution>
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1970</year>
      </pub-date>
      <abstract>
        <p>- In the past decade, Statistical Relational Learning (SRL) has emerged as a new branch of machine learning for representing and learning a joint probability distribution over relational data. Relational representations have the necessary expressive power for important real-world problems, but until recently have not supported uncertainty. Statistical relational models fill this gap. Among the languages recently developed for statistical relational representations is Multi-Entity Bayesian Networks (MEBN). MEBN is the logical basis for Probabilistic OWL (PR-OWL), a language for uncertainty reasoning in the Semantic Web. However, until now there has been no implementation of MEBN learning. This paper describes the first implementation of MEBN learning. The algorithm learns a MEBN theory for a domain from data stored in a relational Database. Several issues are addressed such as aggregating influences, optimization problem, and so on. In this paper, as our contributions, we will provide a MEBN-RM (Relational Model) Model which is a bridge between MEBN and RM, and suggest a basic structure learning algorithm for MEBN. And the method was applied to a test case of a maritime domain in order to prove our basic method.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Keywords: Probabilistic ontology, Multi-Entity Bayesian
networks, PR-OWL, Relational Model/Database, Machine
Learning, Statistical Relational Learning</p>
    </sec>
    <sec id="sec-2">
      <title>I. INTRODUCTION</title>
      <p>Statistical Relational Learning (SRL) is a new branch of
machine learning for representing and learning a joint
distribution over relational data [1, 2]. As its name suggests, it
combines statistical and relational knowledge representations.
A relational model represents a domain as a collection of
objects that may have attributes and can participate in
relationships with other objects. Relational representations are
expressive enough for important real-world problems, but until
recently have not supported uncertainty. This gap has been
filled by SRL methods. Statistical relational knowledge
representations combine statistical and relational approaches,
allowing representation of a probability distribution over a
relational model of a domain. SRL methods allow such
representations to be learned from data.</p>
      <p>Examples of representation languages for SRL include
Probabilistic Relational Models (PRMs), Markov Logic
Networks (MLNs), Relational Dependency Networks (RDNs),
Bayesian Logic Programs (BLPs), Join Bayes Net (JBN), and
Multi-Entity Bayesian Networks (MEBN) [2, 3, 4, 5, 6, and 7].</p>
      <p>A comparison of some of the above models is given in [1].
Typically, SRL models provide a representation for relational
knowledge, along with methods for both induction and
deduction. Relational representations provide both class and
instance models. A class model describes statistical information
that applies to classes of objects. For example, a class model
might describe the false positive and false negative rates for a
class of sensor. The instance model is generated from the class
model by a deduction method. For example, the instance model
would be used to infer the probability that a given detection is a
false positive. An induction method learns structure and
parameters of a domain theory from observations. For example,
induction would be used to learn the false positive and false
negative rates from a data set annotated with ground truth.</p>
      <p>SRLs have been applied to problems such as Object
Classification, Object Type Prediction, Link Type Prediction,
Predicting Link Existence, Link Cardinality Estimation, Entity
Resolution, Group Detection, Sub-graph Discovery, Metadata
Mining, and so on [2].</p>
      <p>This paper is concerned with the Multi-Entity Bayesian
Networks (MEBN), a relational language that forms the logical
basis of Probabilistic OWL (PR-OWL), a language for
uncertainty reasoning in the Semantic Web [7, 8]. PR-OWL
has been extended to PR-OWL 2, which provides a tighter link
between the deterministic and probabilistic aspects of the
ontology [9]. MEBN extends Bayesian networks to a relational
representation. A MEBN Theory, or MTheory, consists of a set
of Bayesian network fragments, or MFrags, that together
represent a joint distribution over instances of the random
variables represented in the MTheory [7].</p>
      <p>However, until now there has been no implementation of
induction or learning for MEBN or PR-OWL. This paper
describes such an implementation. We follow an approach used
by other SRL models [1] and use Relational Database (RDB) to
store the observations from which the representation is learned.</p>
      <p>This paper focuses a basic learning algorithm that addresses
the following issues:
1.
2.</p>
    </sec>
    <sec id="sec-3">
      <title>Developing a bridge of MEBN and RDB;</title>
      <p>Developing basic structure and parameter learning for
MEBN.</p>
      <p>Ultimately, a relational learning algorithm should address
issues such as aggregation of data, reference uncertainty, type
uncertainty, and continuous variable learning. These issues will
be considered for future research.</p>
      <p>Our learning method is exact, and assumes discrete random
variables, and complete data. It will be evaluated by the
inference accuracy test.</p>
      <p>In Section 2, we give a brief definition of MEBN and RM
as background. In the Section 3, we introduce the MEBN-RM
Model. In Section 4, we present the basic structure learning
algorithm. The application of the algorithm is described in the
Section 5.</p>
      <p>II. MULTI-ENTITY BAYESIAN NETWORKS (MEBN) AND</p>
      <p>RELATIONAL MODEL (RM)</p>
      <sec id="sec-3-1">
        <title>A. Multi-Entity Bayesian Networks (MEBN)</title>
        <p>MEBN extends Bayesian Networks (BNs) to represent
relational information. BNs have been very successful as an
approach to representing uncertainty about many interrelated
variables. However, BNs are not expressive enough for
relational domains. MEBN extends Bayesian networks to
represent the repeated structure of relational domains.</p>
        <p>MEBN represents knowledge about a domain as a
collection of MFrags, an MFrag is a fragment of a graphical
model that is a template of probabilistic relationships among
instances of its random variables. Random variables in an
MFrag can contain ordinary variables which can be filled in
with domain entities. And MFrag includes context, input, and
resident node for restriction of entity, reference of node, and
random variable respectively. We can think of an MFrag as a
class which can generate instances of BN fragments, which can
then be assembled into a Bayesian network [7].</p>
      </sec>
      <sec id="sec-3-2">
        <title>B. Relational Model (RM)</title>
        <p>In 1969, Edgar F. Codd proposed RM as a database model
based on first-order predicate logic [10]. RM is composed of
Relation, Attribute, Key, Tuple, Instance, and Cell. Relational
database which is the most popular database is based on RM.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>III. MEBN-RM MODEL</title>
      <p>As a bridge of MEBN and RM, we suggest MEBN-RM
Model which provides a specification for how to match
elements of MEBN to elements of RM. Key nodes in MEBN
are the context and resident node. To understand this easily, we
use the following example of the university relational model.</p>
      <p>Course Registration Student
Key Difficulty Course Key Student Key Grade Key Advisor
c1 low c1 s1 low s1 p4
c2 high c1 s2 high s2 p2
c3 high c2 s2 high s3 p3
c4 low c2 s4 low s4 p1
c5 med c3 s5 med s5 p5
c6 low c4 s6 low s6 null
Professor
Key Major
p1 SYST
p2 OR
p3 OR
p4 CS
p5 SYST
p6 OR</p>
      <sec id="sec-4-1">
        <title>A. Context Node</title>
        <p>In MFrags, context terms (or nodes) are used to specify
constraints under which the local distributions apply. Thus, it
determines specific entities on an arbitrary situation of a
context. In MEBN-RM model, we define four types of data
structure corresponding to context nodes: Isa, Slot-filler,
Value-Constraint, and Entity-Constraint type.</p>
        <p>Name</p>
        <p>Isa
Value-Constraint</p>
        <p>Slot-Filler
Entity-Constraint</p>
        <p>Example
Isa( Person, P ), Isa( Car, C )</p>
        <p>Height( P ) = high
P = OwnerOf( C )</p>
        <p>Friend( A, B )
1) Isa</p>
        <p>In MEBN, the Isa random variable represents the type of an
entity. In a RM, an entity table represents a collection of
entities of a given type. Thus, an entity table corresponds to an
Isa random variable in MEBN. Note that a relationship table
whose primary key is composed of foreign keys does not
correspond to an Isa RV. A relationship table will correspond
to the Entity-Constraint type of Context Node.</p>
      </sec>
      <sec id="sec-4-2">
        <title>2) Value-Constraint</title>
        <p>In a case, a value of attribute can limit keys which are
related with only the value. For example, Consider Table 1, in
which we have the course table with the difficulty attribute. (In
our definition, Attribute is descriptive Attribute and Key is
Primary Key)</p>
        <p>The course table has instances of the key (e.g., c1, c2, c3,
c4, c5, and c6). And if we want to focus on a case of the entity
with “high” value of the attribute, it will be {c2, c3}. In this
case, for the entity, any group of elements related with any
attributes can be derived. We encode this into “Difficulty
(Course) = high” in MEBN.</p>
      </sec>
      <sec id="sec-4-3">
        <title>3) Slot-Filler</title>
        <p>In the table 1, the professor key is used on the student table
by a foreign key, Advisor. The foreign key is not primary key
in the student table. In this case, the connection will be
expressed by “Professor = Advisor (Student)” in MEBN. And
its instance will be that s1’s advisor is p4 and so on.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4) Entity-Constraint</title>
        <p>The registration table is a relationship table which is a
bridge between the course and student entity. In this case,
obviously, the registration table will be an intersection group.
And this is described as “Registration (Course, Student)” in
MEBN.</p>
      </sec>
      <sec id="sec-4-5">
        <title>B. Resident Node</title>
        <p>In MFrags, Resident Node can be described as Function,
Predicate, and Formula of FOL with a probability distribution.
FOL Function consists of arguments and an output, while FOL
Predicate consists of arguments and no output, but Boolean
output. We define the following relationship between elements
of RM and MEBN.</p>
        <p>RM
Attribute</p>
        <p>Key
Cell of Attribute</p>
        <p>Resident Node
Function/ Predicate</p>
        <p>Arguments</p>
        <p>Output</p>
        <p>For example, in the table 1, the grade of the registration
table is the function having the course and student keys as
arguments. Its output will be the cell of the grade such as low,
med, and high. On the other hand, if the domain type of the
grade is Boolean, it will be the predicate in MEBN.</p>
        <p>IV. THE BASIC STRUCTURE LEARNING FOR MEBN
To address the issues in Section 1, we suggest a basic
structure learning algorithm for MEBN. The initial ingredients
of the algorithm are a dataset of RM, a Bayesian Network
Structure searching algorithm, and a size of chain. For the
parameter learning, we only use Maximum Likelihood
Estimation (MLE). The algorithm focuses on discrete variables
with complete data. We utilize a standard Bayesian Network
Structure searching algorithm to generate a local BN from the
joined dataset of RM. To avoid infinite loops, we employed the
size of chain. Thus, the process of searching structure will
finish in the size of chain.</p>
        <p>Firstly, the algorithm creates the default MTheory. All keys
of DB are defined as entities of MEBN theory. One default
reference MFrag is created. For the all of tables of DB, the
dataset for each table is retrieved and, by using the BN
structure searching algorithm, a graph is generated from the
dataset. If the graph has a cycle and undirected edge, a
knowledge expert for the domain sets the arc direction. Based
on the revised graph, an MFrag is created. Until the size of
chain is reached, the joined datasets which are derived by
“Join” command in SQL are retrieved. The graphs related to
the joined datasets are generated in the same way as the above.
If any nodes of the new generated graph are not used in any
MFrags, create the resident node having the name of the dataset
of the graph on the default reference MFrag and the new
MFrag for the dataset. If not, only make edges between resident
nodes in the different MFrags. Lastly, for all resident nodes in
the MTheory, LPDs are generated by MLE.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>V. CASE STUDY</title>
      <p>
        To evaluate the algorithm, we used a dataset which came
from the PROGNOS (Probabilistic OntoloGies for Net-centric
Operation Systems) [
        <xref ref-type="bibr" rid="ref4">11, 12</xref>
        ]. The purpose of the system is to
provide higher-level knowledge representation, fusion, and
reasoning in the maritime domain.
      </p>
      <p>The PROGNOS includes a simulation which provides the
ground truth information for the system. The simulation uses a
given single entity Bayesian Network (we use this term to
discriminate the SSBN from Multi Entity Bayesian Networks)
in order for sampling data. The simulation generates 85000
persons, 10000 ships, and 1000 organization entities with
various values of attributes. The data for these entities are
stored in the relational database.</p>
      <p>For the evaluation of the model, the training and test dataset
was generated by the simulation. Using the basic structure
learning for MEBN, the PROGNOS MTheory was derived as
shown in Figure 2. In the model, a total of four MFrags were
generated such as the default reference, org_members, person,
and ship MFrag.</p>
      <p>To generate a SSBN from this MTheory, we assume that
we have one person, ship, and organization. They are related as
ship_crews (Ship S, Person P) and org_members( Organization
O, Person P). We queried the isShipOfInterest node with the
several evidence nodes located in the leaf nodes. Figure 3
presents the result SSBN in which the nodes of the ship and
person entity are connected each other.</p>
      <p>To compare the accuracies of the results, we used the single
entity Bayesian Network which was used for the sampling.
Thus, the single network provided another query result with the
same evidence. Figure 1 shows the Receiver Operating
Characteristic (ROC) Curve which describes accuracy of the
result of the learned MTheory and single entity Bayesian
Network. The areas under curves are shown in Table 4.</p>
      <p>Model</p>
      <p>Learned MTheory
Single Entity Bayesian Network</p>
      <p>AUC
0.874479929
0.87323784</p>
      <p>As we can see from Figure 1 and Table 4, the results of
accuracy of the learned MTheory and the single entity
Bayesian Network are almost the same. This means that the
learned MTheory well reflected the data of the relational
database which was sampled using the single entity Bayesian
Network.</p>
      <p>In this paper, we only compared the learned MTheory to the
true model which was the single entity Bayesian Network. This
result proves that our approach reflects the true model correctly.
However, the result of this paper is only the beginning and
baseline for a full MEBN Learning method, because we didn’t
address the aggregating influence problem which is the
important issue in SRL models.</p>
    </sec>
    <sec id="sec-6">
      <title>VI. DISCUSSION AND FUTURE WORK</title>
      <p>Because of a flood of complex and huge data, efficient and
accurate methods are needed for learning expressive models
incorporating uncertainty. In this paper, we have introduced a
learning approach for MEBN. As a bridge between MEBN and
RM, MEBN-RM Model was introduced. For induction, the
Basic Structure Learning for MEBN was suggested.</p>
      <p>Recently, we are studying about a heuristic approach which
called as the Framework of Function Searching for LPD
(FFSLPD) to address the aggregating influence problem. We plan to
expand the learning algorithm in order to include continuous
random variables.</p>
      <p>REFERENCES
Kersting, K., de Raedt, L.: Bayesian logic programming: Theory and
tool. In: Introduction to Statistical Relational Learning</p>
      <p>Laskey, K. B.,MEBN: A Language for First-Order Bayesian Knowledge
Bases. Artificial Intelligence, 172(2-3), 2008
Rommel N. Carvalho, Probabilistic Ontology: Representation and
Modeling Methodology, PhD Dissertation, George Mason University,
July 2011.
[11] P.C.G. Costa, K.B. Laskey, and KC Chang, “PROGNOS: Applying
Probabilistic Ontologies To Distributed Predictive Situation Assessment
In Naval Operations.” Proceedings of the 14th Int. Command And
Control Research and Technology Symposium, Washington, D.C., USA,
2009.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Hassan</given-names>
            <surname>Khosravi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Bahareh</given-names>
            <surname>Bina</surname>
          </string-name>
          .
          <article-title>A Survey on Statistical Relational Learning</article-title>
          .
          <source>In Proceedings of Canadian Conference on AI</source>
          '
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          pp.
          <volume>256</volume>
          ~
          <fpage>268</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Markov logic: A unifying framework for statistical relational learning</article-title>
          . In: Introduction to Statistical Relational Learning,
          <source>ch. 12</source>
          , pp.
          <fpage>339</fpage>
          -
          <lpage>367</lpage>
          ,
          <year>2007</year>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R. N.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. C. G.</given-names>
            <surname>Costa</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. B. Laskey</surname>
            , and
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Chang</surname>
          </string-name>
          , “
          <article-title>PROGNOS: predictive situational awareness with probabilistic ontologies,”</article-title>
          <source>in Proceedings of the 13th International Conference on Information Fusion</source>
          , Edinburgh,
          <string-name>
            <surname>UK</surname>
          </string-name>
          , Jul.
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>