<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Sanghack Lee and Vasant Honavar Artificial Intelligence Research Laboratory College of Information Sciences and Technology Pennsylvania State University University</institution>
          <addr-line>Park, PA 16802</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Maier et al. (2010) introduced the relational causal model (RCM) for representing and inferring causal relationships in relational data. A lifted representation, called abstract ground graph (AGG), plays a central role in reasoning with and learning of RCM. The correctness of the algorithm proposed by Maier et al. (2013a) for learning RCM from data relies on the soundness and completeness of AGG for relational dseparation to reduce the learning of an RCM to learning of an AGG. We revisit the definition of AGG and show that AGG, as defined in Maier et al. (2013b), does not correctly abstract all ground graphs. We revise the definition of AGG to ensure that it correctly abstracts all ground graphs. We further show that AGG representation is not complete for relational d-separation, that is, there can exist conditional independence relations in an RCM that are not entailed by AGG. A careful examination of the relationship between the lack of completeness of AGG for relational d-separation and faithfulness conditions suggests that weaker notions of completeness, namely adjacency faithfulness and orientation faithfulness between an RCM and its AGG, can be used to learn an RCM from data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Discovery of causal relationships from observational and
experimental data is a central problem with applications
across multiple areas of scientific endeavor. There has been
considerable progress over the past decades on algorithms
for eliciting causal relationships from data under a broad
range of assumptions
        <xref ref-type="bibr" rid="ref13 ref17 ref18">(Pearl, 2000; Spirtes et al., 2000;
Shimizu et al., 2006)</xref>
        . Most algorithms for causal discovery
assume propositional data where instances are independent
and identically distributed. However, in many real world
applications, these assumptions are violated because the
underlying data has a relational structure of the sort that is
modeled in practice by an entity-relationship model
        <xref ref-type="bibr" rid="ref2">(Chen,
1976)</xref>
        . There has been considerable work on learning
predictive models from relational data
        <xref ref-type="bibr" rid="ref3 ref4">(Getoor and Taskar,
2007)</xref>
        . Furthermore, researchers from different disciplines
have studied causal relationships and resulting phenomena
on relational world, e.g., peer effects
        <xref ref-type="bibr" rid="ref12 ref15">(Sacerdote, 2000;
Ogburn and VanderWeele, 2014)</xref>
        , social contagion
        <xref ref-type="bibr" rid="ref16 ref3 ref4">(Christakis
and Fowler, 2007; Shalizi and Thomas, 2011)</xref>
        , viral
marketing
        <xref ref-type="bibr" rid="ref6">(Leskovec et al., 2007)</xref>
        , and information diffusion
        <xref ref-type="bibr" rid="ref5">(Gruhl et al., 2004)</xref>
        .
      </p>
      <p>
        Motivated by the limitations of traditional approaches to
learning causal relationships from relational data, Maier
and his colleagues introduced the relational causal model
(RCM)
        <xref ref-type="bibr" rid="ref10">(Maier et al., 2010)</xref>
        and provided a sound and
complete causal structure learning algorithm, called the
relational causal discovery (RCD) algorithm
        <xref ref-type="bibr" rid="ref8 ref9">(Maier et al.,
2013a)</xref>
        , for inferring causal relationships from relational
data. The key idea behind RCM is that a cause and its
effects are in a direct or indirect relationship that is reflected
in the relational data. Traditional approaches for
reasoning on and learning of a causal model cannot be trivially
applied for relational causal model
        <xref ref-type="bibr" rid="ref8 ref9">(Maier et al., 2013a)</xref>
        .
Reasoning on an RCM to infer a relational version of
conditional independence (CI) makes use of a lifted
representation, called abstract ground graphs (AGGs), in which
traditional graphical criteria can be used to answer relational CI
queries. The lifted representation is employed as an internal
learning structure in RCD to reflect the inferred CI results
among relational version of variables. RCD makes use of a
new orientation rule designed specifically for RCM.
Motivation and Contributions RCM
        <xref ref-type="bibr" rid="ref10">(Maier et al.,
2010)</xref>
        offer an attractive model for representing,
reasoning about, and learning causal relationships implicit in
relational data.
        <xref ref-type="bibr" rid="ref1">Arbour et al. (2014)</xref>
        proposed a relational
version of propensity score matching method to infer
(relational) causal effects from observational data.
        <xref ref-type="bibr" rid="ref11">Marazopoulou et al. (2015)</xref>
        extended RCM to cope with
temporal relational data. They generalized both RCM and RCD to
Temporal RCM and Temporal RCD, respectively. A lifted
representation, called abstract ground graph (AGG), plays
a central role in reasoning with and learning of RCM.
The correctness of the algorithms proposed by Maier et al.
(2013a) for learning RCM and
        <xref ref-type="bibr" rid="ref11">Marazopoulou et al. (2015)</xref>
        for Temporal RCM, respectively, from observational data
rely on the soundness and completeness of AGG for
relational d-separation to reduce the learning of an RCM to
learning of an AGG. The main contributions of this
paper are as follows: (i) We show that AGG, as defined in
Maier et al. (2013b) does not correctly abstract all ground
graphs; (ii) We revise the definition of AGG to ensure that
it correctly abstracts all ground graphs; (iii) We further
show that AGG representation is not complete for
relational d-separation, that is, there can exist conditional
independence relations in an RCM that are not entailed by
AGG; and (iv) Based on a careful examination of the
relationship between the lack of completeness of AGG for
relational d-separation and faithfulness conditions suggests
that weaker notions of completeness, namely adjacency
faithfulness and orientation faithfulness between an RCM
and its AGG, can be used to learn an RCM from data.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>PRELIMINARIES</title>
      <p>
        We follow notational conventions introduced in
        <xref ref-type="bibr" rid="ref7 ref8 ref9">(Maier
et al., 2013a; ?,b; Maier, 2014)</xref>
        . An entity-relationship
model
        <xref ref-type="bibr" rid="ref2">(Chen, 1976)</xref>
        abstracts the entities (e.g., employee,
product) and relationships (e.g., develops) between entities
in a domain using a relational schema. The instantiation of
the schema is called a skeleton where entities form a
network of relationships (e.g., Quinn-develops-Laptop,
Rogerdevelops-Laptop). Entities and relationships have attributes
(e.g., salary of employees, success of products).
Cardinality constraints specify the cardinality of relationships that
an entity can participate in (e.g., many employees can
develop a product.).1 The following definitions are taken from
        <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
        :
Definition 1. A relational schema S is a tuple
hE , R, A, cardi: a set of entity classes E ; a set of
relationship classes R where Ri = hEji ijn=1 and n = |Ri| is arity
for Ri; attribute classes A where A (I ) is a set of attribute
classes of I 2 E [ R ; and cardinalities card : R ⇥ E !
{one, many}.
      </p>
      <p>
        Every relationship class Ri have two or more distinct
entity classes.2 We denote by I all item classes E [ R . We
denote by IX an item class that has an attribute class X
assuming, without loss of generality, that the attributes of
different item classes are disjoint. Participation of an entity
class Ej in a relationship class Ri is denoted by Ej 2Ri
1The examples are taken from
        <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
        .
      </p>
      <p>2In general, the same entity class can participate in a
relationship class in two or more different roles. For simplicity, we only
consider relationship classes only with distinct entity classes.
[Prod, Dev, Emp] .competence ! [Prod] .success</p>
      <p>[Emp] .competence ! [Emp] .salary
salary
competence
employee
develops
success
product</p>
      <p>
        Definition 2. A relational skeleton is an instantiation of
relational schema S , represented by a graph of entities and
relationships. Let (I ) denote a set of items of item class
I 2 Iin . Let ij , ik 2 such that ij 2 (Ij ), ik 2 (Ik),
and Ij , Ik 2 I, then we denote ij ⇠ ik if there exists an
edge between ij and ik in .
Relational causal model
        <xref ref-type="bibr" rid="ref10">(RCM, Maier et al., 2010)</xref>
        is a
causal model where causes and their effects are related
given an underlying relational schema. For example, the
success of a product depends on the competence of
employees who develop the product (see Figure 1). An RCM
models relational dependencies; each relational dependency
has a cause and its effect, which are represented by
relational variables; a relational variable is a pair consisting of
a relational path and an attribute.
      </p>
      <p>Definition 3. A relational path P = [Ij , . . . , Ik] is an
alternating sequence of entity class E 2 Eand relationship
class R 2 R. An item class Ij is called base class or
perspective and Ik is called a terminal class. A relational path
should satisfy:
1. for every [E, R] or [R, E], E 2R;
2. for every [E, R, E0], E 6= E0; and
3. for every [R, E, R0], if R = R0, then card (R, E) =
many.</p>
      <p>All valid relational paths on the given schema S are
debnyotPedi:bjy=P[SP.kW]jke=ideonr oPtei:th=e [lPenkg]|ktPh=|ioffoPr 1b y |Pi |, ajs u|bpPat|h,
and the reversed path by P˜ = [P|P |, . . . , P2, P1]. Note that
all subpaths of a relational path as well as the
corresponding reverse paths are valid. A relational variable P.X is
a pair of a relational path P and an attribute class X for
the terminal class of P . A relational variable is said to be
canonical if its relational path has a length equal to 1. A
relational dependency is of the form [Ij , . . . , Ik].Y ! [Ij ].X
such that its cause and effect share the same base class and
its effect is canonical.</p>
      <p>Given a relational schema S , a relational (causal) model
M⇥ is a pair of a structure M = hS , Di, where D is the
set of relational dependencies, and ⇥ is a set of
parameters. We assume acyclicity of the model so that the attribute
classes can be partially ordered based on D. The
parameters ⇥ define conditional distributions, p([I ].X |Pa([I ].X )),
for each pair (I , X ) where I 2 I, X 2 A(I ), and
Pa([I ].X ) is a set of causes of [I ].X , i.e., {P.Y |P.Y !
[I ].X 2D}. This paper focuses on the structure of RCM.
Hence we often omit parameters ⇥ from M.</p>
      <p>Terminal Set and Ground Graph Because a skeleton is
an instantiation of an underlying schema, a ground graph
is an instantiation of the underlying RCM given a skeleton
translating relational dependencies to every entity and
relationship in the skeleton. It is obtained by interpreting the
dependencies defined by the RCM on the skeleton using
the terminal sets of each of the instances in the skeleton.
Given a relational skeleton , the terminal set of a
relational path P given a base b 2 (P1), denoted by P |b, is
the set of terminal items reachable from b when we
traverse the skeleton along P . Formally, a terminal set P |b is
defined recursively, P 1:1|b = {b} and
P 1:`|b = {i 2</p>
      <p>S
(P`) | j 2P 1:` 1|b, i ⇠ j}\ 1 k&lt;`P 1:k|b.</p>
      <p>
        This implies that P 1:`|b and P |b will be disjoint for 1 
` &lt; |P |. Restricting the traversals so as not to revisit any
previously visited items corresponds to the bridge
burning semantics (hereinafter, BBS)
        <xref ref-type="bibr" rid="ref8 ref9">(Maier et al., 2013b)</xref>
        .
The instantiation of an RCM M for a skeleton yields
a ground graph which we denote by GGM . The vertices
of GGM are labeled by pairs of items and its attribute,
{i.X | I 2 I, i 2 (I ), X 2 A(I )}. There exists an edge
ij .X ! ik.Y in GGM such that ij 2 (Ij ), ik 2 (Ik),
Y 2 A(Ik), and X 2 A(Ij ) if and only if there exists a
dependency P.X ! [Ik].Y 2D such that ik 2P |ij .
In essence, RCM models dependencies on relational
domain as follows: Causal relationships are described from
the perspective of each item class; and are interpreted for
each items to determine its causes in a skeleton yielding a
ground graph. Since an RCM is defined on a given schema,
RCM is interpreted on a skeleton so that every ground
graph is an instantiation of the RCM.
      </p>
      <p>Throughout this paper, unless specified otherwise, we
assume a relational schema S , a set of relational
dependencies D, and an RCM M = hS , Di.
3</p>
    </sec>
    <sec id="sec-3">
      <title>REASONING</title>
    </sec>
    <sec id="sec-4">
      <title>WITH AN RCM</title>
      <p>
        An RCM can be seen as a meta causal model or a
template whose instantiation, a ground graph, corresponds to a
traditional causal model (e.g., a causal Bayesian network).
Reasoning with causal models relies on conditional
independence (CI) relations among variables. Graphical
criteria such as d-separation
        <xref ref-type="bibr" rid="ref13">(Pearl, 2000)</xref>
        are often exploited
to test CI given a model. Hence, the traditional definitions
and methods for reasoning with causal models need to be
“lifted” to the relational setting in order to be applicable to
relational causal models.
      </p>
      <p>
        Definition 4 (Relational d-separation
        <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
        ). Let
U, V, and W be three disjoint sets of relational variables
with the same perspective B 2 I defined over relational
schema S . Then, for relational model structure M, U and
V are d-separated by W if and only if, for all skeletons
2 ⌃ S , U|b and V|b are d-separated by W|b in ground
graph GGM for all b 2 (B).
      </p>
      <p>There are two things implicit in this definition: (i)
allground-graphs semantics which implies that d-separation
must be hold over all instantiations of the model; (ii) the
terminal set items of two different relational variables may
overlap (which we refer to as intersectability). In other
words, two relational variables U = P.X and V = P 0.X
of the same perspective B and the same attribute, are said
to be intersectable if and only if:
9 2 ⌃ S 9 b2 (B)P |b \ P 0|b 6= ; .
(1)
In order to allow testing of conditional independence on all
ground graphs, Maier et al. (2013a) introduced an abstract
ground graph (AGG), which abstracts all ground graphs
and is able to cope with the intersectability of relational
variables. We first recapitulate the original definition of
AGGs.
3.1</p>
      <sec id="sec-4-1">
        <title>ORIGINAL ABSTRACT GROUND GRAPHS</title>
        <p>
          An abstract ground graph AGGMB is defined for a given
relational model M and a perspective B 2 I
          <xref ref-type="bibr" rid="ref8 ref9">(Maier et al.,
2013a)</xref>
          , Since we fix the model, we omit the subscript M
and denote the abstract ground graph for perspective B by
AGGB . The resulting graph consists of two types of
vertices: RVB and IVB ; and two types of edges: RVEB and
IVEB .
        </p>
        <p>We denote by RVB the set of all relational variables (RV)
whose paths originate in B. We denote by RVEB the
set of all edges between the relational variables in RVB .
A relational variable edge (RVE) implies direct influence
arising from one or more dependencies in D. There is an
RVE P.X ! Q.Y if there exists a dependency R.X !
[IY ].Y 2 D that can be interpreted as a direct influence
from P.X to Q.Y from perspective B. Such an
interpretation is implemented by an extend function, which takes two
relational paths and produces a set of relational paths: If
P 2extend(Q, R), then there exists an RVE P.X ! Q.Y
P no1 Q
E1 Ra E2 Rb E3 Rb E2 Rc E4</p>
        <p>P</p>
        <p>Q
E1 Ra E2 Rc E4</p>
        <p>P</p>
        <p>
          Rb Q
E3
We denote by IVB the set of intersection variables (IVs),
i.e., unordered pairs of intersectable relational variables in
RVB. Given two RVs P.X and P 0.X that are intersectable
with each other, we denote the resulting intersection
variable by P.X \ P 0.X (Here, the intersection symbol ‘\ ’
denotes intersectability of the two relational variables). By
the definition
          <xref ref-type="bibr" rid="ref8 ref9">(Maier et al., 2013b)</xref>
          , if there exists an RVE
P.X ! Q.Y , then there exist edges P.X \ P 0.X ! Q.Y
and P.X ! Q.Y \ Q0.Y for every P 0 and Q0 intersectable
with P and Q, respectively. The IVs and the edges that
connect them with RVs (IVEs) correspond to indirect
influences (arising from intersectability) as opposed to direct
influence due to dependencies (which are covered by RVs
and RVEs). We denote by IVEB the set of all such edges
that connect RVs with IVs.
        </p>
        <p>Two AGGs with different perspectives share no vertices
nor edges. Hence, we view all AGGs, {AGGB}B2I , as
a collection or a single multi-component graph AGG =
SB2I AGGB. We similarly define RV, IV, RVE, and
IVE as the unions of their perspective-based counterparts.
For any mutually disjoint sets of relational variables U, V,
and W, one can test U ? V | W, conditional
independence admitted by the underlying probability
distribution, by checking U¯? ? V¯ | W¯ (traditional) d-separation
on an AGG3, where V¯ includes V and their related IVs,
V¯ = V [ { V \ T 2IV | V 2V}. Figure 3 illustrates
relational d-separation on an AGG.</p>
        <p>We later show that the preceding definition of AGG does
3We denote conditional independence by ‘? ’ in general. We
use ? ‘? ’ to represent (traditional) d-separation on a directed
acyclic graph., e.g., AGGM or GGM . Furthermore, we
parenthesize conditional independence and use a subscript to specify
the scope of the conditional independence, if necessary.</p>
        <sec id="sec-4-1-1">
          <title>U[Emp, Dev, Prod, Fund, Biz, Fund, Prod].succ</title>
          <p>[Emp].comp</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>V [Emp, Dev, Prod].succ</title>
        </sec>
        <sec id="sec-4-1-3">
          <title>W[Emp, Dev, Prod, Fund, Biz].rev</title>
        </sec>
        <sec id="sec-4-1-4">
          <title>X[Emp, Dev, Prod, Dev, Emp].comp</title>
          <p>Y [[EEmmpp,,DDeevv,,PPrroodd,,FDuenvd\,,EBmizp,,FDuenvd,,PPrroodd]]..ssuucccc</p>
        </sec>
        <sec id="sec-4-1-5">
          <title>Z[Emp, Dev, Prod, Dev, Emp, Dev, Prod].succ</title>
          <p>
            not properly abstract all ground graphs; nor does it
guarantee the correctness of reasoning about relational
dseparation in an RCM. We revise the definition of AGG
(Section 3.2) so as to ensure that the resulting AGG
abstracts all ground graphs. However, we find that even with
the revised definition of AGG, the AGG representation is
not complete for relational d-separation, that is, there can
exist conditional independence relations in an RCM that
are not entailed by AGG (Section 4.1). A careful
examination of the lack of completeness of AGG for relational
d-separation with respect to causal faithfulness yields
useful insights that allow us to make use of weaker notions of
faithfulness to learn RCM from data (Section 4.2).
Because of the importance of IV and IVE in AGG in
reasoning about relational d-separation, it is possible that
errors in abstracting all ground graphs could lead to errors
in CI relations inferred from an AGG. We proceed to show
that 1) the criteria for determining intersectability
            <xref ref-type="bibr" rid="ref7">(Maier,
2014)</xref>
            are not sufficient, and 2) the definition of IVE, as
it stands, does not guarantee the soundness of AGG as an
abstract representation of the all ground graphs of an RCM.
We provide the necessary and sufficient criteria for
determining IVs and a sound definition for IVEs.
          </p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>3.2.1 Intersectability and IV</title>
        <p>
          The declarative characterization of intersectability (Eq. 1)
does not offer practical procedural criteria to determine
intersectability. Based on the criteria
          <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
          , two
different relational paths P and Q are intersectable if and only
if 1) they share the same perspective, say B 2 I, and 2)
they share the common terminal class, and 3) one path is
        </p>
        <p>R|ij \ P |b
6= ;
2) 9 2 ⌃ S 9 b2 (B)
P |b \ P 0|b 6= ;
3) 9 2 ⌃ S 9 b2 (B)9 ij2 Q|b</p>
        <p>R|ij \ P |b \ P 0|b 6= ;
not a prefix of the other. We will prove that the preceding
criteria are not sufficient. In essence, we will show the
conditions under which non-emptiness of P |b \ Q|b for any
b 2 (B) in any skeleton always contradicts the BBS.
For the proof, we define LLRSP(P, Q) (the length of the
longest required shared path) for two relational paths P
and Q of the common perspective as</p>
        <p>
          max{` | P 1:` = Q1:`, 8 2 ⌃ S 8 b2 (B) P 1:`|b = 1}.
LLRSP(P, Q) is computed as follows. Initially set ` = 1
since P1 = Q1. Repeat incrementing ` by 1 if P`+1 = Q`+1
and either P` 2 Ror P` 2 Ewith card(P`, P`+1) = one.
Lemma 1. Given a relational schema S, let P and Q
be two different relational paths satisfying the (necessary)
criteria of
          <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
          and |Q|  | P |. Let m and n
be LLRSP(P, Q) and LLRSP(P˜, Q˜), respectively. Then, P
and Q are intersectable if and only if m + n  | Q|.
Proof. See Appendix.
        </p>
        <p>
          The lemma demonstrates the criteria by
          <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
          do
not rule out the case of m + n &gt; |Q| where P and Q cannot
be intersectable.
3.2.2
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>Co-intersectability and IVE</title>
        <p>
          Based on the definition
          <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
          , an IVE exists
between an IV, U \ V , and an RV, W , if and only if there
exists an RVE between U and W or V and W . It would
indeed be appealing to define IV, U\ V , such that it inherits
properties of the corresponding RVs, U and V . However,
the abstract ground graph resulting from such a definition
turns out to be not a sound representation of the underlying
ground graphs. We proceed to prove this result.
Definition 5 (Co-intersectability). Given a relational
schema S, let Q, R, P , and P 0 be valid relational paths
of the same perspective B where P 2Q on R and P and P 0
are intersectable. Then, a tuple hQ, R, P, P 0i is said to be
co-intersectable if and only if
We relate co-intersectability with the definition of IVE. Let
an RVE P.X ! Q.Y is due to some dependencies R.X !
[IY ].Y 2D where P 2Q on R. This implies
9 2 ⌃ S 9 b2 (B)9 ij2 Q|b R|ij \ P |b 6= ; ,
(4)
and there are edges from X of R|ij \ P |b to Y of Q|b in
GG . In order for the intersectability of P 0 with P
translates into an influence between P and Q, it is necessary that
there exists a skeleton that admits such influence. However,
we can construct a counterexample that satisfies the
necessary conditions for the existence of an RVE and the
conditions for intersectability but does not satisfy the
conditions for co-intersectability (see Figure 4 for a comparison
of Eq. 4, 1, and 3).
        </p>
        <p>Example 1. Let S be a relational schema where E =
{Ij , Ik, B, E1, E2, E3}, R = {Ri}i5=1 such that R1 =
hB, E1i, R2 = hE1, E3i, R3 = hE1, E2i, R4 =
hE2, E3, Iki, R5 = hIk, Ij i with the cardinality of each
relationship and each entity in the relationship being one.
Let
• Q = [B, R1, E1, R2, E3, R4, Ik, R5, Ij ],
• R = [Ij , R5, Ik, R4, E3, R2, E1, R3, E2, R4, Ik],
• P = [B, R1, E1, R3, E2, R4, Ik], and
• P 0 = [B, R1, E1, R2, E3, R4, Ik].</p>
        <p>Observe that
1. P 2extend (Q, R);
2. P 0 and P are intersectable; and
3. P 0 is a subpath of Q.</p>
        <p>This example satisfies Eq. 1 and Eq. 4. Assume for
contradiction that there exists a skeleton satisfying Eq. 3. Since,
in this example, the cardinality of each relationship and
each entity in the relationship is one, for each b 2 (B),
there exists only one ij 2Q|b and only one ik 2P |b. By
the assumption, P 0|b = {ik}. Since P 0 is a subpath of Q,
P 0|b will end at i0k = R1:3|ij (see Figure 5). Due to BBS,
R|ij \ R1:3|ij = ; , that is, {ik}\{ i0k} = ; . This contradicts
the assumption that ik = i0k.</p>
        <p>This counterexample clearly represents there is an
interdependency between intersection variables and RVEs.
Therefore, we revise the definition of IVE accompanying
co-intersectability.</p>
        <p>Definition 6 (IVE). There exists an IVE edge, P.X \
P 0.X ! Q.Y (or P.X ! Q.Y \ Q0.Y ), if and only if there
exists a relational path R such that R.X ! [IY ].Y 2 D,
P 2 Q on R, and hQ, R, P, P 0i (or hP, R˜, Q, Q0i) is
cointersectable.</p>
        <p>To determine IVEs, co-intersectability of a tuple can be
computed by solving a constraint satisfaction problem
involving four paths in the tuple.</p>
        <p>
          Implications of Co-intersectability We investigated the
necessary and sufficient criteria for intersectability and
revised the definition of IVE so as to guarantee that AGG
correctly abstracts all ground graphs as asserted (although
incorrectly) by Theorem 4.5.2
          <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
          . The new
criterion, called co-intersectability, is especially interesting
since it describes the interdependency between
intersection variables and related relational variable edges.
Several of the key results (e.g., soundness and completeness of
AGG for relational d-separation, Theorem 4.5.2) and
concepts (e.g., (B,h)-reachability) of
          <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
          are based
on independence between intersection variables and related
relational variable edges. Hence, it is useful to carefully
scrutinize the relationship between AGG and relational
dseparation.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>NON-COMPLETENESS OF AGG FOR</title>
    </sec>
    <sec id="sec-6">
      <title>RELATIONAL D-SEPARATION</title>
      <p>We first revisit the definition of relational d-separation.
Given three disjoint sets of relational variables U, V, and
W of a common perspective B 2 I, U and V are
relational d-separated given W, denoted by (U ? V | W) ,
M
if and only if
8 2 ⌃ S 8 b2 (B) (U|b? ?</p>
      <p>V|b | W|b)GGM .</p>
      <p>
        From Theorem 4.5.4 of
        <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
        , the lifted
representation AGGM is said to be sound (or complete) for
relational d-separation of M if (traditional) d-separation
holds on the AGGM with a modified CI query only when
(or whenever) relational d-separation holds true. Then, the
completeness of AGG for relational d-separation can be
represented as
(U ?
      </p>
      <p>V | W)M )
¯
U? ?</p>
      <p>The completeness can be proved by the construction
of a skeleton 2 ⌃ S demonstrating d-connection
(U|b? 6? V|b | W|b)GGM for some b 2 (B) if (U¯? 6?
V¯ | W¯ )AGGM . In other words, we might disprove the
completeness by showing
iy
( U¯? 6?</p>
      <p>V | W¯ )AGGM ^
¯
8 2 ⌃ S 8 b2 (B) (U|? b ?</p>
      <p>V|b | W|b)GGM .
4.1</p>
      <sec id="sec-6-1">
        <title>A COUNTEREXAMPLE</title>
        <p>The following counterexample shows that AGG is not
complete for relational d-separation.</p>
        <p>Example. Let S = hE , R, A, cardi be a relational
schema such that: E = {Ei}i5=1; R = {Rj }j3=1 with
R1 = hE1, E2, E4i, R2 = hE2, E3i, and R3 =
hE3, E4, E5i; A = {E2 : {Y } , E3 : {X} , E5 : {Z}};
and 8 R2R 8 E2 Rcard (R, E) = one. Let M = hS, Di be
a relational model with</p>
        <p>D = {D1.X ! [IY ].Y, D2.Z ! [IY ].Y }
such that D1 = [E2, R2, E3, R3, E4, R1, E2, R2, E3] and
D2 = [E2, R2, E3, R3, E5]. Let P.X, Q.Y , S.Z, and S0.Z
be four relational variables of the same perspective B = E1
where their relational paths are distinct where
• P = [E1, R1, E2, R2, E3],
• Q = [E1, R1, E4, R3, E3, R2, E2],
• S = [E1, R1, E4, R3, E5], and
• S0 = [E1, R1, E2, R2, E3, R3, E5].</p>
        <p>Given the above example, we can make two claims.
Claim 1. P.X? 6? S0.Z | Q.Y .</p>
        <p>AGGM
Proof. See Appendix.</p>
        <p>Assuming that AGG is complete for relational
dseparation, we can infer (P.X 6? S0.Z | Q.Y )M and there
must exist a pair of a skeleton and a base b 2 (B)
that satisfies (P.X|b? 6? S0.Z|b | Q.Y |b)GGM . However,
we claim that such a skeleton and base may not exist.
iy
ix</p>
        <p>The counterexample demonstrates that a d-connection path
captured in an AGGM might not have a corresponding
d-connection path in any ground graph.</p>
        <p>Corollary 1. The revised (as well as the original) abstract
ground graph for an RCM is not complete for relational
d-separation.</p>
        <p>It is possible that an additional test can be utilized to check
whether there exists such a ground graph that can
represent a d-connection path captured in AGGM. However,
the efficiency of such an additional test is unknown and
designing such a test is beyond the scope of this paper.
4.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>RELATING NON-COMPLETENESS WITH</title>
      </sec>
      <sec id="sec-6-3">
        <title>FAITHFULNESS</title>
        <p>In light of the preceding result that AGG is not complete
for relational d-separation, we proceed to examine the
relationship between an RCM and its lifted representation in
terms of the sets of conditional independence relationships
that they admit. In RCM, there are several levels of
relationship regarding the sets of conditional independence:
between the underlying probability distributions and the
ground graphs, between the ground graphs of an RCM and
the RCM, and between the RCM and its AGG:</p>
        <p>{p $ GGM⇥ } 2 ⌃ S $ M ⇥ $ AGGM
In RCM, the causal Markov condition and causal
faithfulness condition (see below) can be applied between a
ground graph GGM and its underlying probability
distribution p. Both conditions are assumed for learning an RCM
from relational data. Relational d-separation requires a set
of conditional independence of M⇥ using those deduced
from every ground graph GGM⇥ for every 2 ⌃ S . In
light of the lack of completeness of AGG for relational
d-separation, the set of conditional independence relations
admitted by M⇥ and its lifted representation AGGM are
not necessarily equivalent (see Corollary 1).</p>
        <p>
          We will relate M and AGGM using an analogy of causal
Markov condition and faithfulness
          <xref ref-type="bibr" rid="ref14 ref18">(Spirtes et al., 2000;
Ramsey and Spirtes, 2006)</xref>
          interpreting AGGM and M
as a DAG G and a distribution p, respectively. We first
recapitulate the definitions for causal Markov condition and
faithfulness.
        </p>
        <p>
          Definition 7 (Causal Markov Condition
          <xref ref-type="bibr" rid="ref14">(Ramsey and
Spirtes, 2006)</xref>
          ). Given a set of variables whose causal
structure can be represented by a DAG G, every
variable is probabilistically independent of its non-effects
(nondescendants in G) conditional on its direct causes (parents
in G).
        </p>
        <p>The causal Markov condition (i.e., local Markov
condition) is not directly translated into the relationship
between AGGM and M since they refer to different
variables. However, the soundness of AGGM for relational
d-separation of M (i.e., global Markov condition) would
be sufficient to interpret causal Markov condition between
AGGM and M. That is,</p>
        <p>¯
8 U,V,W 2 RV U? ?</p>
        <p>V | W¯
¯</p>
        <p>AGGM )
(U ? V | W )M
where U , V , and W are distinct relational variables sharing
a common perspective.</p>
        <p>
          Definition 8 (Causal Faithfulness Condition
          <xref ref-type="bibr" rid="ref14">(Ramsey and
Spirtes, 2006)</xref>
          ). Given a set of variables whose causal
structure can be represented by a DAG, no conditional
independence holds unless entailed by the causal Markov
condition.
        </p>
        <p>By the counterexample above, M is not strictly faithful to
AGGM, because more conditional independences hold in
M than those entailed by AGGM.
4.2.1</p>
      </sec>
      <sec id="sec-6-4">
        <title>Weaker Faithfulness Conditions</title>
        <p>
          <xref ref-type="bibr" rid="ref14">Ramsey and Spirtes (2006)</xref>
          showed that the two
weaker types of faithfulness – adjacency-faithfulness
and orientation-faithfulness – are sufficient to retrieve a
maximally-oriented causal structure from a data under the
causal Markov condition. What we have showed is that
there are more conditional independence hold in M than
those entailed by its corresponding AGGM. However, the
two weaker faithfulness conditions hold true (if they are
appropriately interpreted in an RCM and its lifted
representation).
        </p>
      </sec>
      <sec id="sec-6-5">
        <title>Adjacency-Faithfulness</title>
        <p>
          Definition 9 (Adjacency-Faithfulness
          <xref ref-type="bibr" rid="ref14">(Ramsey and Spirtes,
2006)</xref>
          ). Given a set of variables V whose causal structure
can be represented by a DAG G, if two variables X, Y are
adjacent in G, then they are dependent conditional on any
subset of V \ {X, Y }.
        </p>
        <p>Let U , V be two distinct relational variables of the same
perspective B. We limit U and V to be non-intersectable
to each other. Otherwise, they must not be adjacent to each
other by the definition of RCM since an edge between
intersectable relational variables yields a feedback in a ground
graph. If there is an edge U ! V in AGGM,
8 W✓ RVB\{U,V } (U 6?</p>
        <p>
          V | W)M
We can construct a skeleton 2⌃ S where its
corresponding ground graph GGM satisfies that U |b and V |b are
singletons and U |b [ V |b are disjoint to (RVB \ {U, V }) |b
for b 2 (B). Lemma 4.4.1 by
          <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
          describes
a method to construct a minimal skeleton to represent U
and V with a single b 2 (B). It guarantees that U |b
and V |b are singletons and every relational variable W 2
RVB \ {U, V } satisfies W |b \ U |b = ; and W |b \ V |b = ; .
        </p>
      </sec>
      <sec id="sec-6-6">
        <title>Orientation-Faithfulness</title>
        <p>
          Definition 10 (Orientation-Faithfulness
          <xref ref-type="bibr" rid="ref14">(Ramsey and
Spirtes, 2006)</xref>
          ). Given a set of variables V whose causal
structure can be represented by a DAG G, let hX, Y, Zi be
any unshielded triple in G.
(O1) if X ! Y Z, then X and Z are dependent given
any subset of V \ {X, Z} that contains Y ;
(O2) otherwise, X and Z are dependent conditional on any
subset of V \ {X, Z} that does not contain Y .
        </p>
        <p>Let U , V , and W be three distinct relational variables of
the same perspective B forming an unshielded triple in
AGGM. Similarly, V is not intersectable to both U and
W . The condition (O1) can be written as
8 T✓ RVB\{U,W }(U 6?</p>
        <p>W | T [ { V })M
if edges are oriented as U !
wise,
V</p>
        <p>W in AGGM.
Other8 T✓ RVB\{U,W }(U 6?</p>
        <p>W | T \ {V })M
for the condition (O2). Again, constructing a minimal
skeleton for U , V , and W guarantees that no T 2RVB \
{U, V, W } can represent any item in {U |b, V |b, W |b}.
Thus, the existence of V in the conditional determines
(in)dependence in the ground graph induced from the
minimal skeleton.</p>
        <p>Learning RCM with Non-complete AGG RCD
(Relational Causal Discovery, Maier et al. (2013a)) is an
algorithm for learning the structure of an RCM from relational
data. In learning RCM, AGG plays a key role: AGG is
constructed using CI tests to obtain the relational dependencies
of an RCM. The lack of completeness of AGG for
relational d-separation in RCM raises questions about the
correctness of RCD. A careful examination of AGG through
the lens of faithfulness suggests that adjacency-faithful and
orientation-faithful conditions can be applied to AGGM
to recover correct partially-oriented dependencies for an
RCM. However, it is still unclear whether RCD
recovers maximally-oriented dependencies with the acyclicity of
AGG (i.e., relational variables) not the acyclicity of RCM
(i.e., attribute classes). This raises the possibility of an
algorithm for learning the structure of an RCM from
relational data that does not require the intermediate step of
constructing a lifted representation.
5</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>CONCLUDING REMARKS</title>
      <p>
        There is a growing interest in relational causal models
        <xref ref-type="bibr" rid="ref1 ref10 ref11 ref7 ref8 ref9">(Maier et al., 2010, 2013a; ?; Maier, 2014; Arbour et al.,
2014; Marazopoulou et al., 2015)</xref>
        . A lifted representation,
called abstract ground graph (AGG), plays a central role
in reasoning with and learning of RCM. The correctness of
the algorithm proposed by Maier et al. (2013a) for learning
RCM from data relies on the soundness and completeness
of AGG for relational d-separation to reduce the learning
of an RCM to learning of an AGG. We showed that AGG,
as defined in
        <xref ref-type="bibr" rid="ref8 ref9">(Maier et al., 2013a)</xref>
        , does not correctly
abstract all ground graphs. We revised the definition of AGG
to ensure that it correctly abstracts all ground graphs. We
further showed that AGG representation is not complete for
relational d-separation, that is, there can exist conditional
independence relations in an RCM that are not entailed by
AGG. Our examination of the relationship between the lack
of completeness of AGG for relational d-separation and
faithfulness suggests that weaker notions of completeness,
namely adjacency faithfulness and orientation faithfulness
between an RCM and its AGG can be used to learn an
RCM from data. Work in progress is aimed at: 1)
identifying the necessary and sufficient criteria for guaranteeing the
completeness of AGG for relational d-separation; 2)
establishing whether the RCD algorithm outputs a
maximallyoriented RCM even when the completeness of AGG for
relational d-separation does not hold; and 3) devising a
structure learning algorithm that does not rely on a lifted
representation.
      </p>
    </sec>
    <sec id="sec-8">
      <title>APPENDIX</title>
      <p>We first prove Lemma 1 in Section 3.2.1.</p>
      <p>
        Lemma. Given a relational schema S , let P and Q be
two different relational paths satisfying the (necessary)
criteria of
        <xref ref-type="bibr" rid="ref7">Maier (2014)</xref>
        and |Q|  | P |. Let m and n be
LLRSP(P, Q) and LLRSP(P˜, Q˜), respectively. Then, P
and Q are intersectable if and only if m + n  | Q|.
Proof. (If part) If m + n  | Q|, then we can construct a
skeleton such that P |b \ Q|b 6= ; for some b 2 (P1)
by adding unique items for Q and for P m+1:|P | n and
complete the skeleton in the same manner as shown in
Lemma 3.4.1
        <xref ref-type="bibr" rid="ref7">(Maier, 2014)</xref>
        . Note that if m + n = |Q|,
then |P | | Q| + 2 since P 6= Q and a relational path is an
alternating sequence. This guarantees that there are at least
two items for P m+1:|P | n.
(Only if part) Let c be in P |b \ Q|b for some arbitrary
skeleton 2⌃ S and b 2 (P1). Then, there should be two lists
of items corresponding to P and Q sharing the first m and
the last n. The condition m + n &gt; |Q| implies Q|b is a
singleton set. We define
      </p>
      <p>p = hp1, . . . , pm, p|P | n+1, . . . , p|P |i
and</p>
      <p>q = hq1, . . . , q|Q|i,
where {q`} = Q1:`|b and {p`} = P 1:`|b for 1  `  m,
and p|P | l+1 2 P˜1:l|c for 1  l  n. We can see that
p1 = q1 = b and p|P | = q|Q| = c. Moreover,</p>
      <p>pm = qm = q|Q| (|Q| m) = p|P | (|Q| m)
by the definition of LLRSP. If |Q| &lt; |P |, then m 6= |P |
|Q| + m and mth item for P is repeated at |P | (|Q|
m)th, which violates the BBS. Otherwise, it is not the case,
since |P | = |Q| implies p = q and, hence, P = Q by the
definition of LLRSP, which contradicts the assumption that
P and Q are different relational paths.</p>
      <p>We provide proofs for two claims regarding the
counterexample in Section 4.1.</p>
      <p>Claim. P.X? 6?</p>
      <p>S0.Z | Q.Y</p>
      <p>AGGM
.</p>
      <p>Proof. By the definition of RVE, there are RVEs P.X !
Q.Y and Q.Y S.Z in AGGM since P = Q on6
D1 and S 2 Q on4 D2. Moreover, there is an IVE
Q.Y S.Z \ S0.Z in AGGM since 1) S and S0
are intersectable, 2) there is an RVE Q.Y S.Z,
and 3) hQ, D2, S, S0i is co-intersectable (see Figure 6).4
Since P.X ! Q.Y S.Z \ S0.Z and S.Z \
S0.Z 2 S0.Z, we derive P.X? 6? S0.Z | Q.Y ,
which implies P.X? 6? S0.Z | Q.Y AGGM
conditioning on Q.Y , compared to Q.Y , does not block
any possible d-connection paths between P.X to S0.Z
since there are only incoming edges to Q.Y . Finally,
P.X? 6? S0.Z | Q.Y holds.</p>
      <p>AGGM
. Furthermore,</p>
      <p>AGGM
Claim. There is no</p>
      <p>Proof. Suppose that there exist such a skeleton
b 2 (B) satisfying (P.X|b? 6?
and base
S0.Z|b | Q.Y |b)GGM .</p>
      <p>4Note that the original definition of AGGM does not check
co-intersectability and Q.Y S.Z \ S0.Z is granted.
Every terminal set for P , Q, and S0 given the base must
not be empty because of the definition of d-separation and
the fact that attribute classes X and Z are connected only
through Y (i.e., Y is a collider). Since every cardinality is
one, terminal sets must be singletons. Let {ix} = P.X|b,
{iy} = Q.Y |b, and {iz} = S0.Y |b. Furthermore, since ix
and iz must be d-connected given iy, GGM must have
two edges ix ! iy iz, which requires ix 2 D1|iy
and iz 2D2|iy . However, due to BBS and cardinality
constraints (i.e., one), there exists only one possible structure
(see Figure 7) where ix and iz are the cause of iy while
satisfying all previously mentioned conditions except {iz} =
S0.Y |b. In other words, the constraint {iz} = S0.Y |b
violates with the set of the rest of conditions. Hence, there
exists no such skeleton and base.</p>
      <sec id="sec-8-1">
        <title>Acknowledgments</title>
        <p>This research was supported in part by the Edward
Frymoyer Endowed Professorship held by Vasant Honavar and
in part by the Center for Big Data Analytics and Discovery
Informatics which is co-sponsored by the Institute for
Cyberscience, the Huck Institutes of the Life Sciences, and the
Social Science Research Institute at the Pennsylvania State
University.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Arbour</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marazopoulou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garant</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Propensity score matching for causal inference with relational data</article-title>
          .
          <source>In Proceedings of the UAI 2014 Workshop Causal Inference: Learning and Prediction</source>
          , pages
          <fpage>25</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>P. P.-S.</given-names>
          </string-name>
          (
          <year>1976</year>
          ).
          <article-title>The entity-relationship model - toward a unified view of data</article-title>
          .
          <source>ACM Transactions on Database Systems (TODS)</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Christakis</surname>
            ,
            <given-names>N. A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Fowler</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>The spread of obesity in a large social network over 32 years</article-title>
          . New
          <source>England journal of medicine</source>
          ,
          <volume>357</volume>
          (
          <issue>4</issue>
          ):
          <fpage>370</fpage>
          -
          <lpage>379</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Getoor</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Taskar</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Introduction to statistical relational learning</article-title>
          . MIT press.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Gruhl</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guha</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liben-Nowell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Tomkins</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Information diffusion through blogspace</article-title>
          .
          <source>In Proceedings of the 13th international conference on World Wide Web</source>
          , pages
          <fpage>491</fpage>
          -
          <lpage>501</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Leskovec</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adamic</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Huberman</surname>
            ,
            <given-names>B. A.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>The dynamics of viral marketing</article-title>
          .
          <source>ACM Transactions on the Web (TWEB)</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <fpage>5</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Causal Discovery for Relational Domains: Representation, Reasoning, and Learning</article-title>
          .
          <source>PhD thesis</source>
          , University of Massachusetts Amherst.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marazopoulou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arbour</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2013a</year>
          ).
          <article-title>A sound and complete algorithm for learning causal models from relational data</article-title>
          .
          <source>In Proceedings of the Twenty-ninth Conference on Uncertainty in Artificial Intelligence</source>
          , pages
          <fpage>371</fpage>
          -
          <lpage>380</lpage>
          , Bellevue, WA. AUAI Press.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marazopoulou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2013b</year>
          ).
          <article-title>Reasoning about independence in probabilistic models of relational data</article-title>
          . Approaches to Causal Structure Learning Workshop,
          <string-name>
            <surname>UAI</surname>
          </string-name>
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , B.,
          <string-name>
            <surname>Oktay</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Learning causal models of relational domains</article-title>
          .
          <source>In Proceedings of the Twenty-Fourth National Conference on Artificial Intelligence</source>
          , pages
          <fpage>531</fpage>
          -
          <lpage>538</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Marazopoulou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Learning the structure of causal models with relational and temporal dependence</article-title>
          .
          <source>In Proceedings of the ThirtyFirst Conference on Uncertainty in Artificial Intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Ogburn</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and VanderWeele,
          <string-name>
            <surname>T.</surname>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Causal diagrams for interference</article-title>
          .
          <source>Statistical Science</source>
          ,
          <volume>29</volume>
          (
          <issue>4</issue>
          ):
          <fpage>559</fpage>
          -
          <lpage>578</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Pearl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Causality: models, reasoning and inference</article-title>
          , volume
          <volume>29</volume>
          . Cambridge Univ Press.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Ramsey</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Spirtes</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Adjacency-faithfulness and conservative causal inference</article-title>
          .
          <source>In Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI-06)</source>
          , pages
          <fpage>401</fpage>
          -
          <lpage>408</lpage>
          . AUAI Press.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Sacerdote</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Peer effects with random assignment: Results for Dartmouth roommates</article-title>
          .
          <source>Technical report, National bureau of economic research.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Shalizi</surname>
            ,
            <given-names>C. R.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Homophily and contagion are generically confounded in observational social network studies</article-title>
          .
          <source>Sociological Methods Research</source>
          ,
          <volume>40</volume>
          (
          <issue>2</issue>
          ):
          <fpage>211</fpage>
          -
          <lpage>239</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Shimizu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoyer</surname>
            ,
            <given-names>P. O.</given-names>
          </string-name>
          , Hyva¨rinen, A., and
          <string-name>
            <surname>Kerminen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>A linear non-Gaussian acyclic model for causal discovery</article-title>
          .
          <source>The Journal of Machine Learning Research</source>
          ,
          <volume>7</volume>
          :
          <fpage>2003</fpage>
          -
          <lpage>2030</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Spirtes</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glymour</surname>
            ,
            <given-names>C. N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Scheines</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Causation, prediction, and search</article-title>
          , volume
          <volume>81</volume>
          . MIT press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>