=Paper=
{{Paper
|id=Vol-2123/paper4
|storemode=property
|title=Mining Correlated Association Rules from Multi-Relational Data using Interval Patterns

|pdfUrl=https://ceur-ws.org/Vol-2123/paper4.pdf
|volume=Vol-2123
|authors=Hirohisa Seki,Masahiro Nagao
|dblpUrl=https://dblp.org/rec/conf/cla/SekiN18
}}
==Mining Correlated Association Rules from Multi-Relational Data using Interval Patterns
==
<pdf width="1500px">https://ceur-ws.org/Vol-2123/paper4.pdf</pdf>
<pre>
       Mining Correlated Association Rules from
      Multi-Relational Data using Interval Patterns

                           Hirohisa Seki and Masahiro Nagao?

                Dept. of Computer Science, Nagoya Institute of Technology
                      Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan
                     seki@nitech.ac.jp, m.nagao.738@nitech.jp


          Abstract. In this paper, we consider the problem of mining numerical
          association rules (ARs) from a multi-relational database (MRDB). More
          specifically, we examine the effectiveness of numerical ARs with inter-
          val patterns (IPs) proposed by Kaytoue et al. in FCA (Formal Concept
          Analysis), and show that the MinIntChange algorithm by Kaytoue et
          al. can be readily extended to mine correlated interval-based ARs with
          the maximal significance in terms of the χ2 measure, by incorporating
          into the algorithm a pruning technique by Morishita et al. Moreover,
          since the search space for computing closed IPs becomes larger as the
          number of numerical attributes increases, we utilize Super CWC , an off
          the shelf feature selection algorithm to reduce the number of attributes
          to use. Our approach is experimentally evaluated and compared with
          the conventional methods such as a discretization-based approach or an
          optimization-based approach.


  1     Introduction

  Numerical data arise prevalently in databases, including business and scientific
  databases. Handling numerical (or quantitative) data in data mining has at-
  tracted much attention since the work on mining quantitative association rules
  by Srikant and Agrawal [16]. Conventionally, data discretization is commonly
  used to handle numerical data; for a quantitative attribute which can have con-
  tinuous values, it reduces the number of values by dividing the range of the
  attribute into intervals. The other approaches to handling numerical data have
  also been proposed, including a statistical distribution-based approach and an
  optimization-based approach (see the survey in [14]).
      Kaytoue et al. [8] proposed an FCA-based approach to handling quantitative
  attributes, and introduced the notions of closed interval patterns (CIPs) as well
  as generators. The notion of IPs is an instance of the general framework of pattern
  structures studied by Ganter and Kuznetsov [5]. Recently, some methods have
  been proposed to use CIPs for mining association rules [6,13]. In particular, the
  approach in [13] handles multi-relational data mining (MRDM); it uses CIPs
  for mining relational quantitative association rules of the form A → C, where
  ?
      Current Affiliation: Fujitsu Limited, Japan.


c paper author(s), 2018. Proceedings volume published and copyrighted by its editors.
  Paper published in Dmitry I. Ignatov, Lhouari Nourine (Eds.): CLA 2018, pp. 47–58,
  Department of Computer Science, Palacký University Olomouc, 2018. Copying
  permitted only for private and academic purposes.
48     Hirohisa Seki and Masahiro Nagao


A and C are relational patterns (i.e., logical conjunctions), and they consist of
categorical attributes and quantitative ones from a given multi-relational data.
In both work, although closed interval patterns allow us to represent intervals
concisely, the numbers of generated patterns (i.e., rules) are still large, which
makes the computation expensive and imposes a significant burden on the user’s
understanding.
     In this paper, we study the problem of mining optimal relational association
rules that have the maximum χ2 value between the assumption and the conclu-
sion of the rule. To find such rules, we use the original MinIntChange algorithm
by Kaytoue et al., and incorporate into it a pruning technique by Morishita et
al. [11]. Moreover, since the search space for computing closed IPs becomes larger
as the number of numerical attributes increases, we utilize Super CWC [15], an
off the shelf feature selection algorithm to reduce the number of attributes to use.
We give some experimental results, which show the effectiveness of the proposed
method.
     The organization of the rest of this paper is as follows. We first summarize
some basic notations and definitions of relational association rule mining and in-
terval patterns in Sect. 2. We then explain our approach to mining quantitative
association rules from multi-relational data in Sect. 3, and show some experi-
mental results in Sect. 4. Finally, we give a summary of this work in Sect. 5.


2     Relational Association Rules with Quantitative
      Attributes
2.1   Relational Pattern Mining and Interval Patterns
We use some basic notions of MRDM in [3]. To represent data and patterns, we
use a class of first-order logical formulas. An atom is an expression of the form
p(t1 , . . . .tn ), where p is a predicate and each ti is a term (i.e., a constant or a
variable). A substitution θ = {X1 /t1 , . . . , Xn /tn } is an assignment of terms to
variables. The result of applying a substitution θ to a formula (i.e., an atom or a
conjunction in this case) F is the formula F θ, where all occurrences of variables
Vi have been simultaneously replaced by the corresponding terms ti in θ. The
set of variables occurring in a formula F is denoted by Var (F ). A pattern is
expressed as a conjunction l1 ∧ · · · ∧ ln of atoms, denoted simply by l1 , . . . , ln .
    A database DB is a set of ground atoms. For a pattern C, let answerset(C; DB )
be the set of substitutions θ such that Cθ is logically entailed by a database DB ,
denoted by DB |= Cθ.
    In MRDM, we often specify one of the predicates as a key (e.g., [2,1]), which
determines the entities of interest and what is to be counted. The key (target)
is thus to be present in all patterns considered. Given a database DB and a
conjunction C containing a key atom key(X), the support (or frequency) of C,
denoted by supp(C), is defined to be the number of different keys that answer
C divided by the total number of keys. C is said to be frequent, if supp(C) is no
less than some user defined threshold minsup.
                  Mining Relational Correlation Rules using Interval Patterns       49


              customerId (c1 ).    marriedTo(c1 , c2 ).    income(c1 , 200 ).
              customerId (c2 ).    marriedTo(c2 , c1 ).    income(c2 , 120 ).
              customerId (c3 ).    marriedTo(c3 , c4 ).    income(c3 , 50 ).
              ...
              age(c1 , 30 ).       bigSpender (c1 ).
              age(c2 , 25 ).       bigSpender (c2 ).
              age(c3 , 55 ).
              ...


  Fig. 1. An Example Database DB with customerId as a key: adapted from [3].


    An association rule we consider in this paper is an existentially quantified
implication of the form: A → C, where A (resp., C) is a conjunction of the
form: a1 , . . . , am (resp., a single atom) (m ≥ 1). We call A (C) the antecedent
(conclusion) of the rule, respectively. The support of a (relational) association
rule is defined as the support of A ∧ C, while the confidence of an association
rule is defined as the support of C divided by the support of the antecedent
A. Following [7], we call a rule strong, if it satisfies both a minimum support
threshold (minsup) and a minimum confidence (minconf ).

Example 1. Consider a toy example of a multi-relational database DB in Fig. 1,
which is adapted and simplified from [3]. Predicate customerId is assumed to be a
key. Let P be a pattern of the form: customerId (X), age(X, Q1 ), marriedTo(X, Y ),
income(Y, Q2 ), whose meaning is obvious. Then, answerset(P ; DB ) contains sub-
stitutions {X/c1 , Q1 /30, Y /c2 , Q2 /120} and {X/c2 , Q1 /25, Y /c1 , Q2 /200}, for ex-
ample.
    The following rule is an example of association rules:

customerId (X), age(X, Q1 ), marriedTo(X, Y ), income(Y, Q2 ) → bigSpender (X).
                                                                            (1)

In the above, Q1 and Q2 are quantitative attributes, while the others are con-
sidered to be categorical ones.                                             2

    We call a variable corresponding to a quantitative (resp., categorical) at-
tribute a quantitative (resp., categorical ) variable. We also call a variable occur-
ring in a key predicate a key variable.


Relational Association Rules with Interval Patterns We use interval
patterns to specify constraints on quantitative variables in an association rule.
In the aforementioned association rule (1), for example, we consider the following
association rule with constraints consisting of interval patterns:

         customerId (X), age(X, Q1 ), marriedTo(X, Y ), income(Y, Q2 ),
            hQ1 , Q2 i ∈ h[l1 , u1 ], [l2 , u2 ]i → bigSpender (X).                 (2)
50        Hirohisa Seki and Masahiro Nagao


where li (ui ) is a value of the domain of the attribute Qi (i = 1, 2), respectively.
     Formally, let A be a conjunction such that A contains quantitative variables
Q1 , . . . , Qk (k ≥ 1). Then, we call an expression c of the form “hQ1 , . . . , Qk i ∈
hI1 , . . . , Ik i” an interval constraint of A, where Ii (1 ≤ i ≤ k) is an interval
pattern for Qi .
     Let θ be a substitution for Var (A) and Ii = [ei , fi ] for some ei and fi . Then,
DB |= (A, c)θ iff DB |= Aθ and Qi θ ∈ [ei , fi ] for i = 1, . . . , k.
     For simplicity, we write simply “A, I” instead of A, hQ1 , . . . , Qk i ∈ hI1 , . . . , Ik i,
where I = hI1 , . . . , Ik i and we call I an interval pattern of A.
     For a conjunction A which has no categorical variables except a key variable,
we can define a closed pattern in the same way as [8]; let S = {θ1 , · · · , θn } (n ≥ 0)
be a set of substitutions for variables in A and let Q1 , . . . , Qk be quantitative
variables in A. Then, we define a mapping δ(·) as follows: for a substitution θ ∈ S
such that θ ⊇ {Q1 /a1 , . . . , Qk /ak }, δ(θ) = h[a1 , a1 ], . . . , [ak , ak ]i. Namely, the
mapping δ maps θ into an k-dimensional interval pattern h[a1 , a1 ], . . . , [ak , ak ]i.
Definition 1 (closed pattern).
    For a conjunction A such that A has no categorical variables except a key
variable, let S = {θ1 , . . . , θn } (n ≥ 0) be a set of substitutions for variables in
A, and I an interval pattern of A. We consider the following two operators (·)2 :
    I 2 = {θ | θ ∈ answerset(A, I; DB )},
    S 2 = hδ(θ1 ) u · · · u δ(θn )i.1 .
    Let I and J be an interval pattern of A. Then, I and J are equivalent if
I 2 = J 2 and we write it by I ≡ J. We call I closed if there does not exist any
other interval pattern J such that I ≡ J and I v J.                                  2

2.2      Correlation Measures
Since the framework using the support/confidence only generates too many rules,
we usually use another measure to find “interesting” ones among the generated
rules. χ2 -value is such a measure to find correlated rules; it is defined as a
normalized derivation of observation from expectation. Given a contingency table
in Table 1, where given m and n are both assumed to be constants, χ2 values
are determined by x and y, and we thus denote it by χ2 (x, y). The following
property of χ2 -value is shown by Morishita et al. [11].
Lemma 1 (Morishita et al.). [11] Let r(I0 ) be a rule with an interval pat-
tern I0 , and let a (b, resp.) be the number of (positive) tuples that satisfy the
antecedent of r(I0 ). Let r(I) be a rule with an interval pattern I, and let p (q,
resp.) be the number of (positive) tuples that satisfy the antecedent of r(I) such
that 0 ≤ p ≤ a, 0 ≤ q ≤ b, q ≤ p and (p − q) ≤ (a − b). Then, we have
                           χ2 (p, q) ≤ max{χ2 (b, b), χ2 (a − b, 0)}.                         (3)
                                                                                               2
1
     For I1 = h[ai , bi ]ii∈{1,,̇k} and I2 = h[ei , fi ]ii∈{1,,̇k} , u is the infimum operator de-
     fined by I1 u I2 = h[min(ai , ei ), max(bi , fi )]ii∈{1,...,k} , and I2 v I1 ⇐⇒ [ei , fi ] ⊆
     [ai , bi ], ∀i ∈ {1, . . . , k}.
                   Mining Relational Correlation Rules using Interval Patterns         51


   The right-hand side of (3) gives an upper bound of χ2 (p, q), and we thus
denote it by ub(r(I)).


                  Table 1. Contingency Table for Rule r=A → C.


                         C is true     C is false           Sumrow
            A is true    sup(r)=y      x−y                  sup(A)=x
            A is false   m−y           n − x − (m − y)      sup(¬A)= n − x
            Sumcol       sup(C)=m      sup(¬C)= n − m       n


     QuantMiner [14], a GA-based algorithm, searches rules with high fitness func-
tion rules. The fitness function Fitness(·) is an evaluation measure for a rule, and
it is based on the Gain measure proposed in [4]: Gain(A → C) = supp(A ∧ C) −
min conf · supp(A). The Gain value is a measure giving a trade-off between sup-
port and confidence. Using x and y in Table 1, we write Gain(A → B) = G(x, y),
and G(x, y) is also a convex function.


3    Mining Quantitative ARs with IPs from a MRDB


Algorithm 1 shows the outline of our algorithm for mining correlated ARs with
interval patterns from a MRDB.
    Given a MRDB DB , the user first specifies a rule template of the form:
A → C; it specifies conjunctions occurring in the left-hand side and the right-
hand side, and the right-hand side contains a single target atom C. The user
also specifies values of categorical variables occurring in A and C. In case the
values of the categorical variables in the rule template are not given, its possible
values will be computed in the algorithm so that each of the categorical variable
is instantiated to some value in its domain.
    Next, we compute the answersets of A and A ∧ C, and we make the initial as-
sociation rule rinit of the form: A, I ⊥ → C, where I ⊥ is the minimal interval con-
straint of A. In the aforementioned rule (2), for example, I ⊥ = h[l1 , u1 ], [l2 , u2 ]i,
where li (ui ) (i = 1, 2) is the minimum (maximum) value of the domain of the
attribute Qi , respectively.
    If rinit is infrequent (i.e., its support supp(A ∧ C) is less than minsupp), then
exit. Otherwise, we compute the set R of strong rules with the best χ2 value on
                                         2
DB , by calling a function MIC+p(χ ) (rinit , 0).
52      Hirohisa Seki and Masahiro Nagao


  Algorithm 1: Correlated AR Mining from a MRDB
   input : a MRDB DB , minsupp, minconf .
   output: a set R of rules rb with the best χ2 value on DB .
 1 A rule template of the form: A → C is specified by the user; // initial
     step
 2 Compute answersets answerset(A; DB ) and answerset(A ∧ C; DB );
     // mining step for categorical attributes
                                                           ⊥
 3 Make an initial association rule rinit of the form: A, I → C;
 4 if A ∧ C is infrequent then return;
 5 Initialize R ← ∅; τ ← −∞, and compute correlated ARs by calling
               2
    MIC+p(χ ) (rinit , 0) ;
 6 return R
                           2
 7   Function MIC+p(χ ) (a rule r(I) : A, I → C, an integer j) : a set R of
      rules with the best χ2 value on Database DB is
 8      A ← {µi,α | µi,α is applicable to I for some i ≥ j, α ∈ {l, r}};
 9      foreach µi,α ∈ A do
10         I 0 ← µi,α (I);
11         if sup(r(I 0 )) < minsup or ub(r(I 0 )) < τ then continue
                   22
12         I1 ← I 0 ;
13         if I1 fails the canonicity test then continue
14         τ1 ← χ2 value of r(I1 );
15         if τ1 > τ and conf (r(I 0 )) ≥ minconf then τ ← τ1 ; R ← {r(I1 )}
16         else if τ1 = τ and conf (r(I 0 )) ≥ minconf then R ← R ∪ {r(I1 )};
                            2
17       call MIC+p(χ ) (r(I1 ), i);
18    end
19 end

                                2
    The function MIC+p(χ ) (r(I), j) is essentially the same as the MinIntChange
algorithm by Kaytoue et al. [8]; the enumeration of closed IPs is done in the
same way as the original MinIntChange. Namely, the algorithm generates its
direct subsumers whose supports are strictly lower than its support. New interval
patterns are generated by applying minimal changes to a given interval pattern
(line 10). Since a closed interval pattern may be generated several times, we
employ the canonicity test due to CloseByOne [10] (line 13).


Definition 2 (minimal change). [8]
    Let I be an interval pattern of a conjunction A, where I = h[a1 , b1 ], . . . , [ak , bk ]i
for some k ≥ 1.
    A right minimal change µi,r (I) (1 ≤ i ≤ k) is defined as I 0 , where I 0 is I
with its i-th interval replaced by [ai , v] such that v = max{x ∈ Vi | x < bi } and
Vi is the set of values which the quantitative variable Qi will take in a given
database. A left minimal change µi,l (I) is defined dually.
                  Mining Relational Correlation Rules using Interval Patterns     53


    A right minimal change µi,r is applicable to I if the resulting interval [ai , v]
does not collapse, i.e., v − ai > 0. An applicable left minimal change µi,l (I) is
defined dually.                                                                    2
             2
    MIC+p(χ ) incorporates into the original MinIntChange a pruning mechanism
(line 11) based on Lemma 1 and a mechanism storing rules with currently best
χ2 value (line 15–16). We then have the following properties:

Theorem 1 (Correctness of Algorithm 1). Let minsup be a given minimum
support and minconf a given minimum confidence. Let DB be a given database.
Then,

[Soundness] All output rules in R of Algorithm 1 give the best χ2 value on
   DB , and satisfy both minsup and minconf .
[Completeness] Let r(I) be an association rule of the form: A, I → C for some
   interval I. Then, if r gives the best χ2 value on DB and satisfies minsup and
   minconf , then Algorithm 1 outputs a rule r1 in R of the form: A, I1 → C
   such that I1 = I 22 .

Proof. Since the soundness is rather obvious, we omit its proof.
    For the completeness, we have from the assumption and the completeness of
MinIntChange that there exists a sequence s of minimal changes from the root
to I1 = (I)22 such that I1 passes the canonicity test, i.e., it is not pruned in
line 13. Furthermore, since the closure operator I 022 (line 12) does not change
its support, the computation corresponding to s is not pranced in line 11, either.
Therefore, we have that r(I1 ) ∈ R.                                             2

    We note that Algorithm 1 is generic in a sense that it works for another
                                              2
correlation measure, m, by replacing MIC+p(χ ) by MIC+p(m) , provided that
the correlation measure m has a property such as convexity so that it allows
us to compute an upper bound ub(r(I)) (line 11). Such correlation measures
include information gain, gini index and Gain, to mention a few.
    Although the proposed pruning method makes the IP search space smaller,
the search space becomes larger, when a given database has many quantitative
attributes. To handle such cases, we utilize Super CWC [15], an off the shelf
feature selection algorithm to reduce the number of attributes to use. We will
show some experimental results in the following section.


4   Experimental Results

We show in Table 2 some datasets used in our experiments; one is from the
UCI Machine Learning 2 and the others are from the CTU Prague Relational
Learning Repository [12]3 . We also show in Table 3 rule templates for those
datasets used to compute correlated association rules, where Qi (i = 1, 2) are
quantitative variables, while the other variables are categorical ones.
54        Hirohisa Seki and Masahiro Nagao


Table 2. Example Databases: † from the UCI Machine Learning Repository and the
others from the CTU Prague Repository [12]. #Relations: the number of tables in the
database. #Instances: the number of rows in the target table. Size: size in MB.

                  Database #Relations #Instances Size (MB) Domain
                 Mutagenesis        3           188           0.9     Medicine
                  Financial          8          682          94.1     Finance
                  Mondial           33          454           3.3    Geography
                   Heart†            1          270         0.016     Medicine


                   Table 3. Rule Templates for the Datasets in Table 2.

      Database                                  Rule Template
     Mutagenesis mol Id(X), ind 1 (A, C), eLumo(X, Q1 ), logP(X, Q2 ) → active(X)
      Financial loan(X), amount(X, Q1 ), duration(X, 60), avg salary(X, Y, Q2 ) → status(X, C)
      Mondial country(X), continent(X, Europe), agri(X, Q1 ), serv (X, Q2 ) → christian(X)
        heart    id(X), sc(X, Q1 ), max hra(X, Q2 ), cp t(X, T ) → disease(X, C)


   We have implemented our proposed method by using Java 8 on a PC with
an Intel Core i7 processor running at 2.30GHz, 8GB of main memory, working
under Windows 7 (64 bit). We have performed the following experiments varying
the thresholds min sup at fixed min conf = 0.6.

Effects of the Pruning Method To see the effects of the pruning method, we
present some results for the two datasets (mutagenesis and financial) in Figure 2.
The figures (left) show the numbers of strong rules generated in computing cor-
related rules for the rule templates in Table 3, where those categorical attributes
in each rule template take some values in their domains. The figures (right) show
the corresponding execution times in milliseconds.
    We have observed that the pruning method based on the branch-and-bound
heuristics enables us to generate much less rules compared with the naive ap-
proach (i.e., without pruning). The execution time of both cases are also reduced
accordingly.

Effects of the Number of Quantitative Attributes Next, to see the ef-
fects of the number of quantitative variables in mining correlated rules, we
consider two more rule templates from the rule template, R2 , for the Mondial
dataset in Table 3, by varying the number of quantitative variables from 2 to
4; namely, one is a rule template R3 obtained by adding industry(X, Q3 ) to the
antecedent of R2 , while the other rule template, R4 , is obtained similarly by
adding inflation(X, Q4 ) to the antecedent of R3 .
2
     http://archive.ics.uci.edu/ml/datasets/statlog+(heart).
3
     https://relational.fit.cvut.cz/.
                                      Mining Relational Correlation Rules using Interval Patterns                                             55


                    400                                      Naive                           1200                                   Naive
                                                             Pruning                                                                Pruning
                                                                                             1000


                                                                          #Exec. Time [ms]
#patterns [x10^3]
                    300
                                                                                              800
                    200                                                                       600
                                                                                              400
                    100
                                                                                              200
                      0                                                                         0
                          0       5       10            15        20                                0        5       10        15        20
                                      min_sup [%]                                                                min_sup [%]

                     800                                                                   1500
                                                             Naive                                                                  Naive
                                                             Pruning                       1250                                     Pruning
 #patterns [x10^3]


                                                                        #Exec. Time [ms]
                     600
                                                                                           1000
                     400                                                                     750
                                                                                             500
                     200
                                                                                             250
                       0                                                                       0
                              2       4        6         8        10                                     2       4        6     8        10
                                      min_sup [%]                                                                min_sup [%]

Fig. 2. #(Generated Strong Rules) and Execution Time for Computing Correlated
Rules for the Mutagenesis (above) and the Financial (below) Dataset. Rule templates
in Table 3 are used.


   The number of generated strong rules and the execution time of computing
the association rules are shown in Fig. 3. The number of different values in the
domain of each quantitative variable Qi are shown in Table 4. The numbers of
possible interval patternss made from the values of Q1 , Q2 , Q3 and Q4 thus could
become very large. However, the figure shows that the numbers of generated rules
using the pruning method only moderately increase. The pruning method thus
works well also in this case.


Table 4. Some Statistics of the Mondial Dataset in Table 2. #Values: # of different
values in dom(Qi ) (1 ≤ i ≤ 4).

                                            Q1 (agri.) Q2 (service) Q3 (industry) Q4 (inflation)
                                  #Values          27              29                               30               29
56                        Hirohisa Seki and Masahiro Nagao


                                                      Naive2                          2000                            Naive2
                                                      Naive3                                                          Naive3
                    600
#patterns [x10^3]


                                                                   #Exec. Time [ms]
                                                      Naive4                                                          Naive4
                                                      Pruning2                        1500                            Pruning2
                    400                               Pruning3                                                        Pruning3
                                                      Pruning4                        1000                            Pruning4

                    200                                                                500

                      0                                                                  0
                             2       4      6        8      10                                2      4     6       8        10
                                     min_sup [%]                                                     min_sup [%]

Fig. 3. #(Generated Strong Rules) and Execution Time for Computing Correlated
Rules for the Mondial Dataset. N -i (resp., P -i): the naive (resp., pruning) method for
rule Ri with i quantitative attributes (i = 2, 3, 4).


    The heart dataset in Table 2 contains 13 attributes; among them, 6 attributes
take numerical values. In this case, the MinIntChange algorithm generates a large
number of CIPs. In fact, we could not obtain outputs of our algorithm within a
reasonable time when applying it directly to the complete dataset. We alleviate
this problem by first choosing some of the numerical attributes from the dataset
by using Super CWC mentioned in Sect. 3, and then applying our algorithm to
this reduced dataset. Figure 4 shows the numbers of strong rules generated and
the execution time in computing correlated rules for an initial association rule.
   Table 5 shows some rules with the best χ2 values obtained by our algorithm
as well as the other approaches, i.e., QuantMiner [14], an optimization-based
approach, and CAIM [9], a data discretization approach. We notice that the rule
obtained by our algorithm has the best χ2 value in this case.


Table 5. Some Obtained Rules: An Example of the Heart Dataset. The Initial As-
sociation Rule: id (X), sc(X, Q1 ), max hra(X, Q2 ), cp t(X, 4), hQ1 , Q2 i ∈ hI1 , I2 i →
disease(X, 2). minsupp = 0.1, minconf = 0.6.

                                                 Interval Patterns hI1 , I2 i                (supp, conf ) χ2 value
                                 Closed IP h[164.0, 409.0], [71.0, 177.0]i (0.33, 0.75)                     81.7

                             QuantMiner h[234.0, 326.0], [122.0, 147.0]i (0.09, 0.85)                       20.2

                                  CAIM          h[126.0, 407.0], [71.0, 195.0]i (0.33, 0.70)                66.0
                               Mining Relational Correlation Rules using Interval Patterns                           57


                    1750                         Naive
                    1500                         Pruning
                                                                              3000                         Naive
#patterns [x10^3]
                                                                                                           Pruning
                    1250                                                      2500


                                                           #Exec. Time [ms]
                    1000                                                      2000
                     750                                                      1500
                     500                                                      1000
                     250                                                       500
                           2   4     6       8        10                             2   4     6       8        10
                               min_sup [%]                                               min_sup [%]

 Fig. 4. #(Generated Strong Rules) and Execution Time for Computing Corre-
 lated Rules for the Heart Dataset with Initial Association Rule: id (X), sc(X, Q1 ),
 max hra(X, Q2 ), cp t(X, 4), hQ1 , Q2 i ∈ hI1 , I2 i → disease(X, 2). minsupp = 0.1,
 minconf = 0.6.


 5                    Concluding Remarks

 In this paper, we have considered the problem of mining relational association
 rules, especially focusing on the use of closed interval patterns (CIPs) for find-
 ing correlated rules with the best χ2 values. Since the number of mined CIPs
 increases as the number of attributes and values in the domain of each attribute
 increases, we have examined the effectiveness of the original MinIntChange al-
 gorithm [8] with the pruning technique by Morishita et al. on the problem. We
 have also examined the effectiveness of the use of a feature selection algorithm,
 Super CWC , to reduce the search space of IPs.
     Most of the work in the field of MRDM have handled numerical data by using
 data discretization. To the best of our knowledge, there has been no approach
 which uses closed interval patterns for mining correlated association rules in
 multi-relational data.
     For future work, we will examine another correlation measure m in MIC+p(m) ,
 since our algorithm is generic and will work for another measure. Since the search
 space for CIPs is still large in general and the computation of CIPs is costly,
 we will need some method for reducing the computational time and space to a
 manageable size.


 Acknowledgment The authors would like to thank anonymous reviewers for
 their useful comments on the previous version of the paper. This work was
 partially supported by JSPS Grant-in-Aid for Scientific Research (C) 15K00305
 and 18K11432.
58     Hirohisa Seki and Masahiro Nagao


References
 1. De Raedt, L., Ramon, J.: Condensed representations for Inductive Logic Program-
    ming. In: Proc. KR’04, pp. 438–446 (2004)
 2. Dehaspe, L.: Frequent pattern discovery in first-order logic. Ph.D. thesis,
    Dept. Computer Science, Katholieke Universiteit Leuven (1998)
 3. Dzeroski, S.: Multi-Relational Data Mining: An Introduction. SIGKDD Explo-
    rations Newsletter 5(1), 1–16 (2003)
 4. Fukuda, T., Morimoto, Y., Morishita, S., Tokuyama, T.: Data mining using two-
    dimensional optimized association rules: Scheme, algorithms, and visualization. In:
    Proc. of ACM SIGMOD ’96. pp. 13–23 (1996)
 5. Ganter, B., Kuznetsov, S.: Pattern Structures and Their Projections. In: ICCS-01,
    LNCS, vol. 2120, pp. 129–142. Springer (2001)
 6. Guyet, T., Quiniou, R., Masson, V.: Mining relevant interval rules. In: Supplemen-
    tary Proceedings of ICFCA. pp. 79–82 (2017)
 7. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann
    Publishers Inc., San Francisco, CA, USA (2006)
 8. Kaytoue, M., Kuznetsov, S.O., Napoli, A.: Revisiting numerical pattern mining
    with formal concept analysis. In: International Conference on Artificial Intelligence
    (IJCAI 2011). pp. 1342–1347 (2011)
 9. Kurgan, L.A., Cios, K.J.: CAIM discretization algorithm. IEEE Transactions on
    Knowledge and Data Engineering 16(2), 145–153 (Feb 2004)
10. Kuznetsov, S.O., Obiedkov, S.A.: Comparing performance of algorithms for gen-
    erating concept lattices. J. Exp. Theor. Artif. Intell. 14(2-3), 189–216 (2002)
11. Morishita, S., Nakaya, A.: Parallel branch-and-bound graph search for correlated
    association rules. In: Proc. of ACM SIGKDD Workshop on Large-Scale Parallel
    KDD Systems. pp. 265–276 (1999)
12. Motl, J., Schulte, O.: The CTU Prague Relational Learning Repository. ArXiv
    e-prints (Nov 2015)
13. Nagao, M., Seki, H.: On mining quantitative association rules from multi-relational
    data with FCA. In: Proc. IEEE IWCIA2016. pp. 81–86 (2016)
14. Salleb-Aouissi, A., Vrain, C., Nortet, C.: QuantMiner: A genetic algorithm for
    mining quantitative association rules. In: IJCAI. pp. 1035–1040 (2007)
15. Shin, K., Kuboyama, T., Hashimoto, T., Shepard, D.: Super-cwc and super-lcc:
    Super fast feature selection algorithms. In: proceedings of Big Data. pp. 61–67
    (2015)
16. Srikant, R., Agrawal, R.: Mining quantitative association rules in large relational
    tables. In: Proc. of ACM SIGMOD ’96. pp. 1–12 (1996)

</pre>