<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Boolean factors as a means of clustering of interestingness measures of association rules ?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Radim Belohlavek</string-name>
          <email>radim.belohlavek@acm.org</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dhouha Grissa</string-name>
          <email>dgrissa@isima.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sylvie Guillaume</string-name>
          <email>guillaum@isima.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Engelbert Mephu Nguifo</string-name>
          <email>mephu@isima.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Outrata</string-name>
          <email>jan.outrata@upol.cz</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CNRS, UMR 6158, LIMOS</institution>
          ,
          <addr-line>F-63173 Aubiere</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Clermont Universite, Universite Blaise Pascal</institution>
          ,
          <addr-line>LIMOS, BP 10448, F-63000 Clermont-Ferrand</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Clermont Universite, Universite d'Auvergne</institution>
          ,
          <addr-line>LIMOS, BP 10448, F-63000 Clermont-Ferrand</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Data Analysis and Modeling Lab Department of Computer Science , Palacky University</institution>
          ,
          <addr-line>Olomouc 17. listopadu 12, CZ-77146 Olomouc</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>URPAH, Departement d'Informatique, Faculte des Sciences de Tunis</institution>
          ,
          <addr-line>Campus Universitaire, 1060 Tunis, Tunisie</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Measures of interestingness play a crucial role in association rule mining. An important methodological problem is to provide a reasonable classi cation of the measures. Several papers appeared on this topic. In this paper, we explore Boolean factor analysis, which uses formal concepts corresponding to classes of measures as factors, for the purpose of classi cation and compare the results to the previous approaches.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        An important problem in extracting association rules, well known since the early
stage of association rule mining [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], is the possibly huge number of rules
extracted from data. A general way of dealing with this problem is to de ne the
concept of rule interestingness: only association rules that are considered
interesting according to some measure are presented to the user. The most widely
used measures of interestingness are based on the concept of support and
condence. However, the suitability of these measures to extract interesting rules
was challenged by several studies, see e.g. [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ]. Consequently, several other
interestingness measures of association rules were proposed, see e.g. [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ], [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
[
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]. With the many existing measures of interestingness arises the problem of
selecting an appropriate one.
      </p>
      <p>
        To understand better the behavior of various measures, several studies of
the properties of measures of interestingness appeared, see e.g. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ],
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Those studies explore various properties of the measures that are considered
important. For example, Vaillant et al. [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ] evaluated twenty interestingness
measures according to eight properties. To facilitate the choice of the user-adapted
interestingness measure, the authors applied the clustering methods on the
decision matrix and obtained ve clusters. Tan et al. [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] studied twenty-one
interestingness measures through eight properties and showed that no measure is
adapted to all cases. To select the best interestingness measure, they use both a
support-based pruning and standardization methods. By applying a new
clustering approach, Huynh et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] classifyed thirty-four interestingness measures
with a correlation analysis. Geng and Hamilton [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] made a survey of
thirtyeight interestingness measures for rules and summaries with eleven properties
and gived strategies to select the appropriate measures. D. R. Feno [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
evaluated fteen interestingness measures with thirteen properties to describe their
behaviour. Delgato et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] provided a new study of the interestingness measures
by means of the logical model. In addition, the authors proposed and justi ed
the addition of two new principles to the three proposed by Piatetsky-Shapiro
[
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. Finally, Heravi and Zaiane [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] studied fty-three objective measures for
associative classi cation rules according to sixteen properties and explained that
no single measure can be introduced as an obvious winner.
      </p>
      <p>
        The assessment of measures according to their properties results in a
measureproperty binary matrix. Two studies of this matrix were conducted. Namely, [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]
describes how FCA can highlight interestingness measures with similar
behavior in order to help the user during his choice. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] attempted to nd
natural clusters of measures using widely used clustering methods, the
agglomerative hierarchical method (AHC) and the K-means method. A common feature
of these methods is that they only produce disjoint clusters of measures. On
the other hand, one could naturally expect overlapping clusters. The aim of this
paper is to explore the possibility of obtaining overlapping clusters of measures
using factor analysis of binary data and to compare the results with the results
of other studies. In particular, we use the recently developed method from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
and take the discovered factors for clusters. The method uses formal concepts
as factors that makes it possible to interpret the factors easily.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <sec id="sec-2-1">
        <title>Binary (Boolean) data</title>
        <p>Let X be a set of objects (such as a set of customers, a set of functions or the
like) and Y be a set of attributes (such as a set of products that customers may
buy, a set of properties of functions). The information about which objects have
which attributes may formally be represented by a binary relation I between
X and Y , i.e. I X Y , and may be visualized by a table (matrix) that
contains 1s and 0s, according to whether the object corresponding to a row has
the attribute corresponding to a column (for this we suppose some orders of
objects and attributes are xed). We denote the entries of such matrix by Ixy.
A data of this type is called binary data (or Boolean data). The triplet hX; Y; Ii
is called a formal context in FCA but other terms are used in other areas.</p>
        <p>Such type of data appears in two roles in our paper. First, association rules,
whose interestingness measures we analyze, are certain dependencies over the
binary data. Second, the information we have about the interestingness measures
of association rules is in the form of binary data: the objects are interestingness
measures and the attributes are their properties.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Association rules</title>
        <p>
          An association rule [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ] over a set Y of attributes is a formula
        </p>
        <p>
          A ) B
(1)
where A and B are sets of attributes from Y , i.e. A; B Y . Let hX; Y; Ii be
a formal context. A natural measure of interestingness of association rules is
based on the notions of con dence and support. The con dence and support of
an association rule A ) B in hX; Y; Ii is de ned by
conf(A ) B) = jA# \ B#j
jA#j
and supp(A ) B) = jA# \ B#j ;
jXj
where C# for C Y is de ned by C# = fx 2 X j for each y 2 C : hx; yi 2 Ig.
An association rule is considered interesting if its con dence and support
exceed some user-speci ed thresholds. However, the support-con dence approach
reveals some weaknesses. Often, this approach as well as algorithms based on it
lead to the extraction of an exponential number of rules. Therefore, it is
impossible to validate it by an expert. In addition, the disadvantage of the support is
that sometimes many rules that are potentially interesting, have a lower support
value and therefore can be eliminated by the pruning threshold minsupp. To
address this problem, many other measures of interestingness have been proposed
in the literature [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], mainly because they are e ective for mining potentially
interesting rules and capture some aspects of user interest. The most important
of those measures are subject to our analysis and are surveyed in Section 3.1.
Note that association rules are attributed to [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. However, the concept of
association rule itself as well as various measures of interestingness are particular cases
of what is investigated in depth in [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], a book that develops logico-statistical
foundations of the GUHA method [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Factor analysis of binary (Boolean) data</title>
        <p>Let I be an n
a decomposition
m binary matrix. The aim in Boolean factor analysis is to nd</p>
        <p>
          I = A
The inner dimension, k, in the decomposition may be interpreted as the number
of factors that may be used to describe the original data. Namely, Ail = 1 if
and only if the lth factor applies to the ith object and Blj = 1 if and only if
the jth attribute is one of the manifestations of the lth factor. The factor model
behind (2) has therefore the following meaning: The object i has the attribute j
if and only if there exists a factor l that applies to i and for which j is one of its
particular manifestations. We refer to [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] for further information and references
to papers that deal with the problem of factor analysis and decompositions of
binary matrices.
        </p>
        <p>
          In [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], the following method for nding decompositions (2) with the number
k of factors as small as possible has been presented. The method utilizes formal
concepts of the formal context hX; Y; Ii as factors, where X = f1; : : : ; ng, Y =
f1; : : : ; mg (objects and attributes correspond to the rows and columns of I).
Let
        </p>
        <p>
          F = fhC1; D1i; : : : ; hCk; Dkig
be a set of formal concepts of hX; Y; Ii, i.e. hCl; Dli are elements of the concept
lattice B(X; Y; I) [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Consider the n k binary matrix AF and a k m binary
matrix BF de ned by
(AF )il = 1 iff i 2 Cl
and
(BF )lj = 1 iff j 2 Dl:
(3)
Denote by (I) the smallest number k, so-called Schein rank of I, such that a
decomposition of I exists with k factors. The following theorem shows that using
formal concepts as factors as in (3) enables us to reach the Schein rank, i.e. is
optimal [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]:
        </p>
        <sec id="sec-2-3-1">
          <title>Theorem 1. For every binary matrix I, there exists F</title>
          <p>I = AF BF and jF j = (I).</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>B(X; Y; I) such that</title>
          <p>
            As has been demonstrated in [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ], a useful feature of using formal concepts as
factors is the fact that formal concepts may easily be interpreted. Namely, every
factor, i.e. a formal concept hCl; Dli, consists of a set Cl of objects (objects are
measures of interestingness in our case) and a set Dl of attributes (properties of
measures in our case). Cl contains just the objects to which all the attributes
from Dl apply and Dl contains all attributes shared by all objects from Cl. From
a clustering point of view, the factors hCl; Dli may thus be seen as clusters Cl
with their descriptions by attributes from Dl. The factors thus have a natural,
easy to understand meaning. Since the problem of computing the smallest set
of factors is NP-hard, a greedy approximation algorithm was proposed in [3,
Algorithm 2]. This algorithm is utilized below in our paper.
          </p>
          <p>Clustering interestingness measures using Boolean
factors
3.1</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Measures of interestingness</title>
        <p>In the following, we present the interestingness measures reported in the
literature and recall nineteen of their most important properties that were proposed
in the literature.</p>
        <p>
          To identify interesting association rules and to enable the user to focus on
what is interesting for him, about sixty interestingness measures [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]
were proposed in the literature. All of them are de ned using the following
parameters: p(XY ), p(XY ), p(XY ) and p(XY ), where p(XY ) = nXY represents
n
the number of objects satisfying XY (the intersection of X and Y ), and X is the
negation of X. The following are important examples of interestingness measures:
Lift [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]: Given a rule X ! Y , lift is the ratio of the probability that X and Y
occur together to the multiple of the two individual probabilities for X and Y ,
i.e.,
        </p>
        <p>
          Lift (X ! Y ) = p( Xp()XYp()Y ) :
If this value is 1, then X and Y are independent. The higher this value, the
more likely that the existence of X and Y together in a transaction is not just
a random occurrence, but because of some relationship between them.
Correlation coe cient [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]: Correlation is a symmetric measure evaluating
the strength of the itemsets' connection. It is de ned by
        </p>
        <p>p(XY ) p(X)p(Y ) :</p>
        <p>Correlation = pp(X)p(Y )p(X)p(Y )
A correlation around 0 indicates that X and Y are not correlated. The lower is
its value, the more negatively correlated X and Y are. The higher is its value,
the more positively correlated they are.</p>
        <p>
          Conviction [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]: Conviction is one of the measures that favor counter-examples.
It is de ned by
        </p>
        <p>
          Conviction = p(X)p(Y )
p(XY )
Conviction which is not a symmetric measure, is used to quatify the deviation
from independence. If its value is 1, then X and Y are independent.
MGK [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]: MGK is an interesting measure, which allows the extraction of
negative rules.
        </p>
        <p>MGK = p(Y =X) p(Y ) ; if X favorise Y</p>
        <p>1 p(Y )</p>
        <p>MGK = p(Y =pX(Y) )p(Y ) ; if X defavorise Y
It takes into account several situations of references: in the case where the rule
is situated in the attractive zone (i.e. p(Y =X) &gt; p(Y )), this measure evaluates
the distance between independence and logical implication. Thus, the higher the
value of MGK is close to 1, the more the rule is close to the logical implication
and the higher the value of MGK is close to 0, the more the rule is close to the
independence. In the case where the rule is located in the repulsive zone (i.e.
p(Y =X) &lt; p(Y )), MGK evaluates this time a distance between the independence
and the incompatibility. Thus, the closer the value of MGK is to 1, the more
similar to incompatibility the rule is; and the closer the value of MGK is to 0,
the closer to the independence the rule is.</p>
        <p>
          As was mentioned above, several studies [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] were reported
in the literature on the various properties of interestingess measures to be able
to characterize and evaluate the interestingness measures. The main goal of
researchers in the domain is then to provide a user assistance in choosing the
best interestingness measure meeting his needs. For that, formal properties have
been developed [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] in order to evaluate the interestingness
measures and to help users understanding their behavior. In the following, we
present nineteen properties reported in the literature.
3.2
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Properties of the measures</title>
        <p>
          The measure-property matrix describing interestingness measures by their
properties is depicted in Figure 2. It consists of 62 measures (61 measures from [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]
plus one more that has been studied recently) described by 21 properties
because the three-valued property P14 is represented by three yes-no properties
No. Property Ref.
P1 Intelligibility or comprehensibility of measure [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]
P2 Easiness to x a threshold to the rule [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
P3 Asymmetric measure. [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
P4 Asymmetric measure in the sense of the conclusion negation. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ]
P5 Measure assessing in the same way X ! Y and Y ! X in the logical [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
implication case.
        </p>
        <p>
          P6 Measure increasing function the number of examples or decreasing func- [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
tion the number of counter-examples.
        </p>
        <p>
          P7 Measure increasing function the data size. [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ]
P8 Measure decreasing function the consequent/antecedent size. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]
P9 Fixed value a in the independence case. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]
P10 Fixed value b in the logical implication case. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
P11 Fixed value c in the equilibrium case. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
P12 Identi ed values in the attraction case between X and Y . [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]
P13 Identi ed values in the repulsion case between X and Y . [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]
P14 Tolerance to the rst counter-example. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ]
P15 Invariance in case of expansion of certain quantities. [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ]
PP1167 DDeessiirreedd rreellaattiioonnsshhiipp bbeettwweeeenn XX !! YY aanndd XX !! YY raunlteisn.omic rules. [[3355]]
P18 Desired relationship between X ! Y and X ! Y rules. [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ]
P19 Antecedent size is xed or random. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
P20 Descriptive or statistical measure. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
P21 Discriminant measure. [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
        </p>
        <p>
          P14:1, P14:2, and P14:3. We computed the decomposition of the matrix using
Algorithm 2 from [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and obtained 28 factors (as in the case below, several of them
may be disregarded as not very important; we leave the details for a full version
of this paper). In addition, we extended the original 62 21 binary matrix by
adding for every property its negation, and obtained a 62 42 binary matrix. The
reason for adding negated properties is due to our goal to compare the results
with the two clustering methods mentioned above and the particular role of the
properties and their negations in these clustering methods. From the 62 42
matrix, we obtained 38 factors, denoted F1; : : : ; F38. The factors are presented
in Figures 3 and 4. Figure 3 depicts the object-factor matrix describing the
interestingness measures by factors, Figure 4 depicts the factor-property matrix
explaining factors by properties of measures. Factors are sorted from the most
important to the least important, where the importance is determined by the
number of 1s in the input measure-property matrix covered by the factor [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
The rst factors cover a large part of the matrix, while the last ones cover only
a small part and may thus be omitted [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], see the graph of cumulative cover of
the matrix by the factors in Figure 5.
4
        </p>
        <p>
          Interpretation and comparison to other approaches
The aim of this section is to provide an interpretation of the results described
in the previous section and compare them to the results already reported in the
literature, focusing mainly on [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. As was described in the previous section, 38
factors were obtained. The rst 21 of them cover 94 % of the input
measureproperty matrix (1s in the matrix), the rst nine cover 72 %, and the rst ve
Fig.2. Input binary matrix describing interestingness measures by their properties.
Fig. 3. Interestingness measures described by factors obtained by decomposition of the
input matrix from Figure 2 extended by negated properties.
Fig. 4. Factors obtained by decomposition of the input matrix from Figure 2 extended
by negated properties. The factors are described in terms of the original and negated
properties.
0
cover 52.4 %. Another remark is that the rst ten factors cover the whole set of
measures.
        </p>
        <p>
          Note rst that the Boolean factors represent overlapping clusters, contrary
to the clustering using the agglomerative hierarchical method and the K-means
method performed in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Namely, the clusterings are depicted in Figure 6
describing the Venn diagram of the rst ve Boolean factors (plus the eighth and
part of the sixth and tenth to cover the whole set of measures) and Figure 7,
which is borrowed from [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], describing the consensus on the classi cation
obtained by the hierarchical and K-means clusterings. This consensus refunds the
classes C1 to C7 of the extracted measures, which are common to both
techniques.
        </p>
        <p>Due to lack of space, we focus on the rst four factors since they cover nearly
half of the matrix (45.1 %), and also because most of the measures appear at
least once in the four factors.</p>
        <p>Factor 1. The rst factor F1 applies to 20 measures, see Figure 3, namely:
correlation, Cohen, Pavillon, conviction, Bayes factor, Loevinger, collective strength,
information gain, Goodman, interest, Klosgen, Mgk, YuleQ, relative risk, one
way support, two way support, YuleY, Zhang, novelty, and odds ratio. These
measures share the following 9 properties: P4, P7, P9, not P11, P12, P13, not
P19, not P20, P21, see Figure 4.</p>
        <p>Interpretation. The factor applies to measures whose evolutionary curve
increases w.r.t the number of examples and have a xed point in the case of
independence (this allows to identify the attractive and repulsive area of a rule).
The factor also applies only to descriptive and discriminant measures that are
not based on a probabilistic model.</p>
        <p>
          Comparison. When looking at the classi cation results reported in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], F1
covers two classes from [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]: C6 and C7, which together contain 15 measures.
Those classes are closely related within the dendrogram obtained with the
agglomerative hierarchical clustering method used in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. The 5 missing measures
form a class obtained with K-means method in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] with Euclidian distance.
        </p>
        <p>Factor 2. F2 applies to 18 measures, namely: con dence, causal con dence,
Ganascia, causal con rmation, descriptive con rmation, cosine, causal
dependency, Laplace, least contradiction, precision, recall, support, causal con rmed
con dence, Czekanowski, negative reliability, Leverage, speci city, and causal
support. These measures share the following 11 properties: P4, P6, not P9, not
P12, not P13, P14.2, not P15, not P16, not P19, not P20, P21.</p>
        <p>Interpretation. The factor applies to measures whose evolutionary curve
increases w.r.t. the number of examples and has a variable point in the case of
independence, which implies that the attractive and repulsive areas of a rule
are not identi able. The factor also applies only to measures that are not
discriminant, are indi erent to the rst counter-examples, and are not based on a
probabilistic model.</p>
        <p>
          Comparison. F2 corresponds to two classes, C4 and C5 reported in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
C4 [ C5 contains 22 measures. The missing measures are: Jaccard,
Kulczynski, examples and counter-examples rate and Sebag. Those measures are not
covered by F2 since they are not indi erent to the rst counter-examples.
        </p>
        <p>Factor 3. F3 applies to 10 measures, namely: coverage, dependency, weighted
dependency, implication index, Jmeasure, Pearl, prevalence, Gini, variation
support, and mutual information. These measures share the following 10 properties:
not P6, not P8, not P10, not P11, not P13, not P14.1, not P15, not P16, not
P17, not P19.</p>
        <p>Interpretation. The factor applies to measures whose evolutionary curve does
not increase w.r.t. the number of examples.</p>
        <p>
          Comparison. F3 corresponds to class C3 reported in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], which contains
8 measures. The two missing measures, variation support and Pearl, belong
to the same classes obtained by both K-means and the hierarchical method.
Moreover, these two missing measures are similar to those from C3 obtained
by the hierarchical method since they merge with the measures in C3 at the
next level of the generated dendrogram. Here, there is a strong correspondence
between results obtained using Boolean factors and the ones reported in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
        </p>
        <p>Factor 4. F4 applies to 9 measures, namely: con dence, Ganascia, descriptive
con rmation, IPEE, IP3E, Laplace, least contradiction, Sebag, and examples and
counter-examples rate. These measures share the following 12 properties: P3, P4,
P6, P11, not P7, not P8, not P9, not P12, not P13, not P15, not P16, not P18.</p>
        <p>Interpretation. The factor applies to measures whose evolutionary curve
increases w.r.t. the number of examples and has a xed value in the equilibrium
case. As there is no xed value in the independence case, we can not get an
identi able area in the case of attraction or repulsion.</p>
        <p>
          Comparison. F4 mainly applies to measures of class C5 obtained in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. The
two missing measures, IPEE et IP3E, belong to a di erent class.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusions and further issues</title>
      <p>We demonstrated that Boolean factors provide us with clearly interpretable
meaningful clusters of measures among which the rst ones are highly similar
to other clusters of measures reported in the literature. Contrary to other
clustering methods, Boolean factors represent overlapping clusters. We consider this
an advantage because overlapping clusters are a natural phenomenon in human
classi cation. We presented preliminary results on clustering the measures using
Boolean factors. Due to limited scope, we presented only parts of the results
obtained and leave other results for a full version of this paper.</p>
      <p>An interesting feature of the presented method, to be explored in the future,
is that the method need not start from scratch. Rather, one or more clusters,
that are considered important classes of measures, may be supplied at the start
and the method may be asked to complete the clustering. Another issue left
for future research is the bene t of the clustering of measures for a user who
is interested in selecting a type of measure, rather than a particular measure
of interestingness of association rules. In the intended scenario, a user may use
various interestingness measures that belong to di erent classes of measures.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Agrawal</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Imielinski</surname>
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swami</surname>
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Mining association rules between sets of items in large databases</article-title>
          .
          <source>Proc. ACM SIGMOD</source>
          <year>1993</year>
          ,
          <volume>207</volume>
          {
          <fpage>216</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Agrawal</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srikant</surname>
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Fast algorithms for mining association rules</article-title>
          .
          <source>Proc. VLDB Conf</source>
          .
          <year>1994</year>
          ,
          <volume>478</volume>
          {
          <fpage>499</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Belohlavek</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vychodil</surname>
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Discovery of optimal factors in binary data via a novel method of matrix decomposition</article-title>
          .
          <source>J. of Computer and System Sciences</source>
          <volume>76</volume>
          (
          <issue>1</issue>
          )(
          <year>2010</year>
          ),
          <volume>3</volume>
          {
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Blanchard</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillet</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Briand</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gras</surname>
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Assessing rule with a probabilistic measure of deviation from equilbrium</article-title>
          .
          <source>In Proc. Of 11th International Symposium on Applied Stochastic Models and Data Analysis ASMDA</source>
          <year>2005</year>
          , Brest, France,
          <volume>191</volume>
          {
          <fpage>200</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Blanchard</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillet</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Briand</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gras</surname>
            <given-names>R.</given-names>
          </string-name>
          : IPEE:
          <article-title>Indice Probabiliste d'Ecart a l'Equilibre pour l'evaluation de la qualite des regles</article-title>
          .
          <source>Dans l'Atelier Qualite des Donnees et des Connaissances</source>
          <year>2005</year>
          ,
          <volume>26</volume>
          {
          <fpage>34</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Brin</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motwani</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silverstein</surname>
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Beyond Market Baskets: Generalizing Association Rules to Correlations</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Conference</source>
          , Tucson, Arizona,
          <year>1997</year>
          ,
          <volume>265</volume>
          {
          <fpage>276</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Carpineto</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Romano</surname>
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Concept Data Analysis</article-title>
          .
          <article-title>Theory and Applications</article-title>
          . J. Wiley,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Davey</surname>
            <given-names>B. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Priestley</surname>
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Introduction to Lattices and Order</article-title>
          . Cambridge University Press, Oxford,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Delgado</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruiz D</surname>
          </string-name>
          .-L.,
          <string-name>
            <surname>Sanchez</surname>
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Studying Interest measures for association rules through a logical model</article-title>
          .
          <source>International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 18(1)</source>
          (
          <year>2010</year>
          ), World Scienti c,
          <volume>87</volume>
          {
          <fpage>106</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Feno</surname>
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Mesures de qualite des regles d'association: normalisation et caracterisation des bases</article-title>
          .
          <source>PhD thesis</source>
          , Universite de La Reunion,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ganter</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Formal Concept Analysis</article-title>
          .
          <source>Mathematical Foundations</source>
          . Springer, Berlin,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Geng</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamilton H.J.:</surname>
          </string-name>
          <article-title>Choosing the Right Lens: Finding What is Interesting in Data Mining</article-title>
          .
          <source>Quality Measures in Data Mining</source>
          <year>2007</year>
          ,
          <source>ISBN 978-3-540-44911-9</source>
          ,
          <issue>3</issue>
          {
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Geng</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamilton</surname>
            <given-names>H. J.:</given-names>
          </string-name>
          <article-title>Interestingness measures for data mining: A Survey</article-title>
          .
          <source>ACM Comput. Surveys</source>
          <volume>38</volume>
          (
          <issue>3</issue>
          )(
          <year>2006</year>
          ),
          <volume>1</volume>
          {
          <fpage>31</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Guillaume</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grissa</surname>
            <given-names>D.</given-names>
          </string-name>
          , Mephu Nguifo E.:
          <article-title>Categorisation des mesures d'inter^et pour l'extraction des connaissances</article-title>
          .
          <source>Revue des Nouvelles Technologies de l'Information</source>
          ,
          <year>2011</year>
          , to appear (previously available as
          <source>Technical Report RR-10-14</source>
          , LIMOS, ISIMA,
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Guillaume</surname>
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Traitement des donnees volumineuses. Mesures et algorithmes d'extraction des regles d'association et regles ordinales</article-title>
          .
          <source>PhD thesis</source>
          . Universite de Nantes, France,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Guillaume</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grissa</surname>
            <given-names>D.</given-names>
          </string-name>
          , Mephu Nguifo E.:
          <article-title>Proprietes des mesures d'inter^et pour l'extraction des regles</article-title>
          .
          <source>Dans l'Atelier Qualite des Donnees et des Connaissances</source>
          , EGC'
          <year>2010</year>
          ,
          <year>2010</year>
          , Hammamet-Tunisie, http://qdc2010.lri.fr/fr/actes.php,
          <volume>15</volume>
          {
          <fpage>28</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Grissa</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillaume</surname>
            <given-names>S.</given-names>
          </string-name>
          , Mephu Nguifo E.:
          <article-title>Combining Clustering techniques and Formal Concept Analysis to characterize Interestingness Measures</article-title>
          .
          <source>CoRR abs/1008.3629</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Hajek</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Havranek</surname>
            <given-names>T.</given-names>
          </string-name>
          : Mechanizing Hypotheses Formation. Springer,
          <year>1978</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Hajek</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holena</surname>
            ,
            <given-names>Rauch J.:</given-names>
          </string-name>
          <article-title>The GUHA method and its meaning for data mining</article-title>
          .
          <source>J. Computer and System Sciences</source>
          <volume>76</volume>
          (
          <year>2010</year>
          ),
          <volume>34</volume>
          {
          <fpage>48</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Hilderman</surname>
            <given-names>R. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamilton</surname>
            <given-names>H. J.</given-names>
          </string-name>
          :
          <source>Knowledge Discovery and Measures of Interest</source>
          , Volume
          <volume>638</volume>
          of The International Series in Engineering and Computer Science
          <volume>81</volume>
          (
          <issue>2</issue>
          )(
          <year>2001</year>
          ), Kluwer.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Huynh X.-H.</surname>
          </string-name>
          ,
          <string-name>
            <surname>Guillet</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Briand</surname>
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Clustering Interestingness Measures with Positive Correaltion</article-title>
          .
          <source>ICEIS (2)</source>
          (
          <year>2005</year>
          ),
          <volume>248</volume>
          {
          <fpage>253</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Heravi</surname>
            <given-names>M. J.</given-names>
          </string-name>
          , Zaane
          <string-name>
            <surname>O. R.:</surname>
          </string-name>
          <article-title>A study on interestingness measures for associative classi ers</article-title>
          .
          <source>SAC</source>
          (
          <year>2010</year>
          ),
          <volume>1039</volume>
          {
          <fpage>1046</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Lallich</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teytaud</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Evaluation et validation de mesures d'inter^et des regles d'association</article-title>
          . RNTI-E-
          <volume>1</volume>
          , numero special
          <year>2004</year>
          ,
          <volume>193</volume>
          {
          <fpage>217</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Lenca</surname>
            <given-names>P</given-names>
          </string-name>
          , Meyer P.,
          <string-name>
            <surname>Picouet</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaillant</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lallich</surname>
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Criteres d'evaluation des mesures de qualite en ecd</article-title>
          . Revue des Nouvelles Technologies de l'
          <string-name>
            <surname>Information (Entreposage</surname>
          </string-name>
          et Fouille de donnees) (
          <volume>1</volume>
          )(
          <year>2003</year>
          ),
          <volume>123</volume>
          {
          <fpage>134</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25. Lenca P., Meyer P.,
          <string-name>
            <surname>Vaillant</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lallich</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A multicriteria decision aid for interestingness measure selection</article-title>
          .
          <source>Technical Report LUSSI-TR-2004-01-EN</source>
          , Dpt. LUSSI,
          <string-name>
            <surname>ENST</surname>
          </string-name>
          <article-title>Bretagne 2004 (chapter 1).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Liu</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mi</surname>
            <given-names>J.-S.:</given-names>
          </string-name>
          <article-title>A novel approach to attribute reduction in formal concept lattices</article-title>
          .
          <source>RSKT 2006, Lecture Notes in Arti cial Intelligence</source>
          <volume>4062</volume>
          (
          <year>2006</year>
          ),
          <volume>522</volume>
          {
          <fpage>529</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Maddouri</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gammoudi</surname>
            <given-names>J</given-names>
          </string-name>
          .:
          <source>On Semantic Properties of Interestingness Measures for Extracting Rules from Data. Lecture Notes in Computer Science</source>
          <volume>4431</volume>
          (
          <year>2007</year>
          ),
          <volume>148</volume>
          {
          <fpage>158</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Maier</surname>
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>The Theory of Relational Databases</article-title>
          . Computer Science Press, Rockville,
          <year>1983</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29. Pawlak
          <string-name>
            <surname>Z.</surname>
          </string-name>
          :
          <article-title>Rough sets</article-title>
          .
          <source>Int. J. Information and Computer Sciences</source>
          <volume>11</volume>
          (
          <issue>5</issue>
          )(
          <year>1982</year>
          ),
          <volume>341</volume>
          {
          <fpage>356</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30. Pawlak
          <string-name>
            <surname>Z.</surname>
          </string-name>
          :
          <article-title>Rough Sets: Theoretical Aspcets of Reasoning About Data</article-title>
          . Kluwer, Dordrecht,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Pearson</surname>
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Mathematical contributions to the theory of evolution, regression, heredity and panmixia</article-title>
          .
          <source>Philosophical Trans. of the Royal Society A</source>
          (
          <year>1896</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Piatetsky-Shapiro</surname>
            <given-names>G</given-names>
          </string-name>
          .:
          <article-title>Discovery, Analysis and Presentation of Strong Rules</article-title>
          . In G.
          <string-name>
            <surname>Piatetsky-Shapiro</surname>
          </string-name>
          &amp; W.J. Frawley, editors: Knowledge Discovery in Databases. AAAI Press,
          <year>1991</year>
          ,
          <volume>229</volume>
          {
          <fpage>248</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33. Polkowski L.:
          <source>Rough Sets: Mathematical Foundations</source>
          . Springer,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Sese</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morishita</surname>
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Answering the most correlated n association rules e ciently</article-title>
          .
          <source>In Proceedings of the 6th European Conf on Principles of Data Mining and Knowledge Discovery</source>
          <year>2002</year>
          , Springer-Verlag,
          <volume>410</volume>
          {
          <fpage>422</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Tan P.-N</surname>
          </string-name>
          .,
          <string-name>
            <surname>Kumar</surname>
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srivastava</surname>
            <given-names>J</given-names>
          </string-name>
          .:
          <article-title>Selecting the right objective measure for association analysis</article-title>
          .
          <source>Information Systems</source>
          <volume>29</volume>
          (
          <issue>4</issue>
          )(
          <year>2004</year>
          ),
          <volume>293</volume>
          {
          <fpage>313</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Tan P.-N</surname>
          </string-name>
          .,
          <string-name>
            <surname>Steinbach</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Introduction to Data Mining</article-title>
          . Addison-Wesley,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Vaillant</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenca</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lallich</surname>
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A Clustering of Interestingness Measures</article-title>
          .
          <source>DS'04, the 7th International Conference on Discovery Science LNAI</source>
          <volume>3245</volume>
          (
          <year>2004</year>
          ),
          <volume>290</volume>
          {
          <fpage>297</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Vaillant</surname>
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Mesurer la qualite des regles d'association: etudes formelles et experimentales</article-title>
          .
          <source>PhD thesis</source>
          , ENST Bretagne,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Wang</surname>
            <given-names>X.</given-names>
          </string-name>
          , Ma J.:
          <article-title>A novel approach to attribute reduction in concept lattices</article-title>
          .
          <source>RSKT 2006, Lecture Notes in Arti cial Intelligence</source>
          <volume>4062</volume>
          (
          <year>2006</year>
          ),
          <volume>522</volume>
          {
          <fpage>529</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Wille</surname>
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Restructuring lattice theory: an approach based on hierarchies of concepts</article-title>
          .
          <source>In: Rival I.: Ordered Sets. Reidel</source>
          , Dordrecht, Boston,
          <year>1982</year>
          ,
          <volume>445</volume>
          {
          <fpage>470</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41. Zhang W.-X.,
          <string-name>
            <surname>Wie</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi J.-</surname>
          </string-name>
          J.:
          <article-title>Attribute reduction in concept lattices based on discernibility matrix</article-title>
          .
          <source>RSFDGrC 2005, Lecture Notes in Arti cial Intelligence</source>
          <volume>3642</volume>
          (
          <year>2005</year>
          ),
          <volume>157</volume>
          {
          <fpage>165</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>