<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>VVIIEE MMGGBB:: AA VViissuuaall IInntteerraaccttiivvee EExxpplloorraattiioonn ooff MMiinniimmaall GGeenneerriicc BBaassiiss ooff AAssssooppcciiaattiioonn RRuulleess</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chiraz Latiri Cherif</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wissem Bellegha</string-name>
          <email>wissembellegha@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sadok Ben Yahia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chiraz Latiri Cherif</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>smissi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sadok Ben Yahia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ghada Guesmi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Campus UniverRseitsaeiarrechELTeMa manUarR</institution>
          ,
          <addr-line>P10A6H0 Tunis</addr-line>
          ,
          <country>Tunisia Campus</country>
        </aff>
      </contrib-group>
      <fpage>179</fpage>
      <lpage>196</lpage>
      <abstract>
        <p>Mining association rules is an important task, even though the number of rules discovered is often huge. A possible solution to this problem, is to use the Formal Concept Analysis (FCA) mathematical settings to restrict rules extraction to a generic basis of association rules. This one is considered as a reduced set to which we can apply appropriate inference mechanisms to derive redundant rules. In this paper, we introduce a new minimal generic basis MGB of non-redundant association rules based on the augmented Iceberg Galois lattice. The proposed approach involves the inference mechanisms used and a set of experiments applied to several real and synthetic databases. Carried out experiments showed important benefits in terms of reduction in the number of generic rules extracted. We present also a new framework for generating and visually exploring the minimal generic basis MGB.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Discovering association rules in large datasets is an interesting problem of
research in data mining. Its principal objective is to find out correlations between
frequents itemsets. An association rule is a strong one when its support (i.e.,
frequency of the represented pattern) and confidence (i.e., strength of the
dependency between the premise and the conclusion) are higher than minimum
thresholds fixed by the user (minsup and minconf ). However the number of
association rules generated is often huge because many of them are redundant.
So, a possible solution to this problem, is to restrict rules extraction to a reduced
set containing only non redundant ones which are strictly related to the user’s
need [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>This reduced set is known as generic basis for association rules, to which
we can apply appropriate inference mechanisms to derive all strong redundant
association rules. A generic basis can be evaluated by three properties such as
informativity, compacity and inference mechanism which will be detailed later.</p>
      <p>
        This paper is organized as follows. Section 2 presents the mathematical
background of Formal Concept Analysis (FCA) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and describes the association rules
mining task. Section 3 deals with the problem of eliminating redundant rules by
surveying some of the proposed approaches for constructing generic bases of
association rules. In section 4, we present a new minimal generic basis of
association rules denoted as MGB. Section 5 consolidates the proposed approach by
a set of experiments applied to several real synthetic databases. Finally, we will
present VIE MGB as a new framework for visually generating and exploring
MGB and the Iceberg Galois lattice. We were inspired by MIRAGE [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] which
allows interactive graphical visualization and exploration of minimal association
rules.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Mathematical background</title>
      <p>In this section, we introduce the mathematical notions based on the FCA.
2.1</p>
      <sec id="sec-2-1">
        <title>Basic notions</title>
        <p>Formal context: A formal context is a triplet K = (O, A, R), where O
represents a finite set of objects, A a finite set of attributes, and R a binary
relation(i.e., R ⊆ O × A). Each couple (o, a) ∈ R expresses that the transaction
o ∈ O contains the attribute a ∈ A.</p>
        <p>
          We define two functions, summarizing links between subsets of objects and
subsets of attributes induced by R, that map sets of objects to sets of attributes
and vice versa. Thus, for a set O ⊆ O, we define
φ(O) = {a | ∀o, o ∈ O =⇒ (o, a) ∈ R} ; and for A ⊆ A, ψ(A) = {o | ∀a, a ∈
A =⇒ (o, a) ∈ R}. Both functions φ and ψ form a Galois connection between
the sets P(A) and P(O) [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Consequently, both compound operators of φ and
ψ are closure operators, in particular, ω = φ ◦ ψ is a closure operator.
        </p>
        <p>Frequent closed itemset: An itemset A ⊆ A is said to be closed if
A = ω(A), and is said to be frequent with respect to minsup threshold if
support(A) = |ψ(A)| ≥ minsup.</p>
        <p>|O|</p>
        <p>Formal concept: A formal concept is a couple c = (O; A), where O is called
extent, and A is a closed itemset, called intent. Furthermore, both O and A are
related through the Galois connection, i.e., φ(O) = A and ψ(A) = O.</p>
        <p>
          Minimal generator: An itemset g ⊆ A is called minimal generator of a
closed itemset A, if and only if ω(g) = A and g ⊆ g such that ω(g ) = A [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          Galois lattice: Given a formal context K, the set of formal concepts CK
is a complete lattice Lc = (C, ≤), called the Galois (concept) lattice, when CK
is considered with inclusion between itemsets [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. A partial order on formal
concepts is defined as follows ∀c1, c2 ∈ CK, c1 ≤ c2 iff intent(c2) ⊆ intent(c1),
or equivalently extent(c1) ⊆ extent(c2). The partial order is used to generate
the lattice graph, called Hasse diagram, in the following manner: there is an
arc (c1, c2), if c1 c2 where is the transitive reduction of ≤, i.e., ∀c3 ∈ CK,
c1 ≤ c3 ≤ c2 implies either c1 = c3 or c2 = c3.
        </p>
        <p>Property 1 Lattice operator join provides the least upper bound (LUB) in the
concept lattice, and it is defined as follows:</p>
        <p>Let S ⊆ Lc</p>
        <p>J oin(S) = ω
∪ c
c∈S</p>
        <p>Iceberg Galois lattice: When only frequent closed itemsets are considered
with set inclusion, the resulting structure (Lc, ⊆) only preserves the LUBS , i.e.,
the join operator. This is called a join semi-lattice or upper semi-lattice. In the
remaining of the paper, such structure is referred to as ”Iceberg Galois Lattice”.
Example 1 Let us consider the extraction context given by table 1. The
associated Iceberg Galois lattice, for minsup = 12 , is depicted by figure 1. Each node
in the Iceberg is represented as couple (closed itemset,support) and is decorated
with its associated minimal generators list.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Association rules</title>
        <p>
          An association rule represents an implication between frequent itemsets. It is
an expression of the form: R : X =⇒ Y [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], in which X and X Y are frequent
itemsets, and X ⊂ X Y . Itemsets X and Y are called, respectively, premise and
conclusion of the rule R. The support of R denoted supp(R), is equal to the
support of X Y , and its confidence denoted by conf (R) is determined as follows:
R A C D T W
1 × × × ×
2 × × ×
3 × × × ×
4 × × × ×
5 × × × × ×
6 × × ×
conf (R) = ssuupppp((XXY)) . If conf (R) = 1, then R is an exact association rule
(ER), otherwise it is called approximate association rule (AR).
        </p>
        <p>
          The process of association rules extraction can be split into two steps [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]:
1. Finding out all frequent itemsets from a given context extraction k for a
specified minsup value.
2. Generating all association rules from the frequent itemsets obtained and
restricting extraction to rules which confidence satisfies the minconf value.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Extraction of non redundant association rules</title>
      <p>
        The problem of the relevance and usefulness of extracted association rules is of
primary importance. Indeed, in most real life databases, the set of association
rules can rapidly grow to be unwieldy, especially as we lower the minsup value,
and among which many are redundant. So, a possible solution to this problem
is to restrict extraction of rules to a generic basis of non redundant association
rules. Once this generic basis is obtained, all the remaining (redundant) rules
can be derived using an appropriate inference mechanism. In an ideal case a
generic basis should satisfy the following conditions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]:
- Informativity: to exactly determine the support and confidence of redundant
association rules.
- Deriving redundant rules: the derivation axioms should be lossless (should
enable derivation of all strong rules) and sound (should forbid derivation of
rules that are not strong).
- Compacity: to have the most reduced set of generic rules allowing the
derivation of all strong remaining ones(redundant rules).
      </p>
      <p>In this section, we will overview some representations of strong association
rules.
3.1</p>
      <sec id="sec-3-1">
        <title>Minimal non redundant association rules (MNR)</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], Bastide et al. present a couple of generic bases, one for the exact rules
denoted GB, and the second for approximate ones denoted IB.
        </p>
        <p>Bastide et al. consider the following rule-redundancy definition:
Definition 1 An association rule R : X =⇒ Y is a minimal non redundant
association rule iff an association rule R1 : X1 =⇒ Y1 fulfilling the following
constraints:
1. supp(R) = supp(R1) and conf (R) = conf (R1)
2. X1 ⊂ X and Y ⊂ Y1</p>
        <p>
          Exact association rules, of the form R : X =⇒ Y , are implications between
two itemsets X and X Y whose closures are identical, i.e., ω(X ) = ω(X Y ).
Indeed, from the aforementioned equality, we deduce that X and X Y belong
to the same class of the equivalence relation induced by the closure operator
ω on P (A) and then supp(X ) = supp(X Y ) (i.e., conf (R) = 1). Bastide et
al. characterized what they called “the generic basis for exact association rules”
(adapting the global implications base of Duquenne and Guigues [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]). The generic
basis for exact association rules, that has been proved to be lossless information,
is defined as follows:
Definition 2 Let F C be the set of frequent closed itemsets extracted from a
context and, for each frequent closed itemset c, let us denote Gc the set of minimal
generators of c. The generic basis for exact association rules is as follows:
        </p>
        <sec id="sec-3-1-1">
          <title>Association rules Support Conf idence</title>
          <p>D =⇒ C 32 1
T =⇒ C 32 1
W =⇒ C 65 1
A =⇒ CW 32 1
DW =⇒ C 12 1
T W =⇒ AC 21 1
AT =⇒ CW 12 1</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>Association rules Support Conf idence</title>
          <p>TADCC==C=⇒⇒==⇒=⇒⇒⇒ACCCWTTDWWW 111222263523 434334632523</p>
          <p>GB = {R : g =⇒ c − g | c ∈ F C ∧ g ∈ Gc ∧ g = c}.</p>
          <p>
            The authors also characterized the informative basis for approximate
association rules based on extending the family of bases of partial implications of
Luxemburger [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ], e.g., cover, structural and arborescent bases. The informative
basis is defined as follows:
Definition 3 The informative basis for approximate association rules is given
by:
          </p>
          <p>IB = {R : g =⇒ c − g, c ∈ F C ∧ g ∈ G ∧ ω(g) ⊂ c}.</p>
          <p>With respect to Definitions 2 and 3, we consider that given an Iceberg Galois
lattice, representing precedence-relation-based ordered closed itemsets, generic
bases of association rules can be derived in a straightforward manner. We assume
that in such structure, each closed itemset is ”decorated” with its associated list
of minimal generators. Hence, AR represent ”inter-node” implications, assorted
with a statistical information, i.e., the confidence, from a sub-closed-itemset to
a super-closed-itemset while starting from a given node in an ordered structure.
Inversely, ER are ”intra-node” implications extracted from each node in the
ordered structure.</p>
          <p>Example 2 Generic bases of exact and approximate association rules that can
be drawn from the context k, are respectively depicted in table 2.</p>
          <p>
            As shown in [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ] the couple (GB,IB) form a sound and informative generic
basis. However, it produces a very important number of association rules, so this
generic basis is not compact.
In [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ], the author proposes a new generic basis for association rules denoted
RBP han, and presents its associated inference mechanism. This approach takes
as input all frequent itemsets obtained from a given context extraction k by
applying an appropriate algorithm such as Apriori [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ]. Phan defines the
ruleredundancy as follows:
Definition 4 Let E be the set of all strong association rules extracted from a
context extraction k. The rule r : l1 =⇒ l2 ∈ E is a redundant one if and
only if there is a rule r : l1 =⇒ l2 ∈ E, and the two following conditions are
satisfied:
          </p>
          <p>So the representative basis RBP han is defined as follows:
Definition 5 Let minsup and minconf be the support and confidence thresholds
for interesting association rules.</p>
          <p>
            RBP han = r : I =⇒ J | I ⊂ J ∧ r : I =⇒ J | I ⊆ I ∧ J ⊆ J
Remark 1. RBP han consists in a redefinition of representative rules denoted as
RR and defined in [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ]. However, the difference between them is that for RBP han
the premise and the conclusion of a given rule are not necessarily disjoint.
Example 3 Given the context extraction k depicted by table 1, we present the
representative basis extracted for minsup = 12 and minconf = 23 in table 3.
Once the representative basis is generated, we can derive all strong redundant
association rules by applying the inference mechanism defined in [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ] as follows:
Definition 6 Let I, J , K be frequent itemsets and having RBP han, we consider
two derivation axioms:
1. Left augmentation: if I =⇒ J K is interesting, then IJ =⇒ J K is
interesting.
2. Decomposition: if I =⇒ I1I2 is interesting, then I =⇒ I1 and I =⇒ I2
are interesting.
          </p>
          <p>Moreover, Phan showed that the representative basis is the most reduced set of
association rules, from which all remaining rules (redundant) can be derived, so
the property of compacity is satisfied. RBP han is sound and lossless, therefore,
this approach presents some drawbacks such as:
1. The representative basis is not informative. The support and confidence of
derived rules can not be found.
2. For each rule R : X =⇒ Y , we must verify if there is no rule R : X =⇒ Y
such as X ⊆ X ∧Y ⊆ Y .
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Informative Generic Basis (IGB)</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], the authors propose a new informative generic basis for association rules
denoted IGB, which takes as input all the frequent closed itemsets. IGB is
defined in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] as follows:
Definition 7 Let F C be the set of frequent closed itemsets and Gc, the list of
minimal generators for a given frequent closed itemset c.
        </p>
        <p>IGB={R : gs =⇒ I − gs | I ∈ F C ∧ gs ∈ Gc, c ∈ F C ∧ c ⊆ I
∧ conf (R)≥minconf ∧ g | g ⊂ gs ∧ conf (g =⇒ A − g ) ≥ minconf }
Example 4 Given the context extraction k of table 1, we present the basis IGB
extracted for minsup = 12 and minconf = 23 in table 4.</p>
        <p>
          In [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], the authors prove that IGB is informative. In fact, IGB contains all
frequent closed itemsets, and the support of a frequent itemset is equal to the
support of the smallest frequent closed itemset which contains it. So, the support
and the confidence of redundant rules can easily be determined.
        </p>
        <p>
          In other way, IGB is sound and lossless, and the inference mechanism is
composed by three derivation axioms which are [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]:
        </p>
        <sec id="sec-3-2-1">
          <title>Association rules Support Confidence</title>
          <p>∅TAD∅∅∅======⇒=⇒⇒⇒⇒⇒⇒ACACCCCCTCWTDWWWW 22111223232356 33434432323256</p>
          <p>∅ =⇒ C 1 1</p>
          <p>Certainly, IGB is informative, therefore this property is satisfied by the presence
of all frequent closed itemsets in the generic basis. So, one can imagine the
number of frequent closed itemsets extracted from real databases and consequently
the number of generic rules that can be extracted.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>MGB: A new Minimal Generic Basis</title>
      <p>In the previous section, we have studied the properties satisfied by each approach
presented, and so, we deduce that, the problem of the construction of generic
bases is to find a compromise among the compacity and the informativity of
such basis. Then, we propose a new minimal generic basis of association rules
that guarantees a partial informativity and which means that we can exactly
determine the support and confidence of some derived rules, and for others, this
approach allows to delimit this measures in an interval.</p>
      <p>MGB is defined as follows:</p>
      <sec id="sec-4-1">
        <title>Definition 8 Given</title>
        <p>1. Lc: Iceberg Galois lattice decorated by minimal generators and their supports.
2. ci: frequent closed itemset.
3. S: set of immediate successors of a given concept c.
4. Gci : list of minimal generators of a concept ci.</p>
        <p>MGB=</p>
        <p>R : g =⇒ ci − g | g ∈ Gci ∧ ci ∈ Lc ∧ conf (R) ≥ minconf</p>
        <p>∧ s ∈ S | ssuuppppoorrtt((gs)) ≥ minconf</p>
      </sec>
      <sec id="sec-4-2">
        <title>Algorithm for discovery MGB from frequent closed itemsets</title>
        <p>In this section, we present the algorithm that extracts the MGB basis directly
from Iceberg Galois lattice decorated with its associated list of minimal
generators and their supports. It is assumed that for every frequent closed itemset we
determine the set of its immediate successors and the list of generators satisfying
a minconf constraint.</p>
      </sec>
      <sec id="sec-4-3">
        <title>Algorithm 1 MGB</title>
        <p>Input: Lc: Iceberg Galois lattice decorated by minimal generators and their
supports.</p>
        <p>Output: MGB
begin
for each (ci ∈ F C | |ci| &gt; 1) do
Gci = {minimal generators of concept c | c ⊆ c∧ ssuuppppoorrtt((cci)) ≥ minconf }
for each generator g ∈ Gci do</p>
        <p>support(s)
if ( s ∈ Sci | support(g) ≥ minconf ) then</p>
        <p>MGB =MGB ∪R : g → ci − g
end</p>
        <p>Gci = Gci − {g | g ⊆ g }
else</p>
        <p>
          Gci = Gci − {g}
Example 5 Given the context extraction shown by table 1, we will illustrate in
table 5 the MGB generic basis for minsup = 12 and minconf = 23 .
As we have shown, MGB requires to determine the list of all immediate
successors of a given concept c, thus, we will present how to construct this list. So,
we will explore the Iceberg Galois lattice considering the set inclusion and the
partial order defined between the different frequent closed itemsets. In fact, this
process is composed of two steps, which are:
1. Generating the list of all successors of a given concept c.
2. Using this list, we determine the immediate successors of c.
For deriving all strong redundant association rules, we propose to adopt two
derivation axioms of the inference mechanism defined in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
1. Augmentation: if X =⇒ Y ∈ MGB, then X ∪ Z =⇒ Y − {Z} is a strong
association rule, Z ⊂ Y .
2. Conditional Decomposition: if R : X =⇒ Y ∈ MGB, then X =⇒ Z is
a strong association rule, Z ⊂ Y .
        </p>
        <p>
          This inference mechanism has been proved in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] to be sound and complete.
Remark 2. It is important to mention that for deriving all approximate
association rules, the order of applying the inference axioms is important. Indeed,
we have to apply first the Augmentation axiom and next apply the Conditional
Decomposition axiom on the resulting set of association rules, i.e., set of
approximate generic rules and those derived by the Augmentation axiom.
        </p>
        <p>In fact, MGB is not informative. The support and confidence of derived
rules cannot be found. For example we cannot find the confidence of the rule
AT =⇒ CW derived from A =⇒ CT W because the support of AT is unknown.</p>
        <p>Nevertheless, the non-informativity problem can be partially resolved by
introducing a new concept denoted ”Partial informativity”. In fact, it means that
we can exactly determine the support and confidence of some derived rules, and
for others, this approach allows to delimit this measures in an interval.
Proposition 1. Let R : X =⇒ Y ∈MGB, A and B two integers, if we apply
the inference mechanism, we obtain rules of the following forms:
1. Augmentation: R1 : XZ =⇒ Y − Z</p>
        <p>A=B A=B
Augmentation Augmentation</p>
        <p>R1.supp=R.supp R1.supp=R.supp</p>
        <p>R1.conf=R.supp/supp(XZ) R1.conf∈[R.conf,1]
Conditional Decomposition Conditional Decomposition</p>
        <p>R2.supp=supp(XZ) B ≤ R2 .supp ≤ A</p>
        <p>R2.conf=supp(XZ)/supp(X) R2.conf∈ [R.conf,1]</p>
        <p>Table 6. Support an confidence of derived rules
2. Conditional Decomposition: R2 : X =⇒ Z
(a) support(XZ) ≤ min {support(i1), support(i2), ..., support(ik) | ii ⊂ XZ},</p>
        <p>A, where ii is a frequent subset of XZ.
(b) support(XZ) ≥ max{support(I1), support(I2), ..., support(Ik)}, B, where</p>
        <p>Ii is a frequent closed itemset containing XZ and existing in MGB.
Remark 3. if A = B, then we have supp(XZ) = A.</p>
        <p>So, table 6 illustrates how we determine support and confidence of these rules.
Proof. a: Since the support of a frequent itemset is less than the support of all
its subsets, then we deduce that support(XZ) is less than the smallest support
of all its subsets.</p>
        <p>b: By same, the support of a frequent itemset is higher than the support
of all its supersets, so support(XZ) is higher than the support of the smallest
frequent closed itemset which contains it and existing in MGB.</p>
        <p>c: If we apply the augmentation axiom, the generic rule’s base and the
redundant rule’s base are similar, so support(R1) = support(R).</p>
        <p>d : If we apply the conditional decomposition axiom, then the derived rule’s
base is XZ, and so support(R2) = support(XZ)
Remark 4. Before deriving the redundant rules, we determine the support of all
frequent itemsets which are the antecedent of generic rules.</p>
        <p>Example 6 Given the generic rule R : C =⇒ AW , supp(R) = 23 and conf (R) =
23 , so, supp(C) = csounpfp((RR)) = 1.</p>
        <p>In table 7, we summarize the properties of the reviewed association rules
representations. These ones contain only rules whose bases are frequent closed
itemsets and whose antecedent are frequent generators. Let us consider the
following notations:
Generic basis Intended derivable rules Infer. mechanism Lossless Sound Informative
RBP han AR LA + D yes yes no</p>
        <p>IGB AR CR + A + CD yes yes yes
(GB,IB) certainAR+approxAR A + CD yes yes yes
scMGB AR A + CD yes yes no</p>
        <p>Table 7. Summary of Association Rules Representations</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Visual implementation and experimental evaluation</title>
      <p>All experiments described below were performed on a 3,06GHz PC with 512MB
of memory, running Windows XP. The algorithm MGB was coded in JAVA.</p>
      <p>
        Table 8 shows characteristics of real and synthetic datasets used in our
evaluation.
We compare MGB’results with those concerning RBP han, IGB and (GB,IB)
which are detailed in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
Mushrooms 30%
– For some minconf values, MGB is the most reduced set of generic
association rules:
- In fact, concerning the representative basis RBP han, for each frequent
closed itemset c, if support(c) ≥ minconf , then we have a factual rule
of the form ∅ =⇒ c.
- However, for MGB and for the same minconf value, if the set of generators
of c is empty, the algorithm do not generate any rule.
– If we consider the database Retail, one can observe that for minconf = 1
the number of association rules is equal to 0, so we can say that all the
frequent closed itemsets have only one minimal generator which overlaps the
associated closed itemset.
– By setting minsup = minconf
- For IGB, the number of association rules is equal to the number of frequent
closed itemsets. In fact, the support of all the frequent closed itemsets is
higher or equal than the minconf value.
- The number of association rules in RBP han is equal to the number of
maximal frequent closed itemsets,
– For Mushrooms dataset we can observe that the bases IGB and RBP han
have one more association rule than MGB and (GB,IB), because of the
presence of the item ”85” in all transactions which produces a factual rule
in IGB and RBP han of the form ∅ =⇒ 85.
5.2
      </p>
      <sec id="sec-5-1">
        <title>VIE MGB: Visual Interactive Exploration of Minimal Generic</title>
      </sec>
      <sec id="sec-5-2">
        <title>Basis</title>
        <p>In this section, we present VIE MGB, a new framework for an interactive
graphical visualization and exploration of MGB and the Iceberg Galois lattice.
VIE MGB allows to display generic rules and to derive all strong redundant
association rules with their supports and confidences. The database of
interest has previously been mined, using an efficient algorithm for mining frequent
closed itemsets and their minimal generators. Once these closed itemsets are
mined, they are taken as input.</p>
        <p>In fact, we have two independent processes executed by VIE MGB, which
are:
1. Generating MGB.
2. Constructing the Iceberg Galois lattice and to visualize redundant rules.
The visual interactive framework for exploring and visualizing minimal generic
basis is depicted in figure 2.</p>
      </sec>
      <sec id="sec-5-3">
        <title>Interactive lattice and rule exploration In other way, from the closed item</title>
        <p>sets, VIE MGB creates the Iceberg Galois lattice as shown in figure 2. Each
closed itemset is represented by a node, and there is a link connecting two nodes
if they are related by a set inclusion relation and there is no intermediate set
between them. Once the lattice is displayed on the panel, one can generate the
redundant association rules as follows:</p>
        <p>The user can click on a chosen node on the lattice, then VIE MGB
displays on the text area the generic rules corresponding to this closed itemset and
all strong redundant association rules derived from the generic ones with their
supports and confidences.
Remark 5. When generating the redundant association rules, each frequent
itemset is mapped to the smallest frequent closed itemset which contains it to
determine the support of the derived rule and its confidence. So, the problem of the
informativity of the generic basis is completely resolved.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>In this paper, we have presented theoretical foundations of generic bases of
association rules and we have pointed out the properties satisfied by some approaches.
We have proposed MGB which is a new minimal generic basis of associations
rules and its associated inference mechanism. We have also introduced the
concept of ”partial informativity”.</p>
      <p>This new approach is consolidated by a set of experiments using real and
synthetic databases showing important benefits in terms of reduction in the
number of association rules presented to the user. Finally, we have proposed a
new framework denoted VIE MGB allowing a visual interactive exploration of
MGB.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Stumme</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taouil</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bastide</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasquier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lakhal</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Intelligent structuring and reducing of association rules with formal concept analysis</article-title>
          .
          <source>In: Proceedings of KI'2001 conference, Lecture Notes in Artificial Intelligence 2174</source>
          , Springer-verlag. (
          <year>2001</year>
          )
          <fpage>335</fpage>
          -
          <lpage>350</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <source>Formal Concept Analysis</source>
          . Springer-Verlag, Heidelberg (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Zaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Phoophakdee</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>MIRAGE: A framework for mining, exploring and visualizing minimal association rules</article-title>
          .
          <source>Rpi cs dept technical report</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Barbut</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monjardet</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Ordre et classification</article-title>
          .
          <source>Alg`ebre et Combinatoire. Hachette</source>
          ,
          <string-name>
            <surname>Tome II</surname>
          </string-name>
          (
          <year>1970</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bastide</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasquier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taouil</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lakhal</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          , G.:
          <article-title>Mining minimal non-redundant association rules using frequent closed itemsets</article-title>
          .
          <source>In: Proceedings of the International Conference DOOD'2000, Lecture Notes in Computer Sciences</source>
          , Springer-verlag. (
          <year>2000</year>
          )
          <fpage>972</fpage>
          -
          <lpage>986</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Agrawal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Imielinski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swami</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Mining Association Rules between sets of items in large Databases</article-title>
          .
          <source>ACM SIGMOD Records</source>
          (
          <year>1993</year>
          )
          <fpage>207</fpage>
          -
          <lpage>216</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Kryszkiewicz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Concise representations of association rules</article-title>
          .
          <source>In: Proceedings of Pattern Detection and Discovery</source>
          , ESF Exploratory Workshop, London, UK. (
          <year>2002</year>
          )
          <fpage>92</fpage>
          -
          <lpage>109</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Duquenne</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guigues</surname>
          </string-name>
          , J.:
          <article-title>Famille minimale d'implications informatives r´esultant d'un tableau de donn´ees binaires</article-title>
          .
          <source>Math´ematiques et Sciences Humaines</source>
          <volume>95</volume>
          (
          <year>1986</year>
          )
          <fpage>5</fpage>
          -
          <lpage>18</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Luong</surname>
            ,
            <given-names>V.P.</given-names>
          </string-name>
          :
          <article-title>Raisonnement sur les rgles d'association</article-title>
          . In: In Proceedings 17me Journe
          <string-name>
            <surname>Bases de Donnes Avances</surname>
            <given-names>BDA</given-names>
          </string-name>
          '
          <year>2001</year>
          ,
          <string-name>
            <surname>Agadir</surname>
          </string-name>
          (Maroc),
          <source>Cpadus Edition</source>
          . (
          <year>2001</year>
          )
          <fpage>299</fpage>
          -
          <lpage>310</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Agrawal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skirant</surname>
          </string-name>
          , R.:
          <article-title>Fast algorithms for mining association rules</article-title>
          .
          <source>In: Proceedings of the 20th International Conference on Very Large Databases</source>
          . (
          <year>1994</year>
          )
          <fpage>478</fpage>
          -
          <lpage>499</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Gasmi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , BenYahia,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Nguifo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.M.</given-names>
            ,
            <surname>Slimani</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          :
          <article-title>A new informative generic base of association rules</article-title>
          .
          <source>In: In Proceedings of the Ninth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD-05)</source>
          ,Hanoi,
          <string-name>
            <surname>Vietnam.</surname>
          </string-name>
          (
          <year>2005</year>
          )
          <fpage>81</fpage>
          -
          <lpage>90</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>