Emulating
 Emulating    a cooperative behavior
           a Cooperative     Behaviorin in
                                         a generic
                                            a Generic
         association rule visualization tool
      Association Rule Visualization Tool

                          I. Nsir1 , S. Ben Yahia1,2 , and E. Mephu Nguifo2
                       I. Nsir1 , S. Ben Yahia1;2 , and E. Mephu Nguifo2
                              1
                              Départment des Sciences de l’Informatique
        1
            Départment des Sciences    de l’Informatique,
                                     Faculté des Sciences de Faculté
                                                              Tunis des Sciences de Tunis
                          Campus
                             CampusUniversitaire,      1060Tunis,
                                       Universitaire, 1060  Tunis,Tunisie.
                                                                    Tunisie.
                                  sadok.benyahia@fst.rnu.tn
                               sadok.benyahia,@fst.rnu.tn
                  2 2
                   Centre de de
                      Centre  Recherche
                                RechercheenenInformatique      deLens-IUT
                                                 Informatique de  Lens-IUT dede Lens
                                                                              Lens
                         Rue
                           Ruededel’Université
                                    l’Université SP  16,62307
                                                  SP 16,  62307Lens
                                                                 LensCedex
                                                                       Cedex
                                  mephu@cril.univ-artois.fr
                                mephu@cril.univ-artois.fr


            Abstract. Traditional framework for mining association rules has pointed out the
            derivation of many redundant rules. In order to be reliable in a decision making
            process, such discovered rules have to be both concise and easily understandable
            for users, and/or an input to visualization tools. In this paper, we present a 3
            graphical visualization prototype for handling generic bases of association rules.
            We discuss also the most adequate graphical visualization technique depending
            on the intrinsic structure of the generic bases of association rules. An interesting
            feature of the prototype is that it provides a ”contextual” exploration of such rule
            set. Such additional displayed knowledge, based on the discovery of fuzzy meta-
            rules, enhances man-machine interaction by emulating a cooperative behavior.


1 Introduction
Modern hardware and database technology has made it possible to store gigabytes of in-
formation in databases. However, this rapid digitalization has pointed out an important
need for tools and/or techniques to delve and efficiently discover valuable, non-obvious
information from large databases. Data mining has been proposed and studied to help
users better understand and analyze the information. Much research in data mining from
large databases has focused on the discovery of association rules [1–3]. Association
rule generation is achieved from a set F of frequent itemsets in an extraction context
D, for a minimal support minsup. An association rule r is a relation between itemsets
of the form r : X ⇒ (Y −X), in which X and Y are frequent itemsets, and X ⊂ Y .
Itemsets X and (Y − X) are called, respectively, premise and conclusion of the rule
r. The valid association rules are those of which the measure of confidence Conf(r)=
 support(Y ) 3
support(X) is greater than or equal to a minimal threshold of confidence, named min-
conf. If Conf (r) = 1 then r is called exact association rule (ER), otherwise it is called
approximative association rule (AR).
    The problem of the relevance and usefulness of extracted association rules is of
primary importance. Indeed, in most real life databases, thousands and even millions of
high-confidence rules are generated, among which many are redundant.
 3
     The number of transactions of D containing Y , i.e., support(Y)=|{t ∈ D | Y ⊆ t}|.


c V. Snášel, R. Bělohlávek (Eds.): CLA 2004, pp. 34–46, ISBN 80-248-0597-9.
  VŠB – Technical University of Ostrava, Dept. of Computer Science, 2004.
  Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool                35


    Various techniques are used to limit the number of reported rules, starting by basic
pruning techniques based on thresholds for both the frequency of the represented pattern
and the strength of the dependency between antecedent and conclusion. This pruning
can be based on patterns defined by the user (user-defined templates), on boolean op-
erators [4–6]. The number of rules can be reduced through pruning based on additional
information such as a taxonomy on items [7] or a metric of specific interest [8] (e.g.,
Pearson’s correlation or χ2 -test).

    More advanced techniques that produce only a limited number of the entire set
of rules rely on closures and Galois connections [9], which are in turn derived from
Galois lattice theory and formal concept analysis (FCA) [10]. Finally, works on FCA
have yielded a row of results on compact representations of closed set families, also
called bases, whose impact on association rule reduction is currently under intensive
investigation within the community [9].

    In this paper, we are interested in the most used kind of of visualization categories
in data mining, i.e., use visualization techniques to present the information catched out
from the mining process. Visualization tools became more appealing when handling
large data sets with complex relationships, since information presented in the form of
images is more direct and easily understood by humans. Visualization tools allow users
to work in an interactive environment with ease in understanding rules. In a based-
tabular view of association rules, all strong rules are represented as in a tabular repre-
sentation format (rule table), in which each entry corresponds to a rule. All rules can
be displayed in different order, such as order by premise, conclusion, support or confi-
dence. This helps users to have a clearer view of the rules and locate a particular rule
more easily. The tabular view is advocated for representing a large number of rules with
varied length. As a drawback, tabular-based technique draws heavily on ”boring” log-
ical inference that a user should perform, and is not suitable for visualizing rules from
different aspects. For example, if a user is interested in a comprehensive view of the re-
lationship between rules and items. In this case, the tabular view is not very convenient,
since an item repetitively appears in the rule table as long as it is contained by a rule.
These facts underline the importance of graphical rule visualization tools, permitting
a clearer and more user-friendly view of rules and items. Indeed, visual representation
has the capability of shifting load from the user’s cognitive system to the perceptual
system [11]. However, as pointed out in the dedicated literature, the graphical based
techniques (e.g., 3D Histograms or 2D matrix) are actually interesting if only a small
size of rules are handled.

    In this paper, we are interested in presenting a graphical-based visualization pro-
totype for handling generic bases of association rules. We try to find an answer to the
following question : ”Which is the most adequate visualisation technique depending on
the intrinsic structure of generic bases of association rules, specially as their sizes is by
far lower than the set of all (redundant) association rules”? As we will show later, 3D
histograms-based technique is particularly advocated for visualizing generic bases ex-
tracted from sparse contexts. While, 2D matrix-based technique is indicated to visualize
generic bases extracted from dense contexts.
36      I. Nsir, S. Ben Yahia, E. Mephu Nguifo


    An interesting feature of the aforementioned prototype is that it provides a ”contex-
tual” exploration of such rule set. Indeed, additional information are provided to a user
selecting a given displayed rule :
 1. All the derivable rule are displayed. To derive such rules, we use the set of inference
    axioms, that we introduced in [12].
 2. All the ”connected” rules are displayed to the user in an interactive manner. The
    contextual interaction is performed through the construction of fuzzy meta-rules,
    i.e., rules where both premises and conclusions are composed of rules.

Interestingly, such additional displayed knowledge allows improved man-machine in-
teraction by emulating a cooperative behavior.
    The remainder of this paper is organized as follows. Section 2 sketches briefly the
mathematical background for the extraction of generic bases of association rules. In
section 3, we present the process of the extraction of fuzzy association meta-rules. Then
we discuss in depth in section 4 the opportunity of the rule set contextual exploration.
Section 5 concludes this paper and points out future research directions.


2 Extracting generic bases of association rules
In the following, we recall some key results from the Galois lattice-based paradigm
in FCA and its applications to association rules mining. A formal context is a triplet
K = (O, A, R), where O represents a finite set of objects (or transactions), A is a finite
set of attributes and R is a binary relation (i.e., R ⊆ D × T ). Each couple (o, a) ∈ R
expresses that the transaction o ∈ O contains the attribute a ∈ A. We define two
functions that map sets of objects to sets of attributes and vice versa. Thus, for a set
O ⊆ O, we define φ(O) = {a | ∀o, o ∈ O ⇒ (o, a) ∈ R}; and for A ⊆ A,
ψ(A) = {o | ∀a, a ∈ A ⇒ (o, a) ∈ R}. Both functions φ and ψ form a Galois
connection between the respective power sets P(A) and P(O) [13]. Consequently, both
compound operators of φ and ψ are closure operators, in particular ω = φ ◦ ψ.
    Frequent closed itemset: An itemset A ⊆ A is said to be closed if A = ω(A),
and is said to be frequent with respect to minsup threshold if support(A) = |ψ(A)|  |O| ≥
minsup. An itemset g ⊆ A is called minimal generator of a closed itemset A, if and
only if ω(g) = A and @g 0 ⊆ g such that ω(g 0 ) = A.
    Iceberg Galois lattice: When only frequent closed itemsets are considered with
set inclusion, the resulting structure (L̂, ⊆) only preserves the joint operator. In the
remaining of the paper, such structure is referred to ”Iceberg Galois Lattice”.
    With respect to [14] and [15], given an Iceberg Galois lattice, representing prece-
dence relation-based ordered closed itemsets, generic bases of association rules can be
derived in a straightforward manner. We assume that, in such structure, each closed
itemset is ”decorated” with its associated list of minimal generators. Hence, generic
approximative association rules (GAR) represent ”inter-node” implications, assorted
with a statistical information, i.e., the confidence, from a sub-closed-itemset to a super-
closed-itemset while starting from a given node in an ordered structure. Inversely,
generic exact association rules (GER) are ”intra-node” implications extracted from each
node in the ordered structure.
  Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool           37


    For example, let us consider the transaction database given by Table 1 (Left). The
set of extracted closed itemsets, with their associated minimal generators, is depicted
by Table 1 (Right). Therefore, Table 2 yields, respectively, the set of generic exact
association rules and the set of approximative association rules that it may be possible
to derive for minsup=1.


               Table 1. Left: Transaction database Right: Closed itemsets list


                                              Gen. extent          intent
                TID items                     a    {1, 2}          aek
                1 aek                         c    {3, 4}          cg
                2 aek                         e    {1, 2, 4, 6} ek
                3 cg                          g    {3, 4, 5, 7} g
                4 cegk                        k    {1, 2, 4, 6, 7} k
                5 g                           ce {4}               cegk
                6 ek                          ck {4}               cegk
                7 gk                          eg {4}               cegk
                                              gk {4}               cegk


           Table 2. Generic association rule set Left:Exact. Right:Approximative


                                              # ”⇒” Conf.
                # ”⇒”
                                              R8 g⇒c 0.5
                R1 a⇒ek
                                              R9 g⇒cek 0.25
                R2 c⇒g
                                              R10 c⇒egk 0.5
                R3 e⇒k
                                              R11 k⇒e 0.8
                R4 ce⇒gk
                                              R12 k⇒ae 0.4
                R5 ck⇒eg
                                              R13 e⇒ak 0.5
                R6 eg⇒ck
                                              R14 k⇒ceg 0.5
                R7 gk⇒ec
                                              R15 e⇒cgk 0.25


3 Extracting fuzzy association rules
In the following, we present the basic fuzzy constructs and our proper notations.

3.1 Basic Fuzzy constructs
Fuzzy Sets : Let U be a finite classical set of objects, called universe of discourse. A
fuzzy set Fe, in a universe of discourse U , is characterized by a membership function
38      I. Nsir, S. Ben Yahia, E. Mephu Nguifo


µFe : U → [0, 1], where µFe (u) denotes the degree of membership of u in the fuzzy set
                                                   µ (u ) µ (u )        µ (u )
Fe . Hence, the fuzzy set Fe is denoted by : Fe = { u1 , u2 , . . . , un }.
                                                     Fe   1  Fe 2         Fe  n


     In the following, the notion of fuzzy extraction context, made up of objects and
attributes, related under a fuzzy type-1 relation, is formalized.
     Fuzzy extraction context :A fuzzy extraction context is a triple D=(O,  e     e R)
                                                                                   I, e de-
                                                        e
scribing a finite set O of objects, a fuzzy finite set I of database items (or descriptors)
                                                                     α
and a fuzzy binary relation R  e (i.e.,R
                                       e ⊆ O × I).
                                                 e Each couple (o, i) ∈ R,   e denotes that
                                      e
the object o ∈ O, has the item i ∈ I, at least with a degree α.
     Fuzzy Galois connection: Let D     e = (O, I,
                                                 e R)e be a fuzzy extraction context. For
X ⊆ O and Ye ⊆ I,    e the functions : φe : P (O) → P(I)    e and ψe : P(I) e → P (O) are
defined as follows [16]:
            α
 e
 φ(X)   = { b | α = min{µRe (a, b) | a ∈ X}},
 e Ye ) = {a ∈ O | ∀b, b ∈ B
 ψ(                           e ⇒ µ e (a, b) ≥ µ e (b)}
                                      R           Y
                          e
     The fuzzy operator φ is applied on a crisp set of objects and determines to which
degree each property is satisfied by all objects, according to their respective degrees.
Note that φ,e as defined formerly, presents a desired abstraction vocation. Indeed, the
retrieved fuzzy set is the least generalization, through the ”min” function, of all fuzzy
sets (or descriptions) associated respectively to the input set objects.
     Similarly, the fuzzy operator ψe is applied on a set of properties, represented by a
fuzzy set, and determines to which degree each object satisfies all of them. The implicit
use of the Rescher-Gaines fuzzy implication permits to obtain the desired interpretation
effect. Hence, the operator ”≥” permits to filter only objects fulfilling the constraint that
the associated descriptions are more general than the input description.

Remark 1. It is noteworthy that the proposed definition of fuzzy Galois connection -
without context transformation- can not be unique. This constatation is due to the ex-
istence of multitude of semantic/syntactic parameters to take into account in any ex-
tension to the fuzzy context (e.g., fuzzy implication choice). For instance, we men-
tion based-Lukasiewicz-implication propositions of Belohlavek [17] and Pollandt [18],
where a fuzzy formal concept is defined by a pair of two fuzzy sets X  e and Ye , where
 e
X =ω      e        e      e  e
       e (Y ) and Y = φ(X). The reader is referred to [16] for a critical discussion on
these propositions, based on semantic interpretations attached to membership degrees
in a fuzzy set (i.e.,fulfillment, preference,interpretation).


3.2 The FARD algorithm

The FARD [19] algorithm falls in the ”test-and-generate” characterization for min-
ing frequent (closed) patterns algorithms. It handles as input an extended transaction
database, e.g, a database in which quantities of the purchased items are available. The
peculiarity of the FARD algorithm stands in the fact that it is dedicated. Indeed, it tack-
les the original context without binarizing it, since it takes advantage of the particular
properties of the fuzzy descriptions to limit the computation effort.
    The algorithm traverses iteratively the search space in a level-wise manner. During
each iteration corresponding to a level, a set of candidate patterns is created by joining
  Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool               39


frequent patterns discovered during the previous iteration, the supports of all candi-
date patterns are counted and infrequent ones are discarded. Fuzzy association rules are
generated in two successive steps:
 1. Discovering all frequent fuzzy closed itemsets (FuFCs),
 2. From the discovered frequent fuzzy closed itemsets, generate all fuzzy association
    rules that it is possible to derive.
    As input the FARD algorithm takes a fuzzy extraction context D,  e minsup and min-
conf. In this paper, we consider that the fuzzy extraction context D  e is constituted as
follows: De = (O, RS, R),
                        b where O represents the finite set of objects, RS is a finite
set of GER and GAR and R   b is a fuzzy binary relation, where µ b (o, R) = conf (R), ∀
                                                                 R
o ∈ O and R ∈ {GER ∪ GAR}.
    Following the general principle of a level-wise algorithm, the FARD algorithm per-
forms in each iteration the following two steps (assuming that items are sorted in a
lexicographic order):
 1. Construction step: The G EN - CLOSED function, is applied to each generator in
    CFuFCi , determining its support and possibly its closure (if it is frequent enough).
    This set is pruned with respect the anti-monotonous constraint, through the minsup
    threshold.
 2. Pruning step : The set of generators to be utilized in the next iteration, i.e. CFuFCi+1 ,
    is computed by applying the G EN - NEXT function to the set CFuFCi .
The algorithm terminates when there are no more generators to process, i.e. CFuFCi .gen-
list, is empty. FARD algorithm pseud-code is given in Algorithm 1.1. Note that sub-
routines pseudo-code is not given due to lack of available space.

   Algorithm 1.1: FARD
          e : fuzzy extraction context, S:items
   Input: D                             e       user-constraints, minsup
   Output: FuFC= ∪i FuFCi
   begin
       CFuFC1 .gen-list={1-fuzzy itemsets};
       for (i = 1;CFuFCi .gen-list6= ∅;i++) do
            CFuFCi .clos= ∅;
            FuFCi =G EN - CLOSED(CFuFCi );
            CFuFCi+1 =G EN - NEXT(FuFCi );
       return FuFC=∪i FuFCi
   end


Example 1. Let us consider the fuzzy extraction context depicted by Table 3. In this
               α
context, each (Ri , j) couple means that the object j is covered by the rule Ri with
a confidence equal to α. From such context, by applying the FARD algorithm, it is
possible to extract the fuzzy closed itemsets from which we derive fuzzy generic exact
fuzzy association rules, which are depicted by Table 4.
40      I. Nsir, S. Ben Yahia, E. Mephu Nguifo


                R1 R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12 R13 R14 R15
              11      1                           .8 .4     .5
              21      1                           .8 .4     .5
              3       1                .5
              4    1 1 1 1 1 1 .5 .25 .5 .8              .2    .25
              5
              6       1                           .8
              7
                             Table 3. Fuzzy meta-context


                      #     ”⇒”
                                        .8 .4 .5
                      MR1 R11 ⇒R31 R11    R12 R14
                              1     1
                      MR2 R2 ⇒R8
                                        .8 .4 .5
                      MR3 R31 ⇒R11 R11    R12 R14
                              1       1 1 1 1 .25 .5 .8 .2 .25
                      MR4 R4 ⇒ R5 R6 R7 R8 R9 R10 R11 R13 R15
                                                       .5 .8 .2 .25
                      MR5 R51 ⇒R41 R61 R71 R81 R9.25 R10  R11 R13 R15
                              1       1 1 1 1 .25 .5 .8 .2 .25
                      MR6 R6 ⇒ R4 R5 R7 R8 R9 R10 R11 R13 R15
                                                       .5 .8 .2 .25
                      MR7 R71 ⇒R41 R51 R61 R81 R9.25 R10  R11 R13 R15
                              1      1
                      MR8 R8 ⇒R2
                      MR9 R9.25 ⇒ R41 R51 R61 R71 R81 R10
                                                        .5 .8 .2 .25
                                                          R11 R13 R15
                              .5       1 1 1 1 1 .25 .8 .2 .25
                      MR10 R10 ⇒ R4 R5 R6 R7 R8 R9 R11 R13 R15
                              .8
                      MR11 R11    ⇒ R31
                              .4             .8 .5
                      MR12 R12    ⇒ R11 R31 R11 R14
                              .2       1 1 1 1 1 .25 .5 .8 .25
                      MR13 R13 ⇒ R4 R5 R6 R7 R8 R9 R10 R11 R15
                              .5             .8 .4
                      MR14 R14    ⇒ R11 R31 R11 R12
                              .25       1 1 1 1 1 .25 .5 .8 .2
                      MR15 R15 ⇒ R4 R5 R6 R7 R8 R9 R10 R11 R13
                      Table 4. Exact fuzzy meta-association Rules set


4 Graphical visualization and cooperative exploration
Let us keep in mind that given a generic association rule set, we aim to set up a graphical
visualization prototype permitting to enhance the man-machine interaction by providing
a contextual ”knowledge”, minimizing a boring large amount of knowledge exploration.
To do so, we construct a set of fuzzy meta-rules, where both premise and conclusion
rule’s are made up of classical association rules. The role of such fuzzy meta-rules is
to highlight ”connections” between association rules without loosing rule’s confidence
information.
    Here we come to a turning point: Why is it interesting to try to understand why a
user and/or a knowledge expert may be interested in a particular rule, and to determine
what interesting information or knowledge, not explicitly requested, we could provide
him, in addition to the proper answer? Indeed, improving man-machine interaction by
emulating a cooperative behavior has been proposed by some researchers through var-
ious techniques [20]. In [21], the author states: ”requests for data can be classified
roughly into two kinds : specific requests and goals. A specific request establishes a
rigid qualification, and is concerned only with data that matches it precisely. A goal,
     Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool          41


on the other hand, establishes a target qualification and is concerned with data which
is close to the target”. For Cuppens and Demolombe [22], ”the basic idea is that when
a person asks a question, he is not interested to know the answer just to increase his
knowledge, but he has the intention to realize some action, and the answer contains
necessary, or useful information to realize this action”. In our context, when such addi-
tional knowledge is not supplied, this forces the user to retry a tedious rule exploration
repeatedly, until obtaining a satisfactory ”matching”.
    The visualization prototype takes as input an XML file complying to the DTD de-
picted by Figure 1. This storing format is argued by the fact that XML, the eXtensible
Markup Language, has recently emerged as a new standard for data representation and
exchange on the Internet. As output, the set of selected rules can be saved in a file with
HTML or TXT format.

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE ruleSet[
<!ELEMENT ruleSet (rule+)>
<!ELEMENT rule (premise, conclusion,listObjects?)>
<!ELEMENT premise (item+)>
<!ELEMENT conclusion (item+)>
<!ELEMENT listObjects #PCDATA>
<!ATTLIST rule id ID #REQUIRED>
<!ATTLIST rule conf CDATA #REQUIRED >
<!ATTLIST premise id ID #REQUIRED>
<!ATTLIST premise deg CDATA #REQUIRED >
]>


                        Fig. 1. Associated DTD of accepted input XML file


    In the prototype interface, c.f., Figure 2(Up), the user can select items that can
both appear in the premise and/or the conclusion part. Once the minsup and minconf
thresholds are fixed, the user can generate the desired rules. Both textual and graphical
representation are provided. The user can further filter specific rules from the generated
ones.


The suitable visualization technique: In the 3D histograms based visualization tech-
nique, c.f., the screenshot depicted by Figure 2(Middle), matrix floor rows represent
items and columns represent item associations. The red and blue blocks of each column
(rule) represent the premise and the conclusion, respectively. Item identities are shown
along the right side of the matrix. The associated confidence and support are represented
by a scaled histogram. While on 2D matrix-based visualization technique, rule premise
items are on one axis, and the rule conclusion items are on the other axis. We use the
solution introduced by MineSet software 4 allowing multiple items in rule premise
 4
     Available at http://www.sgi.com/software/mineset/index.html.
42       I. Nsir, S. Ben Yahia, E. Mephu Nguifo


Fig. 2. (U P ) Parameters settings for association rule selection. (M IDDLE ) 3D histogram visual-
ization of selected rules. (D OWN )2D matrix visualization of selected rules.
     Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool                  43


or conclusion, by grouping each of the items combinations in rule premise or rule con-
clusion as one unit. The confidence value is indicated by different colors. The higher
the confidence value is, the darker the color. Currently, the user has to indicate which
visualization technique he (she) prefers. However, we are trying to set up an automatic
indicator which can choose the most adequate visualization technique depending on
an assessment of the density of the extraction context. In fact, as indicated by experi-
mental experiments on real-life and synthetic extraction contexts whose parameters are
summarized by Table 5, when the extraction context is dense, generic association rules
drawn from such context tend to present large conclusions (depending on the number of
items). Indeed in Table 6, we tried to assess, for multiple minsup values, the difference
between the size of the minimal generator and the average size of its associated frequent
closed itemsets. the larger the difference is, the larger the generic conclusion’s associ-
ation rule part. From the entries dedicated to dense extraction contexts in Table 6, we
remark that this difference is important comparatively to that pointed out in by sparse
contexts. Therefore, when we consider sparse extraction contexts, the length of premise
and the length of the conclusion of generic association rule tend to be equal. Then given
that it is known, 3D histograms visualization technique is adapted to visualize rules with
a small number of items in the conclusion part, we can conclude it is better to use them
for generic rules extracted from sparse contexts.


                       Base      Type |A| |largest itemset | |O|       Size
                   T10I4D100 Sparse 1000            29        100000 4.05MB
                   T10I10D100 Sparse 1000           77        100000 15MB
                    ROOMOUT Dense 120               23          8124 580KB
                    C ONNECT Dense 130              44         49840 8.82MB
                      C HESS    Dense 76            37          3196 334KB
                             Table 5. Extraction contexts parameters


Displaying additional information

 – Displaying derivable rules: Once a user is interested in the graphical visualiza-
   tion window (c.f., for example Figure 2 Down), then by activating the contextual
   menu, two options are displayed ”Syntactic Rule derivation” and ”Rule derivation”.
   If the user selects the first option, then the system provides all syntactically deriv-
   able rules 5 in a new window. Indeed, for these derivable rules the only available
   information is that their support and confidence values are at least equal, respec-
   tively, to those of the generic association rule used to derive them. For example in
   Figure 3, five association rule are presented to the user once he selected the generic
   association rule: k ⇒ae. On the other hand, if the user selects the ”Rule derivation”
 5
     The syntactic derivation is based on the Cover operator introduced in [23], i.e., Cover(X ⇒Y)
     = {X ∪ Z ⇒ V | Z, V ⊆ Y ∧ Z ∩ V = ∅ ∧ V 6= ∅}, with |Cover(X ⇒Y)|=3m -2m where
     |Y|=m.
44      I. Nsir, S. Ben Yahia, E. Mephu Nguifo


Base       support (%) |FCI| size of minimal generator : Average size of closed itemset
               0.5     1073 1:1- 2:2- 3:3-4:4-5:5
                1      385 1:1- 2:2- 3:3
T10I4D100
                2      155 1:1
                5      10    1:1
                5      316 1:1- 2:2
T10I10D100     10      82    1:1
               15      19    1:1
               10      4897 1 : 3.767 - 2 : 5.840 - 3 : 7.138 - 4 : 8.364 - 5 : 8.998 - 6 : 9.803
ROOMOUT                      - 7 : 11.179
               20      1197 1 : 3.418 - 2 : 4.621 - 3 : 6.358 - 4 : 7.332 - 5 : 8.218 - 6 : 9.169
                             - 7 : 11
               30      427 1 : 2.5 - 2 : 4.041 - 3 : 5.052 - 4 : 6.209 - 5 : 7.333 - 6 : 8.5
               89      9017 1: 2.045 -2 :3.796 - 3: 5.322- 4 :6.7 -5:7.962 -6 :9.151 -7 :
C ONNECT                     10.272 -8 : 11.341
               90      7475 1 : 1.952 -2: 3.691 -3 : 5.174 -4 : 6.512 -5 : 7.751 -6 : 8.911 -7 :
                             10.006 -8 : 11.032
               91      6126 1 : 1.952 -2 : 3.596 -3 : 5.017 -4 : 6.323 -5 : 7.539 -6 : 8.671 -7
                             : 9.732 -8 : 10.71
               78      7111 1 :1.227 -2 :2.417 -3: 3.543 -4: 4.61 -5: 5.612 -6: 6.573 -7: 7.525
C HESS                       -8: 8.453 -9: 9.319 -10: 10
               80      5083 11 : 1.21 -2: 2.401 -3: 3.537 -4: 4.564 -5: 5.546 -6: 6.518 -7:
                             7.472 -8: 8.369 -9: 9.181
               85      1885 1 : 1.25 -2: 2.366 -3: 3.406 -4: 4.414 -5: 5.394 -6: 6.322-7: 7.185
                             -8: 8
                                Table 6. Experimental results


   option, then the system provides only all the derivable rules that have exactly the
   same support and confidence values as those of the generic association rule used
   to derive them. As depicted in Figure 3, only one rule, k⇒a, is derivable and has
   exactly the same support and confidence as k⇒ae.
 – Displaying connected rules: This option is currently under implementation and
   we will illustrate it through an example. Suppose that a user may be interested in
   rule the R1 . Then, when the user clicks on R1 , the visualization interface provides
   all rules that are connected to R1 in a displaying area. From the meta-rule MR1
   depicted in Table 4, we can check that the following rules are connected to R1 : R3
   : e⇒k (conf.=1), R11 : k⇒e (conf.=0.8), R12 : k⇒ae (conf.=0.4) and R14 : k⇒ceg
   (conf.=0.5).


5 Conclusion

In this paper, we presented an approach for providing a cooperative exploration of
generic bases of association rules. This additional knowledge highlighting connected
rules to a user-select rules allows an improvement of man-machine interaction. To do
  Emulating a Cooperative Behavior in a Generic Association Rule Vis. Tool                  45


                          Fig. 3. Displaying derived association rule.


so, we constructed a set of fuzzy meta-rules extracted from the discovered fuzzy closed
itemsets.
    The visualization prototype is currently under implementation and in the near fu-
ture, we plan also to include recommended visualization tasks, e.g., history and ex-
tract [24], and to lead extensive experimental results to assess their satisfaction vs the
user graphical interface.

Acknowledgments The authors would like to thank Mrs L EGROY Audrey for its help-
ful efforts in the implementation of visualization prototype.


References
 1. Agrawal, R., Skirant, R.: Fast algorithms for mining association rules. In: Proceedings of
    the 20th International Conference on Very Large Databases. (1994) 478–499
 2. Brin, S., Motwani, R., Ullman, J.: Dynamic itemset counting and implication rules. In: Pro-
    ceedings ACM SIGMOD, International conference on Management of Data ,Tucson, Ari-
    zona, USA. (1997) 255–264
 3. Manilla, H., Toinoven, H., Verkamo, I.: Efficient algorithms for discovering association
    rules. In: AAAI Worshop on Knowledge Discovery in Databases. (1994) 181–192
 4. Meo, R., Psaila, G., Ceri, S.: A new sql-like operator for mining association rules. In:
    Proceedings of the VLDB Conference. (1996) 122–133
46       I. Nsir, S. Ben Yahia, E. Mephu Nguifo


 5. Ng, R.T., Lakshmanan, V.S., han, J., Pang, A.: Exploratory mining and pruning optimizations
    of constrained association rules. In: Proceedings of the SIGMOD Conference. (1998) 13–24
 6. Srikant, R., Vu, Q., Agrawal, R.: Mining association rules with item constraints. In: Proc. of
    the 3rd Intl. Conference on Knowledge Discovery in Databases and Data Mining, Newport
    Beach, California. (1997) 67–73
 7. Han, J., Fu, Y.: Discovery of multiple-level association rules from large databases. In:
    Proceedings of the VLDB Conference. (1995) 420–431
 8. Brin, S., Motawni, R., Silverstein, C.: Beyond market baskets : Generalizing association
    rules to correlation. In: Proceedings of the SIGMOD, Tucson, Arizona (USA). (1997) 265–
    276
 9. Bastide, Y., Pasquier, N., Taouil, R., Lakhal, L., Stumme, G.: Mining minimal non-redundant
    association rules using frequent closed itemsets. In: Proceedings of the International Confer-
    ence DOOD’2000, Lecture Notes in Computer Sciences, Springer-verlag. (2000) 972–986
10. Ganter, B., Wille, R.: Formal Concept Analysis. Springer-Verlag, Heidelberg (1999)
11. Buono, P., Costabile, M.F., Lisi, F.A.: Supporting data analysis through visualizations. In:
    Proceedings of Workshop on Visual Data Mining, Freiburg, Germany. (2001) 67–78
12. BenYahia, S., Nguifo, E.M.: Revisiting generic bases of association rules. In: Proceedings
    of 6th International Conference on Data Warehousing and Knowledge Discovery (DaWaK
    2004) (Springer-Verlag) to appear, Zaragoza, Spain. (2004)
13. Barbut, M., Monjardet, B.: Ordre et classification. Algèbre et Combinatoire. Hachette, Tome
    II (1970)
14. Luxenburger, M.: Implication partielles dans un contexte. Mathématiques et Sciences Hu-
    maines 29 (1991) 35–55
15. Guigues, J., Duquenne, V.: Familles minimales d’implications informatives résultant d’un
    tableau de données binaires. Mathématiques et Sciences Humaines (1986) 5–18
16. Jaoua, A., Elloumi, S., BenYahia, S., Alvi, F.: Galois connection in fuzzy binary relations:
    applications for discovering association rules and decision making. Methodos Publisher
    (2002)
17. Belohlavek, R.: Fuzzy Galois connections. Math. Logic 45 (1999) 497–504
18. Pollandt, S.: Fuzzy-Begriffe: Formale Begriffsanalyse unscharfer Daten. Springer-verlag
    (1997)
19. BenYahia, S., Jaoua, A.: Discovering knowledge from fuzzy concept lattice. In: Data mining
    and Computational Intelligence. Studies in Fuzziness and Soft Computing, Vol. 68. Physica
    Verlag, Heidelberg (2001) 167–190
20. Frantz, V., Shapiro, J.: Algorithms for automatic construction of query formulations in
    boolean form. Journal of the American Society for Information Science 1 (1991) 16–26
21. Motro, A.: Extending the relational Databases Model to support Goal queries. In Lerschberg,
    L., ed.: Expert Database Systems. The Benjamin/Cummings Publishing Company (1987)
    129–150
22. Cuppens, F., Demolombe, R.: How to recognize interesting topics to provide cooperative
    answering. Information Systems 14 (1989) 163–173
23. Kryszkiewicz, M.: Concise representations of association rules. In Hand, D.J., Adams, N.,
    Bolton, R., eds.: Proceedings of Pattern Detection and Discovery, ESF Exploratory Work-
    shop, London, UK. Volume 2447 of Lecture Notes in Computer Science., Springer (2002)
    92–109
24. Shneiderman, B.: The eyes have it: A task by data type taxonomy for information visual-
    ization. In Press, I.C.S., ed.: Proceedings IEEE Symposium on Visual Languages, Boulder,
    Colorado, 336–343 (1996)