<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Three Related FCA Methods for Mining Biclusters of Similar Values on Columns</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mehdi Kaytoue</string-name>
          <email>mehdi.kaytoue@insa-lyon.fr</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Victor Codocedo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jaume Baixieres</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amedeo Napoli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>LORIA (CNRS - Inria Nancy Grand Est - Universite de Lorraine)</institution>
          ,
          <addr-line>B.P. 239, F-54506, Vand uvre-les-Nancy</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universitat Politecnica de Catalunya.</institution>
          <addr-line>08032, Barcelona. Catalonia</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Universite de Lyon. CNRS</institution>
          ,
          <addr-line>INSA-Lyon, LIRIS. UMR5205, F-69621</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Biclustering numerical data tables consists in detecting particular and strong associations between both subsets of objects and attributes. Such biclusters are interesting since they model the data as local patterns. Whereas there exists several de nitions of biclusters, depending on the constraints they should respect, we focus in this paper on biclusters of similar values on columns. There are several ad hoc methods for mining such biclusters in the literature. We focus here on two aspects: genericity and e ciency. We show that Formal Concept Analysis provides a mathematical framework to characterize them in several ways, but also to compute them with existing and e cient algorithms. The proposed methods, which rely on pattern structures and triadic concept analysis, are experimented and compared on two di erent datasets.</p>
      </abstract>
      <kwd-group>
        <kwd>biclustering</kwd>
        <kwd>triadic concept analysis</kwd>
        <kwd>pattern structure</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Biclustering has attracted a lot of attention for many years now, as it was used in
an extensive way for mining biological data [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Given a data-table with objects
as rows and attributes as columns, the goal is to nd \sub-tables", or pairs of
both subsets of objects and attributes, such that the values in the subtables
respect well-de ned constraints or maximize a given measure [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        There exist several types of biclusters depending on the relation the values
should respect. For example, constant biclusters are subtables with equal
values [
        <xref ref-type="bibr" rid="ref12 ref17 ref6">12, 6, 17</xref>
        ]. Biclusters with similar values on columns (BSVC) are subtables
where all values are pairwise similar for each column [
        <xref ref-type="bibr" rid="ref17 ref4">4, 17</xref>
        ]. The latter can also
be generalized to biclusters of similar values (BSV): any two values in the
subtable are similar [
        <xref ref-type="bibr" rid="ref12 ref2 ref21 ref3">2, 3, 12, 21</xref>
        ]. Dozens of algorithms, mostly ad hoc, have been
proposed for computing the di erent types of biclusters. In this paper, we are
interested in possible extensions of the Formal Concept Analysis (FCA)
formalism for achieving the problem of biclustering. This comes with two goals:
(i) formalizing and understanding biclusters formation and structure, and (ii)
reusing existing algorithms for genericity purposes.
      </p>
      <p>
        Actually, the present paper is in continuation with the work of the authors
on the use of pattern structures {an extension of FCA for mining complex data
[
        <xref ref-type="bibr" rid="ref12 ref8">8, 12</xref>
        ]{ for discovering functional dependencies in a crisp and a fuzzy settings
[
        <xref ref-type="bibr" rid="ref1 ref22">1</xref>
        ], and as well on the adaptation of pattern structures to a speci c biclustering
task: the discovery of biclusters of type BSV [
        <xref ref-type="bibr" rid="ref11 ref6">6, 11</xref>
        ]. Moreover, the biclustering
task is usually considered as a \`two-dimensional" (2D) process where biclusters
are rectangles in a table verifying some prior constraints. It was one main idea
of [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] to transpose the problem in a \three-dimensional" setting by using and
adapting triadic concept analysis [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] to the biclustering task.
      </p>
      <p>
        Here we follow the same line and we propose a new approach for discovering
biclusters in a numerical dataset where biclusters have \similar values" w.r.t.
their columns (type BSVC). This works is a new attempt to extend the
capabilities of FCA and of pattern structures, in dealing with the important problem of
biclustering. Actually, biclustering can be also considered in a (pure) numerical
setting, where it is sometimes called coclustering [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and where kernel or
spectral methods are often used for achieving the task. Here we keep the discrete
setting and more precisely an FCA-based setting.
      </p>
      <p>The rest of this paper is organized as follows. In Section 2 we formally
introduce the biclustering problem. Then, we recall in Section 3 the FCA basics
that are necessary for developing our three methods in Section 4. We experiment
with these methods and compare them by processing two real-world datasets in
Section 5 before concluding.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Problem De nition</title>
      <p>We introduce the problem of mining biclusters of similar values on columns, or
simply biclusters when no confusion can be made. A numerical dataset is de ned
as a many-valued context in which biclusters are denoted as pairs of object and
attribute subsets for which a particular similarity constraint holds.</p>
      <sec id="sec-2-1">
        <title>De nition 1 (Many-valued context and numerical dataset). A many</title>
        <p>valued context consists in a quadruple (G; M; W; I) where G is a set of objects,</p>
        <sec id="sec-2-1-1">
          <title>M a set of attributes, W a set of attribute values, and I G M W a ternary</title>
          <p>relation. An element (g; m; w) 2 I, also written m(g) = w or g(m) = w, can
be interpreted as: w is the value taken by the attribute m for the object g. The
relation I is such that g(m) = w and g(m) = v implies w = v.</p>
          <p>In the present work, W is a set of numbers and Knum = (G; M; W; I) denotes
a numerical dataset, i.e. a many-valued context where W is a set of numbers.
Example. A tabular representation of a
numerical dataset is given in Table 1: objects G =
fg1; g2; g3; g4; g5g are represented by rows while
attributes M = fm1; m2, m3; m4g are represented by
columns. W = f0; 1; 2; 6; 7; 8; 9g and we have for
example g2(m4) = 9.</p>
          <p>m1 m2 m3 m4
g1 1 2 2 8
g2 2 1 2 9
g3 2 1 1 2
g4 1 0 7 6
g5 6 6 6 7
Fig. 1. A numerical dataset
De nition 2 (Biclusters with similar values on columns). Given a
numerical dataset (G; M; W; I), a pair (A; B) (where A G; B M ) is called a
bicluster of similar values on columns when the following statement holds:
8g; h 2 A; 8m 2 B; m(g) '
m(h)
where ' is a similarity relation: 8w1; w2 2 W; 2 [0; max(W ) min(W )],
w1 ' w2 () jw1 w2j . A bicluster (A; B) is maximal if @g 2 GnA such
that (A [ fgg; B) is a bicluster, and @m 2 M nB such that (A; B [ fmg) is a
bicluster.</p>
          <p>Example. In Table 1, with = 1, we have that (A; B) = (fg1; g2g; fm1; m2; m3g)
is a bicluster. Indeed, consider each attribute of B separately: the values taken
by the objects A are pairwise similar. However, (A; B) is not maximal, since
we have that both (A [ fg3g; B) and (A; B [ fm4g) are also biclusters. Then,
(fg1; g2; g3g; fm1; m2; m3g) and (fg1; g2g; fm1; m2; m3; m4g) are both maximal.
Problem (Biclustering). Given a numerical dataset (G; M; W; I) and a
similarity parameter , the goal of biclustering is to extract the set of all maximal
biclusters (A; B) respecting the similarity constraint.</p>
          <p>Remark. It should be noticed that in the formal de nition, the similarity
parameter is the same for all attributes. It is possible however to use a di erent
parameter for each attribute without changing neither the problem de nition or
its resolution. For real-world datasets, one can choose di erent similarity
parameters m (8m 2 M ), but also can normalize/scale the attribute domains and use
a single similarity parameter .
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Basics on Formal Concept Analysis</title>
      <p>
        In this paper, we show how our biclustering problem can be formalized and
answered in FCA in di erent ways: (i) using standard FCA [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], (ii) using pattern
structures [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], and (iii) using triadic concept analysis [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. We recall below the
basics of each approach.
      </p>
      <p>Dyadic Concept Analysis. Let G be a set of objects, M a set of attributes
and I G M be a binary relation. The fact (g; m) 2 I is interpreted as \g
has attribute m". The two following derivation operators ( )0 are de ned:
A0 = fm 2 M j 8g 2 A : gImg
B0 = fg 2 G j 8m 2 B : gImg
f or A
f or B</p>
      <p>
        G;
M
which de ne a Galois connection between the powersets of G and M . For A G,
B M , a pair (A; B) such that A0 = B and B0 = A, is called a (formal) concept.
Concepts are partially ordered by (A1; B1) (A2; B2) , A1 A2 (, B2 B1).
With respect to this partial order, the set of all formal concepts forms a complete
lattice called the concept lattice of the formal context (G; M; I). For a concept
(A; B) the set A is called the extent and the set B the intent of the concept.
Triadic Concept Analysis. A triadic context is given by (G; M; B; Y ) where
G, M , and B are respectively called sets of objects, attributes and conditions,
and Y G M B. The fact (g; m; b) 2 Y is interpreted as the statement
\Object g has the attribute m under condition b". A (triadic) concept of (G; M; B; Y )
is a triple (A1; A2; A3) with A1 G, A2 M and A3 B satisfying the two
following statements: (i) A1 A2 A3 Y , X1 X2 X3 Y and (ii) A1 X1,
A2 X2 and A3 X3 implies A1 = X1, A2 = X2 and A3 = X3. If (G; M; B; Y )
is represented by a three dimensional table, (i) means that a concept stands for
a 3-dimensional rectangle full of crosses while (ii) characterizes component-wise
maximality of concepts. For a triadic concept (A1; A2; A3), A1 is called the
extent, A2 the intent and A3 the modus. To derive triadic concepts, two pairs of
derivation operators are de ned. The reader can refer to [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] for their de nitions
which are not necessary for the understanding of the present work.
Pattern Structures. Let G be a set of objects, let (D; u) be a
meet-semilattice of potential object descriptions and let : G ! D be a mapping. Then
(G; (D; u); ) is called a pattern structure. Elements of D are called patterns
and are ordered by a subsumption relation v such that given c; d 2 D one has
c v d () cud = c. Within the pattern structure (G; (D; u); ) we can de ne the
following derivation operators ( ) , given A G and a description d 2 (D; u):
A = l (g)
g2A
d
= fg 2 Gjd v (g)g
These operators form a Galois connection between (}(G); ) and (D; v).
(Pattern) concepts of (G; (D; u); ) are pairs of the form (A; d), A G, d 2 (D; u),
such that A = d and A = d . For a pattern concept (A; d), d is called a pattern
intent and is the common description of all objects in A, called pattern extent.
When partially ordered by (A1; d1) (A2; d2) , A1 A2 (, d2 v d1), the set
of all concepts forms a complete lattice called a (pattern) concept lattice.
      </p>
      <sec id="sec-3-1">
        <title>Computing Concepts and Concept Lattices. Processing a formal context</title>
        <p>
          in order to generate its set of concepts can be achieved by various algorithms
(see [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] for a survey and a comparison, see also itemset mining [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]). For
processing pattern structures, such algorithms generally need minor adaptations.
Basically, one needs to override the code for (i) computing the intersection of
any two arbitrary descriptions, and (ii) test the ordering between two
descriptions. Processing a triadic context is however not so direct and can be done with
nested FCA algorithms [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] or dedicated data-mining algorithm [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
Similarity relations in FCA. The notion of similarity can be formalized by a
tolerance relation: a symmetric, re exive but not necessarily transitive relation.
The similarity relation ' used for de ning biclusters of similar values is a
tolerance. Given W a set of numbers, any maximal subset of pairwise similar values
is called a block of tolerance.
        </p>
        <sec id="sec-3-1-1">
          <title>De nition 3. A binary relation T W</title>
          <p>(i) 8x 2 W xT x (re exivity)
(ii) 8x; y 2 W xT y ! yT x (symmetry)</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>W is called a tolerance relation if:</title>
        </sec>
        <sec id="sec-3-1-3">
          <title>De nition 4. Given a set W , a subset K</title>
          <p>
            W , K is a block of tolerance if:
(i) 8x; y 2 K xT y (pairwise similarity)
(ii) 8z 62 K; 9u 2 K :(zT u) (maximality)
It is shown that tolerance blocks can be obtained from the formal context of a
tolerance relation [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ]. In the context (W; W; ' ), one can characterize all blocks
of tolerance K (and only them) as formal concepts (K; K).
4
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Mining biclusters of similar values on columns in FCA</title>
      <p>
        The basic notions of FCA of the previous section allow us now to answer our
biclustering problem in various ways with: (i) an original method using
interval pattern structure, (ii) a recently introduced method using partition pattern
structures [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and (iii) an original method relying on triadic concept analysis.
We emphasize the genericity of FCA to answer a data mining problem.
4.1
      </p>
      <sec id="sec-4-1">
        <title>Interval Pattern Structure Approach</title>
        <p>
          For a dataset Knum = (G; M; W; I), an interval pattern structure (G; (D; u); )
is de ned as follows [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]: the objects from G are described by vectors of intervals,
where each dimension gives a range of values for an attribute m 2 M (following
a canonical ordering of the dimensions, i.e. dimension i corresponds to attribute
mi 2 M ). Then, for m 2 M , the semi-lattice of intervals (Dm; um) is given by:
        </p>
        <p>Dm = f[w1; w2] j 9g; h 2 G s:t: m(g) = w1 and m(h) = w2g
[a; b] um [c; d]
=</p>
        <p>[min(a; c); max(b; d)]
c um d = c () c vm d
[a; b] vm [c; d] () [c; d]</p>
        <p>[a; b]
The description space (D; u) of the interval pattern structure is a product of
meet-semi-lattices (D; u) = m2M (Dm; um) which is a semi-lattice.
Examples. In Table 1, (fg1; g2; g3g; h[1; 2]; [1; 2]; [1; 2]; [2; 9]i) is a pattern concept:
(g1) = h[1; 1]; [2; 2]; [2; 2]; [8; 8]i
fg1; g2; g3g
= (g1) u (g2) u (g3) = h[1; 2]; [1; 2]; [1; 2]; [2; 9]i
h[1; 2]; [1; 2]; [1; 2]; [8; 9]i v h[1; 2]; [1; 2]; [1; 2]; [2; 9]i
fg1; g2; g3g
= fg1; g2; g3g
We now give the intuitive idea on how the interval pattern concept lattice can
be used to characterize the biclusters. Consider rst the concept (A1; d1) =
(fg1; g2g; h[1; 2]; [1; 2]; [1; 2]; [8; 9]i). Consider also a function attr : D ! M which
returns for an interval pattern the set of attributes whose interval is not larger
than the parameter, for d = h[ai; bi]i, i 2 [1; jM j]: attr(d) = fmi 2 M jai '
big. (A1; attr(d1)) = (fg1; g2g; fm1; m2; m3; m4g) is a maximal bicluster.
Consider the interval pattern concept (A2; d2) = (fg1; g2; g3g; h[1; 2]; [1; 2]; [1; 2]; [2; 9]i):
(A2; attr(d2)) = (fg1; g2; g3g; fm1; m2; m3g) is a maximal bicluster (with = 1).
This means that biclusters can be characterized thanks to pattern concepts.</p>
        <sec id="sec-4-1-1">
          <title>Proposition 1. Consider a numerical dataset (G; M; W; I) as an interval pat</title>
          <p>tern structure (G; (D; u); ). For any maximal bicluster (A; B), there exists a
pattern concept (A; d) such that (A; B) = (A; attr(d)).</p>
          <p>Proof. To ease reading, the proof is given in an appendix.
tu
4.2</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Partition pattern structure approach</title>
        <p>
          A partition pattern structure is a pattern structure instance where the
description space is given by a semi-lattice of partitions over a set X [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Formally,
we have (G; (D; u); ) where: D = P art(X) and d1 u d2 = S pi \ pj where
pi; pj X, pi 2 d1, pj 2 d2. The semi-lattice is actually a complete lattice of
set partitions in which the bottom element is not considered. In [
          <xref ref-type="bibr" rid="ref1 ref22">1</xref>
          ], we showed
that the de nition of u, and equivalently v, needs a slight modi cation when
D = 22K , i.e. a description d 2 D is a set of subsets of X, and they do cover X
(possibly with overlapping). In that case, we have that d1 u d2 = max(S pi \ pj )
where pi; pj X, pi 2 d1, pj 2 d2 and max(:) returns the maximal sets w.r.t.
inclusion.
        </p>
        <p>Now we show that such a pattern structure can be constructed from a
numerical dataset, and that the corresponding concepts allow to generate all
maximal biclusters. From a numerical dataset (G; M; W; I), we build the structure
(M; (D; u); ) where D = 22G . The description of an object4 m 2 M is given by:
(m) = fp1; p2; :::g where p1; p2; :: G and:
m(g1) '</p>
        <p>m(g2); 8g1; g2 2 pi (similarity)
m(gk); 8gk 2 pi (maximality)
[ pi = G
i
(covering)
In other words, each original attribute m 2 M is described by a family of subsets
of G, where each one corresponds to a block of tolerance w.r.t. the values of
attribute m. Let (A; d = fpig) be a partition pattern concept, it is easy to see
how the pairs bici = (pi; A) are biclusters with rows g 2 pi and columns m 2 A5.
While any bici = (pi; A) is a bicluster, it is not necessarily a maximal bicluster.
Nevertheless, maximal biclusters can be identi ed using the concept lattice.
Proposition 2. Consider a pattern concept (A; d = fpig). The bicluster bici =
(pi; A) is maximal if there is no pattern concept (C; fpi; :::g) with A C.
Proof. The proof to this proposition is very intuitive. Recall from Section 2 that
the bicluster (pi; A) is maximal if two conditions are met, namely @g 2 Gnpi
such that (pi [ fgg; A) is a bicluster and @m 2 M nA such that (pi; A [ fmg) is
4 Object in the pattern structure; attribute in the numerical dataset.
5 In order to keep consistency with the previous notation, biclusters are written
inversely as partition pattern concepts.
a bicluster, The rst condition holds for bici given the maximality condition of
the tolerance block pi; The second follows from the proposition declaration. tu
Example. The numerical dataset (G; M; W; I) given in Table 1 can be turned
into a pattern structure as follows with = 1:
(m1) = ffg1; g2; g3; g4gfg5gg
(m3) = ffg1; g2; g3gfg4; g5gg
(m2) = ffg2; g3; g4gfg1; g2; g3gfg5gg
(m4) = ffg4; g5gfg1; g5gfg1; g2gfg3gg</p>
        <p>Indeed, each component of a description is a maximal set of objects
having pairwise similar values for a given attribute. The pattern concept lattice is
given in Figure 2. We remark that (i) any concept corresponds to a
bicluster, (ii) some of them correspond to a maximal bicluster, and most
importantly, (iii) any maximal bicluster can be found as a concept. For example,
from the concept (A1; d1) = (fm3; m4g; ffg1; g2g; fg4; g5g; fg3gg) we obtain the
following biclusters: bic1 = (fg1; g2g; fm3; m4g) and bic2 = (fg4; g5g; fm3; m4g).
Whereas bic2 is a maximal bicluster bic1 is not since we have that (A2; d2) =
(fm1; m2; m3; m4g; ffg1; g2g; fg3g; fg4g; fg5gg) with (A2; d2) (A1; d1). In turn,
bic3 = (fg1; g2g; fm1; m2; m3; m4g) is a maximal bicluster.</p>
        <p>
          Remark. It is noticeable that an equivalent formal context can be built. By
equivalent, we mean that the concept lattices produced by both structures are
isomorphic. To obtain this formal context, we use a slight modi cation of the data
transformation of [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] (pp. 92): (M; B2(G); I) st. (m; (g; h)) 2 I () m(g) '
m(h). The concept lattice is equivalent to the pattern concept lattice [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], and
thus it can be used in the same way to get maximal biclusters. In our running
example, such context is given in Table 1, and its associated concept lattice is
given in Figure 2 (right), a lattice isomorphic to the one raised from the pattern
structure (left). The proof can be done in a similar manner as it is done in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>(g1; g2) (g1; g3) (g1; g4) (g1; g5) (g2; g3) (g2; g4) (g2; g5) (g3; g4) (g3; g5) (g4; g5)
m1
m2
m3
m4
We present another original result: any maximal bicluster of similar values is
characterized as a triadic concept. The triadic context is derived from the
numerical dataset by encoding the tolerance relation between the values.</p>
        <sec id="sec-4-2-1">
          <title>Proposition 3. Given a numerical dataset (G; M; W; I), consider the derived</title>
          <p>triadic context given by (M; G; G; Y ) s.t. (m; g1; g2) 2 Y () m(g1) ' m(g2).</p>
        </sec>
        <sec id="sec-4-2-2">
          <title>There is a one-to-one correspondence between the set of all maximal biclusters (A; B), the set of all triadic concepts (B; A; A) of the derived context.</title>
          <p>Proof. Consider a maximal bicluster (A; B). We have that 8g; h 2 A : m(g) '
m(h) () m 2 B, if and only if (by the de nition of Y ) (B; A; A) Y . We now
take (B0; A0; A0) Y such that B B0 and A A0. Since (A; B) is a maximal
bicluster, we have that for any pair of objects g; h 2 A0 and m 2 B0 such that
g(m) ' h(m), implies that g; h 2 A and m 2 B. Let (B; A; A) be a triadic
concept. We have that for any pair of objects g; h 2 A and m 2 B we have that
g(m) ' h(m), this is, that 8g; h 2 A : g(m) ' h(m) () m 2 B, which is
the alternative de nition of maximal bicluster. tu
Example. Taking again = 1, the triadic context derived from the numerical
dataset from Table 1 is given in Table 2. An example of triadic concept is:
(fm3; m2; m1g; fg1; g3; g2g; fg1; g2; g3g) which is in turn the maximal bicluster
(fg1; g3; g2g; fm3; m2; m1g).
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiments</title>
      <p>We experiment with the di erent FCA methods introduced in the previous
section. We report preliminary results in two aspects: e ciency (running time) and
compactness (number of concepts) to discuss the strengths and weaknesses of
the di erent methods.</p>
      <p>m1 g1 g2 g3 g4 g5 m2 g1 g2 g3 g4 g5 m3 g1 g2 g3 g4 g5 m4 g1 g2 g3 g4 g5
g1 g1 g1 g1
g2 g2 g2 g2
g3 g3 g3 g3
g4 g4 g4 g4
g5 g5 g5 g5</p>
      <p>Data and experimental settings. The rst dataset, \Diagnosis"6, contains
120 objects with 8 attributes. The rst attribute provides temperature
information of a given patient with a range [35:5; 41:5] (numerical). For this attribute
we used = 0:1 and then = 0:3. The other 7 attributes are binary ( = 0).
The second dataset, \dataSample 1.txt", is provided with the BiCat software7.
It contains 420 objects and 70 numerical attributes with range [ 5:9; 6:7]. We
used = 0:05 for all attributes. We provide results in Table 3 for the three
different FCA methods discussed in this article, namely interval pattern structure
(IPS), tolerance blocks/partition pattern structures (TBPS) and triadic concept
analysis (TCA). We also report on the use of standard FCA using the
discretization technique discussed at the end of Section 4.2 (FCA). We also discuss the
computing of clari ed contexts, given that it can dramatically reduce the size
of the context while keeping the same concept lattice (FCA-CL). A context is
clari ed when there exists neither two objects with the same description, or two
attributes shared by the same set of objects.</p>
      <p>
        For the methods based on FCA and pattern structures (IPS, TBPS), we used
a C++ version of the AddIntent algorithm [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]8. No restrictions were imposed
over the size of the biclusters. The TCA method was implemented using
DataPeeler [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. All the experiments were performed using a Linux machine with
Intel Xeon E7 running at 2.67GHz with 1TB of RAM.
      </p>
      <p>Discussion. Results in Table 3 show that for the Diagnosis dataset, the
clari ed context using standard FCA (FCA-CL) is the best of the ve methods
w.r.t. execution time while for the BicAt sample 1, the best is TCA. Times are
expressed as the sum of the time required to create the input representation
of the dataset for the corresponding technique and its execution. In the case
of FCA and FCA-CL, the pre-processing can be as high as the time required
for applying the AddIntent algorithm. However, for large datasets such as the
BicAt example, this times can be ignored. It is also worth noticing that the
pre-processing depends on the chosen value, hence for each di erent con
guration, a new pre-processing task has to be executed. This is not the case for
interval and partition pattern structures the pre-processing of which is linear
6 http://archive.ics.uci.edu/ml/datasets/Contraceptive+Method+Choice
7 http://www.tik.ee.ethz.ch/sop/bicat/
8 https://code.google.com/p/sephirot/
Technique
FCA
FCA-CL
TCA
IPS
TBPS</p>
      <p>Time [s]
Preproc + Exec.</p>
      <p>0.11 + 0.335
0.11 + 0.02
0.04 + 33.3
0.011 + 0.303
0.011 + 1.76
w.r.t. the number of objects (it is actually, just a change of format). We can
also appreciate a more compact representation of the biclusters by the use of
partition pattern structures (TBPS) and its formal context versions (FCA and
FCA-CL). While TBPS is the slowest of the ve methods, it is also the cheapest
one in terms of the use of machine resources, more speci cally RAM. TCA is the
more expensive method in terms of machine resources and data representation,
however this yields results faster. Interval pattern structures are in the middle
as a good trade-o of compactness and execution time.</p>
      <p>
        For this initial experimentation we have not reported the number of maximal
biclusters nor the bicluster extraction algorithms that can be implemented for
each di erent technique, but only in the FCA techniques themselves. Regarding
the number of maximal biclusters, this is the same for each technique since
all of them are bicluster enumeration techniques, i.e. all possible biclusters are
extracted. Hence, the di erence among techniques is not given by the number
of maximal biclusters extracted, but by the number of formal concepts found
and their post-processing complexity to extract the maximal biclusters from
them. In general, it is easy to observe from Propositions 1, 2 and 3 that the
post-processing of TCA is linear w.r.t. the number of triadic concepts found,
while for TPS is linear w.r.t. the number of interval pattern concepts times the
number of columns of the numerical dataset squared and for TBPS is linear
w.r.t. the number of super-sub concept relations in the tolerance block pattern
concept lattice. Nevertheless, di erent strategies for bicluster extraction can be
implemented for each technique rendering the comparison unfair. For example,
in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] an optimization is proposed regarding biclustering using partition pattern
structures (which can be easily adapted to TBPS) which cuts in half its execution
time by breaking the structure of the lattice. Similar strategies for IPS and TCA
could also be implemented but are still a matter of research.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Biclustering is an important data analysis task that is used in several
applications such as transcriptome analysis in biology and for the design of
recommender systems. Biclustering methods produce a collection of local patterns that
are easier to interpret than a global model. There are several types of
biclusters and corresponding algorithms, ad hoc most of the time. In this paper, our
main contribution shows how the biclusters of similar values on columns can be
characterized or generated from formal concepts, pattern concepts and triadic
concepts. Bringing back this problem of biclustering into formal concept
analysis settings allows the usage of existing and e cient algorithms without any
modi cations. However, and this is among the perspectives of research, several
optimizations can be made. For example, with the triadic method, one should
not generate both concepts (A; B; C) and (A; C; B): they are redundant since
only concepts with B = C correspond to maximal biclusters.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Appendix: Proof of proposition 1</title>
      <p>We introduce notations, before to recall and prove Proposition 1 that relates
maximal biclusters to interval pattern concepts of a pattern structure. The
intuition lies in the relation between the set of attributes M of (G; M; W; I)) in an
interval pattern structure (G; (D; u); ). Let d = h[a1; b1]; [a2; b2]; : : : ; [an; bn]i 2
D be a pattern interval in an interval pattern structure (G; (D; u); ), where
jM j = n. For any mi 2 M , we de ne: d(mi) = [ai; bi]. and jd(mi)j = jai bij.
De nition 5. Let d be a pattern in an interval pattern structure (G; (D; u); ).
The function attr : D 7! M is de ned as: attr(d) = fm 2 M j jd(m)j g.
De nition 6. Let A G be a set of objects and m 2 M an attribute. We
de ne: A(m) = fg(m) j g 2 Bg. For instance, in Table 1, if A = fg1; g2; g3g,
then, A(m4) = f2; 8; 9g.</p>
      <sec id="sec-7-1">
        <title>Proposition 4. For A</title>
        <p>G, we have that, for all mi 2 M :
A</p>
        <p>= h[min(A(m1)); max(A(m1))]; : : : ; [min(A(mn)); max(A(mn))]i
Proof. Since the operation u is associative and commutative, we have that
A = l gi = h[min(A(m1)); max(A(m1))]; : : : ; [min(A(mn)); max(A(mn))]i</p>
        <sec id="sec-7-1-1">
          <title>Proposition 5. Consider a numerical dataset (G; M; W; I) as an interval pat</title>
          <p>tern structure (G; (D; u); ). For any maximal bicluster (A; B), we de ne: d =
A . Then: 1. B = attr(d) and 2. (A; D) is a pattern concept in (G; (D; u); ).</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>J.</given-names>
            <surname>Baixeries</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaytoue</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Computing similarity dependencies with pattern structures</article-title>
          . In M.
          <article-title>Ojeda-Aciego and</article-title>
          J. Outrata, editors,
          <source>CLA</source>
          , volume
          <volume>1062</volume>
          <source>of CEUR Workshop Proceedings</source>
          , pages
          <volume>33</volume>
          {
          <fpage>44</fpage>
          . CEUR-WS.org,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Baixeries</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaytoue</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Characterizing Functional Dependencies in Formal Concept Analysis with Pattern Structures</article-title>
          .
          <source>Annals of Mathematics and Arti cial Intelligence</source>
          , pages
          <fpage>1</fpage>
          {
          <fpage>21</fpage>
          ,
          <string-name>
            <surname>Jan</surname>
          </string-name>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Besson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Robardet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. D.</given-names>
            <surname>Raedt</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Boulicaut</surname>
          </string-name>
          .
          <article-title>Mining bi-sets in numerical data</article-title>
          . In S. Dzeroski and J. Struyf, editors,
          <source>KDID</source>
          , volume
          <volume>4747</volume>
          of Lecture Notes in Computer Science, pages
          <volume>11</volume>
          {
          <fpage>23</fpage>
          . Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>A.</given-names>
            <surname>Califano</surname>
          </string-name>
          , G. Stolovitzky, and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tu</surname>
          </string-name>
          .
          <article-title>Analysis of gene expression microarrays for phenotype classi cation</article-title>
          . In P. E. Bourne,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gribskov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. B.</given-names>
            <surname>Altman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Jensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Hope</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lengauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. D.</given-names>
            <surname>Schee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Strande</surname>
          </string-name>
          , and H. Weissig, editors,
          <source>Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, August 19-23</source>
          ,
          <year>2000</year>
          , La Jolla / San Diego, CA, USA, pages
          <volume>75</volume>
          {
          <fpage>85</fpage>
          . AAAI,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>L.</given-names>
            <surname>Cerf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Besson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Robardet</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Boulicaut</surname>
          </string-name>
          .
          <article-title>Closed patterns meet n-ary relations</article-title>
          .
          <source>TKDD</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          ),
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>V.</given-names>
            <surname>Codocedo</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Lattice-based biclustering using Partition Pattern Structures</article-title>
          .
          <source>In 21st European Conference on Arti cial Intelligence (ECAI)</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Freitas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ayadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Elloumi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.-K.</given-names>
            <surname>Hao</surname>
          </string-name>
          .
          <article-title>Biological Knowledge Discovery Handbook: Preprocessing, Mining, and Postprocessing of Biological Data, chapter Survey on Biclustering of Gene Expression Data</article-title>
          . John Wiley &amp; Sons, Inc.,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>B.</given-names>
            <surname>Ganter</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. O.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          .
          <article-title>Pattern structures and their projections</article-title>
          .
          <source>In ICCS '01: Proceedings of the 9th International Conference on Conceptual Structures</source>
          , pages
          <volume>129</volume>
          {
          <fpage>142</fpage>
          . Vol.
          <volume>2120</volume>
          , Springer-Verlag,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>B.</given-names>
            <surname>Ganter</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Wille</surname>
          </string-name>
          .
          <source>Formal Concept Analysis</source>
          . Springer,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. R. Jaschke,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hotho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Schmitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ganter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Trias - an algorithm for mining iceberg tri-lattices</article-title>
          .
          <source>In ICDM</source>
          , pages
          <volume>907</volume>
          {
          <fpage>911</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>M. Kaytoue</surname>
            ,
            <given-names>S. O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Macko</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Biclustering meets triadic concept analysis</article-title>
          .
          <source>Annals of Mathematics and Arti cial Intelligence</source>
          ,
          <volume>70</volume>
          (
          <issue>1-2</issue>
          ),
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>M. Kaytoue</surname>
            ,
            <given-names>S. O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Biclustering numerical data in formal concept analysis</article-title>
          . In P. Valtchev and R. Jaschke, editors,
          <source>ICFCA</source>
          , volume
          <volume>6628</volume>
          <source>of LNCS</source>
          , pages
          <volume>135</volume>
          {
          <fpage>150</fpage>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>M. Kaytoue</surname>
            ,
            <given-names>S. O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Napoli</surname>
            , and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Duplessis</surname>
          </string-name>
          .
          <article-title>Mining gene expression data with pattern structures in formal concept analysis</article-title>
          .
          <source>Information Science</source>
          ,
          <volume>181</volume>
          (
          <issue>10</issue>
          ):
          <year>1989</year>
          {
          <year>2001</year>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>S. O.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          .
          <article-title>Galois connections in data analysis: Contributions from the soviet era and modern russian research</article-title>
          . In B. Ganter, G. Stumme, and R. Wille, editors,
          <source>Formal Concept Analysis</source>
          , volume
          <volume>3626</volume>
          of Lecture Notes in Computer Science, pages
          <volume>196</volume>
          {
          <fpage>225</fpage>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>S. O.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          and
          <string-name>
            <surname>S. A. Obiedkov.</surname>
          </string-name>
          <article-title>Comparing performance of algorithms for generating concept lattices</article-title>
          .
          <source>J. Exp. Theor. Artif. Intell.</source>
          ,
          <volume>14</volume>
          (
          <issue>2-3</issue>
          ):
          <volume>189</volume>
          {
          <fpage>216</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>F.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Wille</surname>
          </string-name>
          .
          <article-title>A triadic approach to formal concept analysis</article-title>
          .
          <source>In ICCS</source>
          , volume
          <volume>954</volume>
          <source>of LNCS</source>
          , pages
          <volume>32</volume>
          {
          <fpage>43</fpage>
          . Springer,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>S.</given-names>
            <surname>Madeira</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          .
          <article-title>Biclustering algorithms for biological data analysis: a survey</article-title>
          .
          <source>IEEE/ACM Transactions on Computational Biology and Bioinformatics</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <volume>24</volume>
          {
          <fpage>45</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>N.</given-names>
            <surname>Rogovschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Labiod</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Nadif</surname>
          </string-name>
          .
          <article-title>A spectral algorithm for topographical co-clustering</article-title>
          .
          <source>In IJCNN</source>
          , pages
          <article-title>1{6</article-title>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19. T. Uno,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kiyomi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Arimura</surname>
          </string-name>
          .
          <article-title>Lcm ver. 2: E cient mining algorithms for frequent/closed/maximal itemsets</article-title>
          . In R. J. B.
          <string-name>
            <surname>Jr.</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Goethals</surname>
          </string-name>
          , and M. J. Zaki, editors,
          <source>FIMI</source>
          , volume
          <volume>126</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. D. van der Merwe, S. Obiedkov, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Kourie</surname>
          </string-name>
          .
          <article-title>AddIntent: A New Incremental Algorithm for Constructing Concept Lattices</article-title>
          . In P. Eklund, editor,
          <source>Concept Lattices</source>
          , volume
          <volume>2961</volume>
          <source>of LNCS</source>
          , pages
          <volume>205</volume>
          {
          <fpage>206</fpage>
          . Springer, Berlin/Heidelberg,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>R.</given-names>
            <surname>Veroneze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Banerjee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F. J. V.</given-names>
            <surname>Zuben</surname>
          </string-name>
          .
          <article-title>Enumerating all maximal biclusters in real-valued datasets</article-title>
          .
          <source>CoRR, abs/1403.3562</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          Proof. 1. B =
          <article-title>attr(d). We prove that m 2 attr(b) $ m 2 B. Since B = A , then, by the de nition of maximal bicluster we have that 8m 2 M : m 2 B $ jA(m)j , if and only if jmin(A(m)) max(A(m))j if and only if (by the de nition of d) m 2 attr(d). tu 2. We need to prove that A = d and that A = d. A = d holds by the de nition of d. As for A = d , we take g 2 d , which means that 8m 2 M : g(m) 2 d(m), also if m 2 B, which implies that g 2 A by de nition of maximal bicluster</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>