<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Computing Left-Minimal Direct Basis of implications</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pablo Cordero</string-name>
          <email>pcordero@uma.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Enciso</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Angel Mora</string-name>
          <email>amora@ctima.uma.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Ojeda-Aciego</string-name>
          <email>aciego@ctima.uma.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Ma ́laga</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <fpage>293</fpage>
      <lpage>298</lpage>
      <abstract>
        <p>The most popular basis in Formal Concept Analysis is the Duquenne-Guigues basis, which ensure minimality in the number of dependencies and it is built with pseudo-intents, and some method to calculate these basis from an arbitrary set of implications have been introduced. We propose in this paper, an automated method to calculate a left-minimal direct basis from the set of all implications built between a closed set and its corresponding minimal generators. The new basis also has the minimal property demanded in the Duquenne-Guigues basis. It is minimal in the cardinal of the set of implications, and minimal in the size of the left-hand side of the implications.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Formal Concept Analysis [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] has shown to be a powerful framework to discover
knowledge inside date sets. It provides a solid and formal theory which enhances
other well known approaches. The main element of FCA community are binary
relations between objects and attributes, which are described using matrixes
(contexts) and represent the appearance of the attributes in the corresponding
objects. One outstanding element in FCA is attribute implication, which is used
to specify a constraint in the context. Thus we write an implication between
attribute sets X and Y in the form X →Y whenever any object in the context
which has all the attributes in X also has all the attributes in Y .
      </p>
      <p>
        Attribute implication can be managed syntactically using Armstrong’s
Axioms [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a sound and complete inference system. This axiomatic system allows
us to derive new attribute implications that hold in a given context. This
“inference” relation leads to the following problem: How to characterize the minimal
set of implications for a given set of implications? Among the different basis
notions, the Duquenne-Guigues basis [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] also called stem basis seems has to be
cited because of their widely acceptation in the FCA area and because of its
minimality notion (w.r.t. the number of implications). Nevertheless, minimality
in the number of implications is a criteria that may be enhanced.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] K. Bertet and B. Monjardet provided a set of orthogonal
characteristics of the basis and established the equivalence of five definitions presented
by different authors in several areas which correspond with the same notion of
basis.
c paper author(s), 2013. Published in Manuel Ojeda-Aciego, Jan Outrata (Eds.): CLA
2013, pp. 293{298, ISBN 978{2{7466{6566{8, Laboratory L3i, University of La
Rochelle, 2013. Copying permitted only for private and academic purposes.
In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] we present a method to compute all the closed sets and its minimal
generators from a context. This information allows us to built a set of
implications whose left had side is a minimal generator and with its closed set in the
right hand side. We propose in this paper a definition of basis with the good
minimality property of Duquenne-Guigues basis and the characteristics of the above
implications: minimal information in the left had side and a fast computation of
attributes closures from them.
      </p>
      <p>We also introduce an automated method to calculate from a set of
implications a left-minimal direct basis. The new method is based on the Simplification
Logic for Funcional Dependency SLFD [?], a sound and complete inference
system for implications. The main characteristics of SLFD is that its inference system
is not built around the transitivity rule, like other well known Armstrong-like
axiomatic systems for implications.</p>
      <p>
        The work is organized as follows. In Section 2 we summarize preliminary
concepts and results on FCA concerning implications, basis, etc. In Section 3 we
outline the automated method that we have proposed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for the computation
of minimal generators and we introduce a new method to calculate the
leftminimal direct basis from the original set of implications. The paper ends with
a Conclusion Section.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        We will use the well-kwon notation used on Formal Concept Analysis (FCA) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
For the analysis of the information contained in the context K = (G, M, I), a
direction is the study of the pair (closed sets - minimal generators). The set
of attributes A is said to be a minimal generator (mingen) if, for all set of
attributes X ⊆ A if X00 = A00 then X = A.
      </p>
      <p>
        A relevant notion in the framework of Formal Concept Analysis is the concept
of attribute implication [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. This area is devoted to obtain knowledge from a
context that is a table in which attributes and objects are related. An attribute
implication is an expression A → B where A and B are sets of attributes. A
context satisfies A → B if every object that has all the attributes in A has also
all the attributes in B. It is well known that the sets of attribute implications
that are valid in a context satisfies the Armstrong’s Axioms. Although the two
interpretations of formulas (functional dependency and attribute implication)
are different, they have the same concept of semantic entailment. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
      </p>
      <p>Alternatively, attribute implications allow us to capture all the information
which can be deduced from a context. The set of all valid implications in a
context may be syntactically managed by means of the following inference system
known as Armstrong’s axioms. An implication basis of K is defined as a set L
of implications of K from which any valid implication for K can be deduced by
using Armstrong rules.</p>
      <p>
        The goal is to obtain an implication basis with minimal size. This condition
is satisfied by the so-called Duquenne-Guigues (or stem) basis [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. However,
the definition of the Duquenne-Guigues basis refers to minimality only in the
cardinality of the set of formulas, but as we have showed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] with an illustrative
example, redundant attributes use to appear in this kind of minimal basis.
      </p>
      <p>
        In the following, a summary of some interesting result of this survey are
showed. More specifically we present the definitions of some characteristics
studied in the survey that will be used to identify the kind of basis we introduce in
this paper. In the practice, it is interesting considering some properties [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] related
with minimality of them, in order to achieve efficiency.
      </p>
      <p>Definition 1. A set of implications Γ it is said
– minimal or non-redundant if Γ \ {X → Y } is not equivalent to Γ .
– minimum if if it is of least cardinality, that is, | Γ |≤| Γ 0 | for all set of
implications Γ 0 equivalent to Γ .
– optimal if || Γ ||≤|| Γ 0 || for all set of implications Γ 0 equivalent to Γ , where
the size of Γ is defined as
|| Γ ||=</p>
      <p>X
{X→Y ∈Γ }
(| X | + | Y |)</p>
      <p>And finally the two characteristics that constitutes the center of our basis
are introduced:
Definition 2. Let Γ = {X0 → Y0, . . . Xn → Yn} be a set of implications, it is
said a left-minimal basis if there does not exist a Xi → Yi and a subset Xi0 ( Xi
such that Γ \ {Xi → Yi} ∪ {Xi0 → Yi} is equivalent to Γ .</p>
      <p>A set of implications Γ and is said direct if for all implication A → B the set
A ∪ B is a closed set w.r.t. Γ .
3</p>
    </sec>
    <sec id="sec-3">
      <title>Obtaining basis from minimal generators</title>
      <p>
        Some methods to obtain generators of closed sets have been studied in [
        <xref ref-type="bibr" rid="ref12 ref13 ref7">7, 12,
13</xref>
        ]. Moreover, minimal generators [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ] appear in the literature under different
names in various fields, for instance they are the minimal keys of the tables in
relational databases. In [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], the authors emphasize the importance of studying
minimal generators although “they have been paid little attention so far in the
FCA literature”.
      </p>
      <p>
        We agree with these authors about the importance of the study of closed sets
and minimal generators. They constitute a source of essential information to
analyze a formal context. As we mention in the introduction, in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] we illustrated
the use of the Simplification paradigm to guide the search of all minimal
generator sets. Thus, we introduce a method named MinGen [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] which computes a list
of all pairs Φ =&lt;closed set, minimal generators&gt; from an arbitrary set of
implications The goal of this paper can be considered as the reserve direction of
the way we presented there. We will introduce a method to transform these set of
pairs into a basis, preserving two good properties (fast computation of attribute
closure and minimal left hand side in the implications) and providing
minimality in the number of implications.Thus, our goal is to achieve from the minimal
generators and closed sets a basis of implications fulfilling left-minimality and
directness.
      </p>
      <p>
        In the literature, the most cited algorithm to compute Duquenne-Guigues
basis is the Ganter Algorithm [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Algorithm 1 is an adaptation of the algorithm
showed in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to compute a Duquenne-Guigues basis from a set of LSI (labelled
set of items) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and we obtain a Duquenne-Guigues basis.
      </p>
      <p>Algorithm 1: Algorithm for computing a Duquenne-Guigues basis
input : An LSI Φ
output: A Duquenne-Guigues basis
T := ∅;
foreach hB, mg(B)i ∈ Φ do</p>
      <p>foreach A ∈ mg(B) do T := T ∪ {A → B}
repeat</p>
      <p>S := T ;
foreach A → B, C → D ∈ T such that A C and B 6⊆ C do</p>
      <p>T := T r {C → D};
if B ∪ C 6= D then T := T ∪ {BC → D}
until T = S ;
S := ∅;
foreach A → B ∈ T do S := S ∪ {A → B r A};
return S</p>
      <p>
        In the following example, we show how to link the above algorithm with the
work presented in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Example 1. For the input T = {ab → c, ac → bd, b → d, d → c}, Algorithm
MinGen 0 (see [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) returns Φ = {&lt; abcd, {ab, ac, ad} &gt;, &lt; bcd, {b} &gt;, &lt; cd, {d} &gt;
} and from here Algorithm 1 renders Γ = {d → cd, b → bcd, ac → abcd}, which
corresponds to a Duquenne-Guigues basis.
      </p>
      <p>The following theorem ensures the minimality (w.r.t. the cardinality) of the
Duquenne-Guigues basis.</p>
      <p>Theorem 1. Any Duquenne-Guigues basis is a minimum basis.</p>
      <p>In the following, we describe the algorihtm 2 to calculate a Left-Minimal
Direct Basis based on Algorithm 1. The algorithm is polynomial and it is described
searching a good understanding. Of course, an implementation using lectic order
would improve considerably its efficiency.</p>
      <p>First, we consider the following equivalence rules.</p>
      <p>Definition 3 (Aggregation rules). Let A, B, C and D be sets of attributes.
1. If A ⊆ C then {A → B, C → D} ≡ {A → B, BC → D r B}.
2. If A ⊆ C ⊆ A ∪ B then {A → B, C → D} ≡ {A → BD}.
Algorithm 2: Algorithm for computing a Left-Minimal Direct Basis
input : An LSI Φ
output: A Minimal Direct Basis
T := ∅;
foreach hB, mg(B)i ∈ Φ do</p>
      <p>foreach A ∈ mg(B) do T := T ∪ {(A, A → B)}
repeat</p>
      <p>S := T ;
foreach (M, A → B), (N, C → D) ∈ T such that A</p>
      <p>T := T r {(N, C → D)};
if B ∪ C 6= D then T := T ∪ {(N, BC → D)}
until T = S ;
S := ∅;
foreach (M, A → B) ∈ T do S := S ∪ {M → B r M };
return S
C and B 6⊆ C do</p>
      <p>We let for an extended version of this paper the proof that these equivalences
rules are derived rules of equivalences presented in Theorem ??.</p>
      <p>We propose in the Algorithm 2 the use of the two aggregation rules for
reducing the number of implications and also the consequent of implications.
Theorem 2. Let T = {(M1, A1 → B1), . . .} be a set of pairs with minimal
generators and implications obtained from minimal generators and closed sets.
The exhaustive application of the two Aggregation rules produces a left-minimal
direct basis.</p>
      <p>
        Example 2. Let T = {b → agh, d → a, bn → h, ab → def g, abc → djk} be
a set of implications, Algorithm MinGen 0 ([
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) returns a list with closed sets
and their minimal generators, Φ = {&lt; abdef gh, {b} &gt;, &lt; abcdef ghkj, {bc} &gt;,
&lt; abdef ghn, {bn} &gt;, &lt; abcdef ghkn, {bcn} &gt;, &lt; ad, {d} &gt;}
      </p>
      <p>In the first step of the Algorithm 2 with Φ we build T = {(b, b→abdefgh),
(bc,bc →abcdefghkj), (bn, bn → abdef ghn), (bcn, bcn → abcdef ghkn), (d, d →
ad)}.</p>
      <p>Then, we apply the Aggregation rules foreach couple of elements in T . At
the end of these comparisons, we have T = {(b, b → abdef gh), (bc, abcdef gh →
abcdef ghkj), (d, d → ad)}. And from this, in the last foreach of the Algorithm 2,
it renders the following left-minimal direct basis S = {b → adef gh, bc →
adef ghkj, d → a}.</p>
      <p>By the other side, Algorithm 1 return the following Duquenne-Guigues basis
{b → adef gh, abcdef gh → kj, d → a} which in a minimal basis, but not a
left-minimal one.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>In this paper we present an algorithm which allows the transformation of a set
of all closed sets and their corresponding minimal generators into a left-minimal
direct basis. The study about the soundness, completeness, and complexity of
the algorithms proposed are left to a extended paper.</p>
      <p>The new method uses some equivalences deduced in the SLFD Logic and
follows the Lectic order to traverse the list of minimal generators and implications
associated and return a set of implications but with good properties.</p>
      <p>
        As future work we propose to extend the Duquenne-Guigues basis definition
to consider a generalized fuzzy extension of implications. We propose the
definition introduced in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] that has been shown to be the most general one. In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] a
non trivial extension of the SLFD Logic for the generalized definition of fuzzy
functional dependency was introduced. The generalized version of the SLFD Logic will
be used to develop a method to get basis for the generalized fuzzy implications.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgment</title>
      <p>Supported by Grants TIN2011-28084 of the Science and Innovation Ministry of
Spain, and No. P09-FQM-5233 of the Junta de Andaluc´ıa.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. William W. Armstrong,
          <article-title>Dependency structures of data base relationships</article-title>
          ,
          <source>Proc. IFIP Congress</source>
          . North Holland, Amsterdam:
          <fpage>580</fpage>
          -
          <lpage>583</lpage>
          ,
          <year>1974</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Radim</given-names>
            <surname>Belohla</surname>
          </string-name>
          <article-title>´vek, and Vil´em Vychodil, Functional Dependencies of Data Tables Over Domains with Similarity Relations</article-title>
          ,
          <source>Proc. of the 2nd Indian International Conference on Artificial Intelligence</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Radim</given-names>
            <surname>Belohla</surname>
          </string-name>
          <article-title>´vek, and Vil´em Vychodil, Formal Concept Analysis with Constraints by Closure Operators</article-title>
          , Lecture Notes in Computer Science,
          <volume>4068</volume>
          :
          <fpage>131</fpage>
          -
          <lpage>143</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Radim Belohla´vek, Pablo Cordero,
          <string-name>
            <given-names>Manuel</given-names>
            <surname>Enciso</surname>
          </string-name>
          ,
          <article-title>A´ngel Mora, and Vil´em Vychodil, An Efficient Reasoning Method for Dependencies over Similarity</article-title>
          and
          <source>Ordinal Data, Lecture Notes in Computer Science</source>
          ,
          <volume>7647</volume>
          :
          <fpage>408</fpage>
          -
          <lpage>419</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>K.</given-names>
            <surname>Bertet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Monjardet</surname>
          </string-name>
          ,
          <article-title>The multiple facets of the canonical direct unit implicational basis</article-title>
          ,
          <source>Theor. Comput. Sci.</source>
          ,
          <volume>411</volume>
          (
          <fpage>22</fpage>
          -24):
          <fpage>2155</fpage>
          -
          <lpage>2166</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Cordero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Manuel</given-names>
            <surname>Enciso</surname>
          </string-name>
          ,
          <article-title>A´ngel Mora, Manuel Ojeda-Aciego, Computing Minimal Generators from Implications: a Logic-guided Approach</article-title>
          ,
          <string-name>
            <surname>CLA</surname>
          </string-name>
          <year>2012</year>
          :
          <fpage>187</fpage>
          -
          <lpage>198</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>A.</given-names>
            <surname>Day</surname>
          </string-name>
          ,
          <article-title>The lattice theory of functional dependencies and normal decompositions</article-title>
          ,
          <source>International Journal of Algebra and Computation</source>
          ,
          <volume>2</volume>
          : pp.
          <fpage>209</fpage>
          -
          <lpage>431</lpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>B.</given-names>
            <surname>Ganter</surname>
          </string-name>
          ,
          <article-title>Two basic algorithms in concept analysis</article-title>
          ,
          <source>Technische Hochschule, Darmstadt</source>
          ,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>J.L.</given-names>
            <surname>Guigues</surname>
          </string-name>
          and
          <string-name>
            <given-names>V.</given-names>
            <surname>Duquenne</surname>
          </string-name>
          ,
          <article-title>Familles minimales d'implications informatives r´esultant d'un tableau de donn´ees binaires</article-title>
          ,
          <source>Math. Sci. Humaines</source>
          :
          <volume>95</volume>
          ,
          <fpage>5</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>L.</given-names>
            <surname>Szathmary</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. O.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          , ZART:
          <string-name>
            <given-names>A Multifunctional</given-names>
            <surname>Itemset Mining Algorithm</surname>
          </string-name>
          ,
          <source>Proc. of the 6th Intl. Conf. on Concept Lattices and Their Applications (CLA '08)</source>
          :
          <fpage>47</fpage>
          -
          <lpage>58</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>L.</given-names>
            <surname>Szathmary</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Valtchev</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Napoli</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Godin</surname>
          </string-name>
          ,
          <article-title>Efficient Vertical Mining of Frequent Closures and Generators</article-title>
          , LNCS
          <volume>5772</volume>
          :
          <fpage>393</fpage>
          -
          <lpage>404</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>C. Frambourg</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Valtchev</surname>
            , and
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Godin</surname>
          </string-name>
          ,
          <article-title>Merge-based computation of minimal generators</article-title>
          .
          <source>Discrete Mathematics, LNCS</source>
          <volume>3596</volume>
          :
          <fpage>181</fpage>
          -
          <lpage>194</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. K. Nehm´e, P. Valtchev,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Rouane</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Godin</surname>
          </string-name>
          ,
          <article-title>On Computing the Minimal Generator Family for Concept Lattices and Icebergs</article-title>
          , LNCS
          <volume>3403</volume>
          :
          <fpage>192</fpage>
          -
          <lpage>207</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>