<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Simple Structure in Boolean Matrix Factorization</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martin Trnecka</string-name>
          <email>martin.trnecka@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marketa Trneckova</string-name>
          <email>marketa.trneckova@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. Computer Science Palacky ́ University Olomouc</institution>
          ,
          <addr-line>Olomouc</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2033</year>
      </pub-date>
      <fpage>0000</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>We are going back to the root of Boolean matrix factorization (BMF)-to factor analysis-and we examine an implementation of the simple structure in BMF. We propose a Boolean counterpart of the simple structure, i.e. a novel way how to evaluate interpretability of factors. Moreover, we discuss the proposed approach from the formal concept analysis perspective, and we provide an analysis of the interpretability of results obtained via selected the state-of-the-art BMF algorithms on real-world data. Additionally, we propose a novel BMF algorithm utilizing a main criterion for the factor selection based on the simple structure.</p>
      </abstract>
      <kwd-group>
        <kwd>Boolean factor analysis</kwd>
        <kwd>Interpretation of results</kwd>
        <kwd>Simple structure</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Boolean matrix factorization (BMF) is a natural and immensely popular way of
summarizing binary (yes/no) data via new fundamental variables, called factors,
that are implicitly hidden in the data. In the past, the BMF was successfully used
in various fields, e.g. role-based access control [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], computational biology [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ],
recommender systems [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], logic circuit synthesis [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], classification [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], computer
network analysis [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and many other fields of study.
      </p>
      <p>The general aim of BMF is to simplify complex data. More precisely, for a
given Boolean matrix I ∈ {0, 1}m×n, with m objects and n attributes, to find
matrices A ∈ {0, 1}m×k and B ∈ {0, 1}k×n for which
Operator ◦ represents Boolean matrix multiplication, i.e.</p>
      <p>I ≈ A ◦ B.
and the factors, and B captures a relation between the factors and the original
attributes.</p>
      <p>
        While, BMF is widely used tool in general data-mining, the main motivation
for the BMF comes from the psychology and the social sciences. In a broad sense,
BMF is an implementation of the general idea of factor analysis introduced by
psychologist Charles Spearman [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. This is not surprising. In a fact BMF comes
from the psychology, where Boolean data often occur (see e.g. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]). Surprisingly,
in BMF only small attention to the crucial aspect of the factor analysis—an
interpretation of factors itself—is payed.
      </p>
      <p>
        In the factor analysis, the interpretation of one particular factor is based on
its loading scores, i.e. how much are the original attributes involved in the factor
description. This view to factors is called a Royce’s model [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] of the factor. The
interpretation of a factorization, i.e. interpretation of all factors, is driven by the
law of parsimony, well known as Occam’s razor, i.e. we should pick the simplest
explanation of facts. A solution selected via the parsimony law is called simple
structure.
      </p>
      <p>
        In BMF, the question of how to interpret one particular factor is usually
answered with a help of formal concept analysis (FCA)[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], however the Royce
model may also be adopted, especially in the case of general factorization. Before
we explain this, let us remind the basic connection between BMF and FCA.
      </p>
      <p>For every I ∈ {0, 1}n×m one may associate a pair h↑, ↓i of arrow operators
defined by</p>
      <p>C↑ = {j ∈ Y | ∀i ∈ C : Iij = 1} and D↓ = {i ∈ X | ∀j ∈ D : Iij = 1},
where C is a subset of all objects X = {1, . . . , m} of I and D is subset of all
attributes Y = {1, . . . , n} of I. A pair hC, Di for which C↑ = D and D↓ = C
is called formal concept. The set of all formal concepts for I is defined in the
following way</p>
      <p>B(I) = {hC, Di | C ⊆ X, D ⊆ Y, C↑ = D, D↓ = C}.</p>
      <p>
        Now, we explain an important connection—originally described in a pioneer
work [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]—between a set of formal concepts and the Boolean matrix factorization.
Every set F = {hC1, D1i, . . . , hCk, Dki} ⊆ B(I), where the indexing of the formal
concepts hCl, Dli is fixed, induces the m × k and k × n Boolean matrices AF
and BF by
(AF )il =
1, if i ∈ Cl,
0, if i 6∈ Cl,
and (BF )lj =
1, if j ∈ Dl,
0, if j 6∈ Dl,
for l = 1, . . . , k. That is, the lth column of AF and lth row BF are the
characteristic vectors of Cl and Dl, respectively. The set F is also called a set of factor
concepts. The advantage of this approach is that a factor can be seen as a formal
concept, i.e. the interpretation of the factor is straightforward: factor = formal
concept—an entity with extent and intent part.
      </p>
      <p>The Boolean counterpart of simple structure or an elementary way how to
evaluate whether the factorization is easily interpretable is missing.</p>
      <p>
        In the paper we are going back to the root of BMF—to factor analysis—
and we discuss an implementation of the simple structure introduced in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]
by L. L. Thurstone in BMF. The main contribution of this work is following:
(i) we propose a Boolean counterpart of the simple structure, which may be
utilized in BMF, i.e. a novel way how to evaluate a quality (interpretability)
of factors and we discuss it from the formal concept analysis perspective; (ii)
we evaluate the interpretability of factorization delivered by the state-of-the-art
BMF algorithms; and (iii) we propose a novel BMF algorithm which uses the
simple structure as a main criterion for the factor selection.
      </p>
      <p>The rest of this paper is organized as follows. The next section provides basic
insight to the simple structure and discusses its formalization that can be used
in BMF. In Section 3, an evaluation of selected existing the state-of-the-art BMF
algorithms from an interpretability viewpoint is performed. Section 4 describes a
new BMF algorithm using our formalization of simple structure as a criterion for
the factor selection as well as its experimental evaluation. The paper is concluded
by Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Simple structure</title>
      <p>In the factor analysis, the question of the interpretability of factors is decided by
the law of parsimony, well known as Occam’s razor. In other words the simplest
explanation is the best one. Such explanation is selected via the parsimony law,
and it is called a simple structure.</p>
      <p>Even though, the BMF has motivation in the factor analysis, an
interpretation of factors has only a small attention in contemporary BMF research. This is
surprising because the interpretation of factors is a crucial part of factor analysis.
2.1</p>
      <sec id="sec-2-1">
        <title>Thurstone’s simple structure criteria</title>
        <p>
          In 1947 Thurstone formalized in his work [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] five criteria of the simple
structure. Later, Cattell in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] claimed that the simple structure factors are (usually)
simple to interpret. Unfortunately, the Thurstone’s formalization of good factors
is informal, vague and verbally described. According to Thurstone, interpretable
factors satisfy the following five conditions:
1. Each row of the rotated matrix should contain at least one zero.
2. In each factor, the minimum number of zero loadings should be the number
of factors in the rotation.
3. For every pair of factors there should be variables with zero loadings on one
and significant loadings on the other.
4. For every pair of factors a large portion of the loadings should be zero, at
least in a matrix with a large number of factors.
5. For every pair of factors there should be only a few variables with significant
loadings on both factors.
        </p>
        <p>Let us note, that the loading matrix is the factor-attribute matrix. Basically,
the criterion of the simple structure is derived from properties of the loading
matrix. This approach adopts the Royce definition of factor, namely “factor is
a construct operationally defined by its factor loadings”. In other words, factors
are represented via attributes that are manifestation of them. In our terminology,
factors are represented via rows of matrix B.</p>
        <p>The third and the fifth criterion are of overriding importance, namely the
loading matrix is the simple structure if each pair of factors have a few high
loadings.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Simple structure for BMF</title>
        <p>
          There have been many attempts—without noticeable success—to formalize the
simple structure (see e.g. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]) in factor analysis. In [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] a formalization of the
Thurstone’s five criteria via logical formulas for a decomposition of matrices over
a finite scale is proposed. Despite the fact, that the BMF is a special case of the
decomposition over a finite scale [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], in the following part we discuss that the
approach in [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] cannot be straightforwardly adopted in the Boolean case.
        </p>
        <p>
          In BMF, the first Thurstone’s criterion is always met, with an exception
where the input data contain an object which have all attributes. The second
criterion may not be easy to satisfy, especially if an exact decomposition is
required. In such case BMF algorithms may produce greater number of factors
than attributes (for more details see e.g. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]). The third criterion can be
understood as follows: for every pair of factors, there should be an attribute which is
a manifestation of one of them and is not a manifestation of the other one. As a
consequence of this, the third criterion is always met in BMF. The penultimate
criterion is in conflict with in FCA well-known fact that the maximal
rectangles are the best choice from the interpretability point of view. Namely, let us
consider a concept “dog”. If we cut some attributes from them, the
interpretation of this concept suffers. The last criterion says, that the number of common
attributes for each pair of factors should be minimal.
        </p>
        <p>
          Since the first and the third criterion are meaningless in BMF and the fourth
criterion violates a basic view on Boolean factors, we can conclude, that the
well-interpretable factorization satisfies the condition that all pairs of factors
have the number of common attributes as small as possible. This is consistent
with an observation presented [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], where a kind of dependencies between factors
in a presence of noise in data was observed. Obviously, if the sets of attributes,
that describe factors, are very similar, such factorization is hard to interpret.
        </p>
        <p>We define a measure, adopting the Royce model of factor, which calculate a
diversity (interpretability) of two sets of attributes A and B:
div(A, B) = 1 − max
|A ∩ B| , |A ∩ B| .</p>
        <p>|A| |B|</p>
        <p>A high value of div(A, B) indicates that sets A and B have a small number
of common attributes. div(A, B) = 1 means that the sets have no common
attributes at all.</p>
        <p>For the set of factors F , we may compute the diversity as an average diversity
of each pair of factors, i.e.</p>
        <p>div(F ) = X X div(Di, Dj )/number of pairs</p>
        <p>i j&gt;i
for Di, Dj such that hCi, Dii, hCj , Dj i ∈ F . Note, the diversity is equal to 1 for
the set of factors that have no common attributes.</p>
        <p>
          We employed the diversity measure to evaluate the quality (interpretability)
of factorization. Note that a different approach to the quality was proposed in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
In what follows we emphasize a fact that the BMF problem is a set covering
problem [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], i.e. the number of entries that are covered by a particular factor is
an important measure.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Comparison of existing methods</title>
      <p>
        In the following section we address the question of the interpretability of
factors that are obtained via existing BMF algorithms. We compare—by means of
the diversity measure—results delivered by selected state-of-the-art BMF
algorithms, namely GreConD [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Asso [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], GreEss [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], PaNDa [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], Hyper [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ],
8M [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], GreConD+ [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. An overview of these algorithms can be found e.g.
in [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2–4</xref>
        ].
3.1
      </p>
      <sec id="sec-3-1">
        <title>Datasets</title>
        <p>
          We used several, in BMF standard, real-world datasets from [
          <xref ref-type="bibr" rid="ref11 ref16 ref3">3, 11, 16</xref>
          ] that are
listed, together with their basic characteristics and the numbers of factors that
are required by selected algorithms to achieve an exact decomposition in Table 1.
        </p>
        <p>Moreover we build three new datasets CLA, Rio medal and Rio
participation.1 CLA data include as objects 128 regular contributors (authors that have
more than one contribution) on CLA conference. The attributes of CLA data
consist of 13 individual conferences (years). Both Rio datasets include 207
objects representing countries participated on Olympic games in Rio in 2016 and
28 attributes that represent sport disciplines such as aquatic sports, cycling,
gymnastics, etc. We used two relations between objects and attributes. The first
one reflects that a particular country had a representative in a sport discipline
(Rio participation). The second one reflects if a particular country wins some
medal in sport discipline (Rio medal).
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Experimental evaluation</title>
        <p>
          We computed factorizations for each dataset in Table 1 via above mentioned
BMF methods. Table 2 shows the interpretability (the diversity measure) for sets
1 All datasets as well as implementation of below presented algorithm GreDConD
are available online at
https://github.com/Marketa-Trneckova/CLA2020-SimpleStructure-BMF.
of factors that achieve a prescribe coverage of input data (column c). Namely,
75%, 90%, 95% and 100% coverage is shown. The value N/A in the table means,
that particular algorithm is not able to achieve prescribed coverage. The coverage
is computed as a percentage of nonzero entries in data that are covered by some
factors (for more details see e.g. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]).
        </p>
        <p>The highest values of the diversity are obtained via algorithm Hyper, which
for almost all datasets returns value 1 or a value close to 1. The main reason
is that Hyper usually produces factors including only one attribute—such
factors are really easy to interpret. This can be seen as a drawback of the simple
structure, because such factors are usually not useful in practice.</p>
        <p>Algorithms GreEss and GreConD return comparable results. Both of them
use formal concepts as factors and as a result of this both produce so-called
from-below decomposition, i.e. no zero entries in input data are covered by some
factor. In almost all cases GreEss slightly outperform GreConD.</p>
        <p>GreConD+, PaNDa and Asso return sets of factors that are not able to
cover the whole data. For example in many cases factors computed via PaNDa
cover less that 70% of input data. Differently from GreEss and GreConD,
factors produced by GreConD+, PaNDa and Asso are rectangles in data
that may contain zeros. The Royce’s model of the factor enables to evaluate
such factors. Surprisingly, 8M—the oldest BMF method—produces very good
factors, however each factor contains a large number of zero entries.
dataset
Breast
CLA
DBLP
Domino
Ecoli
Chess
Iris
Mushroom
Paleo
Post
Rio medal</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Improving existing BMF algorithms</title>
      <p>
        Almost all BMF algorithms—directly or indirectly—utilize the coverage as main
criterion for factor selection (for more details see e.g. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]). Despite the fact that
the coverage is an important criterion, it does not reflect the interpretability of
factors. Reasonable question is if we can improve existing BMF algorithms to
give a more interpretable factorization that still covers a large part of data.
      </p>
      <p>We choose GreConD—one of the most successful BMF algorithms—as a
base for a new algorithm. GreConD works as follows. To produce matrices AF
and BF GreConD uses a particular greedy search for factor concepts which
allows to compute factor formal concepts “on demand”. The algorithm constructs
the factor concepts by adding sequentially “promising columns” to candidate
hC, Di to factor concept. A new column j that maximize the number of newly
covered entries of the input data is added to hC, Di. This is repeated until no
such columns exist. If there is no such column, the hC, Di is added to the set F .</p>
      <sec id="sec-4-1">
        <title>Algorithm 1: GreDConD algorithm</title>
        <p>Input: Boolean matrix I.</p>
        <p>Output: Set F of factor concepts.</p>
        <p>We use an architecture of GreConD and we modify the computation of the
cost value. A pseudocode of the modification, which we called GreDConD2 is
depicted in Algorithm 1.</p>
        <p>A value of the cost (line 11) is computed as the number of newly covered
entries of the input matrix I multiplied by a value of diversity obtained via
procedure Diversity depicted in Algorithm 2.</p>
      </sec>
      <sec id="sec-4-2">
        <title>Algorithm 2: Diversity procedure</title>
        <p>Input: Set F of factor concepts, candidate D to the set F</p>
        <p>Output: The average diversity d.
1 foreach hAi, Bii ∈ F do
2 si ← 1 − max( |B|iB∩iD|| , |B|iD∩|D| )
3 end
4 d = P|i|F=F|1| si
5 return d</p>
        <p>Such setting of the factor selection criteria tends to prefer large factors—
factors that cover a large part of the data—that are more diverse and thus
potentially easily interpretable. Note, that the others BMF algorithms may be
improved in a similar way. In what follows we provide an experimental evaluation
of the new algorithm.
4.1</p>
      </sec>
      <sec id="sec-4-3">
        <title>Experimental evaluation</title>
        <p>We perform with GreDConD experiments described in Section 4.1. From
Table 2 one may clearly observe that GreDConD algorithm outperform
GreConD as well as GreEss (with an exception of CLA, Paleo, Rio participation
and Zoo extended). Both are its main competitors.</p>
        <p>Additionally we observed, in the case of GreConD, GreEss, GreDConD,
that the interpretability of factors tends to decrease when an exact coverage is
achieved. This points to a well-known problem, regarding the deciding what is a
good coverage for data. The decrease indicates, that factors, that are not simply
interpretable were added to factorization. As a very promising further research
seems to be investigating, if the diversity measure can be used as a stopping
criterion for a factor enumeration.</p>
        <p>Moreover, we deeply analyzed the first three factors. Figures 1, 2 and 3
represent attributes of datasets in order Zoo extended, Rio participation and Breast
dataset and show how many of the first three factors involves this attribute. The
darker square represents the higher number of factors having this attribute in
its loading. White color means, that no factor explain this attribute, black color
means, that all three factors have this attribute.
2 GreDConD is the abbreviation for Greedy Diversity Concepts on Demand.</p>
        <p>One may see from Figures 1, 2 and 3, that the first three factors obtained
via GreDConD share less attributes in comparison to other methods. Almost
the same results may be observed in the case of GreEss but these factors are
narrower and explained by smaller number of attributes (see Figure 2).</p>
        <p>Since the number of factors is an important characteristic, we should mention
that the number of factors (see Table 1) required for an exact decomposition via
GreDConD is comparable—sometimes even smaller—to the number of factors
delivered by GreConD algorithm.</p>
        <p>GreDConD</p>
        <p>GreConD</p>
        <p>GreEss
Hyper</p>
        <p>8M
GreConD+</p>
        <p>Asso
PaNDa</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>We propose a Boolean counterpart of the simple structure—which provides a
measure of interpretability of results in factor analysis—that can be utilized in
BMF. We presented a new measure that reflects the Thurston’s criterion for
the simple structure, and we evaluate factorizations delivered by the
state-ofthe-art BMF algorithms via the presented measure. Additionally we propose an
improvement one of existing BMF algorithm which uses the simple structure as
a main criterion for the factor selection. The new presented algorithm returns
slightly better results in a comparison to similar methods.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Supported by Junior research Grant No. JG 2020 003 of the Palacky´ University
Olomouc. Support by Grant No. IGA PrF 2020 019 of IGA of Palacky´ University
is also acknowledged.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grissa</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillaume</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguifo</surname>
            ,
            <given-names>E.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Outrata</surname>
          </string-name>
          , J.:
          <article-title>Boolean factors as a means of clustering of interestingness measures of association rules</article-title>
          .
          <source>Annals of Mathematics and Artificial Intelligence</source>
          <volume>70</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>151</fpage>
          -
          <lpage>184</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Outrata</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trnecka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Toward quality assessment of Boolean matrix factorizations</article-title>
          .
          <source>Inf. Sci</source>
          .
          <volume>459</volume>
          ,
          <fpage>71</fpage>
          -
          <lpage>85</lpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.ins.
          <year>2018</year>
          .
          <volume>05</volume>
          .016
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trnecka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>From-below approximations in Boolean matrix factorization: Geometry and new algorithm</article-title>
          .
          <source>J. Comput. Syst. Sci</source>
          .
          <volume>81</volume>
          (
          <issue>8</issue>
          ),
          <fpage>1678</fpage>
          -
          <lpage>1697</lpage>
          (
          <year>2015</year>
          ). https://doi.org/10.1016/j.jcss.
          <year>2015</year>
          .
          <volume>06</volume>
          .002
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trnecka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Handling noise in boolean matrix factorization</article-title>
          .
          <source>Int. J. Approx. Reasoning</source>
          <volume>96</volume>
          ,
          <fpage>78</fpage>
          -
          <lpage>94</lpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.ijar.
          <year>2018</year>
          .
          <volume>03</volume>
          .006
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trnecka</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A new algorithm for boolean matrix factorization which admits overcovering</article-title>
          .
          <source>Discrete Applied Mathematics</source>
          <volume>249</volume>
          ,
          <fpage>36</fpage>
          -
          <lpage>52</lpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.dam.
          <year>2017</year>
          .
          <volume>12</volume>
          .044
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Belohlavek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vychodil</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Discovery of optimal factors in binary data via a novel method of matrix decomposition</article-title>
          .
          <source>J. Comput. Syst. Sci</source>
          .
          <volume>76</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>20</lpage>
          (
          <year>2010</year>
          ). https://doi.org/10.1016/j.jcss.
          <year>2009</year>
          .
          <volume>05</volume>
          .002
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Boeck</surname>
            ,
            <given-names>P.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenberg</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Hierarchical classes: Model and data analysis</article-title>
          .
          <source>Psychometrika</source>
          <volume>53</volume>
          ,
          <fpage>361</fpage>
          -
          <lpage>381</lpage>
          (
          <year>1988</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Carroll</surname>
            ,
            <given-names>J.B.</given-names>
          </string-name>
          :
          <article-title>An analytical solution for approximating simple structure in factor analysis</article-title>
          .
          <source>Psychometrika</source>
          <volume>18</volume>
          (
          <issue>1</issue>
          ),
          <fpage>23</fpage>
          -
          <lpage>38</lpage>
          (
          <year>1953</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Cattell</surname>
          </string-name>
          , R.B.:
          <article-title>The scientific use of factor analysis in behavioral and life sciences</article-title>
          .
          <source>Springer US</source>
          (
          <year>1978</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Dixon</surname>
          </string-name>
          , W.:
          <article-title>Bmdp statistical software manual to accompany the 7.0 software release, vols 1-3</article-title>
          . (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ene</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horne</surname>
            ,
            <given-names>W.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Milosavljevic</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rao</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schreiber</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarjan</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          :
          <article-title>Fast exact and heuristic methods for role minimization problems</article-title>
          . In: Ray,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          <source>(eds.) 13th ACM Symposium on Access Control Models and Technologies</source>
          ,
          <string-name>
            <surname>SACMAT</surname>
          </string-name>
          <year>2008</year>
          ,
          <article-title>Estes Park</article-title>
          , CO, USA, June 11-13,
          <year>2008</year>
          , Proceedings. pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . ACM (
          <year>2008</year>
          ). https://doi.org/10.1145/1377836.1377838
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <source>Formal Concept Analysis Mathematical Foundations</source>
          . Springer-Verlag, Berlin, Heidelberg (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hashemi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reda</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Approximate logic synthesis using Boolean matrix factorization</article-title>
          .
          <source>In: Approximate Circuits</source>
          , pp.
          <fpage>141</fpage>
          -
          <lpage>154</lpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ignatov</surname>
            ,
            <given-names>D.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nenova</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstantinova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstantinov</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          :
          <article-title>Boolean matrix factorisation for collaborative filtering: An fca-based approach</article-title>
          . In: Agre,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Hitzler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Krisnadhi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.A.</given-names>
            ,
            <surname>Kuznetsov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.O</given-names>
            . (eds.) Artificial Intelligence: Methodology,
            <surname>Systems</surname>
          </string-name>
          , and Applications - 16th International Conference, AIMSA 2014, Varna, Bulgaria,
          <source>September 11-13</source>
          ,
          <year>2014</year>
          .
          <source>Proceedings. Lecture Notes in Computer Science</source>
          , vol.
          <volume>8722</volume>
          , pp.
          <fpage>47</fpage>
          -
          <lpage>58</lpage>
          . Springer (
          <year>2014</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          - 10554-3 5, https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -10554-3 5
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Kocayusufoglu</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoang</surname>
            ,
            <given-names>M.X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          :
          <article-title>Summarizing network processes with network-constrained Boolean matrix factorization</article-title>
          .
          <source>In: 2018 IEEE International Conference on Data Mining (ICDM)</source>
          . pp.
          <fpage>237</fpage>
          -
          <lpage>246</lpage>
          . IEEE (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Lichman</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>UCI machine learning repository (</article-title>
          <year>2013</year>
          ), http://archive.ics.uci.edu/ml
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Lucchese</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlando</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perego</surname>
          </string-name>
          , R.:
          <article-title>Mining top-k patterns from binary datasets in presence of noise</article-title>
          .
          <source>In: Proceedings of the SIAM International Conference on Data Mining, SDM</source>
          <year>2010</year>
          , April 29 - May 1,
          <year>2010</year>
          , Columbus, Ohio, USA. pp.
          <fpage>165</fpage>
          -
          <lpage>176</lpage>
          . SIAM (
          <year>2010</year>
          ). https://doi.org/10.1137/1.9781611972801.15
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Miettinen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Mielika¨inen, T.,
          <string-name>
            <surname>Gionis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mannila</surname>
          </string-name>
          , H.:
          <article-title>The discrete basis problem</article-title>
          .
          <source>IEEE Trans. Knowl. Data Eng</source>
          .
          <volume>20</volume>
          (
          <issue>10</issue>
          ),
          <fpage>1348</fpage>
          -
          <lpage>1362</lpage>
          (
          <year>2008</year>
          ). https://doi.org/10.1109/TKDE.
          <year>2008</year>
          .53
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Nau</surname>
            ,
            <given-names>D.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markowsky</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woodbury</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amos</surname>
          </string-name>
          , D.B.:
          <article-title>A mathematical analysis of human leukocyte antigen serology</article-title>
          .
          <source>Mathematical Biosciences</source>
          <volume>40</volume>
          (
          <issue>3-4</issue>
          ),
          <fpage>243</fpage>
          -
          <lpage>270</lpage>
          (
          <year>1978</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Royce</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Factors as theoretical constructs</article-title>
          .
          <source>American Psychologist</source>
          <volume>18</volume>
          (
          <issue>8</issue>
          ),
          <volume>522</volume>
          (
          <year>1963</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Spearman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          : ”
          <article-title>General intelligence,” objectively determined and measured</article-title>
          .
          <source>The American Journal of Psychology</source>
          <volume>15</volume>
          (
          <issue>2</issue>
          ),
          <fpage>201</fpage>
          -
          <lpage>292</lpage>
          (
          <year>1904</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Thurstone</surname>
            ,
            <given-names>L.L.</given-names>
          </string-name>
          :
          <article-title>Multiple-factor analysis; a development and expansion of the vectors of mind</article-title>
          . (
          <year>1947</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Trnecka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trneckova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Can we measure the interpretability of factors?</article-title>
          <source>In: Proceedings of the 14th International Conference on Concept Lattices and Their Applications</source>
          . pp.
          <fpage>179</fpage>
          -
          <lpage>190</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuhry</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dragan</surname>
            ,
            <given-names>F.F.</given-names>
          </string-name>
          :
          <article-title>Summarizing transactional databases with overlapped hyperrectangles</article-title>
          .
          <source>Data Min. Knowl. Discov</source>
          .
          <volume>23</volume>
          (
          <issue>2</issue>
          ),
          <fpage>215</fpage>
          -
          <lpage>251</lpage>
          (
          <year>2011</year>
          ). https://doi.org/10.1007/s10618-010-0203-9
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>