<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Time(s) &gt;</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>FCA and pattern structures for mining care tra jectories</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aleksey Buzmakov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elias Egho</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Jay</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergei O. Kuznetsov</string-name>
          <email>skuznetsov@hse.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amedeo Napoli</string-name>
          <email>amedeo.napolig@loria.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chedy Rassi</string-name>
          <email>chedy.raissig@inria.fr</email>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Research University Higher School of Economics</institution>
          ,
          <addr-line>Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>U. de Lorraine)</institution>
          ,
          <addr-line>Vand uvre-les-Nancy</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <volume>1</volume>
      <issue>4</issue>
      <abstract>
        <p>In this paper, we are interested in the analysis of sequential data and we propose an original framework based on Formal Concept Analysis (FCA). For that, we introduce sequential pattern structures, an original speci cation of pattern structures for dealing with sequential data. Pattern structures are used in FCA for dealing with complex data such as intervals or graphs. Here they are adapted to sequences. For that, we introduce a subsumption operation for sequence comparison, based on subsequence matching. Then, a projection, i.e. a kind of data reduction of sequential pattern structures, is suggested in order to increase the e ciency of the approach. Finally, we discuss an application to a dataset including patient trajectories (the motivation of this work), which is a sequential dataset and can be processed with the introduced framework. This research work provides a new and e cient extension of FCA to deal with complex (not binary) data, which can be an alternative to the analysis of sequential datasets.</p>
      </abstract>
      <kwd-group>
        <kwd>formal concept analysis</kwd>
        <kwd>pattern structures</kwd>
        <kwd>sequential pattern structures</kwd>
        <kwd>sequences</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Sequence data is largely present and used in many applications. Consequently,
mining sequential patterns from sequence data has become an important and
crucial data mining task. In the last two decades, the main emphasis has been on
developing e cient mining algorithms and e ective pattern representations [1{5].
However, the problem with traditional sequential pattern mining algorithms (and
generally with all pattern enumeration algorithms) is that they generate a large
number of frequent sequences while few of them are truly relevant. Moreover, in
some particular cases, only sequential patterns of a certain type are of interest
and should be mined rst. Are we able to develop a framework for taking into
account only patterns of required types? In addition, another drawback for these
pattern enumeration algorithms is that they depend on a user selected support
threshold, which is usually hard to be properly set by non-experts. How can one
avoid the setting of support threshold while having optimal pattern analysis?</p>
      <p>
        The above questions can be answered by addressing the problem of analyzing
sequential data with the formal concept analysis framework (FCA), an elegant
mathematical approach to data analysis [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and pattern structures, an extension
of FCA that handles complex data [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. We explain the usage of projections which
are mathematical functions respecting certain properties and allow to reduce
the computational costs by reducing the volume of resulting patterns. Such a
reduction helps an expert to interpret the extracted sequential patterns and
reduces the famous \pattern ooding".
      </p>
      <p>In this paper, we develop a novel and e cient approach for working with
sequential pattern structures in FCA. The rest of the paper is organized as
follows. Section 1 introduces main de nitions of FCA and pattern structures.
The next section de nes sequential pattern structures. Then, before concluding
the results are presented and discussed in Section 3.
1</p>
    </sec>
    <sec id="sec-2">
      <title>FCA and Pattern Structures</title>
      <p>
        FCA [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is a mathematical formalism having many applications in data analysis.
Pattern structures is a generalization of FCA for dealing with complex
structures, such as sequences or graphs [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. As it is a generalization it is enough to
introduce only pattern structures.
      </p>
      <p>De nition 1. A pattern structure is a triple (G; (D; u); ), where G is a set of
objects, (D; u) is a complete meet-semilattice of descriptions and : G ! D
maps an object to a description.</p>
      <p>The lattice operation in the semilattice (u) corresponds to the similarity
between two descriptions d1 and d2, i.e. the description which is common between
d1 and d2. Standard FCA can be presented in terms of pattern structures in the
following way. The set of objects G remains, while the semilattice of descriptions
is (}(M ); \), where }(M ) is a powerset of M , and, thus, a description is a set
of attributes, and the similarity operation corresponding to the set intersection,
i.e. the similarity is the set of shared attributes. If x = fa; b; cg and y = fa; c; dg
then x u y = x \ y = fa; cg. The mapping : G ! }(M ) is given by, (g) =
fm 2 M j (g; m) 2 Ig, returning the describing set of attributes.</p>
      <p>The Galois connection for a pattern structure (G; (D; u); ) between the set
of objects and the semilattice of descriptions is de ned as follows:
A := l (g);</p>
      <p>g2A
d := fg 2 G j d v (g)g;
for A</p>
      <p>G
for d 2 D
Given a set of objects A, A returns the description which is common to all
objects in A. And given a description d, d is the set of all objects whose
description subsumes d. The partial order on D (v) is de ned w.r.t. the similarity
operation u: c v d , c u d = c, and c is subsumed by d.</p>
      <p>De nition 2. A pattern concept of a pattern structure (G; (D; u); ) is a pair
(A; d) where A G and d 2 D such that A = d and d = A, A is called the
pattern extent and d is called the pattern intent.</p>
      <p>As in the standard case of FCA, a pattern concept corresponds to the
maximal set of objects A whose description subsumes the description d, while there
is no e 2 D, subsuming d, i.e. d v e, describing every object in A. The set of all
concepts can be partially ordered w.r.t. partial order on extents (dually, intent
patterns, i.e v), within a concept lattice.</p>
      <p>
        It is wroth mentioning, that the size of the concept lattice can be exponential
w.r.t. to the number of objects, and, thus, we need a special ranking method to
select the most interesting concepts for further analysis. Several such techniques
are considered in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], where it is shown that stability index [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is more reliable
in noisy data. Thus, we decided to use this index in the current work.
      </p>
      <p>An example of pattern structures is given by Table 1a and described in the
next sections, the corresponding lattice is depicted in Figure 1a.
2
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Sequential Pattern Structures</title>
      <sec id="sec-3-1">
        <title>An Example of Sequential Data</title>
        <p>Patient Trajectory
p1 hfag ; fc; dg ; fb; ag ; fdgi
p2 hfc; dg ; fb; dg ; fa; dgi
p3 hfc; dg ; fbg ; fag ; fa; dgi
(a) Sequential dataset.</p>
        <p>Subsequences
ss1 hfc; dg ; fbg ; fdgi ss5
ss2 hfdg ; fagi ss6
ss3 hfc; dg ; fbgi ss7
ss4 hfc; dg ; fbg ; fagi</p>
        <p>Subsequences
hfag ; fdgi
hfa; dgi
hfagi
(b) Some common subsequences.</p>
        <p>
          A medical trajectory of a patient is a sequence of hospitalizations, where every
hospitalization is described by a set of medical procedures the patient underwent.
An example of medical trajectories for three patients is given in Table 1a. Patient
p1 had four hospitalizations and during the second hospitalization he underwent
procedures c and d. Patients may have a di erent number of hospitalizations.
Hereafter we use the following notation, di erent sequences are enumerated in
superscript (p1), while elements of a sequence are enumerated in the subscript
(p12 = fc; dg). One important task is to nd the characteristic subsequences of
hospitalizations for patients to optimize hospitalization processes. For example,
we can nd a strange sequence and, thus, motivate the deeper analysis of the
problems behind or we can nd typical sequences that allow us to estimate the
treatment costs for a patient.
A sequence is constituted of elements from an alphabet. The classical
subsequence matching task requires no special properties of the alphabet. Several
generalization of the classical case were made by introducing subsequence
relation based on itemset alphabet [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] or on multidimensional and multilevel
alphabet [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Here, we generalize the previous cases, requiring for an alphabet
to form a semilattice (E; uE )1. This generalization allows one to process in a
uni ed way all types of complex sequential data.
        </p>
        <p>De nition 3. A sequence t = ht1; :::; tki is a subsequence of a sequence s =
hs1; :::; sni, denoted t s, i k n and there exist j1; ::jk such that 1 j1 &lt;
j2 &lt; ::: &lt; jk n and for all i 2 f1; 2; :::; kg, ti vE sji .</p>
        <p>With complex sequences and such kind of subsequences the calculation
procedures can be di cult, thus, to simplify the procedure, only \restricted"
subsequences are considered, where only the order of consequent elements is taken
into account, i.e. given a j1 in De nition 3, ji = ji 1 + 1 for all i 2 f2; 3; :::; kg.
Such a restriction makes sens for our data, because a hospitalization is a discrete
event and it is likely that the next hospitalization has a relation with the
previous one, for example, hospitalizations for treating aftere ects of chemotherapy.
Below the word \subsequence" refers to a \restricted" subsequence.</p>
        <p>Based on De nition 3 and on the alphabet (}(P ); \), the sequence ss1 in
Table 1b is a subsequence of p1 because if we set ji = i + 1 (De nition 3) then
ss11 v p21 (fc; dg fc; dg), ss12 v p31 (fbg fb; ag) and ss13 v p41 (fdg fdg).
2.3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Meet-semilattice of Sequences</title>
        <p>
          Using the previous de nitions, we can precisely de ne the sequential pattern
structure. For that, we make an analogy with the pattern structures for graphs [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]
where the meet-semilattice operation u respects subgraph isomorphism. Thus,
we introduce a sequential meet-semilattice respecting subsequence relation. Given
an alphabet lattice (E; uE ), D consists of sets of sequences based on E, such
that if d 2 D contains a sequence s then all subsequences of s should be included
into d, 8s 2 d; @s~ s : s~ 2= d. Similarity operation is the set intersection for two
sets of sequences. Given two patterns d1; d2 2 D, the set intersection operation
ensures that if a sequence s belongs to d1 u d2 then any subsequence of s belongs
to d1 ud2 and thus (d1 ud2) 2 D. As the set intersection operation is idempotent,
commutative and associative, (D; u) is a valid semilattice.
1 In this paper we consider two semilattices, the rst one is on the characters of the
alphabet, (E; uE), and the second one is introduced by pattern structures, (D; u).
        </p>
        <p>However, the set of all possible subsequences can be rather large. Thus, it is
more e cient and representable to keep a pattern d 2 D as a set of all maximal
sequences d~, d~ = fs 2 d j @s 2 d : s sg . Below, every pattern is given only
by the set of maximal sequences. For example, p2 u p3 = ss4; ss6 (see
Tables 1a and 1b), i.e. ss4; ss6 is the set of maximal common subsequences
between fp2g and fp3g, correspondingly ss4; ss6 u p1 = ss3; ss7 . Moreover,
representing a pattern by the set of maximal sequences allows for an e cient
implementation of the intersection \u" (see Section 3.1 for more details).</p>
        <p>The sequential pattern structure for the example given by Subsection 2.1
is (G; (D; u); ), where G = p1; p2; p3 , (D; u) is the semilattice of sequential
descriptions, and is the mapping associating an object in G to a description in
D shown in Table 1a. Figure 1a shows the resulting lattice of sequential pattern
concepts for this particular pattern structure (G; (D; u); ).
2.4</p>
      </sec>
      <sec id="sec-3-3">
        <title>Projections of Sequential Pattern Structures</title>
        <p>
          Pattern structures can be hard to process due to the usually large number of
concepts in the concept lattice and the complexity of the involved descriptions
and the similarity operation. Moreover, a given pattern structure can produce
a lattice with a lot of patterns which are not interesting for an expert. Can we
save computational time by deleting some unnecessary patterns? Projections of
pattern structures \simplify" to some degree the computation and allow one
to work with a reduced description. In fact, projections can be considered as
constraints (or lters) on patterns respecting certain mathematical properties.
These mathematical properties guarantee that the projection of a lattice is a
lattice where projected concepts have certain correspondence to original ones.
We introduce projections on sequential patterns, adapting them from [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          A projection : D ! D is an operator, which is monotone (x v y )
(x) v (y)), contractive ( (x) v x) and idempotent ( ( (x)) = (x)). A
projection preserves the semilattice operation u as follows. Under a projection
, the pattern structure (G; (D; u); ) becomes the projected pattern structure
((G; (D; u); )) = (G; (D ; u ); ), where D = (D) = fd 2 D j 9d 2
D : (d ) = dg and 8x; y 2 D; x u y := (x u y). The concepts of a projected
pattern structure have a \similar" concept in the initial pattern structure [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>One possible projection for sequential pattern structures comes from the
following observation. In many cases it can be more interesting to analyze quite
long common subsequences rather than small ones. For example, if we prefer
common subsequences of length &gt; 2, then between p1 and p2 in Table 1a there
is only one maximal common subsequence, ss1 in Table 1b, while ss2 in Table 1b
is too short to be considered as a common subsequence. Such kind of projections
we call Minimal Length Projection (MLP) and depends on the parameter l,
the minimal allowed length of the sequences in a pattern. The projected pattern
concept lattice for MLP 3 is shown in Figure 1b. In the experimentation section
we compare MLP projections with di erent value of the parameter. Projections
are very useful, as they reduce the computational costs in a meaningful manner.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Sequential Pattern Structure Evaluation</title>
      <sec id="sec-4-1">
        <title>3.1 Implementation</title>
        <p>
          Nearly all state-of-the-art FCA algorithms can be adapted to process pattern
structures. We adapted AddIntent algorithm [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], as the lattice structure is
important for us to calculate stability (see the algorithm for calculating stability
in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]). To compute the semilattice operation (u, v) between two sets of
sequences S = fs1; :::sng and T = ft1; :::; tmg, S u T is calculated according to
Section 2.3, i.e. maximal sequences among all maximal common subsequences
for any pair of si and tj. To nd all common subsequences of two sequences,
the following observations is useful. If ss = hss1; :::; ssli is a subsequence of
s = hs1; :::; sni with jis = ks+i (De nition 3: ks is the index di erence from which
ss is a subsequence of s) and a subsequence of t = ht1; :::; tmi with jit = kt + i
(likewise), then for any index i 2 f1; 2; :::; lg, ssi vE (sjis u tjit ). Thus, to nd
maximal common subsequences between s and t, we, rst, align s and t in all
possible ways, and then we compute the resulting intersections and keep only
the maximal ones. Let us consider two possible alignments of s1 and s2:
s1 = hfag ; fc; dg ; fb; ag; fdg i
s2 = h fc; dg;fb; dg ; fa; dgi
ssl = h ; ; fdg i
s1 = hfag ; fc; dg;fb; ag; fdg i
s2 = h fc; dg;fb; dg;fa; dg i
ssr = h fc; dg; fbg ; fdg i
The left intersection ssl is smaller than ssr and, thus, is not kept.
3.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Experiments and Discussion</title>
        <p>The experiments are carried out on an \Intel(R) Core(TM) i7-2600 CPU @
3.40GHz" computer with 8Gb of memory under the Ubuntu 12.04 operating
system. The algorithms are not parallelized and are coded in C++.</p>
        <p>First, the public available dataset from UCI repository on anonymous web
data is used as a benchmark data set for scalability tests. This dataset contains
around 106 transactions, and each transaction is a sequence based on \simple"
alphabet, i.e. with no order on the elements. The overall time changes from
37279 seconds for the sequences of length M LP 5 upto 97042 seconds for the
sequences of length M LP 3. For more details see the web-page.2</p>
        <p>
          Our use-case data set comes from a French healthcare system [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. In the
experiment, 1000 patients are analyzed. The dataset describes a patient as a
sequence of hospitalizations without any timestamps. A hospitalization for a
patient is a tuple with the hospital name, the cause for the hospitalization and
the set of procedures the patient underwent. All the eld of a hospitalization are
joined into one single set, which describes one element of a sequence.
        </p>
        <p>Table 2 shows the nal times and the lattice sizes for di erent projections.
The calculation time with projections changes from 3120 to 4510 seconds
depending on the projection parameter. The lattice sizes for the di erent projections
2 http://www.loria.fr/~abuzmako/FCA4AI2013/experiment-uci.html
changes from 105 to 5 105. Notice that the ratio for the lattice size is of 5 while
the ratio for computation time is 1:5. Deletion of short sequences can
dramatically change the lattice size, since the shorter a sequence is, the more probable it
is a subsequence of a patient trajectory, but the computation of the semilattice
operation is easily processed with shorter sequences. The calculation without
projection takes a lot of time with relatively small lattice size (40 times more for
the runtime and 6 times more for the lattice size w.r.t the projection l = 2). The
reason for that is memory swapping to the hard disk. Thus, the best projection
for that dataset is l = 2 as its computation time is reasonable and it preserves
the most of the information among the other projections.</p>
        <p>
          Table 3 shows some interesting concept intents and the sizes of the
corresponding extents. The concept are selected among the most stable ones [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. The
concept c1 corresponds to 28% of the patients having at least 8 consequent
hospitalizations because of chemotherapy. Now we can estimate the minimal cost of
the overall procedures for a patient (we know the price of chemotherapy and we
know the expected number of procedures). Concept c2 covers 7% of the patients,
and its intent is more interesting: 12 hospitalizations for chemotherapy
following the hospitalization for adjustment of a chemotherapy device and an artery
catheter installation. Both concepts c1 and c2 can be found within any of the
considered projections (with l 2 f2; 3; ::; 6g), while the concept c3 can be found
only within the most speci c projection (l = 2). The concept c3 covers 19% of
the patients and describes patients with at least two consequent hospitalizations
accompanied with a chest radiography. In this way, we have a kind of control of
the trajectory quality, because we could have an under-examination of a patient
during the rst hospitalization.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper, we present an approach for analyzing sequential datasets within
the framework of pattern structures, an extension of FCA dealing with complex
(non binary) data. Using pattern structures leads to the construction of a pattern
concept lattice, which does not require the setting of a support threshold, as
usually needed in sequential pattern mining. Another point worth to be noticed
is that the use of projections gives a lot of exibility especially for mining and
interpreting special kinds of patterns. The framework is tested on a real-life
dataset recording patient trajectories in a healthcare system. Interesting patterns
are extracted and then interpreted, showing the feasibility and usefulness of
the approach. For the future work, it is important to more deeply investigate
projections, their potentialities w.r.t. the types of patterns.</p>
      <p>Acknowledgements: this research received funding from the Basic Research
Program at the National Research University Higher School of Economics
(Russia) and from the BioIntelligence project (France).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Han,
          <string-name>
            <given-names>J</given-names>
            .,
            <surname>Pei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Mortazavi-Asl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            ,
            <surname>Dayal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            ,
            <surname>Hsu</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>FreeSpan: frequent pattern-projected sequential pattern mining</article-title>
          .
          <source>In: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          . (
          <year>2000</year>
          )
          <volume>355</volume>
          {
          <fpage>359</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J</given-names>
            ., Han, J
          </string-name>
          .,
          <string-name>
            <surname>Mortazavi-Asl</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pinto</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dayal</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Pre xSpan Mining Sequential Patterns E ciently by</surname>
          </string-name>
          <article-title>Pre x Projected Pattern Growth</article-title>
          .
          <source>In: 17th International Conference on Data Engineering</source>
          . (
          <year>2001</year>
          )
          <volume>215</volume>
          {
          <fpage>226</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , Han,
          <string-name>
            <given-names>J</given-names>
            .,
            <surname>Afshar</surname>
          </string-name>
          , R.:
          <article-title>CloSpan: Mining Closed Sequential Patterns in Large Databases</article-title>
          . In: In SDM. (
          <year>2003</year>
          )
          <volume>166</volume>
          {
          <fpage>177</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lo</surname>
            , D., Han,
            <given-names>J</given-names>
          </string-name>
          .,
          <string-name>
            <surname>Khoo</surname>
          </string-name>
          , S.C.:
          <article-title>E cient Mining of Closed Repetitive Gapped Subsequences from a Sequence Database</article-title>
          .
          <source>In: 2009 IEEE 25th International Conference on Data Engineering</source>
          , IEEE (March
          <year>2009</year>
          )
          <volume>1024</volume>
          {
          <fpage>1035</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Rassi,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Calders</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Poncelet</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Mining conjunctive sequential patterns</article-title>
          .
          <source>Data Min. Knowl. Discov</source>
          .
          <volume>17</volume>
          (
          <issue>1</issue>
          ) (
          <year>2008</year>
          )
          <volume>77</volume>
          {
          <fpage>93</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <source>Formal Concept Analysis: Mathematical Foundations. 1st edn</source>
          . Springer, Secaucus, NJ, USA (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>S.O.</given-names>
          </string-name>
          :
          <article-title>Pattern Structures and Their Projections</article-title>
          . In Delugach, H.,
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          , G., eds.:
          <source>Conceptual Structures: Broadening the Base SE - 10. Volume 2120 of LNCS</source>
          . Springer Berlin Heidelberg (
          <year>2001</year>
          )
          <volume>129</volume>
          {
          <fpage>142</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Klimushkin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obiedkov</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Approaches to the Selection of Relevant Concepts in the Case of Noisy Data</article-title>
          .
          <source>In: Proceedings of the 8th international conference on Formal Concept Analysis. ICFCA'10</source>
          ,
          <string-name>
            <surname>Springer</surname>
          </string-name>
          (
          <year>2010</year>
          )
          <volume>255</volume>
          {
          <fpage>266</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>S.O.</given-names>
          </string-name>
          :
          <article-title>On stability of a formal concept</article-title>
          .
          <source>Annals of Mathematics and Arti cial Intelligence</source>
          <volume>49</volume>
          (
          <issue>1-4</issue>
          ) (
          <year>2007</year>
          )
          <volume>101</volume>
          {
          <fpage>115</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Casas-Garriga</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Summarizing Sequential Data with Closed Partial Orders</article-title>
          . In: SDM. (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Plantevit</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laurent</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laurent</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teisseire</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choong</surname>
            ,
            <given-names>Y.W.</given-names>
          </string-name>
          :
          <article-title>Mining multidimensional and multilevel sequential patterns</article-title>
          .
          <source>ACM Transactions on Knowledge Discovery from Data</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ) (
          <year>January 2010</year>
          )
          <volume>1</volume>
          {
          <fpage>37</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Learning of Simple Conceptual Graphs from Positive and Negative Examples</article-title>
          . Volume
          <volume>1704</volume>
          <source>of LNCS</source>
          . Springer (
          <year>1999</year>
          )
          <volume>384</volume>
          {
          <fpage>391</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Merwe</surname>
            ,
            <given-names>D.V.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obiedkov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kourie</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>AddIntent: A new incremental algorithm for constructing concept lattices</article-title>
          . In Goos, G.,
          <string-name>
            <surname>Hartmanis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leeuwen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eklund</surname>
          </string-name>
          , P., eds.: Concept Lattices. Volume
          <volume>2961</volume>
          . Springer (
          <year>2004</year>
          )
          <volume>372</volume>
          {
          <fpage>385</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obiedkov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kourie</surname>
            ,
            <given-names>D.G.</given-names>
          </string-name>
          :
          <article-title>On succinct representation of knowledge community taxonomies with formal concept analysis A Formal Concept Analysis Approach in Applied Epistemology</article-title>
          .
          <source>International Journal of Foundations of Computer Science</source>
          <volume>19</volume>
          (
          <issue>02</issue>
          ) (
          <year>April 2008</year>
          )
          <volume>383</volume>
          {
          <fpage>404</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Fetter</surname>
            ,
            <given-names>R.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freeman</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Averill</surname>
            ,
            <given-names>R.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          :
          <article-title>Case mix de nition by diagnosis-related groups</article-title>
          .
          <source>Med Care</source>
          <volume>18</volume>
          (
          <issue>2</issue>
          ) (
          <year>February 1980</year>
          )
          <volume>1</volume>
          {
          <fpage>53</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>