<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Application of Formal Concept Analysis to Neural Decoding</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dominik Endres</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter F¨oldi´ak</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Uta Priss</string-name>
          <email>u.priss@napier.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computing, Napier University</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Psychology, University of St. Andrews</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2008</year>
      </pub-date>
      <fpage>181</fpage>
      <lpage>192</lpage>
      <abstract>
        <p>This paper proposes a novel application of Formal Concept Analysis (FCA) to neural decoding: the semantic relationships between the neural representations of large sets of stimuli are explored using concept lattices. In particular, the effects of neural code sparsity are modelled using the lattices. An exact Bayesian approach is employed to construct the formal context needed by FCA. This method is explained using an example of neurophysiological data from the high-level visual cortical area STSa. Prominent features of the resulting concept lattices are discussed, including indications for a product-of-experts code in real neurons.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Mammalian brains consist of billions of neurons, each capable of independent
electrical activity. From an information-theoretic perspective, the patterns of
activation of these neurons can be understood as the codewords comprising the
neural code. The neural code describes which pattern of activity corresponds
to what information item. We are interested in the (high-level) visual system,
where such items may indicate the presence of a stimulus object or the value of
some stimulus attribute, assuming that each time this item is represented the
neural activity pattern will be the same or at least similar. Neural decoding is
the attempt to reconstruct the stimulus from the observed pattern of activation
in a given population of neurons [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1,2,3,4</xref>
        ]. Popular decoding quality measures,
such as Fisher’s linear discriminant [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] or mutual information [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] capture how
accurately a stimulus can be determined from a neural activity pattern (e.g.,
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]). While these measures are certainly useful, they provide little information
about the structure of the neural code, which is what we are concerned with
here. Furthermore, we would also like to elucidate how this structure relates to
the represented information items, i.e. we are interested in the semantic aspects
of the neural code.
      </p>
      <p>
        To explore the relationship between the representations of related items,
F¨oldi´ak [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] demonstrated that a sparse neural code can be interpreted as a graph
(a kind of ”semantic net”). Each codeword can then be represented as a set of
active units (a subset of all units). The codewords can now be partially ordered
under set inclusion: codeword A ≤ codeword B iff the set of active neurons
of A is a subset of the active neurons of B. This ordering relation is capable
of capturing semantic relationships between the represented information items.
There is a duality between the information items and the sets representing them:
a more general class corresponds to a smaller subset of active neurons, and more
specific items are represented by larger sets [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Additionally, storing codewords
as sets is especially efficient for sparse codes, i.e. codes with a low activity ratio
(i.e. few active units in each codeword). These findings by Foldiak [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] did not
employ the terminology and tools of Formal Concept Analysis (FCA) [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ]. But
because this duality is a Galois connection, it is of interest to apply FCA to
such data. The resulting concept lattices are an interesting representation of the
relationships implicit in the code.
      </p>
      <p>
        We would also like to be able to represent how the relationship between
sets of active neurons translates into the corresponding relationship between
the encoded stimuli. In our application, the stimuli are the formal objects, and
the neurons are the formal attributes. The FCA approach exploits the duality
of extensional and intensional descriptions and allows to visually explore the
data in lattice diagrams. FCA has shown to be useful for data exploration and
knowledge discovery in numerous applications in a variety of fields [
        <xref ref-type="bibr" rid="ref10 ref11">10,11</xref>
        ].
      </p>
      <p>
        This paper does not include an introduction to FCA because FCA is well
described in the literature (e.g., [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). We use the phrase reduced labelling to refer
to line diagrams of concept lattices which have labels only attached to object
concepts and attribute concepts. As a reminder, an object concept is the smallest
(w.r.t. the conceptual ordering in a concept lattice) concept of whose extent the
object is a member. Analogously, an attribute concept is the largest concept of
whose intent the attribute is a member. Full labelling refers to line diagrams of
concept lattices where concepts are depicted with their full extent and intent.
      </p>
      <p>We provide more details on sparse coding in section 2 and demonstrate how
the sparseness (or denseness) of the neural code affects the structure of the
concept lattice in section 3. Section 4 describes the generative classifier model
which we use to build the formal context from the responses of neurons in the
high-level visual cortex of monkeys. Finally, we discuss the concept lattices so
obtained in section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Sparse coding</title>
      <p>
        One feature of neural codes which has attracted a considerable amount of interest
is its sparseness. As detailed in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], sparse coding is to be distinguished from
local and dense distributed coding. At one extreme of low average activity ratio
are local codes, in which each item is represented by a separate neuron or a small
set of neurons. This way there is no overlap between the representations of any
two items in the sense that no neuron takes part in the representation of more
than one item. An analogy might be the coding of characters on a computer
keyboard (without the Shift and Control keys), where each key encodes a single
character. It should be noted that locality of coding does not necessarily imply
that only one neuron encodes an item, it only says that the neurons are highly
selective, corresponding to single significant items of the environment (e.g. a
“grandmother cell” - a hypothetical neuron that has the exact and only purpose
to be activated when someone sees, hears or thinks about their grandmother).
      </p>
      <p>The other extreme (approximate average activity ratio of 0.5) corresponds
to dense, or holographic coding. Here, an information item is represented by the
combination of activities of all neurons. For N binary neurons this implies a
representational capacity of 2N . Given the billions of neurons in a human brain,
2N is beyond astronomical. As the number of neurons in the brain (or even just in
a single cortical area, such as primary visual cortex) is substantially higher than
the number of receptor cells (e.g. in the retina), the representational capacity
of a dense code in the brain is much greater than what we can experience in a
lifetime (the factor of the number of moments in a lifetime adds the requirement
of only about 40 extra neurons). Therefore the greatest part of this capacity is
redundant.</p>
      <p>
        Sparse codes (small average activity ratio) are a favourable compromise
between dense and local codes ([
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]). The small representational capacity of local
codes can be remedied with a modest fraction of active units per pattern
because representational capacity grows exponentially with average activity ratio
(for small average activity ratios). Thus, distinct items are much less likely to
interfere when represented simultaneously. Furthermore, it is much more likely
that a single layer network can learn to generate a target output if the input
has a sparse representation. This is due to the higher proportion of mappings
being implementable by a linear discriminant function. Learning in single layer
networks is therefore simpler, faster and substantially more plausible in terms of
biological implementation. By controlling sparseness, the amount of redundancy
necessary for fault tolerance can be chosen. With the right choice of code, a
relatively small amount of redundancy can lead to highly fault-tolerant decoding.
For instance, the failure of a small number of units may not make the
representation undecodable. Moreover, a sparse distributed code can represent values at
higher accuracy than a local code. Such distributed coding is often referred to
as coarse coding.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Concept lattices of local, sparse and dense codes</title>
      <p>
        In the case of a binary neural code, the sparseness of a codeword is inversely
related to the fraction of active neurons. The average sparseness across all
codewords is the sparseness of the code [
        <xref ref-type="bibr" rid="ref12 ref13">13,12</xref>
        ]. Sparse codes, i.e. codes where this
fraction is low, are found interesting for a variety of reasons: they offer a good
compromise between encoding capacity, ease of decoding and robustness [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ];
they seem to be employed in the mammalian visual processing system [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]; and
they are well suited to representing the visual environment we live in [
        <xref ref-type="bibr" rid="ref16 ref17">16,17</xref>
        ]. It is
also possible to define sparseness for graded or even continuous-valued responses
(see e.g. [
        <xref ref-type="bibr" rid="ref12 ref18 ref4">18,4,12</xref>
        ]). To study what structural effects different levels of
sparseness would have on a neural code, we generated random codes, i.e. each of 10
stimuli was associated with randomly drawn responses of 10 neurons, subject to
the constraints that the code be perfectly decodable and that the sparseness of
each codeword was equal to the sparseness of the code. Fig.1 shows the contexts
(represented as cross-tables) and the concept lattices of a local code (activity
ratio 0.1), a sparse code (activity ratio 0.2) and a dense code (activity ratio 0.5).
In a local code, the response patters to different stimuli have no overlapping
activations, hence the lattice representing this code is an anti-chain with top and
bottom element added. Each concept in the anti-chain introduces (at least) one
stimulus and (at least) one neuron. In contrast, a dense code results in a larger
number of concepts which introduce neither a stimulus nor a neuron. The lattice
of the dense code also contains substantially longer chains between the top and
bottom nodes than in the case of sparse and local codes.
      </p>
      <p>The most obvious differences between the lattices is the total number of
concepts. A dense code, even for a small number of stimuli, will give rise to a large
number of concepts, because the neuron sets representing the stimuli are very
probably going to have non-empty intersections. These intersections are
potentially the intents of concepts which are larger than those concepts that introduce
the stimuli. Hence, the latter are found towards the bottom of the lattice. This
implies that they have large intents, which is of course a consequence of the
density of the code. Determining these intents thus requires the observation of a
large number of neurons, which is unappealing from a decoding perspective. The
local code does not have this drawback, but is hampered by a small encoding
capacity (maximal number of concepts with non-empty extents): the concept
lattice in fig.1 is the largest one which can be constructed for a local code
comprised of 10 binary neurons. Which of the above structures is most appropriate
depends on the conceptual structure of the environment to be encoded and the
appropriate sparseness that can be selected.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Building a formal context from responses of high-level visual neurons</title>
      <p>
        To explore whether FCA is a suitable tool for interpreting real neural codes,
we constructed formal contexts from the responses of high-level visual cortical
cells in area STSa (part of the temporal lobe) of monkeys. Characterising the
responses of these cells is a difficult task. They exhibit complex nonlinearities and
invariances which make it impossible to apply linear techniques, such as reverse
correlation [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], that were shown to be useful in understanding the responses
of neurons in early visual areas [
        <xref ref-type="bibr" rid="ref20 ref21">20,21</xref>
        ]. The concept lattices obtained by FCA
might enable us to display and browse these invariances: if the response of a
subset of cells indicates the presence of an invariant feature in a stimulus, then
all stimuli having this feature should form the extent of a concept whose intent
is given by the responding cells.
4.1
      </p>
      <sec id="sec-4-1">
        <title>Physiological data</title>
        <p>
          The data were obtained through [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], where the experimental details can be
found. Briefly, spike trains were obtained from single neurons within the upper
and lower banks of the superior temporal sulcus (STSa) of an awake and
behaving monkey (Macaca mulatta) via standard extracellular recording techniques
[
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. During the recordings, the monkey had to perform a fixation task. This
area contains cells which are responsive to faces. Extracellular voltage
fluctuations were measured, and the stereotypical action potentials (i.e. ’spikes’) of the
neuron were detected and their temporal sequence was recorded resulting in a
’spike train’. These spike trains were turned into distinct samples, each of which
contained the spikes from −300 ms before to 600 ms after the stimulus onset
with a temporal resolution of 1 ms. The stimulus set consisted of 1704 images,
containing colour and black and white views of human and monkey head and
body, animals, fruits, natural outdoor scenes, abstract drawings and cartoons.
Stimuli were presented for 55 ms each without inter-stimulus gaps in random
sequences. While this rapid serial visual presentation (RSVP) paradigm
complicates the task of extracting stimulus-related information from the spike trains,
it has the advantage of allowing for the testing of a large number of stimuli. A
given cell was tested on a subset of 600 or 1200 of these stimuli, each stimulus
was presented between 1-15 times.
        </p>
        <p>
          The data were previously analysed with respect to the stimulus selectivity of
individual cells only. Previous neural population decoding studies were aimed at
identifying stimulus lables (e.g. [
          <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
          ]) only. This paper presents the first analysis
of the semantic structure of neural data with FCA.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Bayesian thresholding</title>
        <p>In order to apply FCA, we extracted a binary attribute from the raw spike
trains. We could use many-valued attributes to describe the neural response,
but we will employ a simple binary thresholding as a starting point. This binary
attribute should be as informative about the stimulus as possible, to allow for
the construction of meaningful concepts. We do this by Bayesian thresholding,
as detailed below. This procedure also avails us of a null hypothesis H0 =”the
responses contain no information about the stimuli”.</p>
        <p>
          A standard way of obtaining binary responses from neurons is thresholding
the spike count within a certain time window. This is a relatively straightforward
task, if the stimuli are presented well separated in time and a large number of
trials per stimulus are available. Then latencies and response offsets are often
clearly discernible and thus choosing the time window is not too difficult.
However, under RSVP conditions with few trial per stimulus, response separation
becomes more tricky, as the responses to subsequent stimuli will tend to follow
each other without an intermediate return to baseline activity. Moreover, neural
responses tend to be rather noisy. We will therefore employ a simplified version
of the generative Bayesian Bin classification algorithm (BBCa) [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], which was
shown to perform well on RSVP data [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
        <p>
          BBCa was designed for the purpose of inferring stimulus labels g from a
continuous-valued, scalar measure z of a neural response. The range of z is
divided into a number of contiguous bins. Within each bin, the observation model
for the g is a Bernoulli scheme with G types and with a Dirichlet prior over its
parameters. It is shown in [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] that one can iterate/integrate over all possible
bin boundary configurations efficiently, thus making exact Bayesian inference
feasible. Moreover, the marginal likelihood (or model evidence) becomes thus
available, which can be used to infer the posterior distribution over all spike
counting windows. We make two simplifications to BBCa: 1) z is discrete, because
we are counting spikes and 2) we use models with only 1 bin boundary Z0 in the
range r of z, i.e.
        </p>
        <p>P (g = li|z = zi) =
pli if zi ≤ Z0
qli otherwise
(1)</p>
        <p>X pg = 1 ,</p>
        <p>g
p(p0, . . . , pG) =
p(q0, . . . , qG) =
p(Z0) =
1
|r|
.</p>
        <p>X qg = 1
g
Γ Pg αg
Qg Γ (αg)
Γ Pg βg
Qg Γ (βg)</p>
        <p>
          Y pgαg−1
Y qgβg−1
g
g
(2)
(3)
(4)
(5)
We have no a priori preferences for any stimulus label, thus we choose ∀g : αg =
βg = 1. Since the Dirichlet priors on the pg and qg are conjugate to the likelihood
of the data (eqn.(1)), the posteriors can be computed in closed form. Further
details of the posterior computation after observing a set of stimulus-response
pairs (li, zi) are analogous to [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ].
        </p>
        <p>The bin membership (higher bin = stimulus has attribute) of a given neural
response can then serve as the binary attribute required for FCA, since BBCa
weighs bin configurations by their classification (i.e. stimulus label decoding)
performance. We proceed in a straight Bayesian fashion: since the bin
membership is the only variable we are interested in, all other parameters (counting
window size and position, class membership probabilities, bin boundaries) are
marginalised. This minimises the risk of spurious results due to ”contrived”
information (i.e. choices of parameters) made at some stage of the inference process.
Afterwards, the probability that the response belongs to the upper bin is
thresholded at a probability of 0.5, i.e. if the probability is larger than 0.5, then there
will be a cross in the context. Instead of this simple binarisation, other methods
of conceptual scaling could be used.</p>
        <p>Since BBCa yields exact model evidences, it can also be used for model
comparison. Running the algorithm with no bin boundaries in the range of z
effectively yields the probability of the data given the ”null hypothesis” H0: z
does not contain any information about g. We can then compare it against the
alternative hypothesis described above (i.e. the information which bin z is in
tells us something about g) to determine whether the cell has responded at all.
4.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Cell selection</title>
        <p>The experimental data consisted of recordings from 26 cells. To minimise the
risk that the computed neural responses were a result of random fluctuations,
we excluded a cell if 1) H0 was more probable than 10−6 or 2) the posterior
standard deviations of the counting window parameters were larger than 20 ms,
indicating large uncertainties about the response timing. Cells which did not
respond above the threshold included all cells excluded by the above criteria
(except one). Furthermore, since not all cells were tested on all stimuli, we also
had to select tuples of subsets of cells and stimuli such that all cells in a tuple
were tested on all stimuli. Incidentally, this selection can also be accomplished
with FCA, by determining the concepts of a context with gIm =”stimulus g was
tested on cell m” and selecting those with a large number of stimuli × number
of cells. One of these cell and stimulus subset pairs (16 cells, 310 stimuli) was
selected for further exemplary analysis, but the lattices computed from the other
subset pairs displayed similar features.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>To analyse the neural code, the thresholded neural responses were used to build
stimulus-by-cell-response contexts. We performed FCA on these with
ColibriConcepts1, created stimulus image montages2 and plotted the lattices3. In
these graphs, the images represent the formal objects. The top of the frame
around each concept image contains the concept number and the list of cells in
the intent (which, unfortunately, may be difficult to see in the printed version of
the graphs. Moreover, the list is truncated if more than 6 cells are in the intent.).</p>
      <p>Fig.2 shows a lattice which has an emphasis on ”face” and ”head” concepts.
The concepts introducing human and cartoon faces (i.e. with extents consisting
of general ”face” images) tend to be higher up in the lattice and their intents
tend to be small. In contrast, the lower concepts introduce mostly single monkey
faces (and faces of the monkey’s caregivers), with the bottom concepts having
intents of ≥ 7 cells. We may interpret this as an indication that the neural code
has a higher ”resolution” for faces of conspecifics (and other ”important” faces)
than for faces in general, i.e. other monkeys are represented in greater detail in a
monkey’s brain than humans or cartoons. This feature can be observed in most
lattices we generated. Thus, monkey STSa cells are not just responsive to faces
in general, but to specific subclasses, such as monkey faces, in particular.</p>
      <p>
        Fig.3 shows a subgraph from a lattice with full labelling. Full labelling is
of interest in these applications because viewing the full extent simultaneously
gives an impression of “what this concept is about”. The concepts in the left
half of the graph are face concepts, whereas the extents of the concepts in the
right half also contain a number of non-face stimuli. Most of the latter have
something ”roundish” about them. The bottom concept, being subordinate to
both the ”round” and the ”face” concepts, contains a stimulus with both
characteristics, which points towards a product-of-experts (PoE) encoding [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. In
PoE, each ’expert’ can be thought of as an attribute (or attribute combination)
of the represented item. These experts are expected to correspond to meaningful
aspects of the information items. Several examples of this kind can be found in
the other graphs of the complete concept lattices, which cannot be included in
this paper.
1 available at http://code.google.com/p/colibri-concepts/
2 via ImageMagick, available at http://www.imagemagick.org
3 with Graphviz, available at http://www.graphviz.org
We demonstrated the potential usefulness of FCA for the exploration and
interpretation of neural codes. This technique is feasible even for high-level visual
codes, where linear decoding methods [
        <xref ref-type="bibr" rid="ref20 ref21">20,21</xref>
        ] fail, and it provides qualitative
information about the structure of the code which goes beyond stimulus label
decoding [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1,2,3,4</xref>
        ]. The semantic structure of neural data has previously been
analysed with tree-based clustering methods [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. Imposing a tree structure on
the data may be inappropriate for neural data that reflects a more general
semantic structure, as supported by our results.
      </p>
      <p>
        Clearly, however, our application of FCA for this analysis is still in its
infancy. It would be very interesting to repeat the analysis presented here on data
obtained from simultaneous multi-cell recordings, to elucidate whether the
conceptual structures derived by FCA are used for decoding by real brains. On
a larger scale than single neurons, FCA could also be employed to study the
relationships in fMRI data [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ].
      </p>
      <p>Acknowledgements D. Endres was supported by MRC fellowship G0501319.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Georgopoulos</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kettner</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          :
          <article-title>Neuronal population coding of movement direction</article-title>
          .
          <source>Science</source>
          <volume>233</volume>
          (
          <issue>4771</issue>
          ) (
          <year>1986</year>
          )
          <fpage>1416</fpage>
          -
          <lpage>1419</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Fo¨ldi´ak, P.: The 'Ideal Homunculus'
          <article-title>: statistical inference from neural population responses</article-title>
          . In Eeckmann, F.,
          <string-name>
            <surname>Bower</surname>
            , J., eds.: Computation and
            <given-names>Neural</given-names>
          </string-name>
          <string-name>
            <surname>Systems</surname>
          </string-name>
          . Kluwer Academic Publishers, Norwell, MA (
          <year>1993</year>
          )
          <fpage>55</fpage>
          -
          <lpage>60</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Oram</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>F</surname>
          </string-name>
          ¨oldi´ak,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Perrett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Sengpiel</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          :
          <article-title>The 'Ideal Homunculus': decoding neural population signals</article-title>
          .
          <source>Trends In Neurosciences</source>
          <volume>21</volume>
          (
          <year>June 1998</year>
          )
          <fpage>259</fpage>
          -
          <lpage>265</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Quiroga</surname>
            ,
            <given-names>R.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koch</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fried</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Decoding Visual Inputs From Multiple Neurons in the Human Temporal Lobe</article-title>
          .
          <source>J Neurophysiol</source>
          <volume>98</volume>
          (
          <issue>4</issue>
          ) (
          <year>2007</year>
          )
          <fpage>1997</fpage>
          -
          <lpage>2007</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Duda</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hart</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stork</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Pattern classification</article-title>
          . John Wiley &amp; Sons, New York, Chichester (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cover</surname>
            ,
            <given-names>T.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          :
          <article-title>Elements of Information Theory</article-title>
          . John Wiley &amp; Sons, New York (
          <year>1991</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Fo¨ldi´ak, P.:
          <article-title>Sparse neural representation for semantic indexing</article-title>
          .
          <source>In: XIII Conference of the European Society of Cognitive Psychology (ESCOP-2003)</source>
          .
          <article-title>(</article-title>
          <year>2003</year>
          ) http://www.st-andrews.ac.uk/∼pf2/escopill2.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <article-title>Restructuring lattice theory: an approach based on hierarchies of concepts</article-title>
          .
          <source>In Rival</source>
          , I., ed.:
          <article-title>Ordered sets</article-title>
          .
          <source>Reidel</source>
          , Dordrecht-Boston (
          <year>1982</year>
          )
          <fpage>445</fpage>
          -
          <lpage>470</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <source>Formal Concept Analysis: Mathematical foundations</source>
          . Springer (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumme</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R., eds.:
          <source>Formal Concept Analysis, Foundations and Applications</source>
          . Volume
          <volume>3626</volume>
          of Lecture Notes in Computer Science. Springer (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Priss</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Formal concept analysis in information science</article-title>
          .
          <source>Annual Review of Information Science and Technology</source>
          <volume>40</volume>
          (
          <year>2006</year>
          )
          <fpage>521</fpage>
          -
          <lpage>543</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Fo¨ldi´ak,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Endres</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>Sparse coding</article-title>
          .
          <source>Scholarpedia</source>
          <volume>3</volume>
          (
          <issue>1</issue>
          ) (
          <year>2008</year>
          ) 2984 http://www.scholarpedia.org/article/Sparse coding.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. Fo¨ldi´ak, P.:
          <article-title>Sparse coding in the primate cortex</article-title>
          . In Arbib, M.A., ed.:
          <article-title>The Handbook of Brain Theory and Neural Networks. second edn</article-title>
          . MIT Press (
          <year>2002</year>
          )
          <fpage>1064</fpage>
          -
          <lpage>1068</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Fo¨ldi´ak, P.:
          <article-title>Forming sparse representations by local anti-Hebbian learning</article-title>
          .
          <source>Biological Cybernetics</source>
          <volume>64</volume>
          (
          <year>1990</year>
          )
          <fpage>165</fpage>
          -
          <lpage>170</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Olshausen</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Field</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pelah</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Sparse coding with an overcomplete basis set: a strategy employed by V1</article-title>
          .
          <source>Vision Res</source>
          .
          <volume>37</volume>
          (
          <issue>23</issue>
          ) (
          <year>1997</year>
          )
          <fpage>3311</fpage>
          -
          <lpage>3325</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Olshausen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          : Learning Linear, Sparse,
          <string-name>
            <given-names>Factorial</given-names>
            <surname>Codes</surname>
          </string-name>
          .
          <source>Technical Report AIM 1580</source>
          (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Simoncelli</surname>
            ,
            <given-names>E.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Olshausen</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          :
          <article-title>Natural image statistics and neural representation</article-title>
          .
          <source>Annual Review of Neuroscience</source>
          <volume>24</volume>
          (
          <year>2001</year>
          )
          <fpage>1193</fpage>
          -
          <lpage>1216</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Rolls</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Treves</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The relative advantages of sparse versus distributed encoding for neuronal networks in the brain</article-title>
          .
          <source>Network</source>
          <volume>1</volume>
          (
          <year>1990</year>
          )
          <fpage>407</fpage>
          -
          <lpage>421</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Dayan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abbott</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Theoretical Neuroscience</article-title>
          . MIT Press, London, Cambridge (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmer</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          :
          <article-title>An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex</article-title>
          .
          <source>Journal of Neurophysiology</source>
          <volume>58</volume>
          (
          <issue>6</issue>
          ) (
          <year>1987</year>
          )
          <fpage>1233</fpage>
          -
          <lpage>1258</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Ringach</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          :
          <article-title>Spatial structure and symmetry of simple-cell receptive fields in macaque primary visual cortex</article-title>
          .
          <source>Journal of Neurophysiology</source>
          <volume>88</volume>
          (
          <year>2002</year>
          )
          <fpage>455</fpage>
          -
          <lpage>463</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22. Fo¨ldi´ak,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Keysers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Edwards</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Perrett</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>Rapid serial visual presentation for the determination of neural selectivity in area STSa</article-title>
          . Progress in Brain Research (
          <year>2004</year>
          )
          <fpage>107</fpage>
          -
          <lpage>116</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Oram</surname>
            ,
            <given-names>M.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perrett</surname>
            ,
            <given-names>D.I.</given-names>
          </string-name>
          :
          <article-title>Time course of neural responses discriminating different views of the face and head</article-title>
          .
          <source>Journal of Neurophysiology</source>
          <volume>68</volume>
          (
          <issue>1</issue>
          ) (
          <year>1992</year>
          )
          <fpage>70</fpage>
          -
          <lpage>84</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Endres</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Fo¨ldi´ak, P.:
          <article-title>Exact Bayesian bin classification: a fast alternative to bayesian classification and its application to neural response analysis</article-title>
          .
          <source>Journal of Computational Neuroscience</source>
          <volume>24</volume>
          (
          <issue>1</issue>
          ) (
          <year>2008</year>
          )
          <fpage>24</fpage>
          -
          <lpage>35</lpage>
          DOI: 10.1007/s10827-007-0039-5.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Endres</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Bayesian and Information-Theoretic Tools for Neuroscience</article-title>
          .
          <source>PhD thesis</source>
          , School of Psychology, University of St. Andrews,
          <string-name>
            <surname>U.K.</surname>
          </string-name>
          (
          <year>2006</year>
          ) http://hdl.handle.net/10023/162.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G.:
          <article-title>Products of experts</article-title>
          .
          <source>In: Ninth International Conference on Artificial Neural Networks ICANN 99. Number 470 in ICANN</source>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Kiani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Esteky</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mirpour</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tanaka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Object category structure in response patterns of neuronal population in monkey inferior temporal cortex</article-title>
          .
          <source>Journal of Neurophysiology</source>
          <volume>97</volume>
          (
          <issue>6</issue>
          ) (
          <year>April 2007</year>
          )
          <fpage>4296</fpage>
          -
          <lpage>4309</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Kay</surname>
            ,
            <given-names>K.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naselaris</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prenger</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gallant</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          :
          <article-title>Identifying natural images from human brain activity</article-title>
          .
          <source>Nature</source>
          <volume>452</volume>
          (
          <year>2008</year>
          )
          <fpage>352</fpage>
          -
          <lpage>255</lpage>
          http://dx.doi.org/10.1038/nature06713.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>