<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Statistical Invariants of Spatial Form: From Local AND to Numerosity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christoph ZETZSCHE</string-name>
          <email>zetzsche@informatik.uni-bremen.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Konrad GADZICKI</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tobias KLUTH</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Cognitive Neuroinformatics, University of Bremen</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>163</fpage>
      <lpage>172</lpage>
      <abstract>
        <p>Theories of the processing and representation of spatial form have to take into account recent results on the importance of holistic properties. Numerous experiments showed the importance of “set properties”, “ensemble representations” and “summary statistics”, ranging from the “gist of a scene” to something like “numerosity”. These results are sometimes difficult to interpret, since we do not exactly know how and on which level they can be computed by the neural machinery of the cortex. According to the standard model of a local-to-global neural hierarchy with a gradual increase of scale and complexity, the ensemble properties have to be regarded as high-level features. But empirical results indicate that many of them are primary perceptual properties and may thus be attributed to earlier processing stages. Here we investigate the prerequisites and the neurobiological plausibility for the computation of ensemble properties. We show that the cortex can easily compute common statistical functions, like a probability distribution function or an autocorrelation function, and that it can also compute abstract invariants, like the number of items in a set. These computations can be performed on fairly early levels and require only two well-accepted properties of cortical neurons, linear summation of afferent inputs and variants of nonlinear cortical gain control.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>shape invariants</kwd>
        <kwd>peripheral vision</kwd>
        <kwd>ensemble statistics</kwd>
        <kwd>numerosity</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Recent evidence shows that our representation of the world is essentially determined by
holistic properties [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5 ref6">1,2,3,4,5,6</xref>
        ]. These properties are described as “set properties”,
“ensemble properties”, or they are characterized as “summary statistics”. They reach from
the average orientation of elements in a display [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] over the “gist of a scene”[
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ], to the
“numerosity” of objects in a scene [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. For many of these properties we do not exactly
know by which kind of neural mechanisms and on which level of the cortex they are
computed. According to the standard view of the cortical representation of shape, these
properties have to be considered as high-level features because the cortex is organized in
form of a local-to-global processing hierarchy in which features with increasing order of
abstraction are computed in a progression of levels [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. At the bottom, simple and
locally restricted geometrical features are computed, whereas global and complex
properties are represented at the top levels of the hierarchy. Across levels, invariance is
systematically increased such that the final stages are independent of translations, rotations, size
changes, and other transformations of the input. However convincing this view seems on
first sight, it creates some conceptual difficulties.
      </p>
      <p>
        The major difficulty concerns the question of what exactly is a low-level and a
highlevel property. Gestalt theorists already claimed that features considered high-level
according to a structuralistic view are primary and basic in terms of perception. Further
doubts have been raised by global precedence effects [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Similar problems arise with
the recently discovered ensemble properties. The gist of a scene, a high-level feature
according to the classical view, can be recognized in 150 msec [
        <xref ref-type="bibr" rid="ref12 ref13 ref14 ref7">7,12,13,14</xref>
        ] and can be
modeled using low-level visual features [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In addition, categories can be shown to be
faster processed than basic objects, contrary to the established view of the latter as
entrylevel representations [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. A summary statistics approach, also based on low-level visual
features, can explain the holistic processing properties in the periphery of the visual field
[
        <xref ref-type="bibr" rid="ref16 ref17 ref4">4,16,17</xref>
        ]. What is additionally required in these models are statistical measures, like
probability distributions and autocorrelation functions, from which it is not known how
and on which level of the cortical hierarchy they can be realized.
      </p>
      <p>
        One of the most abstract ensemble properties seems to be the number of elements
in a spatial configuration. However, the ability to recognize this number is not restricted
to humans with mature cognitive abilities but has also been found in infants and animals
[
        <xref ref-type="bibr" rid="ref18 ref9">9,18</xref>
        ], recently even in invertebrates [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. Neural reactions to numerosity are fast (100
msecs in macaques [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]). And finally there is evidence for a “direct visual sense for
number” since number seems to be a primary visual property like color, orientation or
motion, to which the visual system can be adapted by prolonged viewing [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>The above observations on ensemble properties raise a number of questions, from
which the following are addressed in this paper: Sect. 1: Can the cortex compute a
probability distribution? Sect. 2: And also an autocorrelation function? By which kind of
neural hardware can this be achieved? Sect.3: Can the shape of individual objects also
be characterized by such mechanisms? Sect. 4: What is necessary to compute such an
abstract property like the number of elements in a spatial configuration? Can this be
achieved in early sensory stages?</p>
    </sec>
    <sec id="sec-2">
      <title>1. Neural Computation of a Probability Distribution</title>
      <p>Formally, the probability density function pe(e) of a random variable e is defined via the
cumulative distribution function: pe(e) , dPe(e) with Pe(e) = Pr[e  e]. Their empirical
de
counterparts, the histogram and the cumulative histogram, are defined by use of indicator
functions. For this we divide the real line into m bins (e(i), e(i+1)] with bin size D e =
e(i+1) e(i). For each bin i, an indicator function is defined as
An illustration of such a function is shown in Fig. 1a. From N samples ek of the
random variable e we then obtain the histogram as h(i) = N1 Â kN=1 Qi(ek). The cumulative
histogram He(e) can be computed by changing the bins to (e(1), e(i+1)] (cf. Fig. 1b), and
by performing the same summation as for the normal histogram. The reverse cumulative
(a)
(b)
(c)
histogram H¯ (i) is simply the reversed version of the cumulative histogram. The
corresponding bins are D ei = (e(i), e(m+1)] and the indicator functions are defined as (Fig. 1c)
Albrecht and Hamilton (1982)
(b)</p>
      <p>How does all this relate to visual cortex? Has the architecture shown in Fig. 2a any
neurobiological plausibility? The final summation stage is no problem since the most
basic capability of neurons is computation of a linear sum of their inputs. But how about
the indicator functions? They have two special properties: First, the indicator functions
come with different sensitivities. An individual function does only generate a non-zero
output if the input e exceeds a certain level, a kind of threshold, which determines the
sensitivity of the element e(i) in Eq. (2) and Fig. 1c. To cover the complete range of
values, different functions with different sensitivities are needed (Fig. 2a). Second, the
indicator functions exhibit a certain independence of the input level. Once the input is
clearly larger than the threshold, the output remains constant (Fig. 1c).</p>
      <p>
        Do we know of neurons which have such properties, a range of different
sensitivities, and a certain independence of the input strength? Indeed, cortical gain control (or
normalization), as first described in early visual cortex (e.g. [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]) but now believed to
exist throughout the brain [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], yields exactly these properties. Gain-controlled neurons
(Fig. 2b) exhibit a remarkable similarity to the indicator functions used to compute the
reverse cumulative histogram, since they (i) come with different sensitivities, and (ii)
provide an independence of the input strength in certain response ranges.
      </p>
      <p>The computation of a reverse cumulative histogram thus is well in reach of the
cortex. We only have to modify the architecture of Fig. 2a by the smoother response
functions of cortical neurons. The information about a probability distribution available to the
visual cortex is illustrated in Fig. 3. The reconstructed distributions, as estimated from the
neural reverse cumulative histograms, are a kind of Parzen-windowed (lowpass-filtered)
versions of the original distributions.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Neural Implementation of Auto- and Cross-Correlation Functions</title>
      <p>
        A key feature of the recent statistical summary approach to peripheral vision [
        <xref ref-type="bibr" rid="ref16 ref24 ref4 ref6">4,6,24,16</xref>
        ]
is the usage of auto- and cross-correlation functions. These functions are defined as
h(i) =
1 N/2
N k= Â N/2+1
e(k) g(i + k),
      </p>
      <p>
        (4)
where autocorrelation results if e(k) = g(k) and where indicates multiplication. With
respect to their neural computation, the outer summation is no problem, but the
crucial function is the nonlinear multiplicative interaction between two variables. A
neural implementation could make use of the Babylonian trick ab = 14 [(a + b)2 (a b)2]
[
        <xref ref-type="bibr" rid="ref25 ref26 ref27">25,26,27</xref>
        ], but this requires two or more neurons for the computation and thus far there
is neither evidence for such a systematic pairing of neurons nor for actual multiplicative
interactions in the visual cortex. However, exact multiplication is not the key factor: a
reasonable statistical measure merely requires provision of a matching function such that
e(k) and g(i + k) generate a large contribution to the autocorrelation function if they are
similar, and a small contribution if they are dissimilar. For this, it is sufficient to provide
a neural operation which is AND-like [
        <xref ref-type="bibr" rid="ref27 ref28">27,28</xref>
        ]. Surprisingly, such an AND-like operation
can be achieved by the very same neural hardware as used before, the cortical gain
control mechanism, as shown in [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Cortical gain control [
        <xref ref-type="bibr" rid="ref22 ref29">22,29</xref>
        ] applied to two different
features si(x, y) and s j(x, y) can be written as
where k = k(i, j), e is a constant which controls the steepness of the response and Q is a
threshold. The resulting nonlinear combination is comparable with an AND-like
operation of two features and causes a substantial nonlinear increase of the neural selectivity,
as illustrated in Fig. 4.
      </p>
      <p>Of course there will be differences between a formal autocorrelation function and
the neurobiological version, but the essential feature, the signaling of good matches in
dependence of the relative shifts will be preserved (Fig. 5).</p>
    </sec>
    <sec id="sec-4">
      <title>3. Figural Properties from Integrals</title>
      <p>We extracted different features sr,q from the image luminance function l = l(x, y) by
applying a Gabor-like filter operation sr,q (x, y) = (l ⇤ F 1(Hr,q ))(x, y) where F 1
denotes the inverse Fourier transformation and the filter kernel Hr,q is defined in the
spectral space. We distinguish two cases (even and odd) which can be seen in the following
definition in polar coordinates:</p>
      <p>Hre,vqen( fr, fq ) :=
( cos2 ⇣ p fr r ⌘ cos2 ⇣ p fq q ⌘</p>
      <p>2 2 fr,h 2 2 fq ,h
0
, ( fr, fq ) 2 W r,q
, else,
with W r,q := {( fr, fq )| fr 2 [r 2 fr,h, r + 2 fr,h] ^ fq 2 [q 2 fq ,h, q + 2 fq ,h] \ [q + p
2 fq ,h, q + p + 2 fq ,h]}, where fr,h denotes the half-bandwidth in radial direction and fq ,h
denotes the half-bandwidth in angular direction. Hro,qdd is defined as the Hilbert
transformed even symmetric filter kernel.</p>
      <p>Various AND combinations of these oriented features (see caption Fig. 6) are
obtained by the gain-control mechanism described in Eq. (5). The integration over the
whole domain results in global features Fk := RR2 gk(x, y) d(x, y) which capture basic
shape properties (Fig. 6).</p>
    </sec>
    <sec id="sec-5">
      <title>4. Numerosity and Topology</title>
      <p>
        One of the most fundamental and abstract ensemble properties is the number of elements
of a set. Recent evidence (see Introduction) raised the question at which cortical level
the underlying computations are performed. In this processing, a high degree of
invariance has to be achieved, since numerosity can be recognized largely independent of other
properties like size, shape and positioning of elements. Models which address this
question in a neurobiologically plausible fashion, starting from individual pixels or neural
receptors instead of an abstract type of input, are rare. To our knowledge, the first approach
in this direction has been made in [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. A widely known model [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] has a shape-invariant
mapping to number which is based on linear DOG filters of different sizes, which
substantially limits the invariance properties. A more recent model is based on unsupervised
learning but has only employed moderate shape variations [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] we suggested
that the necessary invariance properties may be obtained by use of a theorem which
connects local measurements of the differential geometry of the image surface with global
topological properties [
        <xref ref-type="bibr" rid="ref30 ref33">30,33</xref>
        ]. In the following we will build upon this concept.
      </p>
      <p>The key factor of our approach is a relation between surface properties and a
topological invariant as described by the famous Gauss-Bonnet theorem. In order to apply
this to the image luminance function l = l(x, y) we interpret this function as a surface
S := {(x, y, z) 2 R3|(x, y) 2 W , z = l(x, y)} in three-dimensional real space. We then apply
the formula for the Gaussian curvature</p>
      <p>
        K(x, y) =
lxx(x, y)lyy(x, y) lxy(x, y)2
(1 + lx(x, y)2 + ly(x, y)2)2
,
(6)
(7)
where subscript denotes the differentiation in the respective direction (e.g. lxy = ∂∂x2∂ly ).
The numerator of (6) can also be written as D = l 1l 2 where l 1,2 are the eigenvalues of
the Hessian matrix of the luminance function l(x, y) which represent the partial second
derivatives in the principal directions. The values and signs of the eigenvalues give us
the information about the shape of the luminance surface S in each point, whether it
is elliptic, hyperbolic, parabolic, or planar. Since Gaussian curvature results from the
multiplication of the second derivatives l 1,2 it is zero for the latter two cases. It has been
shown that this measure can be generalized in various ways, in particular towards the use
of neurophysiologically realistic Gabor-like filters instead of the derivatives [
        <xref ref-type="bibr" rid="ref27 ref30">27,30</xref>
        ]. The
crucial point, however, is the need for AND combinations of oriented features [
        <xref ref-type="bibr" rid="ref27 ref30">27,30</xref>
        ]
which can be obtained as before by the neural mechanism of cortical gain control [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ].
      </p>
      <p>The following corollary from the Gauss-Bonnet theorem is the basis for the
invariance properties in the context of numerosity.</p>
      <p>Corollary 4.1 Let S ⇢</p>
      <p>R3 be a closed two-dimensional Riemannian manifold. Then</p>
      <p>Z
S
K dA = 4p (1
g)
where K is the Gaussian curvature and g is the genus of the surface S.</p>
      <p>We consider the special case where the luminance function consists of multiple objects
(polyhedra with orthogonal corners) with constant luminance level. We compare the
surface of this luminance function to the surface of a cuboid with holes that are shaped like
the polyhedra. The trick is that the latter surface has a genus which is determined by the
number of holes in the cuboid and which can be determined by the integration of the
local curvature according to Eq. (7). If we can find the corresponding contributions of
the integral on the image surface, we can use this integral to count the number of
objects. We assume the corners to be locally sufficiently smooth such that the surfaces are
Riemannian manifolds. The Gaussian curvature K then is zero almost everywhere except
on the corners. We hence have to consider only the contributions of the corners. It turns
out that these contributions can be computed from the elliptic regions only if we use
different signs for upwards and downwards oriented elliptic regions. We thus introduce the
following operator which distinguishes the different types of ellipticity in the luminance
function. Let l 1 l 2, then the operator N(x, y) := | min(0, l 1(x, y))| | max(0, l 2(x, y))|
is always zero if the surface is hyperbolic and has a positive sign for positive
ellipticity and a negative one for negative ellipticity. We thus can calculate the numerosity
feature which has the ability of counting objects in an image by counting the holes in an
imaginary cuboid as follows:</p>
      <p>F =</p>
      <p>Z</p>
      <p>N(x, y)
W (1 + lx(x, y)2 + ly(x, y)2) 23 d(x, y).
(8)
The crucial feature of this measure are contributions of fixed size and with appropriate
signs from the corners. The denominator can thus be replaced by a neural gain control
mechanism and an appropriate renormalization. For the implementation here we use a
shortcut which gives us straight access to the eigenvalues. The numerator D(x, y) of (6)
can be rewritten as
D(x, y) = lxxlyy</p>
      <p>(luu lvv)2 =
(9)
with u := x cos(p /4) + y sin(p /4) and v := x sin(p /4) + y cos(p /4). The eigenvalues
then are l 1,2 = 12 (D l ± |e |) and we can directly use them to compute N(x, y). Application
of this computation to a number of test images is shown in Fig. 7.</p>
      <p>50
100
150
200
250
50
100
150
200
250
5 100
0−5 150
−10 200
−15
50 100 150 200 250 −20 250
4.0
rechteckstruktur01c</p>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>
        Recent evidence shows that ensemble properties play an important role in perception and
cognition. In this paper, we have investigated by which neural operations and on which
processing level statistical ensemble properties can be computed by the cortex.
Computation of a probability distribution requires indicator functions with different
sensitivities, and our reinterpretation of cortical gain control suggests that this could be a basic
function of this neural mechanism. The second potential of cortical gain control is the
computation of AND-like feature combinations. Together with the linear summation
capabilities of neurons this enables the computation of powerful invariants and summary
features. We have repeatedly argued that AND-like feature combinations are essential
for our understanding of the visual system [
        <xref ref-type="bibr" rid="ref27 ref28 ref30 ref34 ref35 ref36">27,30,34,35,36,28</xref>
        ]. The increased selectivity
of nonlinear AND operators, as compared to their linear counterparts, is a prerequisite
for the usefulness of integrals over the respective responses [
        <xref ref-type="bibr" rid="ref28 ref30">30,28</xref>
        ]. We have shown that
such integrals of AND features are relevant for the understanding of texture perception
[
        <xref ref-type="bibr" rid="ref37">37</xref>
        ], of numerosity estimation [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], and of invariance in general [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Recently, integrals
over AND-like feature combinations in form of auto- and cross-correlation functions
have been suggested for the understanding of peripheral vision [
        <xref ref-type="bibr" rid="ref16 ref17 ref4">4,16,17</xref>
        ].
      </p>
      <p>
        A somewhat surprising point is that linear summation and cortical gain control, two
widely accepted properties of cortical neurons, are the only requirements for the
computation of ensemble properties. These functions are already available at early stages of
the cortex, but also in other cortical areas [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The computation of ensemble properties
may thus be an ubiquitous phenomenon in the cortex.
      </p>
      <p>Acknowledgement
This work was supported by DFG, SFB/TR8 Spatial Cognition, project A5-[ActionSpace].</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S. C.</given-names>
            <surname>Dakin</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Watt</surname>
          </string-name>
          .
          <article-title>The computation of orientation statistics from visual texture</article-title>
          .
          <source>Vision Res</source>
          ,
          <volume>37</volume>
          (
          <issue>22</issue>
          ):
          <fpage>3181</fpage>
          -
          <lpage>3192</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Ariely</surname>
          </string-name>
          . Seeing Sets:
          <article-title>Representation by Statistical Properties</article-title>
          .
          <source>Psychol Sci</source>
          ,
          <volume>12</volume>
          (
          <issue>2</issue>
          ):
          <fpage>157</fpage>
          -
          <lpage>162</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Lin</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>The topological approach to perceptual organization</article-title>
          .
          <source>Visual Cognition</source>
          ,
          <volume>12</volume>
          (
          <issue>4</issue>
          ):
          <fpage>553</fpage>
          -
          <lpage>637</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Balas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Nakano</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosenholtz</surname>
          </string-name>
          .
          <article-title>A summary-statistic representation in peripheral vision explains visual crowding</article-title>
          .
          <source>J Vis</source>
          ,
          <volume>9</volume>
          (
          <issue>12</issue>
          ):
          <fpage>13</fpage>
          .
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Alvarez</surname>
          </string-name>
          .
          <article-title>Representing multiple objects as an ensemble enhances visual cognition</article-title>
          .
          <source>Trends Cog Sci</source>
          ,
          <volume>15</volume>
          (
          <issue>3</issue>
          ):
          <fpage>122</fpage>
          -
          <lpage>31</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosenholtz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Huang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Ehinger</surname>
          </string-name>
          .
          <article-title>Rethinking the role of top-down attention in vision: effects attributable to a lossy representation in peripheral vision</article-title>
          .
          <source>Front Psychol</source>
          ,
          <volume>3</volume>
          :
          <fpage>13</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Thorpe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fize</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Marlot</surname>
          </string-name>
          .
          <article-title>Speed of processing in the human visual system</article-title>
          .
          <source>Nature</source>
          ,
          <volume>381</volume>
          (
          <issue>6582</issue>
          ):
          <fpage>520</fpage>
          -
          <lpage>522</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Torralba</surname>
          </string-name>
          .
          <article-title>Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope</article-title>
          .
          <source>International Journal of Computer Vision</source>
          ,
          <volume>42</volume>
          (
          <issue>3</issue>
          ):
          <fpage>145</fpage>
          -
          <lpage>175</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Brannon</surname>
          </string-name>
          .
          <article-title>The representation of numerical magnitude</article-title>
          .
          <source>Curr Opin Neurobiol</source>
          ,
          <volume>16</volume>
          (
          <issue>2</issue>
          ):
          <fpage>222</fpage>
          -
          <lpage>9</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hegde</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.J.</given-names>
            <surname>Felleman</surname>
          </string-name>
          .
          <article-title>Reappraising the Functional Implications of the Primate Visual Anatomical Hierarchy</article-title>
          .
          <source>The Neuroscientist</source>
          ,
          <volume>13</volume>
          (
          <issue>5</issue>
          ):
          <fpage>416</fpage>
          -
          <lpage>421</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D.</given-names>
            <surname>Navon</surname>
          </string-name>
          .
          <article-title>Forest before trees: The precedence of global features in visual perception</article-title>
          .
          <source>Cognitive Psychology</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <fpage>353</fpage>
          -
          <lpage>383</lpage>
          ,
          <year>1977</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Greene</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          .
          <source>The Briefest of Glances. Psychol Sci</source>
          ,
          <volume>20</volume>
          (
          <issue>4</issue>
          ):
          <fpage>464</fpage>
          -
          <lpage>472</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hegde</surname>
          </string-name>
          <article-title>´. Time course of visual perception: coarse-to-fine processing and beyond</article-title>
          .
          <source>Prog Neurobiol</source>
          ,
          <volume>84</volume>
          (
          <issue>4</issue>
          ):
          <fpage>405</fpage>
          -
          <lpage>39</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fabre-Thorpe</surname>
          </string-name>
          .
          <article-title>The characteristics and limits of rapid visual categorization</article-title>
          .
          <source>Front Psychol</source>
          ,
          <volume>2</volume>
          :
          <fpage>243</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>M. J-M Mace´</surname>
            ,
            <given-names>O. R.</given-names>
          </string-name>
          <string-name>
            <surname>Joubert</surname>
          </string-name>
          ,
          <string-name>
            <surname>J-L. Nespoulous</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Fabre-Thorpe</surname>
          </string-name>
          .
          <article-title>The time-course of visual categorizations: you spot the animal faster than the bird</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>4</volume>
          (
          <issue>6</issue>
          ):e5927,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Freeman</surname>
          </string-name>
          and
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Simoncelli</surname>
          </string-name>
          .
          <article-title>Metamers of the ventral stream</article-title>
          .
          <source>Nature neuroscience</source>
          ,
          <volume>14</volume>
          (
          <issue>9</issue>
          ):
          <fpage>1195</fpage>
          -
          <lpage>1201</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H.</given-names>
            <surname>Strasburger</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Rentschler</surname>
          </string-name>
          , and
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Ju¨ttner. Peripheral vision and pattern recognition: a review</article-title>
          .
          <source>J Vis</source>
          ,
          <volume>11</volume>
          (
          <issue>5</issue>
          ):
          <fpage>13</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Nieder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Freedman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. K.</given-names>
            <surname>Miller</surname>
          </string-name>
          .
          <article-title>Representation of the quantity of visual items in the primate prefrontal cortex</article-title>
          .
          <source>Science</source>
          ,
          <volume>297</volume>
          (
          <issue>5587</issue>
          ):
          <fpage>1708</fpage>
          -
          <lpage>11</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Gross</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pahl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Si</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tautz</surname>
          </string-name>
          , and
          <string-name>
            <surname>S. Zhang.</surname>
          </string-name>
          <article-title>Number-based visual generalisation in the honeybee</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>4</volume>
          (
          <issue>1</issue>
          ):e4263,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Roitman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Brannon</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Platt</surname>
          </string-name>
          .
          <article-title>Monotonic coding of numerosity in macaque lateral intraparietal area</article-title>
          .
          <source>PLoS biology</source>
          ,
          <volume>5</volume>
          (
          <issue>8</issue>
          ):e208,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>J.</given-names>
            <surname>Ross</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Burr</surname>
          </string-name>
          .
          <article-title>Vision senses number directly</article-title>
          .
          <source>Journal of vision</source>
          ,
          <volume>10</volume>
          (
          <issue>2</issue>
          ):
          <fpage>10</fpage>
          .
          <fpage>1</fpage>
          -
          <issue>8</issue>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>D. G.</given-names>
            <surname>Albrecht</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. B.</given-names>
            <surname>Hamilton</surname>
          </string-name>
          .
          <article-title>Striate cortex of monkey and cat: contrast response function</article-title>
          .
          <source>J Neurophysiol</source>
          ,
          <volume>48</volume>
          (
          <issue>1</issue>
          ):
          <fpage>217</fpage>
          -
          <lpage>237</lpage>
          ,
          <year>Jul 1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M.</given-names>
            <surname>Carandini</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Heeger</surname>
          </string-name>
          .
          <article-title>Normalization as a canonical neural computation</article-title>
          .
          <source>Nature Reviews Neurosci</source>
          ,
          <volume>13</volume>
          :
          <fpage>51</fpage>
          -
          <lpage>62</lpage>
          ,
          <year>Jul 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosenholtz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Raj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Balas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Ilie</surname>
          </string-name>
          .
          <article-title>A summary statistic representation in peripheral vision explains visual search</article-title>
          .
          <source>J Vis</source>
          ,
          <volume>12</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>H.L.</given-names>
            <surname>Resnikoff</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.O.</given-names>
            <surname>Wells</surname>
          </string-name>
          . Mathematics in Civilization. Popular Science Series. Dover,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>E. H.</given-names>
            <surname>Adelson</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Bergen</surname>
          </string-name>
          .
          <article-title>Spatiotemporal energy models for the perception of motion</article-title>
          .
          <source>J. Opt. Soc. Am. A</source>
          ,
          <volume>2</volume>
          (
          <issue>2</issue>
          ):
          <fpage>284</fpage>
          -
          <lpage>99</lpage>
          ,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Barth</surname>
          </string-name>
          .
          <article-title>Fundamental limits of linear filters in the visual processing of two-dimensional signals</article-title>
          .
          <source>Vision Res</source>
          ,
          <volume>30</volume>
          (
          <issue>7</issue>
          ):
          <fpage>1111</fpage>
          -
          <lpage>1117</lpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          and
          <string-name>
            <given-names>U.</given-names>
            <surname>Nuding</surname>
          </string-name>
          .
          <article-title>Nonlinear and higher-order approaches to the encoding of natural scenes</article-title>
          .
          <source>Network</source>
          ,
          <volume>16</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>191</fpage>
          -
          <lpage>221</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.J.</given-names>
            <surname>Heeger</surname>
          </string-name>
          .
          <article-title>Normalization of cell responses in cat striate cortex</article-title>
          .
          <source>Visual Neurosci</source>
          ,
          <volume>9</volume>
          (
          <issue>2</issue>
          ):
          <fpage>181</fpage>
          -
          <lpage>198</lpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Barth</surname>
          </string-name>
          .
          <article-title>Image surface predicates and the neural encoding of two-dimensional signal variations</article-title>
          . In B. E. Rogowitz and Jan P. A., editors,
          <source>Proc SPIE</source>
          , volume
          <volume>1249</volume>
          , pages
          <fpage>160</fpage>
          -
          <lpage>177</lpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dehaene</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Changeux</surname>
          </string-name>
          .
          <article-title>Development of elementary numerical abilities: a neuronal model</article-title>
          .
          <source>J. Cogn. Neurosci.</source>
          ,
          <volume>5</volume>
          (
          <issue>4</issue>
          ):
          <fpage>390</fpage>
          -
          <lpage>407</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>I.</given-names>
            <surname>Stoianov</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Zorzi</surname>
          </string-name>
          .
          <article-title>Emergence of a 'visual number sense' in hierarchical generative models</article-title>
          .
          <source>Nat Neurosci</source>
          ,
          <volume>15</volume>
          (
          <issue>2</issue>
          ):
          <fpage>194</fpage>
          -
          <lpage>6</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>M. Ferraro E. Barth</surname>
            and
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Zetzsche</surname>
          </string-name>
          .
          <article-title>Global topological properties of images derived from local curvature features</article-title>
          . In L. P.
          <string-name>
            <surname>Cordella</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Arcelli</surname>
          </string-name>
          and G. Sanniti di Baja, editors,
          <source>Visual Form 2001. Lecture Notes in Computer Science</source>
          , pages
          <fpage>285</fpage>
          -
          <lpage>294</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          , E. Barth, and
          <string-name>
            <given-names>B.</given-names>
            <surname>Wegmann</surname>
          </string-name>
          .
          <article-title>The importance of intrinsically two-dimensional image features in biological vision and picture coding</article-title>
          . In A. B. Watson, editor,
          <source>Digital images and human vision</source>
          , pages
          <fpage>109</fpage>
          -
          <lpage>138</lpage>
          . MIT Press, Cambridge, MA,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>G.</given-names>
            <surname>Krieger</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          .
          <article-title>Nonlinear image operators for the evaluation of local intrinsic dimensionality</article-title>
          .
          <source>IEEE Transactions Image Processing</source>
          ,
          <volume>5</volume>
          :
          <fpage>1026</fpage>
          -
          <lpage>1042</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Krieger</surname>
          </string-name>
          .
          <article-title>Nonlinear mechanisms and higher-order statistics in biological vision and electronic image processing: review and perspectives</article-title>
          .
          <source>J Electronic Imaging</source>
          ,
          <volume>10</volume>
          (
          <issue>1</issue>
          ):
          <fpage>56</fpage>
          -
          <lpage>99</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>E.</given-names>
            <surname>Barth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zetzsche</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Rentschler.</surname>
          </string-name>
          <article-title>Intrinsic 2D features as textons</article-title>
          .
          <source>J. Opt. Soc. Am. A</source>
          ,
          <volume>15</volume>
          (
          <issue>7</issue>
          ):
          <fpage>1723</fpage>
          -
          <lpage>1732</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>