<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Forensic vs. Computing writing features as seen by Rex, the intuitive document retriever</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vlad Atanasiu</string-name>
          <email>atanasiu@alum.mit</email>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2011</year>
      </pub-date>
      <fpage>16</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>-The paper reveals the superficial matching between script features as understood by forensic experts and computer scientists and advocates the development of computational instruments tailored to fit the features traditionally used by the forensic community. In particular, and including other areas of graphonomics and the general public, there exists a demand for software for the analysis of intuitive features, think “slant” or “roundness,” as opposed to analytical features, like “Fourier transform” or “entropy.” Rex, a software with such a capability, is introduced and used to explore the potentialities of this approach for script forensics. An investigation of properties of the script contour orientation, the feature used by Rex, is also presented.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Index Terms—script features, contour orientation,
computational graphonomics, handwriting forensics,</p>
      <p>
        Semiotics — That much forensic handwriting expertise is
subjective and would profit from mathematics and computing
in its quest for objectivity and replicability is publicly admitted
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], but the less advertised side of reality is that of software
insisting to treat the users on feasts of mathematics and
technology without actually meeting their needs [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. At the root of
this dialogue of the deaf lies, among other interesting factors of
the sociology of science, the very words “writing feature.” For
forensic experts the “feature” is usually intuitively
comprehensible, such as “slant” [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], while for computer scientists the most
powerful “features” are mathematical concepts, like “Fourier
components” or “fractal dimension,” which need specialized
knowledge for their properties to be understood. Developing
measurement software for intuitive features not only gives
forensic professionals tools which they know how to handle, but
also allows them to communicate about their work — an
essential aspect in respect to testimony in court. Intuitive features
additionally benefit the design of computer systems, improving
the ergonomy of user interfaces as exemplified in section iii.
      </p>
      <p>Cognition — An interesting viewpoint on the debate over
intuitive and analytic features is to consider mathematics as an
evolutionary outcrop of the neural computing capacities of the
brain. Intuition is evolutionary unconscious learning by
interaction with the environment to which conscious analysis
supplements when novelties arise. Thus the two can be envisioned as a
continuum, mathematics progressively becoming intuitive.</p>
      <p>Sociology — To think that the divergence of the two feature
types is a function of mathematical educational level is
overlooking a fundamental distinction. Writer identification and
verification are main mobiles of computational handwriting
forensics, and because here only results count, it can use any method
without even the need of thorough understanding insofar as it
is better. This evolutionary mindset of a goal-focused black box
approach is faced by the knowledge-oriented crystal ball
attitude seen in the traditional graphonomical research, which adds
to the control tasks mentioned above a considerable interest in
the handwriting ecosystem, i.e. the structures and dynamics of
handwriting features across populations and the underlying
factors: material, cognitive, biomechanical, sociocultural.</p>
      <p>
        Linguistics — The issues with the term “feature” extend to
a further worldview cloaking inconspicuously its users. The
proposition “This font is Roman” is considered in philosophy
either as an expression on a property owned by the font
(objectivism) or attributed to the font by an observer (subjectivism)
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The difference is one of lifestyle: the world is there for
truth to be discovered or for models to be invented. Translated
at lexical level this is what defines the terms “feature” and
“descriptor,” among their numerous handwriting related synonyms
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. To this author “descriptor” seems more appropriate since it
doesn’t presuppose anything about the object (it just is) and it’s
easier and more fun to be critical about a model than a truth.
Incidentally, while “feature” prevails in graphonomics,
“descriptor” has a foothold in the wider pattern recognition community,
as witnessed in a wording like “shape descriptor.”
      </p>
      <p>Implications — Computer scientists have to consider in
common intelligence with forensic experts three issues worth
mentioning because they bear an influence on how the software
presented later in the paper is to be used. The issues are the desired
precision of the analysis, the definition of the features and the
affordability to analyze them in the current state of the art. I will
illustrate this through two visual examples.</p>
      <p>
        Precision — Fig. 1 presents three bitmap circles of various
sizes for which the orientation along their contour is measured
(details in section iii). Being circles, we would expect that all
orientations be equally well represented, but due to the discrete
nature of the underlying raster in which the shapes live the
distribution is biased towards the orthogonal direction — the
distribution will peak at 0 and 90 degrees ([
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], for hexagonal
grids see [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]). Making a model of the distortion and applying
it to arbitrary orientation profiles should solve the issue, but it
turns out that the distortion is shape specific. For example, a
vertical line has no distortion at all, so there is no need for
correction. A somewhat better choice is to increase the image
resolution at capture time or after, with the drawback of generating
voluminous files and knowing that often only low resolution
images are available. This digital geometry problem is
compounded upstream by the design of discrete Gaussian filters for
orientation measurement [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and downstream by digitization,
the same physical document producing at pixel level different
shapes depending on its alignment with the digital grid of the
imaging system, hence affecting the replicability of results [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ],
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. A number of techniques address these issues [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]–[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] but
the implications for handwriting analysis have yet to be fully
explored, starting with the question of how much precision is
needed for which application. High accuracy graphonomics is
therefore an area open to investigation.
      </p>
      <p>Definition — I discuss now the slant of three Roman script
characters as perceived by a human and raise the question of
how this simple feature should be defined. In the case of I the
slant is vertical and corresponds to the shape’s axis of
equilibrium through its center of gravity — here the slant is a physical
property of the object. For an O there is no way to tell how the
character is oriented would the baseline be unknown — slant is
here a property of the object relative to the surrounding. The
slant of y can be considered as upright only if we are able to
identify the shape as character “y” and be aware of the
convention that this lower case letter has to be considered vertical
despite its physical right-leaning — this is a case of semantic slant.
A deeper examination might reveal even more criteria. In
conclusion, a slant analysis algorithm implementing human expert
behavior appears to be more challenging than suspected, given
first the very difficulty to define the feature, and secondly due to
the mix of perceptual and cultural considerations to model.</p>
      <p>Afordability — The last sentence leads to the issue of
affordability: do we have the technological means to perform
comprehensive slant analysis since we need to recognize unconstrained
handwritten characters? This task not being presently solved,
a positive answer can be given only if we are happy with a
certain degree of imprecision, its exact amount having to be
determined. Some of the fine computational forensic expertise
that we would wish to attain is thus yet out of reach.</p>
    </sec>
    <sec id="sec-2">
      <title>III. Rex, the intuitive document retriever</title>
      <p>
        Rationale — Written documents in databases can be retrieved
by appearance by one of the following methods: visual (using
a reference document), semantic (describing script features),
haptic (by drawing) and exogenous (from document ecosystem
metadata). Semantic retrieval is convenient because it is intuitive
(it takes place via a graphical and natural-language interface),
free of any preexisting model (not always available) and can
describe aspects of a script (contrary to the holistic approach of
visual retrieval). The software that grew out of these
considerations, called Rex, suits the demand for tools supporting forensic
specific features as described above (Fig. 2) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]–[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>
        Technicalities — The software measures the local orientation
along the writing contour, a popular computational
graphonomics feature [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. This is done by applying on the binary image
of the contour an anisotropic Gaussian filter bank with one
degree of radial displacement. At this stage of this well-known
approach two innovations are introduced, in addition to the fine
grained resolution. First, after deriving the probability density
function from the orientations’ frequency count, statistical
properties of the distribution are obtained. Second, it was discovered
that these statistics correlate with various script features of the
intuitive type, perceived as distinct one from another, such as
“slant,” “roundness” or “density” (Fig. 3). To sum up, Rex
behaves like a handy, multipurpose Swiss army knife.
      </p>
      <p>Applications — The Swiss reference is not fortuitous, since
the handwriting documents presently used by Rex originate in
that country (IAM Handwriting Database 3.0 [20]). This shows
again the surprising versatility of the tool in that it is not only
a document browser, but also a teaching tool about
handwriting. In addition to learning about individual documents, Rex
provides an insight in the make-up of a population of
writers — that of the canton of Bern from where most of the dataset
writers hail (Fig. 4). The question that immediately springs to
mind —“Do writers from other parts of the world have
similar characteristics?” — is typical of the richness of research and
pedagogical possibilities opened by such an instrument (indeed,
the few Greek, Chinese and other foreigners among the
contributors show scriptural characteristics apart form the Swiss
majority). If the present usage of Rex is rather limited to a browser
of a specific dataset and much development can be imagined, it
is nevertheless also an intriguing tool to experiment with as a
testbed for other computational forensic applications.</p>
    </sec>
    <sec id="sec-3">
      <title>IV. Properties of the orientation feature</title>
      <sec id="sec-3-1">
        <title>While contour orientation is a concept easy enough to grasp,</title>
        <p>
          it has a number of less apparent properties with implications for
the expertise work. They reveal why studies find orientation not
the best performing biometric instrument [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Rotation — The feature is evidently not rotation invariant,
meaning that the same document will have different
measurement profiles depending on, for example, the skew of the
paper in a scanner (Fig. 5.1–2). However the difference is only a
translation of the profile, thus the bias can be corrected.</p>
        <p>Organization — Contour orientation exhibits some unusual
cases of shape invariance, all deriving from its low sensitivity
to the spatial organization of pixels, due to the fact that, by
definition, the measure is done locally. It is thus possible to have
perceptually different shapes with the same orientation profile.
Fig. 5.5 demonstrates scrambling invariance.</p>
        <p>Localization — The various informations that can be read
in the global orientation profile can’t be traced to specific
locations in the written document. If there is, say slant variation in
a particular line, we see it in the profile, but can’t localize the
given line and even not know if the variation is concentrated in
one line or spread over the entire document.</p>
        <p>Convexity — For 180° shape rotations the profiles are
identical, leading to shape confusion (Fig. 5.3–4).</p>
        <p>Neighborhood — Fig. 5.6 shows that lines and circles in
certain configurations can look the same to the orientation
instrument: it is unaware about the neighborhood.</p>
        <p>Additivity — Shapes contribute linearly to profiles,
facilitating combinatorial pattern simulations from primitives.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>V. Conclusions</title>
      <sec id="sec-4-1">
        <title>I conclude by reminding that forensic and computational script features are usually not identical, that they need to be thoroughly explored to be safely used, and that public software, like Rex, introduced here, are excellent learning opportunities.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Found</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Rogers</surname>
          </string-name>
          , “
          <article-title>The Probative Character of Forensic Document Examiners' Identification and Elimination Opinions on Questioned Signatures,”</article-title>
          <source>in Proc. 13th Conf. of the Intl. Graphonomics Society</source>
          , Melbourne, Australia,
          <year>2007</year>
          , pp.
          <fpage>171</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.J.</given-names>
            <surname>Verduijn</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.E. van den Heuvel</surname>
          </string-name>
          , R.D. Stoel, “
          <source>Forensic Requirements for Automated Handwriting Analysis Systems,” in Proc. 15th Conf. of the Intl. Graphonomics Society</source>
          , Cancun, Mexico,
          <year>2011</year>
          , pp.
          <fpage>132</fpage>
          -
          <lpage>135</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.A.</given-names>
            <surname>Huber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            <surname>Headrick</surname>
          </string-name>
          , Handwriting Identification:
          <article-title>Facts and Fundamentals</article-title>
          . Boca Raton, FL: CRC,
          <year>1999</year>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>91</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.H.</given-names>
            <surname>Mulder</surname>
          </string-name>
          , “Objectivity,”
          <source>The Internet Encyclopedia of Philosophy, September</source>
          <volume>6</volume>
          ,
          <year>2004</year>
          [Online]. http://www.iep.utm.edu/objectiv/ [
          <source>Acessed: August 3</source>
          ,
          <year>2011</year>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Ch. Swoyer</surname>
            and
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Orilia</surname>
          </string-name>
          , “Properties,” The Stanford Encyclopedia of Philosophy,
          <year>July 2011</year>
          [Online]. Acessible: http://plato.stanford.edu/entries/ properties/ [
          <source>Accessed: August 3</source>
          ,
          <year>2011</year>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gonzato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Mulargia</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ciccotti</surname>
          </string-name>
          , “
          <article-title>Measuring the fractal dimension of ideal and actual objects: implications for application in geology and geophysics</article-title>
          ,” Geophysical J. Intl., vol.
          <volume>142</volume>
          ,
          <year>2000</year>
          , pp.
          <fpage>108</fpage>
          -
          <lpage>116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Klette</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Rosenfeld</surname>
          </string-name>
          , Digital Geometry, San Francisco, CA: Morgan Kaufmann,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.C.</given-names>
            <surname>Staunton</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Storey</surname>
          </string-name>
          , “
          <article-title>A Comparison Between Square and Hexagonal Sampling Methods for Pipeline Image Processing,” in SPIE Conf. Optics, Illumination, and Image Sensing for Machine Vision</article-title>
          , Philadelphia, PA, USA,
          <year>1989</year>
          , pp.
          <fpage>142</fpage>
          -
          <lpage>151</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E. R.</given-names>
            <surname>Davies</surname>
          </string-name>
          , “
          <article-title>Design of optimal Gaussian operators in small neighbourhoods,” Image and Vision Computing</article-title>
          , vol.
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <year>1987</year>
          , pp.
          <fpage>199</fpage>
          -
          <lpage>205</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>B.</given-names>
            <surname>Nagy</surname>
          </string-name>
          , “
          <article-title>An algorithm to find the number of the digitizations of discs with afixed radius</article-title>
          ,
          <source>” Electronic Notes in Discrete Mathematics</source>
          , vol.
          <volume>20</volume>
          ,
          <year>2005</year>
          , pp.
          <fpage>607</fpage>
          -
          <lpage>622</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.N.</given-names>
            <surname>Huxley</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Žunić</surname>
          </string-name>
          , “Different Digitisations of Displaced Discs,”
          <source>Foundations of Computational Mathematics</source>
          ,
          <year>2006</year>
          , pp.
          <fpage>255</fpage>
          -
          <lpage>268</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>F. de Vieilleville</surname>
          </string-name>
          and J.
          <string-name>
            <surname>-O. Lachaud</surname>
          </string-name>
          , “
          <article-title>Comparison and improvement of tangent estimators on digital curves,” Pattern Recognition</article-title>
          , vol.
          <volume>42</volume>
          (
          <issue>8</issue>
          ),
          <year>2009</year>
          , pp.
          <fpage>1693</fpage>
          -
          <lpage>1707</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Coeurjolly</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Klette</surname>
          </string-name>
          , “
          <article-title>A Comparative Evaluation of Length Estimators of Digital Curves,”</article-title>
          <source>IEEE Trans. Pattern Analysis and Machine Intelligence</source>
          , vol.
          <volume>26</volume>
          (
          <issue>2</issue>
          ),
          <year>2004</year>
          , pp.
          <fpage>252</fpage>
          -
          <lpage>258</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L.J. van Vliet and P.W.</given-names>
            <surname>Verbeek</surname>
          </string-name>
          , “
          <article-title>Curvature and bending energy in digitized 2D and 3D images,”</article-title>
          <source>in Proc. 8th Scandinavian Conf. on Image Analysis, Tromsø, Norway</source>
          ,
          <year>1993</year>
          , pp.
          <fpage>1403</fpage>
          -
          <lpage>1410</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.-</given-names>
            <surname>Ch</surname>
          </string-name>
          . Pei and J.-W. Horng, “
          <article-title>Fitting digital curve using circular arcs,” Pattern Recognition</article-title>
          , vol.
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <year>1995</year>
          , pp.
          <fpage>107</fpage>
          -
          <lpage>116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>V.</given-names>
            <surname>Atanasiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Likforman-Sulem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Vincent</surname>
          </string-name>
          , “
          <article-title>Rex, a description-based retriever for written documents</article-title>
          ,
          <source>” April</source>
          <year>2011</year>
          [Online]. Accessible: http:// glyph.telecom-paristech.
          <source>fr [Accessed: June</source>
          <volume>30</volume>
          ,
          <year>2011</year>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>V.</given-names>
            <surname>Atanasiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Likforman-Sulem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Vincent</surname>
          </string-name>
          , “Talking Script.
          <article-title>Retrieval of written documents by description of script features,” Gazette du Livre Medieval</article-title>
          , to be published.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>V.</given-names>
            <surname>Atanasiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Likforman-Sulem</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>N.</given-names>
            <surname>Vincent</surname>
          </string-name>
          , “
          <article-title>Writer Retrieval-Exploration of a Novel Biometric Scenario Using Perceptual Features Derived from Script Orientation,”</article-title>
          <source>in Proc. 11th Intl. Conf. on Document Analysis and Recognition</source>
          , Beijing, China,
          <year>2011</year>
          , to be published.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>M.L. Bulacu</surname>
          </string-name>
          , “
          <article-title>Statistical pattern recognition for automatic writer identification and verification</article-title>
          ,”
          <source>Ph.D. dissertation</source>
          , Artif. Intell. Inst.,
          <source>Univ. of Groningen</source>
          , The Netherlands,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>