=Paper=
{{Paper
|id=None
|storemode=property
|title=Classification of Features into Strong and Weak Features for an Intelligent Online Signature Verification System
|pdfUrl=https://ceur-ws.org/Vol-768/Paper3.pdf
|volume=Vol-768
|dblpUrl=https://dblp.org/rec/conf/icdar/TariqSH11
}}
==Classification of Features into Strong and Weak Features for an Intelligent Online Signature Verification System==
Proceedings of the 1st International Workshop on Automated Forensic Handwriting Analysis (AFHA) 2011
Classification of Features into Strong and Weak Features for an Intelligent Online
Signature Verification System
Saad Tariq, Saqib Sarwar & Waqar Hussain
Department of Electrical Engineering
Air University
Islamabad, Pakistan
tariq.saad@live.com
Abstract—This paper presents an efficient algorithm for the additional features such as pen pressure, pen speed and pen
classification of features into strong and weak features for tilt angle have made the process of forging online signatures
every distinct subject to create an intelligent online signature more difficult. Equal error rate of available online
verification system. Whereas Euclidean distance classifier is signature verification systems lies between 1 to 10%.
used for validation processes and low error rates obtained
Still a lot of work is needed to be done to reduce Equal
illustrate the feasibility of the algorithm for an online signature
verification system. error rate (EER) to make online signature verification the
most secure way of personal identification.
Keywords-Signature Biometrics; intellegient signature
verification ; online signature verification; classification of
features; strong features; weak features; dynamic signature
verification; euclidean distance classifer II. FEATURE EXTRACTION
Feature extraction phase is one of the crucial phases of
I. INTRODUCTION an on-line signature verification system. The
Today, with the astonishing growth of the Internet and discriminative power of the features and their flexibility
Intranet, E-commerce and E-finance become the hottest to the variation within the reference signatures of a
topics on this planet. Doing business through the public writer, play one of the major roles in the whole
network makes personal identification data security more verification process. While features related to the
and more crucial as well. How to protect the private signature shape are not dependent on the data acquisition
identification from being pirated is the key issue that the device, presence of dynamic features, such as pressure at
Internet and intranet clients would be concerned with the pen-tip or pen-tilt, depends on the hardware used.
before such E-business could be widely accepted since Features may be classified as global or local, where
authentication has become an essential part of highly global features identify signature’s properties as a whole
computerized services and/or security-sensitive and local ones correspond to properties specific to a
installations in modern society. sampling point. For example, signature bounding box,
Signature verification fulfills all the above described average signing speed, trajectory length or are global
circumstances and can play a vital role in protection and features, and Local features include curvature change
personal identification as it is a popular means of between consecutive points on the signature trajectory or
endorsement historically. Although such signatures are distance are local features. Features may also be
never the same for the same person at diverse times, classified as temporal (related to the dynamics) and
there appears to be no practical problem for human spatial (related to the shape).
beings to discriminate visually the real signature from the These features can be referred as human traits, as they
forged one. It will be extremely useful when an can vary from person to person and can be classified as
electronic device can display at least the same virtuosity. strong or weak for every distinct individual. If we make a
Signature verification systems are usually built following list of these features, more than 100 features are present
either on-line or off-line approaches, depending on the kind and even new features can be derived depending on their
of data and application involved. On-line systems generally discriminative power.
present a better performance than the off-line system but
require the necessary presence of the author during both the
acquisition of the reference data and the verification process
limiting its use. In online signature verification systems,
11
Proceedings of the 1st International Workshop on Automated Forensic Handwriting Analysis (AFHA) 2011
III. DATABASE & COMPILATION sample point of y-coordinate, zeros of x-velocity,
standard deviation of x-coordinates, standard deviation
A. System of y-coordinates, total number of samples, time taken,
For the purpose of signature verification we made an length, zero crossings of x-velocity, zero crossings of y-
experimental setup in which a person is enrolled in the velocity, zero crossings of x-acceleration, zero crossings
database by taking some of his/her signatures and a template of y-acceleration, zeros in x-acceleration, zeros in y-
is created and stored against the name and ID of the specified acceleration.
person. A new signature from that person can then be A pressure sensitive tablet was used that records pressure at
checked against the enrolled template to validate the person. every sample taken, providing with a very strong local
Furthermore we will discuss about the technique used in our feature of pressure.
system, database and how we optimized features as strong C. Optimization & Experimental Setup
and weak features.
Here is an important discussion that how we opted only 9
B. Database Completion features out of those 26 features for our system. As we know
A comprehensive database was created by obtaining the that a large number of features have been proposed by
signatures from the students. Signatures were gathered from researchers for online signature verification [2], [3], [4].
a total of hundred subjects with ten signatures from each However, a little work has been done in measuring the
subject. So a total of thousand signatures were collected to consistency and discriminative power of these features [5],
create the original signature database. WACOM INtuous4 [6]. On the basis of consistency and discriminative power
tablet with a sampling rate of 200 samples per seconds was features can be divided into strong and weak features, where
used for this purpose. presence of the strong features decreases the FRR while on
To form the forgeries database we performed a total 10 the other hand presence of some weak features also
forgeries per person, among which were five zero-effort decreases FRR but increases FAR. Thus there is a need to
forgeries and five skilled forgeries. The forgeries that are select the best features set.
performed by first training the counterfeiter to copy the The approach we used for classification of strong and
precise dynamics of the original signer are skilled forgeries. weak features is by using difference between mean to
A forger is trained by showing him plots of the original standard deviation ratio of each feature from the feature
signature being performed or by training the original signer vector and from the forgeries features vector set. Thus the
himself. mean/standard-deviation difference of each feature from the
template of 100 subjects was taken. The standard deviation
of a feature shows how large a deviation from the enrolled
IV. SIGNATURE VERIFICATION TECHNIQUE template can be tolerated (i.e. large deviated signature could
In the first phase, a signature verification technique was be classified as true for large standard deviation).
successfully put into operation for the classification of
original and forged signatures using Euclidean Classifier.
The technique is previously implemented by H. Dullink, B.
Van Daalen, J. Nijhuis, L. Spaanenburg, and H. Zuuidhof
[1].
A. No Pre-Processing
The technique we implemented did not use any
preprocessing because the tablet used had a sampling rate of In (1), Mo/STDo is the mean/standard-deviation ratio of
200 samples per second. Therefore it was not essential to the feature of original signatures and Mf/STDf is the
smooth or normalize the signature datasets, which were mean/standard-deviation ratio of the feature of forgery
required if we had used the signatures collected from a signature. The features with large value of mean/standard-
tablets with low resolution. Re-sampling and resizing was deviation difference as compared to others were taken as
also skipped considering the fact that valuable data is lost strong features and others as weak features eliminating
while pre-processing the data. which results in considerable good results.
A number of original signature’s features have a large
B. Feature Extraction mean/standard-deviation ratio and of course it will decrease
Among the list of features that can be extracted a total of FRR but contrary to it forgery signature’s features having a
26 features were extracted. The features extracted were large mean/standard-deviation will decrease FAR. So
standard deviation of x-acceleration, standard deviation therefore to obtain best results we took the difference
of y-acceleration, average pressure, standard deviation of between the original signature and forgery signature.
x-velocity, standard deviation of y-velocity, number of D. Optimization Results
pen-up samples, pen down time/total time taken,
standard deviation of y / change in y, pen down time, As computed using (1) nearly 14 features have greater C
RMS velocity / maximum velocity, average jerk, jerk than other 16 features. As researchers have discussed earlier
RMS, maximum sample point x-coordinate, maximum that too many features may decrease FRR but increase FAR
12
Proceedings of the 1st International Workshop on Automated Forensic Handwriting Analysis (AFHA) 2011
[7] therefore we have to choose between the best of them. Features such as total time, pen-down time and total
The 14 features with greater C are standard deviation in y- samples are all time dependent features so therefore for a
velocity, total samples, number of zeros in y- versatile verification engine we opted total time to be the
acceleration, number of zeros in x-acceleration, zero best among them. Moreover standard deviation of y-velocity
crossings in x-acceleration, zero crossings in y- is another feature having a greater result but on the standard
acceleration, zero crossings in x-velocity, zero crossings deviation of x-velocity has a very small difference, therefore
in y-velocity, length, average pressure, total time, this ambiguous result made us step down with these features
number of zeros in y-velocity, number of zeros in x- too.
velocity and pen-down time.
TABLE I. CALCULATIONS OF EQUATION (1)
V. INTELLIGENT ONLINE SIGNATUARE VERIFICATION
Feature Mo/STDo Mf/STDf C The experimental setup and optimization proposed above
gave very good results but still as we have discussed earlier
Std Dev y/∆y -4.8766 -1.9296 2.9
that signature and its features are personal traits and they
T(pen-down)/T(total) 23.3710 17.6752 5.4 may vary person to person. Thus to make this system
N (pen-ups) 3.8719 0.8551 2.95 efficient and intelligent we made it route person to person.
Standard Deviation vy 25.2692 13.9054 12.3
As we had a list of 9 most efficient features, we decided to
choose 5 out it but based on subject itself. These 5 features
Standard Deviation vx 2.8116 1.9122 0.9 may vary person to person. While recording a template from
N(vy=0) 5.8355 1.2074 4.6 a subject all these features were stored in the template but at
Average v/v( max.) 5.7595 3.3267 2.45 the time of verification we proposed a system in which only
5 features were compared against its template based on the
(x1-xmin)/average x 4.5197 2.8109 1.7
following results.
Total Samples 15.9329 2.1116 13.79 X = C/ Vx - STDf (2)
(x1-xmax)/average x -7.4158 -8.2712 0.8
N(max. y) 15.9590 17.7610 1.81 Where C is the difference between the mean/standard-
deviation ratio of the feature of original signatures and the
Standard Deviation of ay 3.1448 4.0654 0.92 mean/standard-deviation ratio of the feature of forgery
Standard Deviation of ax 1.6747 2.1500 0.48 signature from (1) which is already calculated and Vx is
Number of zeros in ay 7.7817 1.0288 6.78 current value of the sample and STDf is the standard
deviation of the forgery signature already stored. So among
Number of zeros in ax 8.5880 1.2653 7.30
the 9 features, only 5 features are opted which have a greater
Zero cross. X- 9.0654 1.3230 7.68 value of X from (2).
acceleration
Zero cross. Y- 9.6669 1.2263 8.44 A. Comparison
acceleration For comparison we need a reference. So for the
Zero cross. X-velocity 12.8354 1.5204 11.31 enrollment process we selected 5 original signatures from
each signature extracted the 9 features described above to
Zero cross. Y-velocity 13.5760 1.2228 12.35
create a reference template. The template contains the mean,
Length 7.5981 1.7094 5.89 standard deviations and their difference stored in 3 vectors R,
rms jerk 2.6554 1.9491 0.71 S and C respectively. If we want to compare a signature
average jerk 2.7470 2.4410 0.26
(original or forged) with the template we will first compute
the feature vector of that signature and corresponding vector
N(max. x) 15.4440 13.6379 1.81 X using (2). Then the greater 5 features depending on the
Average Pressure 12.1289 2.2516 9.87 value of X will be stored in a vector T. To compare the
Total Time 15.9329 2.1116 13.82 signature we will simply opt out those 5 features from R and
S and a distance vector D will be computed using Euclidean
Number of zeros in vy 8.8355 1.2074 7.63 classifier.
Number of zeros in vx 8.5746 1.2781 7.30 D=R–T (3)
(y1-ymax)/average y -3.9525 -3.1871 0.77 Then the distance vector V will be normalized by dividing
each value by the corresponding standard deviation in the
(x1-xmin)/average x 8.0218 5.4126 2.62
vector S to obtain a vector Z whose mean is then computed
Pen-down Time 29.7766 2.9390 26.87 and finally the computed norm is compared to a pre-defined
Highlighted features are with greater results threshold.
13
Proceedings of the 1st International Workshop on Automated Forensic Handwriting Analysis (AFHA) 2011
this system to make it more efficient by using other
classifiers and updating signature over time with tablets
B. Results
with better sampling rates.
Results for FRR, FAR of the template of 5 signatures of 100
subjects were computed with threshold from 4 to 9 for this
intelligent online signature verification system and best REFERENCES
results were obtained.
[1] H. Dullink, B. van Daalen, J. Nijhuis, L. Spaanenburg and H.
Zuidhof, Implementing a DSP Kernel for Online Dynamic
TABLE II. CALCULATIONS OF FFR AND FAR (1) Handwritten Signature Verification Using the TMS320 DSP Family,
EFRIE, France December 1995, SPRA 304.J. Clerk Maxwell, A
Treatise on Electricity and Magnetism, 3rd ed., vol. 2. Oxford:
Threshold FRR FAR Clarendon, 1892, pp.68–73.
[2] Charles E. Pippin, Dynamic Signature Verification using Local and
4 11.57% 0.72% Global Features, Georgia Institute of Technology, July 2004.
[3] T. S. Tolba, A Virtual-Reality-Based System for Dynamic Signature
5 11.20% 3.92% Verification, Digital Signal Processing Vol. 9, pp. 241-266, 1999.
(article available online at http://www.idealibrary.com)
6 4.53% 8.02% [4] V. S. Nalwa, Automatic On-Line Signature Verification, Proceedings
of IEEE, vol. 85, pp. 215-239, 1997.M. Young, The Technical
7 2.06% 13.62% Writer’s Handbook. Mill Valley, CA: University Science, 1989.
[5] Hao Feng and Chan Choong Wah, Online signature verification using
8 1.13% 19.89% a new extreme points warping technique, PRL(24), No. 16, pp. 2943-
2951, December 2003.
[6] H. Goto, Y. Hasegawa, and M. Tanaka, “Efficient Scheduling
9 0.66% 27.02% Focusing on the Duality of MPL Representatives,” Proc. IEEE Symp.
Computational Intelligence in Scheduling (SCIS 07), IEEE Press,
Dec. 2007, pp. 57-64, doi:10.1109/SCIS.2007.357670.
[7] G. Lorette and R. Plamondon, Dynamic approaches to handwritten
Results obtained from our implementation are very better signature verification, Computer Processing of handwriting, World
Scientific, 1990, 21-47.
than a number of techniques implemented because we used
very strong features and an intelligent system to classify
them person to person. Anyways more work can be done on
14
Proceedings of the 1st International Workshop on Automated Forensic Handwriting Analysis (AFHA) 2011
15