Introduction

Adversarial Robustness for Face Recognition: How to Introduce Ensemble Diversity among Feature Extractors?

Takuma Amada

Kazuya Kakizaki

Toshinori Araki

Seng Pei Liew

sengpei.liew@gmail.com 1

Joseph Keshet

jkeshet@cs.biu.ac.il 0

Jun Furukawa

jun.furukawa1971g@nec.com jun.furukawa@necam.com 2 0 Bar-Ilan University , Ramat Gan, 52900 , Israel 1 NEC Corporation , 7-1, Shiba, 5-chome Minato-ku, Tokyo 108-8001 Japan 2 NEC Israel Research Center , 2 Maskit Street, Herzliya Pituach , Israel

An adversarial example (AX) is a maliciously crafted input that humans can recognize correctly, while machine learning models cannot. This paper considers how to turn deep learning-based face recognition systems to be robust against AXs. A large number of studies have proposed methods for protecting machine learning-classifiers from AXs. One of the most successful methods among them is to prepare an ensemble of classifiers and promote diversity among them. Face recognition typically relies on feature extractors instead of classifiers. We found that directly applying this successful method to feature extractors leads to failure. We show that this failure is due to a lack of true diversity among the feature extractors and fix it by synchronizing the direction of features among models. Our method significantly enhances the robustness against AXs under the white box and black box settings while slightly increasing the accuracy. We also compared our method with adversarial training.

Introduction

Deep neural networks (DNNs) have become core components of many essential services as their performance has gone beyond the human capability of recognition in many tasks (Parkhi, Vedaldi, and Zisserman 2015; Schroff, Kalenichenko, and Philbin 2015; Szegedy et al. 2015; He et al. 2016) . Face recognition is one of the most widely used services that rely on DNNs (Sun et al. 2014) , ranging from immigration inspection to smartphone authentication. However, the vulnerability of deep learning under adversarial examples has begun to threaten its promises (Szegedy et al. 2014) . (Singh et al. 2020) discuss a wide variety of attacks.

An adversarial example (AX) is an inconceivably perturbed input that deceives the machine learning model into miss-classification of it, while a human can correctly classify it. Some attacking methods need an entire code of the model, a white-box setting, while some need only oracle access to the model, a black-box setting. The threat became very plausible when Sharif et al. (2016) showed physical glasses could deceive the machine learning model in the black-box setting. As a result of such an attack, a person on the blacklist may wear a slightly fancy glass to evade immigration inspection performed by machines.

Many works proposed how to prevent AXs, and many of them are broken back, leading to an arms race between defenders and attackers. A model ensembling (Abbasi and Gagne´ 2017; Dabouei et al. 2020; Kariyappa and Qureshi 2019) is one of the most successful prevention methods among them. In particular, the adaptive diversity promoting (ADP) method (Pang et al. 2019) that promotes the diversity of models in the ensemble is the most successful. Although its defense is immature like all others when attacked adaptively (Trame`r et al. 2020) , such a strategy has an advantage. It is orthogonal to other defensive approaches that focus on enhancing single-model adversarial robustness and can be used in tandem to achieve further adversarial robustness. Adversarial training (Zhong and Deng 2019) , another successful AX prevention method, is an example of such a tandem method.

We are interested in applying the ADP method to enhance the robustness of face recognition against AXs. Face recognition commonly relies on a machine learning feature extractor since it enables the service to register a huge number of new faces without retraining the network. We experimented with ADP’s direct application and found that it does not improve feature extractors’ robustness against AXs.

We consider the cause of failure is that the directions of different models’ features are not comparable in a meaningful way, and thus their diversity is insignificant. We propose letting all the models in the ensemble share weight vectors of their final layers so that features in different models can compare themselves in a common coordinate. We also propose to promote the diversity of feature vectors directly rather than the diversity of the classifier. Figure 1 illustrates our method.

We have experimented with ADP, our methods, and several variants. We trained them in various methods such as ArcFace (Deng et al. 2019) and CosFace (Wang et al. 2018) , with the MS1MV2 dataset (Deng et al. 2019) , the refined version of the MS-Celeb-1M, and verified by VGG2 (Cao et al. 2018). We measured the robustness by AXs adaptively generated by I-FGSM, an iterative variant of Fast Gradient Signed Method (FGSM) (Goodfellow, Shlens, and Szegedy 2015) , Basic Iterative Method (BIM) (Kurakin, Goodfellow, and Bengio 2017a; Madry et al. 2018) , and Carlini & Wagner (CW) attack (Carlini and Wagner 2017b) . We confirmed that ADP affects neither accuracy nor robustness against AXs. On the other hand, our method significantly enhanced the robustness to AXs in both white-box and black-box settings without harming its accuracy at all.

Despite its enhancement, our model’s robustness was not so distinctly high that we can still generate successful AXs for all legitimate samples in the white-box setting if with sufficiently large perturbation. Although our method does show significantly lower transferability than others in a black-box setting, we found that very well forged AXs often, with sufficiently large perturbation, can again deceive the machine recognition with very high probability.

Preliminaries Face Recognition by DNN Feature Extractor

Face recognition typically belongs to either face identification or face verification. The former is also called closed-set face classification and assumes the probed image belongs to one of the objects enrolled in the gallery. The latter is also called open-set face classification and can reject a probed image with no corresponding object in the gallery. We focus on open-set face recognition.

Modern open-set face recognition systems commonly consist of a DNN-based feature extractor that maps an input image into a low dimensional feature space (Sun et al. 2014) . Then we can measure the similarity of the two images by the distance between their two corresponding feature vectors. The commonly used distances are the Euclidean distance and the cosine distance. A feature extractor excels in face recognition service as it requires no retraining, unlike classifiers when it registers a new face. 1

There are two major ways of training the DNN-based feature extractors. One way is to train a normal multiclass DNN classifier initially and then regard the output of the penultimate layer of the DNN as the feature vector (i.e., the DNN without the last fully connected layer is a feature extractor) (Parkhi, Vedaldi, and Zisserman 2015; Sun et al. 2014) . Another approach is to train the feature extractor directly using the triplet loss (this is also called metric learning) (Schroff, Kalenichenko, and Philbin 2015) . A triplet consists of two matching face thumbnails and a nonmatching face thumbnail, and the loss aims to separate the positive pair from the negative by a distance margin. As the number of triplet combinations increases exponentially, triplet loss tends to be harder to scale. We focus on the former approach in this work.

Feature extractors trained through simple softmax outputs are effective for closed-set classification tasks but are not discriminative enough for open-set verification. An angular margin penalty such as ArcFace (Deng et al. 2019) , CosFace (Wang et al. 2018) , and Sphereface (Liu et al. 2017) , and other approaches such as (Wan et al. 2018) is a form of the loss function that has successfully improved the discriminative power of face verification by feature vectors. The angular margin penalty modifies the final prediction layer to enforce the true label’s predictions to be more restrictive and discriminative than the vanilla softmax by penalty. In such a way, the loss transfers the penalty against the prediction into the distance between features, features with high inter-class variance and low intra-class variance.

We briefly review the angular margin penalty. We start from the cross-entropy loss LCE (x; y) for a trainable parameter , an input x, and its true label y. We assume g(x; ) is the classification over n classes and is the fully connected final layer’s softmax output whose input is the output f (x; ) 2 Rd of the d-dimensional penultimate layer. Then, letting Wj 2 Rd and bj 2 R for j = 1; : : : ; n, respectively, an final weight vector and a final bias for the j-th prediction, eWy f(x)+by LCE (x; y) := T 1y log g(x) = log Pn `=1 eW` f(x)+b` :

The loss function of ArcFace LARC; ; (x; y) L2normalizes f and Wj , lets b` = 0, and introduces penalty and smoothness hyperparameter . With cos (x; `) = W` f(x) kW`k kf(x)k , it is;

LARC; ; (x; y) = log

e cos( (x;y)+ ) e cos( (x;y)+ ) + P`2f1;:::;ngny e cos (x;`) : 1Because of this property, we need to evaluate the accuracy of feature extractor based face recognition by datasets that do not share the labels (identities of the owners of faces) with the training dataset.

The loss function of CosFace is with a slightly different form of the penalty. We call each model of both ArcFace and CosFace as a single model and compare it with other models in our experiments. The feature extractor trained by this network is f~( ; ) 2 Rd, which is the normalization of f ( ; ).

Adversarial Examples

While deep neural networks are successful to show high accuracy in their tasks, they are vulnerable to AXs (Goodfellow, Shlens, and Szegedy 2015; Carlini and Wagner 2017b) . An AX is a crafted input xadv, which is different from a source image xs by , i.e., xadv = xs + , so that the target classifier misclassifies it but not by the humans. Many works search for a small that humans cannot perceive, while some works such as (Kakizaki and Yoshida 2020) search for a large that humans consider natural. I-FGSM, an iterative variant of the Fast Gradient Signed Method (FGSM) (Goodfellow, Shlens, and Szegedy 2015; Madry et al. 2018) , searches by moving in the negative gradient direction to the target label. Basic Iterative Method (BIM) (Kurakin, Goodfellow, and Bengio 2017a) is an extension of I-FGSM where the search is within a given boundary. Carlini & Wagner (CW) attack (Carlini and Wagner 2017b) searches the best by formalizing the problem as optimization. The above I-FGSM, BIM, and CW methods are the major strong attacks available today to generate AXs.

(Rozsa, Gu¨nther, and Boult 2017) proposed a general method, called LOTS, to generate AXs such that an internal layer representation is close to that of a target by iteratively adding perturbations to the source input. It uses a Euclidean loss defined on the internal layer representations of the origin and the target. It applies its gradient to the source input to manipulate the source input’s internal layer representation. We can generate an AX for feature extractors by applying LOTS to the features layer, which is also an internal representation of the classifier that is to be a feature extractor. LOTS is sufficiently general to employ I-FGSM, BIM, and CW for the underlying perturbation method.

Defenses against Adversarial Examples

Various studies have proposed methods to make DNNs robust against AXs. They include adversarial training (Goodfellow, Shlens, and Szegedy 2015; Madry et al. 2018) , feature transformation (Xu, Evans, and Qi 2018) , statistical analysis (Zheng and Hong 2018) , manifold learning (Samangouei, Kabkab, and Chellappa 2018) , knowledge distillation (Papernot et al. 2016) , selective dropout (Goswami et al. 2018, 2019) , an ensemble of models, and more. Despite their partial success, none of them completely prevents AXs from deceiving DNNs with cleverer attack techniques such as (Carlini and Wagner 2017a; Athalye, Carlini, and Wagner 2018) . An adversarial training, a relatively successful defense, trains DNNs by generating AXs and including them in the training data (Goodfellow, Shlens, and Szegedy 2015; Kurakin, Goodfellow, and Bengio 2017b) . However, it can not sufficiently defend the model against some of the AXs that have not appeared during the training (Gilmer et al. 2019) . There have also been studies on certified defenses, of which the aim is to train DNNs in a provably robust fashion (Cohen, Rosenfeld, and Kolter 2019; Hein and Andriushchenko 2017; Wong et al. 2018; Raghunathan, Steinhardt, and Liang 2018; Wong and Kolter 2018) . However, this approach’s successes are largely limited to simple DNN architectures and datasets with low resolution.

Some works such as (Russakovsky et al. 2015) showed that an ensemble of different models improves the generalizability in image classification tasks. It then turns out that the ensemble model is also successful defenses against AXs. (Pang et al. 2019) proposed adaptive diversity promotion (ADP), which trains an ensemble of models so that their non-maximal predictions vary largely. Since non-maximal predictions differ greatly, it is hard to align them to maximal prediction in an AX.

Direct Integration Adaptive Diversity Promoting (ADP) Method into Feature Extractors

Despite the relative success of the ensemble model, all of the works are validated only for classifications. We can directly integrate the adaptive diversity promoting (ADP) method into the angular margin penalty for feature extraction. We assume that our ensemble is composed of K models, and thus all parameters have K duplicates. Each model predicts over n labels in training. Let the output of the k-th model that represents the j-th label in the ensemble be gk;j . The ensemble prediction Gj for the j-th label is the average of all the output of predictions. That is, Gj (x) = K1 PkK=1 gk;j (x). Then, Shannon Entropy of the distribution fGj gj=1;:::;n is H(G(x)) =

N X Gj (x) log(Gj (x)): j=1

Let gk;ny 2 Rn 1 be such an (n 1)- dimensional vector that is gk 2 Rn except for the y-th element, i.e., true prediction. Let g~k;ny 2 Rn 1 be L2-normalized gk;ny 2 Rn 1, and let an (n 1) K matrix M (x; y) = (g~1;ny(x); : : : ; g~K;ny(x)) 2 R(n 1) K . Then, the ensemble diversity of non-maximal (y) prediction of x is

ED(x; y) = det(T M (x; y) M (x; y)): Here, the operation of ” ” is the multiplication of K (n 1) matrix and (n 1) K matrix, whose result is K K matrix. Geometrically, it is the volume of spaces that fg~1;ny(x); : : : ; g~K;ny(x)g spans. With hyperparameters and , the regularizer for promoting adaptive diversity is ADP ; (x; y) =

H(G(x)) +

log(ED(x; y)): k

With fLM; ; (x; y)gk=1;:::;K where M represents either ArcFace or CosFace by ARC or COS, the ADP method trains the model by optimizing parameters for minimum K X k=1 LE;M;ADP; ; ; ; :=

k LM; ; (x; y)

ADP ; (x; y):

The feature extractor is the normalized penultimate layer output (f~k 2 Rd)k=1;:::;K . We let the ensemble features F be the average of all the output of features extractors as which we can compare to the original k(x; `) such that F (x) = 1 N

X f~k(x): K k=1 Here, we omitted . We use the Euclidean distance between the above ensemble feature F (x) of the input x and the registered feature F (x0) of another input x0 as a measure of similarity for our verification task.

Our Approach Problem and Challenge

We will later see in our experiments that ADP scarcely enhances the robustness of the network against AXs. We see its cause as below. Let normalized weight vectors of the final layer by all the models be fW~ `kg(k;`)2f1;:::;Kg f1;:::;ng. Here, k runs for models in the ensemble, and ` runs for labels. Let ff~kgk be the feature extractors. Again, k runs for models in the ensemble. The diversity promotion in the ensemble aims to enforce any perturbation to input x brings it to such x0 that features ff~k(x0)gk of different models are close to W~ `k of different `’s. The ADP is likely to achieve such a property. However, the difference in directions of features does not imply the difference of directions from learned features, i.e., the weight vectors. For example, suppose that f~1(x) near W~ 11 moves to a very different direction from the direction of which f~2(x) near W~ 12 moves when we perturb x by . However, if these directions are such that f~1(x) moves to W~ 21 and that f~2(x) moves to W~ 22, the difference in directions does not prevent AXs. Since weight vectors are independent among different models, the previous approaches do not prevent such situations from occurring. Therefore, we need to promote the diversity of features among all the ensemble models to be diverse relative to respective weight vectors’ positions.

Our Solution Shared Representative Vector:

We propose to share weight vectors fW~ `kg(k;`)2f1;:::;Kg f1;:::;ng of the final layer by all the models. We propose W~ `k for all k are the same and let W~ ` denote it. The change promotes all the label y features to be close to the same W~ y independent of models. Hence, the adversary needs to find the perturbation of x, which moves all features ff~k(x)gk=1;:::;K from W~ y to W~ y0 . Suppose the directions of the susceptibility of features to perturbation are different among different models. Then, it is hard for an adversary to find perturbation that moves features in the same direction. If we promote diversity of features, we can expect the chart of features around weight vector are different among models and can expect that the ensemble increases its robustness to AXs.

We call the weight vector W` that all models share as shared representative weight vectors (SRV). Let k(x; `) be the angle, i.e., the arc, between W~ ` and f~k(x) in Rd. That is, k(x; `) is such that, cos k(x; `) = W~ ` f~k(x); cos k(x; `) =

W`k fk(x) kW`kk kfk(x)k = W~ `k f~k(x): We have the loss function with the shared representative weight vector as =

LE;ARC;SRV; ; (x; y) K X log k=1

e cos( k(x;y)+ ) e cos( k(x;y)+ ) + P`2f1;:::;ngny e cos k(x;`) : We expect that feature extractors trained by the following loss function are robust to AXs.

LE;ARC;SRV;F DP; ; ; (x; y) =

LE;ARC;SRV; ; (x; y)

FDP (x): We define LE;COS;SRV;F DP; ; ; (x; y) similarly. Feature Diversity Promotion: In ADP, we promote nonmaximal predictions of models in the ensemble to be diverse. However, it is a feature that we want it hard for adversaries to manipulate rather than predictions. Hence, we choose to promote the diversity of ensemble features directly. We can measure the diversity EDfeat of ensemble features at x in the same manner as before. Let F~(x) = (f~1(x); : : : ; f~K (x)) 2 Rd K be d K matrix. Then,

EDfeat(x) = det(T F~(x) F~(x)) Here, the determinant is on the K K matrix. We promote the ensemble feature extractors to be such that their features are diverse by the following regularizer.

FDP (x) =

log(EDfeat(x)) The weighting coefficient is a hyperparameter. The regularizer has no term of Shannon entropy, unlike ADP ; , since we do not need to balance features with values such as maximal prediction in ADP.

Experiments Implementation Details

We followed the same training process that is in (Deng et al. 2019) . We adopt an MS1MV2 dataset (Deng et al. 2019) , the refined version of the MS-Celeb-1M dataset, for the training, and VGG2 for the verification. The training dataset in the MS1MV2 dataset includes 5.8M face images and 85K identities. For data preprocessing, we crop face images to the size of 112 112 and align face images by utilizing MTCNN (Zhang et al. 2016) .

For the embedding network, we employ the widely used CNN architecture, MobileFacenet (Chen et al. 2018) . After the last convolutional layer, we explore the BN (Ioffe and Szegedy 2015) -Dropout (Srivastava et al. 2014) -FC-BN structure to get the final 512-dimensional embedding feature.

We follow (Wang et al. 2018) to set the feature to scale = 64 and choose the angular margin of ArcFace at = 0:5. We choose the angular margin of CosFace at = 0:35.

We set the batch size to 256 and train the ensemble consisting of three feature extractors on one NVIDIA Tesla V100 (32GB) GPU. We set the initial learning rate is 10 3, and we divide it by 10 at 12, 15, and 18 epoch. The training process finishes at 20 epoch.

We have experimented with the effectiveness of regularizers ADP ; (x; y) and FDP (x) to the robustness of feature extractors trained by the ensemble model of ArcFace and CosFace. The hyperparameter of regularizers we tested are ( ; ) = (2:0; 0:5); (2:0; 10:0), and (2:0; 50:0) for ADP, and = 1:0; 10:0, and 50:0 for FDP. We also compared our method with one of the best adversarial training, which exploits margin-based triplet embedding regularization (Zhong and Deng 2019) . They have not experimented with MobileFacenet that we adopted. They also observed that the robustness varies depending on the margin in the triplet embedding regularization term. Hence, we tried several hyperparameter values of m = 0:2; 0:6; 1:4, and 3:0 to find the best one for MobileFacenet.

We applied attacks such as I-FGSM, BIM, and CW to the following models in the LOTS framework. (1) ”single model,” which is the original model, (2) ”baseline,” which is a simple ensemble of original models, (3) ”ADP,” which is a simple ensemble model with ADP regularizer, and (4) ”FDP” which is a simple ensemble model with FDP regularizer. (5) ”SRV,” which is an ensemble with a shared representative vector, (6) ”SRV+ADP,” which is SRV with ADP regularizer, (7) ”AdvT for m = 0:2; 0:6; 1:4; 3:0,” which is adversarial training, and (8) ”Our method,” which is our full model that has SRV with FDP regularizer. The number of models in each ensemble is three (K = 3).

All the attacks are adaptive, which means we applied methods to robustified networks. We could compare our method to only those previous methods we experimented with ourselves since their reported results are in different conditions.

Performance on Legitimate Samples

We verified the accuracy of verifications on the VGG2 dataset. Table 1 shows ROC-AUCs of the single model, ADP, AdvT, and our method (SRV+FDP) for ArcFace and CosFace. The hyperparameters resulting from best ROCAUC are ( ; ) = (2:0; 0:5), m = 0:6, and = 10:0. Our proposed SRV+FDP is not only no worse than the original single model but performs best.

We also show the accuracy of verification on several legitimate datasets of different conditions in Table 2.

Loss ArcFace CosFace

method single model Baseline

ADP FDP

SRV SRV+ADP

AdvT

Our method

single model SRV+ADP

Our method

89:60 89:71 86:20 89:94 90:94 86:62 90:80 89:97 88:29 88:33 91:14 94:22 94:15 90:52 94:13 94:93 93:98 94:38 95:15 93:07 92:43 94:97

These datasets are FW (Huang et al. 2007), AgeDB30 (Moschoglou et al. 2017) , and CFP-FP (Sengupta et al. 2016) , which provide face data in an unconstrained setting. We can see that our SRV+FDP is remarkably often better slightly than the single model. On the other hand, ADP and SRV+ADP are often worse than the single model. We can use our method in tandem with other methods. The experiments are with ArcFace and CosFace.

Robustness against Adversarial Examples in White-Box Setting

We generate AXs by LOTS with I-FGSM, BIM, and CW in a white-box setting. We did not limit the number of iterations in all attacks, which swells more than 1,000 in some cases. We chose a rather large boundary ( = 0:1) in BIM. We randomly sample 1000 pairs of different identity images from the VGG2 test dataset and generated 500 AXs. As we imposed less on AXs, all the attacks have been successful with their different perturbation sizes. We let the learning rate of underlying SGD to be 0:1. We note that we can still recognize images with the largest perturbation by our human eyes.

We compared the robustness of various methods through the size of perturbation. We define the -attack success rate as

Acc = jfxadvjxadv 2 AX ; kxadv xsk2 <

gj : (1) jAX j Here, xs is a legitimate sample, and xadv is the AX created from xs. AX is a set of all the successful AXs generated in the white-box setting. This measurement represents the proportion of adversarial samples whose perturbation size is less than to all legitimate images. We can say that the larger perturbation we need to fool feature extractors, the more robust they are.

Graphs in the upper row of Figure 2 show the Acc of LOTS via I-FGSM, BIM, and CW, respectively, for ArcFace. We observed that (1) the baseline, ADP, and the FDP do not enhance the robustness against AXs in a noticeable manner, (2) SRV alone enhances robustness, (3) the addition of ADP to SRV does not enhance robustness, (4) Our

LOTS via I-FGSM (CosFace)

LOTS via BIM (CosFace)

LOTS via CW (CosFace)

FDP+SRV has the best robustness. We see the same trends for another common loss function of CosFace in graphs in the lower row of Figure 2.

Graphs in Figure 3 compare our method with adversarial training by the Acc of LOTS via I-FGSM, BIM, and CW, respectively, for ArcFace. The adversarial training is with several values of hyperparameter ”m.” Differences in robustness among them are marginal. We see our method enjoys the stronger robustness in all the cases.

Robustness against Adversarial Examples in Black-Box Setting

We first generate AXs by LOTS with I-FGSM, BIM, and CW to the single model in a white-box setting. We generated several sets of AXs with different distances between their features and the target features. All of their distances are sufficiently small that all AXs are successful attacks. Although a class with a smaller distance is hard to generate, we successfully generated AX for all input images by searching for a longer time. On average, it took 10.97 seconds to generate an AX whose distance from the target is 0.2. As in the white-box setting experiments, we did not restrict the number of iterations; we chose a rather large perturbation boundary. We applied these AX to each model for the evaluation, whose results are in graphs in Figure 4.

The results show that our method has significantly suppressed the transferability of AXs. We could also see a clear relation between the transferability of the AXs and the distance between features. We consider it because our ensemble uses the same single model. The transferability surprisingly reaches 1 when the distance between AX and the target in feature space becomes sufficiently small. However, this comes with a much larger perturbation in input images, as shown in Figure 5. The transferability of all models approaches zero as the distance between features becomes large. It is so because AX becomes no longer a successful one with such a large distance. That we eventually have complete transferability with a small distance in feature space indicates that an essential improvement within a single model is necessary, unless we can strictly forbid the small size of perturbation by some means. We consider this is a clear limitation of the ensemble model method.

Conclusion

Introducing ensemble diversity in feature extractors for adversarial robustness was not direct. We proposed a novel method that could effectively increase the robustness of feature extractors. Our method might not completely prevent adversarial examples if the perturbation is large. However, it can potentially be used with other tandem methods to increase their robustness. Our method also showed stronger robustness than one of the adversarial training methods. AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2-7, 2018, 6829–6836. AAAI Press. URL https://www.aaai.org/ocs/index.php/ AAAI/AAAI18/paper/view/17334.

Abbasi , M. ; and Gagne´, C. 2017 . Robustness to Adversarial Examples through an Ensemble of Specialists . In 5th International Conference on Learning Representations, ICLR 2017 , Toulon, France, April 24-26 , 2017 , Workshop Track Proceedings.

Athalye , A. ; Carlini , N.; and Wagner , D. A. 2018 . Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples . In Proceedings of the 35th International Conference on Machine Learning , ICML 2018 , Stockholmsma¨ssan, Stockholm, Sweden, July 10-15 , 2018 , 274 - 283 .

2018. VGGFace2: A Dataset for Recognising Faces across Pose and Age . In FG, 67 - 74 . IEEE Computer Society.

Carlini , N.; and Wagner , D. A. 2017a . Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods . In Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security , AISec@CCS 2017 , Dallas, TX, USA, November 3, 2017 , 3 - 14 .

Carlini , N.; and Wagner , D. A. 2017b . Towards Evaluating the Robustness of Neural Networks . In IEEE Symposium on Security and Privacy , 39 - 57 . IEEE Computer Society.

Chen , S. ; Liu, Y. ; Gao , X. ; and Han, Z. 2018 . MobileFaceNets: Efficient CNNs for Accurate Real-time Face Verification on Mobile Devices . CoRR abs/ 1804 .07573.

Cohen , J. M. ; Rosenfeld , E.; and Kolter , J. Z. 2019 . Certified Adversarial Robustness via Randomized Smoothing.

Chaudhuri , K.; and Salakhutdinov, R., eds., Proceedings of the 36th International Conference on Machine Learning , ICML 2019 , 9 - 15 June 2019, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research , 1310 - 1320 . PMLR.

Dabouei , A. ; Soleymani , S. ; Taherkhani , F. ; Dawson , J.; and Nasrabadi , N. M. 2020 . Exploiting Joint Robustness to Adversarial Perturbations . In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Deng , J. ; Guo , J. ; Xue, N.; and Zafeiriou , S. 2019 . ArcFace: Additive Angular Margin Loss for Deep Face Recognition .

CVPR

, 4690 - 4699 . Computer Vision Foundation / IEEE.

Gilmer , J. ; Ford , N. ; Carlini , N.; and Cubuk , E. D. 2019 .

Adversarial

Examples

Are a Natural Consequence of Test Error in Noise . In Proceedings of the 36th International Conference on Machine Learning , ICML 2019 , 9 - 15 June 2019, Long Beach, California, USA, 2280 - 2289 .

Goodfellow , I. J. ; Shlens , J.; and Szegedy , C. 2015 .

Explaining and Harnessing Adversarial Examples . In 3rd International Conference on Learning Representations, ICLR 2015 , San Diego, CA, USA, May 7- 9 , 2015 , Conference Track Proceedings.

Goswami , G. ; Agarwal , A. ; Ratha , N. K. ; Singh , R. ; and Vatsa, M. 2019 . Detecting and Mitigating Adversarial Perturbations for Robust Face Recognition . Int. J. Comput.

Vis. 127 ( 6-7 ): 719 - 742 . doi: 10 .1007/s11263-019-01160-w.

URL https://doi.org/10.1007/s11263-019-01160-w.

Goswami , G. ; Ratha, N. K. ; Agarwal , A. ; Singh , R. ; and Vatsa, M. 2018 . Unravelling Robustness of Deep Learning Based Face Recognition Against Adversarial Attacks . In McIlraith , S. A. ; and Weinberger , K. Q., eds., Proceedings of the Thirty-Second He , K. ; Zhang , X. ; Ren , S. ; and Sun , J. 2016 . Deep Residual Learning for Image Recognition . In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las

Vegas

, NV , USA, June 27-30, 2016 , 770 - 778 . IEEE Computer Society.

Hein , M. ; and Andriushchenko, M. 2017 . Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation . In Guyon, I.; von Luxburg, U.; Bengio , S. ; Wallach, H. M. ; Fergus , R. ; Vishwanathan, S.

V. N. ; and Garnett, R., eds., Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 , 4 -9 December 2017 , Long Beach, CA, USA, 2266 - 2276 .

2007. Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments . Technical Report 07-49 , University of Massachusetts, Amherst.

Ioffe , S. ; and Szegedy , C. 2015 . Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift . In Bach , F. R. ; and Blei , D. M., eds., Proceedings of the 32nd International Conference on Machine Learning , ICML 2015 , Lille, France, 6 - 11 July 2015 , volume 37 of JMLR Workshop and Conference Proceedings , 448 - 456 . JMLR.org.

Kakizaki , K. ; and Yoshida , K. 2020 . Adversarial Image Translation: Unrestricted Adversarial Examples in Face Recognition Systems . In SafeAI@AAAI, volume 2560 of CEUR Workshop Proceedings , 6 - 13 . CEUR-WS.org.

Kariyappa , S. ; and Qureshi, M. K. 2019 . Improving Adversarial Robustness of Ensembles with Diversity Training .

CoRR abs/ 1901 .09981.

Kurakin , A. ; Goodfellow , I. J .; and Bengio , S. 2017a .

Adversarial examples in the physical world . In 5th International Conference on Learning Representations, ICLR 2017 , Toulon, France, April 24-26 , 2017 , Workshop Track Proceedings.

Kurakin , A. ; Goodfellow , I. J .; and Bengio , S. 2017b . Adversarial Machine Learning at Scale. In 5th International Conference on Learning Representations, ICLR 2017 , Toulon, France, April 24-26 , 2017 , Conference Track Proceedings.

Liu , W. ; Wen, Y. ; Yu , Z. ; Li , M. ; Raj , B. ; and Song , L. 2017 .

SphereFace: Deep Hypersphere Embedding for Face Recognition . In CVPR , 6738 - 6746 . IEEE Computer Society.

Madry , A. ; Makelov , A. ; Schmidt , L. ; Tsipras , D. ; and Vladu , A. 2018 . Towards Deep Learning Models Resistant to Adversarial Attacks . In 6th International Conference on Learning Representations, ICLR 2018 , Vancouver, BC, Canada, April 30 - May 3, 2018 , Conference Track Proceedings.

Moschoglou , S. ; Papaioannou , A. ; Sagonas , C. ; Deng , J. ; Kotsia , I.; and Zafeiriou , S. 2017 . AgeDB: The First Manually Collected, In-the-Wild Age Database . In CVPR Workshops , 1997 - 2005 . IEEE Computer Society.

Pang , T. ; Xu , K. ; Du , C. ; Chen , N.; and Zhu , J. 2019 . Improving Adversarial Robustness via Promoting Ensemble Diversity . In ICML, volume 97 of Proceedings of Machine Learning Research , 4970 - 4979 . PMLR.

Papernot , N. ; McDaniel , P. D. ; Wu , X. ; Jha , S. ; and Swami , A. 2016 . Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks . In IEEE Symposium on Security and Privacy , SP 2016 , San Jose, CA, USA, May 22 -26, 2016 , 582 - 597 .

Parkhi , O. M. ; Vedaldi , A. ; and Zisserman , A. 2015 . Deep Face Recognition . In Xie , X.; Jones , M. W. ; and Tam, G. K. L., eds., Proceedings of the British Machine Vision Conference 2015 , BMVC 2015, Swansea , UK, September 7- 10 , 2015 , 41 . 1 - 41 .12. BMVA Press.

Raghunathan , A. ; Steinhardt , J.; and Liang, P. 2018 .

Semidefinite relaxations for certifying robustness to adversarial examples . In Bengio, S.; Wallach, H. M. ; Larochelle , H. ; Grauman , K. ; Cesa-Bianchi , N.; and Garnett, R., eds., Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 , NeurIPS 2018 , 3 -8 December 2018 , Montre´al, Canada, 10900 - 10910 .

Rozsa , A. ; Gu¨nther, M.; and Boult, T. E. 2017 . LOTS about attacking deep features . In IJCB , 168 - 176 . IEEE.

Russakovsky , O. ; Deng , J. ; Su, H. ; Krause , J. ; Satheesh, S. ; Ma, S.; Huang , Z. ; Karpathy , A. ; Khosla , A. ; Bernstein , M. S. ; Berg , A. C. ; and Li , F. 2015 . ImageNet Large Scale Visual Recognition Challenge . Int. J. Comput. Vis . 115 ( 3 ): 211 - 252 .

Samangouei , P. ; Kabkab , M. ; and Chellappa, R. 2018 .

Defense-GAN : Protecting Classifiers Against Adversarial Attacks Using Generative Models . In 6th International Conference on Learning Representations, ICLR 2018 , Vancouver, BC, Canada, April 30 - May 3, 2018 , Conference Track Proceedings.

Schroff , F. ; Kalenichenko , D. ; and Philbin , J. 2015 . FaceNet: A unified embedding for face recognition and clustering . In CVPR , 815 - 823 . IEEE Computer Society.

Sengupta , S. ; Chen , J. ; Castillo, C. D. ; Patel , V. M. ; Chellappa , R.; and Jacobs , D. W. 2016 . Frontal to profile face verification in the wild . In WACV , 1 - 9 . IEEE Computer Society.

2016. Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition . In Weippl , E. R.; Katzenbeisser, S. ; Kruegel , C. ; Myers , A. C. ; and Halevi , S., eds., Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security , Vienna, Austria, October 24-28 , 2016 , 1528 - 1540 . ACM.

Singh , R. ; Agarwal , A. ; Singh , M. ; Nagpal , S. ; and Vatsa, M. 2020 . On the Robustness of Face Recognition Algorithms Against Attacks and Bias . In The Thirty-Fourth AAAI Conference on Artificial Intelligence , AAAI 2020 , The Thirty-Second Innovative Applications of Artificial Intelligence Conference , IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020 , New York, NY, USA, February 7- 12 , 2020 , 13583 - 13589 . AAAI Press. URL https://aaai.

org/ojs/index.php/AAAI/article/view/7085.

Srivastava , N. ; Hinton , G. E. ; Krizhevsky , A. ; Sutskever , I.; and Salakhutdinov, R. 2014 . Dropout: a simple way to prevent neural networks from overfitting . J. Mach. Learn. Res.

Sun , Y. ; Chen , Y. ; Wang , X. ; and Tang , X. 2014 .

Deep

Learning Face Representation by Joint IdentificationVerification. In

NIPS , 1988 - 1996 .

2015. Going deeper with convolutions . In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 , Boston, MA, USA, June 7-12, 2015 , 1 - 9 . IEEE Computer Society.

Szegedy , C. ; Zaremba , W. ; Sutskever, I. ; Bruna, J. ; Erhan , D. ; Goodfellow , I. J .; and Fergus, R. 2014 . Intriguing properties of neural networks . In Bengio, Y.; and LeCun, Y., eds., 2nd International Conference on Learning Representations, ICLR 2014 , Banff , AB , Canada, April 14-16 , 2014 , Conference Track Proceedings.

2020. On Adaptive Attacks to Adversarial Example Defenses . In Larochelle, H.; Ranzato, M. ; Hadsell , R. ; Balcan, M. ; and Lin , H., eds., Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 , NeurIPS 2020 , December 6- 12 , 2020 , virtual.

URL https://proceedings.neurips.cc/paper/2020/hash/ 11f38f8ecd71867b42433548d1078e38-Abstract.html.

Wan , W. ; Zhong, Y. ; Li , T. ; and Chen , J. 2018 . Rethinking Feature Distribution for Loss Functions in Image Classification . In CVPR , 9117 - 9126 . IEEE Computer Society.

Wang , H. ; Wang , Y. ; Zhou , Z. ; Ji , X. ; Gong , D. ; Zhou , J. ; Li , Z. ; and Liu, W. 2018 . CosFace: Large Margin Cosine Loss for Deep Face Recognition . In CVPR , 5265 - 5274 . IEEE Computer Society.

Wong , E.; and Kolter , J. Z. 2018 . Provable Defenses against Adversarial Examples via the Convex Outer Adversarial Polytope . In Dy, J. G.; and Krause , A., eds., Proceedings of the 35th International Conference on Machine Learning , ICML 2018 , Stockholmsma¨ssan, Stockholm, Sweden, July 10-15 , 2018 , volume 80 of Proceedings of Machine Learning Research , 5283 - 5292 . PMLR.

2018. Scaling provable adversarial defenses . In Bengio, S.; Wallach, H. M. ; Larochelle , H. ; Grauman , K. ; CesaBianchi, N.; and Garnett, R., eds., Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 , NeurIPS 2018 , 3 -8 December 2018 , Montre´al, Canada, 8410 - 8419 .

Xu , W. ; Evans , D. ; and Qi, Y. 2018 . Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks . In 25th Annual Network and Distributed System Security Symposium , NDSS 2018 , San Diego, California, USA, February 18 - 21 , 2018 .

Zhang , K. ; Zhang , Z. ; Li , Z. ; and Qiao, Y. 2016 . Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks . CoRR abs/1604 .02878.

Zheng , Z. ; and Hong, P. 2018 . Robust Detection of Adversarial Attacks by Modeling the Intrinsic Properties of Deep Neural Networks . In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 , NeurIPS 2018 , 3 -8 December 2018 , Montre´al, Canada, 7924 - 7933 .

Zhong , Y. ; and Deng , W. 2019 . Adversarial Learning With Margin-Based Triplet Embedding Regularization . In ICCV, 6548 - 6557 . IEEE.