Privacy Safe Representation Learning via Frequency Filtering Encoder

Privacy Safe Representation Learning via Frequency Filtering Encoder JonghuJeong jonghu.jeong@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

MinyongCho minyong.cho@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

PhilippBenz philipp.benz@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

JinwooHwang jinwoo.hwang@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

JeewookKim jeewook.kim@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

SeungkwanLee seungkwan.lee@deepingsource.io Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

Tae-HoonKim Deeping Source Inc

508, Eonju-ro, Gangnam-gu Seoul Republic of Korea

Privacy Safe Representation Learning via Frequency Filtering Encoder 1613-0073 10EB8A5541D71929056DBAC88FA07D62 GROBID - A machine learning software for extracting information from scholarly documents privacy-preserving machine learning adversarial representation learning image frequency filtering

Deep learning models are increasingly deployed in real-world applications. These models are often deployed on the server-side and receive user data in an information-rich representation to solve a specific task, such as image classification. Since images can contain sensitive information, which users might not be willing to share, privacy protection becomes increasingly important. Adversarial Representation Learning (ARL) is a common approach to train an encoder that runs on the client-side and obfuscates an image. It is assumed, that the obfuscated image can safely be transmitted and used for the task on the server without privacy concerns. However, in this work, we find that training a reconstruction attacker can successfully recover the original image of existing ARL methods. To this end, we introduce a novel ARL method enhanced through low-pass filtering, limiting the available information amount to be encoded in the frequency domain. Our experimental results reveal that our approach withstands reconstruction attacks while outperforming previous state-of-the-art methods regarding the privacy-utility trade-off. We further conduct a user study to qualitatively assess our defense of the reconstruction attack.

Introduction

Service providers, such as Amazon Rekognition and Microsoft Cognitive Services, frequently deploy deep learning models in real-world applications in recent years. The models run on the providers' server can receive and process user information in an information-rich representation to solve a specific task. For example, the users send their face images from their smartphone (client) to the server and receive the processed results, such as face identification. However, the raw images can also contain additional information which users do not consent to reveal or share, violating the users' privacy. An adversary could take over and abuse the images of the users. In one possible attack scenario, adversaries can train a new attacker model (e.g. neural network) that retrieves private attributes, such as gender, emotional state, and race. Even the service provider could have malicious intent without the users' knowledge. Hence, an obfuscation method should be used to protect the users' privacy.

For privacy protection with deep learning models, several prior works exist ranging from federated learn-Figure 1: An overview of our proposed method. The proposed method trains an encoder that obfuscates an input image through a neural net and leverages a frequency filtering module to safely transmit a privacy-sensitive image from a client-side to a server-side. The frequency filtering module helps the encoder to remove private information effectively from the image while retaining its utility to be used for a particular task of interest (utility task) on the server-side. The encoder is trained with the conventional ARL scheme and then deployed to the client-side. Even with the possibility of data leakage during data transmission, malicious attackers can not abuse the obfuscated image for a privacy breach attack (privacy task) since the transmitted data contains information that is only useful for the utility task.

ing [1,2], split learning [3,4], differential privacy [5,6,7], and homomorphic encryption [8,9,10] to instance hiding mechanisms [11,12,13,14], GAN-based obfuscation techniques [15,16] and adversarial representation learning [17]. Among these works, however, adversarial representation learning (ARL) is the one suitable for the service provider to serve users with an obfuscation method. For example, federated learning and instance hiding focus on model training with privacy-safe data, not on inference with obfuscated data [1,11]. Furthermore, several existing methods suffer under privacy leakage [18,19,20], and the degree of computational complexity is too large to be deployed in practice [8,9,10]. With ARL, the service provider can train an obfuscator model and deploy it to make data obfuscation possible on the user side [21,22].

Most previous ARL methods solve the problem of privacy-safe transmission by optimizing 1) utility task loss and 2) proxy adversary task loss [23,21,24,22]. They also introduce specific loss-design formulations, model architecture design, and training schemes. The methods are evaluated quantitatively with performance on both utility and adversary tasks. Note that there usually exists a trade-off between privacy and utility. We use a reconstruction attack, to test the quality of the obfuscation. In a reconstruction attack, a new model is trained that takes the obfuscated representation as an input and outputs the original image. As demonstrated in Figure 2, the original data of existing ARL methods can successfully be recovered from the obfuscated representation. This result suggests that the private information is still encoded in the obfuscated representations.

We present a novel ARL method that leverages frequency filtering, leveraging an extreme low-pass frequency filter (Figure 1). The representation filtering on the frequency domain effectively limits the amount of information to be encoded. Our experimental results show that our approach outperforms previous state-of-the-art methods regarding the privacy-utility trade-off. We also present that our proposed method withstands the reconstruction attack better than existing ARL methods, which are evaluated through visual metrics and a user study.

Related Work

Data-privacy in Computer Vision

For privacy-safe data transmission, several approaches have been proposed to tackle the problem of raw image sharing. Federated learning [1,2] and split learning [3,4] aim to train a machine learning model without directly sharing raw images through sharing gradients or a processed representation. These methods usually focus on the model training, and not on inference with obfuscated data. Homomorphic encryption [8,9,10] attempts to train models on encrypted data, such that the data can be shared in encrypted form and be processed without decryption. Currently, this method suffers from a considerably high computational cost. Instance hiding mechanisms [11,12,13,14] introduce random pixel mixing and clipping algorithm to perturb images. The perturbed images are used only for the training, and the original Figure 2: Results of the reconstruction attack with various methods on CelebA. For a successful defense, the reconstructed image should not reveal 1) the identity of the original image and 2) the privacy attribute (in this case, gender). Our method successfully defends the reconstruction attack while all other approaches fail. Detailed results are further discussed in Section 5.

images are used for the inference which means that there are still potential threats for data breaches when inferring the target.

Adversarial Representation Learning (ARL) Another line of work focuses on the training framework of ARL to address the utility-privacy trade-off of (a) mitigation of privacy disclosure while (b) maintaining task utility. ARL methods have found their application in practical scenarios, such as information censoring [25], learning fair representations [26,27], the mitigation of information leakage [23,21,24], collaborative inference [28,29,22], and GAN-based obfuscation techniques [15,16]. Commonly, the ARL framework consists of three entities: 1) an obfuscator, which transforms input data to a private representation that retains utility, 2) a task model, performing the utility task on the data representation, 3) a proxy adversary, attempting to extract sensitive attributes. Recent approaches [30,31,32,24] represent each component as deep neural networks (DNNs). MaxEnt [23] formulate the ARL problem as an adversarial non-zero-sum game and minimizes the amount of non-utility information, which they quantify through entropy. Adversarial representation learning with non-linear functions through kernel representation with theoretical guarantees are introduced in [33]. While most of the previous methods represent the obfuscated output as the intermediate feature of a DNN, Bertran et al. [21] leverages domain-preserving transformations, i.e. images to images. Above mentioned ARL methods mainly focused on designing special loss functions or model architectures. To the best of our knowledge, our method is the first ARL method that focuses on the effective encoding of privacy-safe representation in the frequency domain.

There are three common attacks on privacy in machine learning. The first is the membership inference attack [34], which attempts to infer whether a data sample is used for the machine learning model training. This attack is more related to the attack on the server-side model, not the transmitted data. The second is the inversion attack [35] which attempts to infer raw data from processed representation. This is the same attack scenario as the aforementioned reconstruction attack. The last is the information leakage attack [23], for which adversaries attempt to infer privacy-related information from obfuscated representation. In this work the inversion attack and the information leakage attack are considered as they are potential threats to transmitted privacy-sensitive images.

Frequency Perspective in Computer Vision

Prior works have explored the behavior of DNNs from a frequency perspective. Overall, there is solid evidence that both high-frequency features and low-frequency features can be helpful for classification [36,37]. It has been demonstrated that DNNs have an increased bias toward texture compared to the object's shape [38]. On the other hand, DNNs trained only on low-pass filtered images also generalize well and are capable of achieving high accuracies [36]. Yin et al. [36] shows that adversarial training and Gaussian data augmentation shift DNNs towards utilizing low-frequency information in the input. Wang et al. [37] points out that convolutional neural networks (CNNs) mainly exploit high-frequency components. Similarly, Abello et al. [39] find that mid or highlevel frequencies are disproportionately critical for CNNs. Ilyas et al. [40] also show similar findings that humanimperceptible features with high-frequency properties are sufficient for the model to exhibit high generalization capability.

In this work, we leverage previous insights that information can be encoded in different frequency ranges of images. We propose encoding information in the lowfrequency band of images to securely transfer them between different parties.

Problem Formulation

We consider an image dataset 𝑥 ∼ 𝒳 ∈ R 𝐻 ×𝑊 ×3 , where 𝐻 and 𝑊 represent width and height, respectively, along with a number of various attributes 𝑦 ∼ 𝒴. Some of the attributes are private attributes 𝑦 𝑝 ∼ 𝒴 𝑝 and some are utility attributes 𝑦 𝑡 ∼ 𝒴 𝑡 , such that 𝒴 = 𝒴 𝑡 ∪𝒴 𝑝 . Given a utility task model 𝑓 𝑡 , we search for an intermediate representation x , from which 𝑓 𝑡 can infer the utility attributes, but not the privacy attributes. This transformation can also be represented through a DNN 𝑜, termed obfuscator, resulting in 𝑜(𝑥) = x . Note that in prior works, the intermediate representation x was often represented as a feature map differing in shape from the original input images. However, similar to [21], we represent the obfuscated representation in the same shape as the original input image. This setting allows us to leverage existing image transformation techniques, such as transforming them into a 2D Fourier representation. Additionally, this form of intermediate representation allows us to analyze the representations visually.

Threat Model Given the above problem formulation, an attacker can attempt to retrieve information about the private attributes from the intermediate representation. This can be realized either by directly inferring private information from the intermediate representation (information leakage attack) or through the reconstruction of the original input images from the intermediate representations (reconstruction attack). In the information leakage attack scenario an attacker is able to obtain data pairs consisting of the corresponding intermediate representation and their respective private attributes { x , 𝑦 𝑝 }. In this scenario an attacker can attempt to train a model 𝑓 𝑎 , which leaks the private information from the representations 𝑓 𝑎 ( x ) = 𝑦 𝑝 . In the reconstruction attack, given image pairs of the original image and the intermediate representation {𝑥, x } the attacker attempts to obtain a model 𝑓 𝑟 which retrieves the original image 𝑥 from the intermediate representation 𝑓 𝑟 ( x ) = 𝑥. In this work, we represent both attacker models 𝑓 𝑎 and 𝑓 𝑟 through DNNs, since they are proven to be powerful for image processing tasks.

Methodology

Fourier Transformation Fourier transform is a common tool to perform frequency analysis [41]. We consider the 2D discrete Fourier transformation ℱ ∶ R 𝑊 ×𝐻 → C 𝑊 ×𝐻 and the inverse Fourier transformation as ℱ −1 . After applying ℱ on an image, low frequencies are located in the center of a Fourier image, while high frequencies are located toward the boundaries. For low-pass filtering, we set all frequency components outside of a central circle with radius 𝑟 in the frequency domain to zero and apply ℱ −1 afterward. We normalize the radius to be in the range of [0, 1] by considering the center of the image as 0 and the corner as 1. We indicate low-pass filtering as 𝐿𝑃.

Frequency Obfuscation We depict our proposed methodology in Figure 1. Given an input image, the objective is to obfuscate the image to achieve the best privacyutility trade-off. Our obfuscator module consists of an encoder architecture followed by frequency-filtering. We choose the commonly used U-Net [42] architecture as our encoder and pass the original image through it. Formally, we express this as 𝑒(𝑥), where we indicated the encoder with 𝑒. The subsequent frequency filtering is realized via a low-pass filter 𝐿𝑃(𝑒(𝑥)). This procedure completes the generation of the intermediate representation through the obfuscator x = 𝑜(𝑥) = 𝐿𝑃(𝑒(𝑥)). During obfuscator training, we leverage a task model and a proxy adversary. The objective of the task model is to predict the utility attribute from the intermediate representation. The respective task loss can be calculated with 𝑙 𝑡 = 𝔼[ℒ 𝑡 (𝑓 𝑡 (𝑜(𝑥)), 𝑦 𝑡 )], where ℒ 𝑡 indicates the task loss function, which is the cross-entropy function in our setup. The objective of proxy adversary model is to leak the privacy attribute from the intermediate representation. The proxy adversary loss can be calculated as 𝑙 𝑝 = 𝔼[ℒ 𝑝 (𝑓 𝑎 (𝑜(𝑥)), 𝑦 𝑝 )], where ℒ 𝑝 indicates the privacy loss function, which is also represented as the crossentropy function. The obfuscator loss is represented as

𝑙 𝑜 = 𝑙 𝑡 − 𝑙 𝑝 .

Similar to the scenario introduced in DISCO [22] a practical application scenario of our proposed approach is when the obfuscator module is present on a trusted client device, which sends the intermediate feature representations to a server. Since an adversary can intercept the communication between client and server, or the server can also be malicious, we consider the server-side an untrusted entity.

Evaluation Protocol

In the following, we outline our evaluation protocol. We follow the general ARL evaluation protocol [22,23]. Given an image classification dataset, we specify certain classes as the utility and privacy tasks, respectively. Based on the chosen tasks, following our proposed method we obtain an obfuscator and a utility task model. Note that this includes training proxy adversaries. After training, we evaluate the models on the utility task and report the accuracy as utility. Then we freeze the weights of the obfuscator and train an adversary model to predict the privacy attributes and report the accuracy as privacy. To assess the privacy-utility trade-off, we measure their difference (Δ).

Additionally, we report the performance bounds. Theo-retically, the utility (higher the better) is upper bounded by 100%. In practice, however, we consider the upper bound as the utility performance of a ResNet18 [43] model trained on the original images. For privacy (lower the better), we consider the lower bound as the random guess for the privacy attribute.

We also perform a reconstruction attack on the obfuscated images to recover corresponding original images. We evaluate the reconstruction attacks quantitative and qualitatively by calculating similarity scores between the original and reconstructed images and conducting a user study on the reconstructed images.

Experiments

Setup

Datasets We conduct experiments on CelebA [44], FairFace [45], and CIFAR10 [46]. Following the utility and privacy task setting from DISCO [22], we set "Smiling" as the utility attribute and "Male" as the privacy attribute for CelebA, "Gender" as the utility attribute, and "Race" as the privacy attribute for FairFace. For CIFAR10, the utility task is defined as classifying living objects (e.g. "bird", "cat", etc.) or non-living objects (e.g. "airplane", "automobile", etc.) and the privacy task as classifying the separate 10 classes.

Implementation details

The encoder is a lightweight variant of U-Net [42], with 4× fewer intermediate feature channels than the original version. We use an extreme low pass filter with radius, 𝑟 = 0.01 for CelebA and Fair-Face, and 𝑟 = 0.05 for CIFAR10. We apply a center-circled filter, which can adjust the level of obfuscation by changing its radius (bandwidth). Section 6.2 discusses the effect of the radius. We normalize the radius by the length from the filter's center to the corner to make the value in the range [0, 1]. For both the utility and privacy task models, we use ResNet-18 [43], and use the same dataset for training both models. We use Adam [47] optimizer for all 3 models with learning rate 10 −4 for U-Net and 10 −3 for the ResNet-18 models. We evaluate the top-1 accuracy for both utility and privacy tasks. We used the lightweight U-Net as the reconstructor for the reconstruction attack. The reconstructor adversary is trained with the MSE loss between the original and the reconstructed images. The reconstructed images are evaluated using MSE, 𝐿 1 , SSIM [48], MS-SSIM [49], PSNR [50], and LPIPS [51]. MSE, 𝐿 1 , and PSNR compare the images pixel-wise while SSIM and MS-SSIM compare structural similarity (e.g., brightness, contrast) between the images. LPIPS uses a pre-trained neural network's feature map for comparison. These metrics are commonly used for comparing the similarity between images [22,24,52] Evaluation of the privacy-utility trade-off. The upper/lower arrow suggests that each value is higher/lower the better. Our method shows the biggest gap between privacy and utility accuracy among all the datasets. Note that the privacy accuracy is based on the newly trained adversary model which is trained with the fully trained and frozen obfuscation model.

as a proxy of human vision.

Compared Methods

We compare our method with various baselines. As a simple baseline obfuscator, we add Gaussian noise sampled from 𝒩 (0, 𝜎 2 ) to the input image while obeying the image range of pixels in the range [0, 1]. We indicate this method with Noise. We use 𝜎 2 = 4 for CelebA and FairFace and 𝜎 2 = 0.64 for CIFAR10, which obfuscate the images sufficiently. To investigate the sole effect of the low-pass filtering, we apply only the low-pass filter to the raw images. We name this baseline as LP. Complementary, we also compare the U-Net without the low-pass filtering module as an obfuscator. We call it U-Net. This setup is similar to DeepObfuscator [24] which uses an encoder, task model, and a proxy adversary. However, since DeepObfuscator has not open-sourced their code, we used our U-Net encoder as a method to compare. Finally, we compare our method to the state-of-the-art ARL method DISCO [22], which selectively removes features via channel pruning in the latent space.

Results

Table 1 shows a comparison between the privacy and utility accuracy of each obfuscation method. Our method resulted in the highest gap between utility and privacy accuracy on all datasets. For the methods without encoder (i.e. Noise and LP), the accuracy for both utility and privacy decreases compared to training with the original image since these methods obfuscate images without any prior knowledge of the tasks. These methods cannot selectively restrict information for high utility and low privacy leakage. U-Net showed high utility accuracy but failed to defend against the privacy attack, although it is trained with a proxy adversary. We conjecture that simply taking the guidance of the proxy model loss is not enough for the encoder to learn to restrict information. Similarity scores between the original image and the reconstructed ones on CelebA. The upper/lower arrow suggests that each value is higher/lower the better, respectively. Our approach shows the best dissimilarity among all the metrics.

Method MSE ↑ 𝐿 1 ↑ SSIM ↓ MS-SSIM ↓ PSNR ↓

Our method is a combination of LP and U-Net, and learns to encode a representation into the restricted bandwidth, which is limited by the frequency filtering module. This limited bandwidth helps the encoder to learn how to extract utility information effectively and remove privacy attributes to fully leverage the limited bandwidth. While the same data is used to train both utility and adversary models, which is a generous and unrealistic condition for the attackers to have, we found the adversary model performed poorly. DISCO shows the lowest privacy accuracy among all the datasets. However, the utility accuracy is lower than our method, so the utility-privacy gap is smaller than ours.

In terms of the visual quality, our obfuscated representations appear as simple globs of color, making them unrecognizable to human observers (Figure 1). The obfuscated representations from other methods also appear obfuscated to the human eye. However, applying our best effort reconstruction attack, it is possible to reconstruct the original image or infer the privacy attribute (i.e. gender) from reconstructed images. (Figure 2). The reconstructed images from our method successfully defend identity reconstruction and privacy attribute leakage, with the reconstructed images all being relatively similar to each other. The quantitative results of the reconstruc-Figure 3: Result of the user study on reconstructed images of CelebA. We asked the participants to classify gender (male/female) on 180 images such as Figure 2. The participants correctly distinguished the gender of original images and reconstructed images from the three methods (Noise, U-Net, and DISCO) with more than 90% accuracy. Our method and LP effectively confused the participants with gender-neutral faces (45.83% and 56.9% of correct answers ratio each), while ours is slightly better than LP in terms of obfuscation. We also plot the ideal case of the user study to show our method's near-perfect superiority against the reconstruction attack.

tion attack in Table 2 further confirm this since all scores achieve the best results in terms of dissimilarity for our approach. We note that an adversary model trained with the reconstructed images to infer the privacy attributes performs worse than directly training the model with the obfuscated images since the reconstructed images are processed from the obfuscated images.

User Study

We present a user study to show our method's robustness against the reconstruction attack on CelebA. Since the privacy task for the dataset is gender classification, the reconstructed image's gender should not be correctly classified by a human observer if the obfuscation is successful. To conduct the experiment, we randomly sampled 30 images (15 for male and 15 for female), for which ResNet18 classifies the gender correctly. By doing so, we balanced each class and addressed the ambiguity of the labels to prevent unfair results. Then, we obfuscated the images using each of the techniques and reconstructed them with their respective attacker models from Section 5.1. Examples of reconstructed images are shown in Figure 2. We presented 180 reconstructed images to a group of people and asked them to identify whether the person in the reconstructed image is male, female, or cannot be judged. We provided the last option to let the users skip the examples that are hard to judge. The test subjects were randomly selected and consist of 30 people who live in Seoul, South Korea, and are in their 20s and 30s.

As shown in Figure 3, people correctly identify the gender for the original images and the reconstructed ones The privacy-utility gap of the high-pass filtering module on FairFace. Our low-pass filtering module shows the best privacy-utility gap compared to the high-pass filter with the various filter radii.

from the methods Noise, U-Net, and DISCO. More than 90% of answers were correct for the three methods. LP showed a relatively low correct ratio (56.9%) and a high "cannot judge" ratio (6.19%). Our method showed the best for both, the lowest correct ratio of 45.83% and the highest "cannot judge" ratio of 7.02%. We consider the 50% ratio for each "correct" and "wrong" answer as a random guess since the labels for the test datasets are balanced. Additionally, we note that "cannot judge" can be considered as a random guess since without this option, the users would have done a random choice. The results indicate that our approach successfully protects against reconstruction attacks in terms of human vision. The results also align with the quantitative results (Table 2).

In terms of obfuscation, our method shows the best results, followed by LP. It reconfirms the usefulness of our architecture design, the combination of the encoder and the frequency filtering module.

Ablation Study

High-pass filter

Previously, we presented the effect of the low-pass frequency filtering module on ARL. The module appropriately limits the amount of encoded information in the obfuscated image. It retains the information at a lowfrequency range. Using a high-pass filter, we can leverage the same intuition, by limiting the information to be encoded in the high-frequency bandwidth. However, in the following, we will present results indicating that the low-pass filter is the superior method to use. We conduct the same experiment from Section 5.2 on FairFace with a high-pass filtering module for 5 radii (0.80, 0.85, 0.90, 0.95, 0.99). Contrary to the low-pass filtering, the filter removes frequencies inside the filter radius, which leads to a radius of 0.99 as the most extreme high-pass filter. We call this method HP.

The respective results are presented in Table 3. As the filtering gets more extreme, the utility accuracy decreases together with the privacy accuracy. The table also shows that our approach with a low-pass filter from Table 1 outperforms all results from the high-pass filter regarding the privacy-utility gap. The best privacy-utility gap with the high-pass filter is 63.16% with a radius of 0.95, which is 2.88%p lower than for the approach with low-pass filtering. It has been demonstrated that DNNs can learn from low-pass filtered images more efficiently than highpass filtered ones [36]. Especially with the extreme highpass (r=0.99), the model did not learn for both, the utility and privacy tasks.

Furthermore, from a practical point of view, we need to reduce the size of the obfuscated image to reduce the cost of transmission or storage. The most commonly used JPEG compression algorithm leverages the filtering of high frequency. If we use a high-pass filter ARL method, encoded information in the high-frequency range would be lost. To this end, encoding information into the lowfrequency range is more suitable than the opposite to utilize the conventional compression algorithms further.

The effect of filter radius

One of the key points of our proposed method is the frequency filtering module. The module has only one parameter to consider, the filter's radius. To gain insight into choosing the parameter, we conducted experiments with various radii. The same experiment from Section 5 on FairFace is done with 5 radii (0.01, 0.015, 0.02, 0.025, 0.03). The radius of 0.01 is the most extreme low-pass filter.

Figure 4 (left) shows a trend of consistent utility accuracy and increasing privacy accuracy. The utility accuracies are around 89% with a small variance. The privacy accuracies show an increasing tendency from 23.64% to 30.45% as the radius increases. It leads the privacy-utility gap to decrease (Figure 4, right).

The increased privacy accuracy aligns with our intuition of limiting information in the obfuscated representation. The wider radius allows the representation to have more information, leading the adversary to exploit it for a privacy attack easily. Note that the utility accuracy did not decrease even with the harshest filter. We speculate that the extremely low-pass filtered representation is enough for these specific utility tasks. Figure 4 and Table 3 confirm that the radius is a crucial factor of privacy and utility accuracy. Thus the radius is a hyperparameter that should be tuned based on the privacy-utility gap.

Conclusion

This work proposes a novel ARL method based on frequency filtering, which is robust to privacy leakage attacks while maintaining task utility. Our experiments suggest that a combination of neural-net encoder and lowpass filter improves ARL training for the quantitative and qualitative metrics. The method outperforms other compared methods for the quantitative measure of privacyutility trade-off and reconstruction attack (Section 5). Our user study suggests that the proposed method effectively defends against reconstruction attacks (Section 5.3). The ablation experiments justified the use of a low-pass filter and also showed that the filter radius adjusts the privacy-utility trade-off (Section 6).

For future work we consider the optimization of the client-side model to reduce the computation burden by using a lightweight architecture such as MobileNetV3 [53]. Furthermore, an adaptive selection of the frequencyfiltering hyperparameter might increase the utility accuracy and decrease the privacy accuracy.

Figure 4 :4Figure 4: (Left) Privacy and utility accuracy under each radius of the low-pass filter. The experiments are conducted on FairFace. (Right) Privacy-utility trade-off. Delta represents the performance gap between utility and privacy.

Table 11and we consider themFairfaceCelebACIFAR10MethodPrivacy ↓ Utility ↑ Δ ↑ Privacy ↓ Utility ↑ Δ ↑ Privacy ↓ Utility ↑ Δ ↑Perf. Bounds19.0390.16 71.1357.4393.3235.8910.0098.79 78.79Noise42.6174.33 31.7291.7185.38-6.3354.3787.77 33.40LP31.9364.77 32.8476.5263.69 -12.8347.0585.76 38.71U-Net51.5286.40 34.8887.2193.125.9185.0595.45 10.40DISCO19.0081.50 62.5061.2091.0029.8022.3091.98 69.68Ours23.6389.67 66.0461.6093.27 31.6722.5892.95 70.37

Table 22LPIPS ↑Noise584.88 16.97 0.6017 0.777620.46 0.3714LP1889.15 32.10 0.4632 0.539015.37 0.5537U-Net390.34 13.81 0.7505 0.883922.22 0.1809DISCO 567.17 15.94 0.5765 0.761120.60 0.4351Ours3689.50 48.08 0.4240 0.472812.47 0.6145

Table 33MethodPrivacy ↓ Utility ↑Δ ↑HP (r=0.80)26.1989.0362.84HP (r=0.85)26.2889.1362.85HP (r=0.90)28.9488.0059.06HP (r=0.95)24.9688.1263.16HP (r=0.99)19.0352.8833.85LP (r=0.01)23.6389.6766.04

JKonečnỳ HBMcmahan FXYu PRichtárik ATSuresh DBacon arXiv:1610.05492 Federated learning: Strategies for improving communication efficiency 2016 arXiv preprint Advances and open problems in federated learning PKairouz HBMcmahan BAvent ABellet MBennis ANBhagoji KBonawitz ZCharles GCormode RCummings Foundations and Trends® in Machine Learning 2021 Distributed learning of deep neural network over multiple agents OGupta RRaskar Journal of Network and Computer Applications 2018 PVepakomma OGupta TSwedish RRaskar arXiv:1812.00564 Split learning for health: Distributed deep learning without sharing raw patient data 2018 arXiv preprint Differential privacy: A survey of results CDwork International conference on theory and applications of models of computation 2008 ZJi ZCLipton CElkan arXiv:1412.7584 Differential privacy and machine learning: a survey and review 2014 arXiv preprint Deep learning with differential privacy MAbadi AChu IGoodfellow HBMcmahan IMironov KTalwar LZhang ACM SIGSAC conference on computer and communications security 2016 EHesamifard HTakabi MGhasemi arXiv:1711.05189 Cryptodl: Deep neural networks over encrypted data 2017 arXiv preprint {GAZELLE}: A low latency framework for secure neural network inference CJuvekar VVaikuntanathan AChandrakasan USENIX Security Symposium 2018 Towards deep neural network training on encrypted data KNandakumar NRatha SPankanti SHalevi Conference on Computer Vision and Pattern Recognition Workshops CVPR-W 2019 Mixup based privacy preserving mixed collaboration learning YFu HWang KXu HMi YWang International Conference on Service-Oriented System Engineering (SOSE) 2019 Instahide: Instance-hiding schemes for private distributed learning YHuang ZSong KLi SArora International Conference on Machine Learning (ICML) 2020 MShin CHwang JKim JPark MBennis S.-LKim arXiv:2006.05148 Xor mixup: Privacy-preserving data augmentation for one-shot federated learning 2020 arXiv preprint EBorgnia JGeiping VCherepanova LFowl AGupta AGhiasi FHuang MGoldblum TGoldstein arXiv:2103.02079 Dp-instahide: Provably defusing poisoning and backdoor attacks with differentially private data augmentations 2021 arXiv preprint T.-HKim DKang KPulli JChoi arXiv:1901.00098 Training with the invisibles: Obfuscating images to share safely for learning visual recognition models 2019 arXiv preprint CXu JRen DZhang YZhang ZQin KRen Ganobfuscator: Mitigating information leakage under gan via differential privacy 2019 Large scale adversarial representation learning JDonahue KSimonyan Advances in Neural Information Processing Systems 32 2019 LLyu HYu QYang arXiv:2003.02133 Threats to federated learning: A survey 2020 arXiv preprint Unleashing the tiger: Inference attacks on split learning DPasquini GAteniese MBernaschi ACM SIGSAC Conference on Computer and Communications Security 2021 OLi JSun XYang WGao HZhang JXie VSmith CWang arXiv:2102.08504 Label leakage and protection in two-party split learning 2021 arXiv preprint Adversarially learned representations for information obfuscation and inference MBertran NMartinez APapadaki QQiu MRodrigues GReeves GSapiro International Conference on Machine Learning (ICML) 2019 Disco: Dynamic and invariant sensitive channel obfuscation for deep neural networks ASingh AChopra EGarza EZhang PVepakomma VSharma RRaskar Conference on Computer Vision and Pattern Recognition (CVPR) 2021 Mitigating information leakage in image representations: A maximum entropy approach PCRoy VNBoddeti Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2019 Deepobfuscator: Obfuscating intermediate representations with privacy-preserving adversarial learning on smartphones ALi JGuo HYang FDSalim YChen International Conference on Internet-of-Things Design and Implementation 2021 Censoring representations with an adversary HEdwards AStorkey International Conference on Learning Representations (ICLR) 2016 CLouizos KSwersky YLi MWelling RZemel The variational fair autoencoder 2016 Learning adversarially fair and transferable representations DMadras ECreager TPitassi RZemel International Conference on Machine Learning (ICML) 2018 Nopeek: Information leakage reduction to share activations in distributed deep learning PVepakomma ASingh OGupta RRaskar ternational Conference on Data Mining Workshops (ICDMW) 2020. 2020 A hybrid deep learning architecture for privacy-preserving mobile analytics SAOsia ASShamsabadi SSajadmanesh ATaheri KKatevas HRRabiee NDLane HHaddadi IEEE Internet of Things Journal 2020 Learning privacy preserving encodings through adversarial training FPittaluga SKoppal AChakrabarti Winter Conference on Applications of Computer Vision (WACV) 2019 Privacy adversarial network: representation learning for mobile data privacy SLiu JDu AShrivastava LZhong ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2019 Towards privacypreserving visual recognition via adversarial training: A pilot study ZWu ZWang ZWang HJin European Conference on Computer Vision (ECCV) 2018 On the global optima of kernelized adversarial representation learning BSadeghi RYu VBoddeti International Conference on Computer Vision (ICCV) 2019 Membership inference attacks against machine learning models RShokri MStronati CSong VShmatikov Symposium on security and privacy (SP) 2017 Model inversion attacks that exploit confidence information and basic countermeasures MFredrikson SJha TRistenpart ACM SIGSAC conference on computer and communications security 2015 A fourier perspective on model robustness in computer vision DYin RGLopes JShlens EDCubuk JGilmer Advances in neural information processing systems (NeurIPS) 2019 Highfrequency component helps explain the generalization of convolutional neural networks HWang XWu ZHuang EPXing Conference on Computer Vision and Pattern Recognition CVPR 2020 Imagenet-trained cnns are biased towards texture; increasing shape bias improves accuracy and robustness RGeirhos PRubisch CMichaelis MBethge FAWichmann WBrendel International Conference on Learning Representations (ICLR) 2019 Dissecting the high-frequency bias in convolutional neural networks AAAbello RHirata ZWang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021 Adversarial examples are not bugs, they are features AIlyas SSanturkar DTsipras LEngstrom BTran AMadry Advances in neural information processing systems NeurIPS 2019 Two-dimensional signal and image processing JSLim 1990 Englewood Cliffs ORonneberger PFischer TBrox International Conference on Medical image computing and computer-assisted intervention 2015 U-net: Convolutional networks for biomedical image segmentation Deep residual learning for image recognition KHe XZhang SRen JSun Conference on computer vision and pattern recognition (CVPR) 2016 Deep learning face attributes in the wild ZLiu PLuo XWang XTang International Conference on Computer Vision (ICCV) 2015 Fairface: Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation KKarkkainen JJoo Winter Conference on Applications of Computer Vision (WACV) 2021 Learning multiple layers of features from tiny images AKrizhevsky 2009 Technical Report DPKingma JBa arXiv:1412.6980 Adam: A method for stochastic optimization 2014 arXiv preprint Image quality assessment: from error visibility to structural similarity ZWang ABovik HSheikh ESimoncelli Transactions on Image Processing 2004 Multiscale structural similarity for image quality assessment ZWang EPSimoncelli ACBovik The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers Ieee 2003. 2003 2 Image quality metrics: Psnr vs. ssim AHoré DZiou International Conference on Pattern Recognition 2010 The unreasonable effectiveness of deep features as a perceptual metric RZhang PIsola AAEfros EShechtman OWang Proceedings of the IEEE conference on computer vision and pattern recognition the IEEE conference on computer vision and pattern recognition 2018 Analyzing and improving the image quality of stylegan TKarras SLaine MAittala JHellsten JLehtinen TAila Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020 Searching for mobilenetv3 AHoward MSandler GChu L.-CChen BChen MTan WWang YZhu RPang VVasudevan Proceedings of the IEEE/CVF International Conference on Computer Vision the IEEE/CVF International Conference on Computer Vision 2019