1. Introduction

September

Evolutionary Counterfactual Visual Explanation

Jacqueline Höllig

Stefen Thoma

Cedric Kulbach

0 0 FZI Research Center for Information Technology , Haid-und-Neu-Strasse 10-14, 76131 Karlsruhe , Germany

2022

20 2022 0000 0003

The increasing success of deep learning models in recent years comes with the drawback of increasing model complexity. Due to the complexity, model insights are hard to obtain. However, understanding the underlying reasoning for a proposed decision becomes crucial in critical settings. Counterfactual explanations are among the most popular methods to interpret predictions of so-called black-box machine learning models. They provide a form of explanation intuitive to human thinking by building ”what-if” scenarios. Despite their popularity for interpreting tabular data, they find limited adaption in the visual domain. Current approaches to image counterfactuals rely heavily on access to model parameters, additional training data, or surrogate models. However, access to additional information might not always be feasible. We, therefore, propose an evolutionary-based method for counterfactual image generation with a custom mutation operator based on data augmentation to overcome these limitations. We show that generating image counterfactuals solemnly on an input instance and access to the prediction function is possible and performs on par with existing methods.

eol>Interpretability Counterfactuals Evolutionary Computation

1. Introduction

Deep Learning models are at the forefront of artificial development as they allow complex decision-making and can sometimes even discover complex patterns in data that other algorithms or humans can hardly find. Due to their complexity, those models are “black-boxes” with no human-understandable explanations for their predictions. With the adaption of such algorithms to critical areas like medical diagnosis, autonomous driving, or airport security, a humaninterpretable explanation becomes crucial to gain trust in these algorithms. However, most machine learning systems lack ways to make decisions transparent to humans. Currently, interest in model-agnostic techniques of explainable and interpretable machine learning is growing [ 1, 2, 3, 4 ]. Most of those approaches determine how much each feature or which feature combination contributes to a particular decision (e.g., [ 2, 5 ]). Nevertheless, those methods fail to show how a diferent prediction could have been achieved. According to Miller [ 6 ], an essential factor for human-understandable explanations, besides selectivity (i.e., only some causes of the prediction are shown), sociability (i.e., interactiveness), and exclusion of probability, is contrastiveness. Contrastive explanations should not explain why an event happened but rather why an event happened instead.

A specific class of algorithms that can provide contrastive explanations are counterfactuals. Counterfactuals present a perturbation to the original input that leads to a change in the prediction of an underlying machine learning model. The roots of counterfactuals lie in causal reasoning and ofer answers to the question "What if?" and "Why?". They are already in daily use in scientific and ordinary language. Therefore, they provide an intuitive concept for humans to understand. Despite many eforts to apply counterfactuals to improve the interpretability of machine learning models (e.g., [ 7, 8, 9, 10, 11, 12, 4 ]), most approaches are restricted to specific input data types (e.g., [ 7, 8, 13 ]), or the underlying model concepts (e.g., [ 7, 8 ]). Most work focuses on tabular data (e.g., [ 7, 14, 13 ]). The small amount of work on images uses additional information like surrogate models [ 15, 12 ], access to training data [ 15, 12 ], or model parameters [ 8 ]. However, in the real world, this additional information is seldom available. In particular, in industrial, medical, or privacy-sensitive applications, the user is often not the model developer and, thus, has no access to model parameters or the expertise to evaluate those. Furthermore, training data is often not available due to privacy-related issues. Nevertheless, validating and explaining decisions is crucial for the user to understand the model’s quality, trustworthiness, and decisions.

This work develops an approach to generating model-agnostic image counterfactuals in a multi-class prediction problem. Our approach, based on NSGA-II [ 16 ], takes on an input image and the prediction function of some black-box classifier to be explained. To summarise the main contributions of this work, we show that: 1. the counterfactual optimization problem is applicable on images. 2. data augmentation mutation enables a better search space coverage compared to uniform mutation. 3. our approach achieves state of the art results on par with the approaches of

Wachter et al. [ 4 ] and Van Looveren & Klaise [ 12 ].

2. Related Work

To obtain an in-depth understanding of black-box models and their predictions, the current research focus shifts from classic explainable AI tools (e.g., LIME [ 2 ], GradCam [ 17 ], SHAP [ 1 ], or Saliency Maps [ 18 ]) that visualize why a particular decision was taken, to counterfactuals. Counterfactuals show why a diferent decision was taken via alternatives, thereby providing contrastiveness.

The first steps to adapt counterfactuals from their roots in causal reasoning to a tool for understanding black-box models were taken by Wachter et al. [ 4 ]. They built on the fundamentals of Pearl [ 19 ] to develop a basic stochastic counterfactual generation approach. They proposed the following formulation: ′

= min max ( (′) − ′)2 + (, ′) (1)

The first part pushes the models’ prediction (′) on the counterfactual ′ to a new target class ′ ̸= other than the original class . In the second part, the distance measure keeps the counterfactual ′ close to the original instance , balances the contributions of the competing terms. Extending their work towards more realistic and interpretable counterfactuals, multiple authors provide mechanisms like feature extractors [ 8, 15 ], constraints [ 14, 20, 10 ], or prototypes [ 12 ]. Sharma et al. [ 21 ] built the first framework for counterfactuals applicable to various black-box algorithms and data types without the need for extensive additional information. They were able to show that their approach works for multiple data types but was unable to produce human-interpretable counterfactuals on MNIST. Dandl et al. [ 7 ] created a general framework for tabular data by formulating a multi-objective problem for counterfactuals solved with the genetic algorithm NSGA-II.

While counterfactuals have already been widely explored for tabular data [ 7, 14, 22, 10, 21, 12, 4 ], less work can be found on images. Some of the model-agnostic approaches for table data have been applied to images (e.g.,[ 21, 4 ]), resulting in more adversarial samples than counterfactuals.1 Approaches to image-specific counterfactuals focus primarily on counterfactuals for convolutional neural networks [ 8 ] and learning of surrogate models [ 15, 12 ].

In contrast, our approach directly operates on the input image and the classifier prediction, eliminating the need for parameter access and training surrogate models.

3. Methodology and Model

Throughout, we consider a black-box machine learning classifier : → where ∈ is a set of input features ( = {1, 2, . . . , }) from the feature domain and ∈ is a vector of probability distribution ( = {0, 1, . . . , ||} where ∑︀|=|0 = 1) over the number of classes ||. In this context, black-box denotes that only the model’s output is observable. The model’s inner workings are unknown. The goal of counterfactual approaches is, given an input and a classifier , to provide an explanation via counter-examples allowing a human to understand why classifier chose class for data point and not a counterfactual class ′ [ 4 ].

Adapted to the image domain, it results in: Given a query image for which a classifier predicts the class , a counterfactual image ′ identifies how could be changed in a proximate (R1) [ 13 ], sparse (R2) [ 13 ] and plausible way (R3) [ 9 ] so that the classifier maximizes the change in the predicted class (R4). Proximity refers to the distance between the original instance and the counterfactual instance ′, calculated as a distance. Sparsity is the number of feature changes between and ′. A plausible adaption indicates that the resulting ′ is in distribution with the data.

3.1. Objectives

Following the definition of a counterfactual and the resulting requirements ( R1-R4), the optimization problem minimizes the distance (R1) (, ′) between the original data point 1Adversarial samples are closely related to counterfactuals. However, in contrast to counterfactuals that aim for small perceptible changes to provide useful explanations, adversarial samples aim to make the changes as small and imperceptible as possible to detect flaws in the model [ 23 ]. and the newly generated counterfactual data point ′ to obtain a counterfactual that is close to the original (1). Furthermore, to ensure sparse changes (R2) the optimization problem uses the 0-norm to minimize the number of pixels subjected to change (2), referred to as sparsity. The third optimization objective is the output distance (R4) which maximizes the classification probability of the counterfactual into a target class . Equation (2) shows the optimization problem to be minimized.

min () := (1(, ′), 2(, ′), 3(′)) (2) ..

() ̸= (′) 1(, ′) = (, ′)

2(, ′) = ∑︁ 1|− ′|

=1 3(′) = 1 − (′) As distance measure , most approaches to counterfactuals adapt the 1- or 2-norm [ 4, 14 ]. However, on images, traditional distance functions do not suficiently account for image similarity as it disregards the spatial relationships of images [ 24 ]. Therefore we compare the mean absolute error (using 1-norm) and the root mean squared error (using 2-norm) with diferent image-based similarity indices see Section 4.1 and appendix A2. R3 is addressed during the algorithm design in Section 3.2.

3.2. Algorithm

Our algorithm combines a modified version of NSGA-II with Island Populations and an adaption of the auto-tuning approach of Castelli et al. [ 25 ]. Deb et al. [ 16 ] developed NSGA-II already in 2002. However, it is still a heavily used algorithm for Multi-Objective Optimization today, as other algorithms like indicator-based methods (e.g., SMS-EMA [ 26 ], IBEA [ 27 ]) rely on the additional computation of the indicator, and the results of decomposition-based methods (e.g., MOEA/D [ 28 ], NSGA-III [ 29 ]) highly depend on the shape of the Pareto front [ 30 ].3

As Equation 2 indicates, the only mandatory inputs for the algorithm are a black-box classifier and an input instance . Our algorithm generates an island with a sub-population for each class ∈ ∖{ ()} that a classifier can classify. For each island the algorithm stated in Algorithm 1 runs in parallel, allowing the creation of counterfactuals in multiple boundry directions at once. In every generation each island generates new candidates by selecting, crossing, and mutating high-performing individuals from the population . 2 https://github.com/JHoelli/Evolutionary_Counterfactual_Visual_Explanations/blob/master/Supplementary_ Material.pdf. 3For full reasoning we refer to the supplementary material A2.

Algorithm 1 Algorithm on island 1: Input: Population Size , Generation , Original Image , 2: Output: Non-Dominated Set The initial individuals of an island are randomly initialized with the length of the flattened input image || along with an individual crossover rate and mutation rate . The generated individuals are evaluated on each objective stated in Equation (2). After evaluating the individual’s fitness, non-dominated sorting is applied, and the crowding distance is calculated according to NSGA-II [ 16 ]. The assigned ranks are used as the primary criterion in the tournament selection. Thereby, two individuals are compared according to their rank. If they have the same rank, the crowding distance is used as a secondary criterion to retain the individual lying in the less crowded region to maintain the population’s diversity. The selected individuals are crossed by performing a uniform crossover [ 31 ]. The unified crossover modiifes two individuals [] ∈ and [ − 1] ∈ in place by swapping attributes according to the averaged crossover probability of the individual. Based on the fitness of the resulting ofsprings [ − 1] and [], a new crossover probability [ − 1] and []is assigned to the corresponding ofspring. The selected individuals [] are mutated with a mutation probability [] by a random change of attribute. Based on the performance [] is adapted. The algorithm stops if it meets the desired number of generations or exceeds a hypervolume [ 32, 33 ] threshold of on all islands (i.e., on all islands, the generated solutions dominate a portion of of the objective space). The stopping criterion is applied to all islands independently as the goal is to achieve a high-quality, non-dominated set for each of them.

3.3. Custom Operators

Some of the operators used by default in evolutionary programming are unsuitable for the stated problem, as they do not account for spatial dependencies in images or enable images to be out of distribution. In this section, we depict the adapted operators of NSGA-II. Initialization By default, NSGA-II initializes the parent population randomly [ 16 ]. However, initializing images with traditional stochastic techniques like Random Number Generators leads to a vast search space (number candidate solutions for an image: (ℎ · ℎℎ · ℎ)!· 255!), which slows down convergence and the probability of finding a suitable solution.

To warmstart the algorithm by introducing relevant information and enable plausible results (R3), we lean on the concepts of superpixels. The original image of size × × , where is the height, the width, and the channels, is divided into patches of size × × by slicing. Therefore an image contains patches = [1, 2, ..., ], where a patch is of size × × . Each individual in a population is generated by random shufling the patch positions .

Mutation Traditionally, individuals are mutated to produce new ofsprings that are diferent from their parents, thereby encouraging diversity. Using the crossover operator alone leads to decreasing diversity and often results in local optima, as only the good parts of the parents survive in each generation (premature convergence).[ 34 ] The proposed mutation operator aims to prevent premature convergence and include new relevant information in the population by applying data augmentation [35]. The idea behind using data augmentation is to make sure that the changes are still plausible (3) by manipulating the image with basic augmentation techniques. Only basic techniques are used, as we do not use additional data or model parameters. The data augmentation pipeline consists of functions for Random Flip (horizontally or vertically), Random Rotation (by factor 0.2, resulting in a counterclockwise rotation by 1.25 ), Random Contrast (by factors between 0.1 and 1.3, resulting in each pixel being adjusted by × ( − ℎ) ), and Zoom (with height factors between -0.7 and -0.2, resulting in a zoom-in between [20%, 70%]).

Parameter Optimization According to Hassanat et al. [36], parameters of evolutionary algorithms, especially the mutation and crossover rates, impact the obtainable results and convergence speed. Tuning these parameters beforehand can result in several preliminary experiments resulting in good values before the run. However, diferent values of parameters might be optimal at diferent stages of the evolutionary process. Mutation can be good in the initial generations to quickly explore the search space, while crossover is more useful once the search process is close to the optimal solution. The proposed algorithm implements a self-adaptive parameter control on the individual level, according to Castelli et al. [ 25 ]. Each individual [] in a population has its own crossover probability [] and its own mutation probability []. Both are initialized with random values between 0 and 1. During crossover, two selected individuals, [] and [], generate an ofspring with the probability = 12 (( []) + ( [])), where the resulting ofspring has the crossover probability ( []) = + . is a small positive number if the fitness of the generated ofspring improved due to crossover and a small negative number in any other case. During mutation, an individual mutates with its mutation rate []. The resulting individual has a mutation rate of ( []) = + , where is a small positive number if the fitness of the generated ofspring improves due to mutation and a small negative number in any other case.

4. Evaluation

In this section, we evaluate the performance of our counterfactual approach on the two broadly research image datasets MNIST [37] and Fashion MNIST [38], to answer the following research questions that aim to contribute to this work:

Q1 How does the proposed image similarity measure influence the performance of our algorithm? → Section 4.1 Q2 How does the proposed mutation mechanism influence the performance of our algorithm? → Section 4.2 Q3 How does the image counterfactual approach perform compared to other stateof-the-art methods for image counterfactuals? → Section 4.3

Both datasets include 60.000 training images and 10.000 test images divided into 10 classes. An image is of size 28 × 28 pixels. Both datasets were split into an 80/20 train/test split. The train set was only used for training the classification model, while the following experiments were run on the test set.

The classification model consists of two convolutional layers for both datasets, followed by max-pooling. The output layer is flattened and fed into a two-layer feed-forward network with ReLu activation and a softmax output layer. This model is trained for 30 epochs with a batch size of 100 on the training set. For MNIST, the model achieves a test set accuracy of 0.9921; for Fashion MNIST, an accuracy of 0.831. We run all experiments on an Intel(R) Xeon(R) Platinum 8180M CPU with 2.50GHz with 1.5 TB of RAM. The code to our evaluation is made publicly available on github4.

4.1. Q1: Distance Function

A counterfactual optimization problem usually includes minimizing the distance to the original data. However, on images, traditional distance measures like the root mean squared error or 4https://github.com/JHoelli/Evolutionary_Counterfactual_Visual_Explanations mean absolute error do not suficiently account for image similarity as they disregard images’ spatial relationships [ 24 ]. To validate our choice of distance function, we compare the mean absolute error ( ) to other popular image similarity indexes: Information Based Statistic Similarity Measure ( ) [39], Feature-Based similarity Index ( ) [40], Root Mean Squared Error ( ), and the Structural Similarity Index ( ) [41]. All functions were inversed and mapped to the range [ 0, 1 ]. Appendix B2 defines the distance measures and transformations.

For each dataset, we randomly sample 15 instances. We run the algorithm without a target direction on every distance ∈ {, , , , } for the selected images and set the number of epochs to 100, as we do not want the stopping criteria to interfere. The population size was set to 1000. The evaluation criterion is the hypervolume (i.e. the search space coverage). The goal is to cover a high fraction of the search space in a small number of generations.

Figure 1 shows the development of the hypervolume averaged over all samples from both datasets. Overall, ME has the highest search space coverage, indicating the highest likelihood of achieving good results. After 100 epochs, the hypervolume of the algorithm optimizing ME as distance reaches an average of 0.7023, the highest result for any tested distance. Further, the superiority of over confirms Wachter et al. [ 4 ]. The sparsity introducing property of the 1-norm used in ME as distance measure is desirable for human-understandable counterfactuals, as only a small number of variables are changed. For image examples, we refer the reader to section C in the appendix2.

4.2. Q2: Mutation Operator

This section evaluates the mutation operator described in Section 3.3. As a baseline, we use an implementation of random mutation, replacing a pixel with a random number ∈ [0, 255] with a probability of 0.1.

For both mutation types, the algorithm runs on 15 randomly chosen images per dataset. With ME as distance, we run the algorithm for 100 epochs with a population size of 1000 and no target direction. Again we evaluate the hypervolume to evaluate which mutation leads better through the search space.

Figure 2 shows the distribution of the hypervolume for our mutation and the random mutation. On average, our mutation leads to better search space coverage. It covers an, on average, over 10 % larger fraction of the search space than the random mutation baseline while having minor performance fluctuations. For image examples, we refer the reader to section C in the appendix 2.

4.3. Q3: Benchmarking

This section compares our approach to two widely used counterfactual benchmarks: the approach of Wachter et al. [ 4 ] and Van Looveren & Klaise [ 12 ]. The approach of Wachter et al.[ 4 ] is a simple stochastic optimization between the distance of the original image and the counterfactual image. Like our approach only the input image and the classification are necessary as inputs. A more sophisticated approach regarding the data distribution was developed by Van Looveren & Klaise [ 12 ] by training a surrogate model for counterfactual search. Therefore, Van Looveren & Klaise [ 12 ] approach is a slightly harder benchmark for our algorithm to meet as we do not use additional information regarding the data distribution.

For both datasets, a representative of each class is chosen, resulting in 10 images per dataset. Our algorithm runs on each image in every possible target direction ̸∈ { ()} for 500 epochs with a population size of 1000. We ran the benchmarks in two settings: 1. without a specific target class , to get the overall best counterfactual image. 2. with every possible target direction ̸∈ { ()} to calculate the benchmark metrics.

The metrics were adapted and fitted to this context from [ 11 ].

• Distances: We measure the distance between a counterfactual ′ and the original image with the 0- and the 1- norm. The 0 norm calculates the number of pixels changed between original and counterfactual instance and is identical to the sparsity from the optimization problem (R2). The 1 norm calculates the average change and is consistent with ME (R1). • Redundancy: Redundancy measures the unnecessary proposed feature changes in a counterfactual, by successively flipping one value of ′ after another back to with the goal of flipping the label back from (′) to the original predicted outcome (). If the predicted outcome does not change, we increase the redundancy counter. • yNN: yNN (Equation (3)) evaluates the data support (R3) of a counterfactual based on instances from the trainings set. Ideally, a counterfactual should be close to a factual image from the same target class . yNN is calculated by measuring how diferent neighborhood points around the counterfactual ′ are classified. knn are the k-nearest neighbors of the original image . We use a value of = 5.

5. Conclusion

In this work, we introduced an approach to generate image counterfactuals in a multiclass classification problem by perturbing the original image with evolutionary computation and data augmentation. Based on NSGA-II, we presented a promising direction in building counterfactuals close to the original input with high data support, without the need to access additional information or model parameters. Further, we show that the counterfactual optimization problem is applicable in high-dimensional feature spaces such as images and that the mutation and augmentation of the image data enables a better search space coverage. Finally, our approach achieves state-of-the-art results on par with the approaches of Wachter et al. [ 4 ] and Van Loveren & Klaise [ 12 ]. Based on the provided approach and the general applicability, we aim to optimize the runtime of the underlying algorithm further, investigate the mutation step and apply our method to real-world applications.

(a) MNIST (b) Fashion MNIST

Acknowledgments

This work was carried out with the support of the German Federal Ministry of Education and Research (BMBF) within the project "MetaLearn" (Grant 02P20A013). [35] C. Shorten, T. M. Khoshgoftaar, A survey on Image Data Augmentation for Deep Learning,

J. Big Data 6 (2019). [36] A. Hassanat, K. Almohammadi, E. Alkafaween, E. Abunawas, A. Hammouri, V. B. Prasath, Choosing mutation and crossover ratios for genetic algorithms-a review with a new dynamic approach, Inf. 10 (2019). [37] Y. LeCun, C. Cortes, C. Burges, MNIST handwritten digit database, ATT Labs [Online].

Available http//yann.lecun.com/exdb/mnist 2 (2010). [38] X. Han, K. Rasul, R. Vollgraf, Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms, arXiv Prepr. arXiv1708.07747 (2017). arXiv:/arxiv.org/abs/1708.07747. [39] M. A. Aljanabi, Z. M. Hussain, N. A. A. Shnain, S. F. Lu, Design of a hybrid measure for image similarity: a statistical, algebraic, and information-theoretic approach, Eur. J.

Remote Sens. 52 (2019) 2–15. [40] Y. Zhang, D. W. Gong, Z. H. Ding, Handling multi-objective optimization problems with a multi-swarm cooperative particle swarm optimizer, Expert Syst. Appl. 38 (2011) 13933–13941. [41] U. Sara, M. Akter, M. S. Uddin, Image Quality Assessment through FSIM, SSIM, MSE and PSNR—A Comparative Study, J. Comput. Commun. 07 (2019) 8–18.

[1]

S. M.

Lundberg ,

S. I.

Lee , A unified approach to interpreting model predictions , Adv. Neural Inf. Process. Syst . 2017 -Decem ( 2017 ) 4766 - 4775 . arXiv: 1705 . 07874 .

[2]

M. T.

Ribeiro ,

Singh ,

Guestrin , "Why should i trust you?" Explaining the predictions of any classifier , Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min . 13 -17-Augu ( 2016 ) 1135 - 1144 . arXiv: 1602 . 04938 .

[3]

M. T.

Ribeiro ,

Singh ,

Guestrin , Anchors: High-precision model-agnostic explanations , in: Proceedings of the AAAI conference on artificial intelligence , volume 32 , 2018 .

[4]

Wachter ,

Mittelstadt ,

Russell , Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR, Harv . J. Law Technol . 31 ( 2017 ) 1 - 52 . arXiv: 1711 . 00399 .

[5]

Carter ,

Mueller ,

Jain ,

Giford , What made you do this? Understanding black-box decisions with suficient input subsets , 22nd Int. Conf. Artif. Intell. Stat . ( 2018 ) 567 -- 576 . arXiv: 1810 .03805.

[6]

Miller , Explanation in artificial intelligence: Insights from the social sciences , Artif. Intell . 267 ( 2019 ) 1 - 38 . arXiv: 1706 . 07269 .

[7]

Dandl ,

Molnar ,

Binder ,

Bischl , Multi-objective counterfactual explanations , in: International Conference on Parallel Problem Solving from Nature , Springer, 2020 , pp. 448 - 469 .

[8]

Goyal ,

Wu ,

Ernst ,

Batra ,

Parikh ,

Lee , Counterfactual visual explanations , in: International Conference on Machine Learning, PMLR , 2019 , pp. 2376 - 2384 .

[9]

Laugel ,

M. J.

Lesot ,

Marsala ,

Renard ,

Detyniecki , The dangers of post-hoc interpretability: Unjustified counterfactual explanations , IJCAI Int. Jt. Conf. Artif. Intell . 2019 -Augus ( 2019 ) 2801 - 2807 . arXiv: 1907 .09294.

[10]

Mahajan ,

Tan ,

Sharma , Preserving causal constraints in counterfactual explanations for machine learning classifiers , arXiv preprint arXiv: 1912 . 03277 ( 2019 ).

[11]

Pawelczyk ,

Bielawski , J. v. d. Heuvel,

Richter , G. Kasneci, Carla: a python library to benchmark algorithmic recourse and counterfactual explanation algorithms , arXiv preprint arXiv:2108.00783 ( 2021 ).

[12]

A. V.

Looveren ,

Klaise , Interpretable counterfactual explanations guided by prototypes , in: Joint European Conference on Machine Learning and Knowledge Discovery in Databases , Springer, 2021 , pp. 650 - 665 .

[13]

R. K.

Mothilal ,

Sharma , C. Tan, Explaining machine learning classifiers through diverse counterfactual explanations , in: Proc. 2020 Conf. Fairness , Accountability, Transpar., ACM , New York, NY, USA, 2020 , pp. 607 - 617 . arXiv: 1905 .07697.

[14]

Dhurandhar ,

P. Y.

Chen ,

Luss ,

C. C.

Tu ,

Ting ,

Shanmugam , P. Das , Explanations based on the Missing: Towards contrastive explanations with pertinent negatives , Adv. Neural Inf. Process. Syst . 2018 -Decem ( 2018 ) 592 - 603 . arXiv: 1802 .07623.

[15]

Liu ,

Kailkhura ,

Loveland , Y. Han, Generative counterfactual introspection for explainable deep learning , Glob. 2019 - 7th IEEE Glob. Conf. Signal Inf. Process. Proc. ( 2019 ). arXiv: 1907 .03077.

[16]

Deb ,

Pratap ,

Agarwal ,

Meyarivan , A fast and elitist multiobjective genetic algorithm: NSGA-II, IEEE Trans . Evol. Comput . 6 ( 2002 ) 182 - 197 .

[17]

R. R.

Selvaraju ,

Cogswell , A. Das , R.

Vedantam , D.

Parikh , D.

Batra , Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization , Int. J. Comput. Vis . 128 ( 2016 ) 336 - 359 . arXiv: 1610 . 02391 .

[18]

Atrey ,

Clary ,

Jensen , Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning, arXiv Prepr . arXiv1912 . 05743 ( 2019 ) 1 - 23 . arXiv: 1912 .05743.

[19]

Pearl , Causality, Cambridge university press, 2000 .

[20] A . -H. Karimi , G.

Barthe , B.

Balle , I. Valera , Model-agnostic counterfactual explanations for consequential decisions , in: International Conference on Artificial Intelligence and Statistics , PMLR, 2020 , pp. 895 - 905 .

[21]

Sharma ,

Henderson ,

Ghosh , Certifai: Counterfactual explanations for robustness, Transparency, Interpretability, and Fairness of Artificial Intelligence models ( 2019 ).

[22]

Dhurandhar ,

Pedapati ,

Balakrishnan , P.-Y. Chen,

Shanmugam ,

Puri , Model agnostic contrastive explanations for structured data , arXiv preprint arXiv: 1906 . 00117 ( 2019 ).

[23]

Pawelczyk ,

Broelemann , G. Kasneci, Learning Model-Agnostic Counterfactual Explanations for Tabular Data , in: Proc. Web Conf . 2020 , c, ACM, New York, NY, USA, 2020 , pp. 3126 - 3132 .

[24]

Zhou

Wang ,

Bovik , Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , IEEE Signal Process. Mag . 26 ( 2009 ) 98 - 117 .

[25]

Castelli ,

Manzoni ,

Vanneschi ,

Silva ,

Popovič , Self-tuning geometric semantic Genetic Programming, Genet . Program. Evolvable Mach . 17 ( 2016 ) 55 - 74 .

[26]

Emmerich ,

Beume ,

Naujoks , An EMO algorithm using the hypervolume measure as selection criterion , Lect. Notes Comput. Sci . 3410 ( 2005 ) 62 - 76 .

[27]

Zitzler ,

Künzli , Indicator-based selection in multiobjective search , Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics ) 3242 ( 2004 ) 832 - 842 .

[28]

Zhang ,

Li , MOEA/D: A multiobjective evolutionary algorithm based on decomposition , IEEE Trans. Evol. Comput . 11 ( 2007 ) 712 - 731 .

[29]

Deb ,

Jain , An evolutionary many-objective optimization algorithm using referencepoint-based nondominated sorting approach, part i: solving problems with box constraints , IEEE transactions on evolutionary computation 18 ( 2013 ) 577 - 601 .

[30]

Ishibuchi ,

Setoguchi ,

Masuda ,

Nojima , Performance of Decomposition-Based Many-Objective Algorithms Strongly Depends on Pareto Front Shapes , IEEE Trans. Evol. Comput . 21 ( 2017 ) 169 - 190 .

[31]

W. M.

Spears , K. A. De Jong, On the virtues of parameterized uniform crossover , Technical Report May , Naval Research Lab Washington DC, 1991 .

[32] C. M. Fonseca , L.

Paquete , M.

López-Ibáñez , An improved dimension-sweep algorithm for the hypervolume indicator , in: 2006 IEEE Congr. Evol. Comput. CEC 2006 , IEEE, 2006 , pp. 1157 - 1163 .

[33]

Zitzler , L. Thiele, Multiobjective evolutionary algorithms: A comparative case study and the strength Pareto approach , IEEE Trans. Evol. Comput . 3 ( 1999 ) 257 - 271 .

[34]

Deb , Introduction to genetic algorithms, Sadhana - Acad. Proc. Eng. Sci . 24 ( 1999 ) 293 - 315 .