Mitigating Bias in Medical Datasets: A Comparative Analysis of Generative Adversarial Networks (GANs) Based Data Generation Techniques ⋆

Mitigating Bias in Medical Datasets: A Comparative Analysis of Generative Adversarial Networks (GANs) Based Data Generation Techniques ⋆ MohamedAshik Regulated Software Research Centre (RSRC) Dundalk Institute of Technology

Dundalk Ireland

ShahulHameed Regulated Software Research Centre (RSRC) Dundalk Institute of Technology

Dundalk Ireland

AsifaMehmood Qureshi Regulated Software Research Centre (RSRC) Dundalk Institute of Technology

Dundalk Ireland

AbhishekKaushik Regulated Software Research Centre (RSRC) Dundalk Institute of Technology

Dundalk Ireland

Mitigating Bias in Medical Datasets: A Comparative Analysis of Generative Adversarial Networks (GANs) Based Data Generation Techniques ⋆ 1613-0073 0174F3DCD71F70231A151BA2779559D9 GROBID - A machine learning software for extracting information from scholarly documents Bias fairness medical datasets GANs TGAN CTGAN MedGAN MC-MedGAN

The increasing use of Artificial intelligence (AI) in the medical domain has highlighted a critical issue: bias in datasets. Biases in medical datasets can lead to skewed predictions, unfair clinical decisions, incorrect diagnoses and poor generalisation of AI models. Very often, these biases are the consequence of imbalance in the dataset. Generative Adversarial Networks (GANs) have appeared to be a promising solution for solving the data imbalance issue. Synthetic data can help mitigate bias by balancing the dataset for sensitive attributes as well as for class labels. However, the efficiency of different GAN variants in mitigating bias remains unexplored in the medical domain. This paper investigates and compares various GAN variants to identify the most effective approach to producing balanced data. In this study, we evaluated different variants of GAN on three medical datasets with the aim of contributing to the development of more fairer and inclusive AI models in the medical domain. The study shows that the performance of the Machine Learning (ML) model improves when the dataset is balanced using synthetic data samples. Moreover, the MedGAN variant performs better when compared with other variants of GAN.

Introduction

Bias in Artificial Intelligence (AI) models refers to AI systems that produce biased results that reflect and amplify human prejudices within a community, encompassing past and contemporary social injustices [1]. These biases when replicated in medical datasets can have life-threatening consequences due to incorrect diagnosis or treatment recommendations [2,3]. For example, German researchers built a skin cancer detection system using neural networks in 2016. The system was able to detect 95% of melanoma cases accurately. It was trained on 10,000 skin images and outperformed 58 dermatologists. Later, it was found that the data was highly dominated by white skin images and did not generalise well to a diverse population [4]. These biases can be handled at pre-processing, algorithmic level or in the post-processing stages of an AI model development [5]. Handling bias would help achieve fair models that do not discriminate against different groups and treat them unfairly [6]. Pre-processing techniques involve handling bias at the data level. One of the widely used techniques to mitigate bias is over-sampling. Over-sampling is the generation of synthetic data that mirrors the characteristics of real-world data. It helps to reduce bias by balancing the representation of different demographic groups so that machine learning models produce reasonable outcomes and generalise well over a diverse population [7]. There are several techniques to generate synthetic data to ensure fairness in medical datasets [8]. These techniques include SMOTE [9], FairSMOTE [10], BorderlineSMOTE [11], and Cluster-based over-sampling [12]. Moreover, deep learning is also widely used to generate artificial data because of its high efficiency and accuracy in generating data. The most commonly used algorithm is the Generative Adversarial Network (GANs) that have gained immense popularity in the research community [13].

GAN is a deep learning model that mainly consists of two neural networks: a Generator used to generate artificial data and a discriminator that tries to distinguish between real and synthetic data to improve quality. These models were first introduced to process only image data, but later different variants of GAN were proposed to process tabular data as well. These variants include Tabular GAN (TGAN) [14], Conditional Tabular GAN (CTGAN) [15], Medical GAN (MEDGAN) [16], Multi-Categorical GAN (MC-MedGAN) [17] and many more.

In this study, we evaluated various GAN variants including GAN, TGAN, CTGAN, MedGAN, and MC-MedGAN to generate synthetic samples to balance different group representations within medical datasets. The newly balanced dataset was fed into different ML models including Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and K-Nearest Neighbour (KNN) to draw a comparison. The GAN models are evaluated on three different medical datasets that consist of gender as a sensitive attribute to balance: the Asthma Disease Dataset [18], the Heart Disease Prediction Dataset [19], and the Cancer Prediction dataset [20]. The performance is evaluated using various metrics i.e., accuracy, precision, F1-score, recall, and Area Under Curve (AUC) scores. Fairness is evaluated using Equal Opportunity (EO) [21], Propensity Score (PS) [22], and Statistical Parity (SP) [23].

Motivation

In today's world, AI is an integral part of the healthcare system. The AI model must incorporate transparency and accountability. The goal of this research is to reduce bias in medical datasets that contain inherent biases due to unequal representation of different demographic groups. AI models can become unfair and imbalanced, particularly in the healthcare sector, where underrepresented groups may receive scant care. Bias in medical datasets poses a significant challenge to the reliability of predictive models [24]. This could be critical for healthcare systems since an automated model prediction has a direct effect on patients that affects their mental health, and quality of life or may risk the life of an individual [25] as well it also leads to financial loss [26]. Due to an unbiased dataset, certain populations may receive incorrect diagnoses or treatments as a result of unreliable predictions brought on by bias in datasets. Nonetheless, GANs provide a potentially helpful way to generate AI data that can assist in balancing underrepresented groups in health databases. The aim to explore how GAN-based techniques can eliminate bias through data augmentation and enable more reliable and equitable Machine Learning (ML) models motivates this effort [13]. The comparative study's main goal is to identify the optimal variant to lessen bias in medical datasets. We want to improve the quality of treatment by lowering bias and ensuring that AI systems generate reliable, accurate, and equitable forecasts for a range of demographics. Therefore, the motivation of this study is to investigate different variants of GAN including TGAN, CTGAN, MedGAN, and MC-MedGAN for their efficacy in mitigating bias and improving predictive performance on multiple medical datasets. This work will serve as a foundation for further experimentation on data generation via GAN to mitigate biases.

Hypothesis: GAN-based data generation methods can help to reduce biases and ensure fairness in medical datasets.

The formulated research questions to explore the above hypothesis are as follows:

• Does GAN-based synthetic data generation help reduce biases in medical datasets? If yes, which GAN variant performs better among basic GAN, TGAN, CTGAN, MedGAN, and MC-MedGAN?

The rest of the article is structured as follows: Section 3 highlights some of the recent related work. Section 4 explains the methodology in detail. Section 5 explains the results. Section 6 discusses the hypothesis and research questions and Section 7 concludes the discussion with future work.

Related work

GANs have gained significant attention in recent years due to their capability of generating highquality data. Therefore, this section reviews the recent methodologies that leverage GAN models to generate synthetic data. A study [27] presents the potential of GANs in generating synthetic data from observational health data and discusses some of the unique challenges associated with healthcare datasets, such as concerns about class imbalance. Observational Health Data (OHD) is highly valuable for medical research and health informatics. The use of such data is severely limited because of strict regulations. It highlights that GAN-generated synthetic data can help overcome some of the common challenges, such as bias, privacy and class imbalance. The authors argue that GANs are useful in generating healthcare data to combat the scarcity of high-quality medical datasets. Moreover, to address the challenges of drift and class imbalance of gas detection systems, [28] employed CTGAN for data augmentation. The result shows a significant improvement in the classification accuracy of each class for both Support Vector Machine (SVM) and Multi-Layer Perceptron (MLP) thus reducing bias toward the majority class. They conclude that CTGAN provides a feasible solution to generate a balanced dataset.

In another study [29] various variants of GAN including CTGAN, TGAN, and Wasserstein GAN (WGAN) are utilised for the anonymisation of real data through data synthesis. These models were compared for precision, recall, and coverage scores to evaluate the generation of realistic tabular data, handling missing and class-imbalanced data, and ensuring privacy. The results show that, although no GAN method performs best in each evaluation metric, CTGAN and TGAN produce better scores in most of the evaluation metrics. Additionally, in [30] a new variant of GAN called Multi-label Timeseries GAN (MTGAN) is proposed to generate sequential Electronic Health Record (EHR) data using a gated recurrent unit with a smooth conditional matrix, while the critic evaluates temporal features using Wasserstein distance for improving the quality of synthetic data. The results show that MTGAN generates realistic EHR data effectively and improves accuracy for uncommon diseases.

The above studies show that GANs have the potential to generate high-quality diverse datasets that can be used to handle bias in real-world datasets. Therefore, to analyse the capabilities of different GAN variants, this study aims to conduct multiple experiments and then assess the fairness within the newly generated synthetic medical datasets.

Methodology

Figure 1 shows the systematic methodology diagram used to evaluate the different variants of GANs. First, the data is preprocessed and split into standard train and test sets. Then, the data is fed into the GAN variant to generate synthetic data. The newly generated data is augmented with the real data to balance the number of samples for the sensitive attributes and the output label. Afterwards, ML models are trained on the newly generated data to evaluate the overall performance as well as the fairness of the models.

Preprocessing

The data preprocessing includes one hot encoding to replace categorical variables with numerical numbers. Afterwards, we applied z-score normalisation on each distinct numerical feature because they did not contain extreme outliers [31]. Normalisation helps to specify each variable within a specified range to simplify the model-learning process [32]. Then, the resulting dataset is split into a 70:30 ratio for train and test sets.

Generate synthetic data

In order to balance the dataset for the sensitive attribute i.e., Gender and class labels. We employed five GAN architectures: basic GAN, TGAN, CTGAN, MedGAN and MC-MedGAN. These variants are specifically designed to handle tabular and medical datasets which is the primary focus of our study. GAN is a type of neural network architecture where two networks, a generator, and a discriminator, are trained simultaneously [33,34,35]. Tabular GAN is an application-driven variant of the GAN that is designed to generate synthetic tabular data, containing rows and columns like in a spreadsheet or database [33,14,36]. The CTGAN is an extension to Tabular GAN that generates synthetic tabular data while taking into consideration the distribution of dependent target variables. This will help associate relations between columns and observe dependence relationships [15]. MedGAN is a specialised version of GAN that generates synthetic data in the medical field, mainly in tabular form containing sensitive information [16]. MC-MedGAN is a variant of MedGAN designed for handling multi-categorical variables, commonly present in medical datasets [17].

Train ML classifiers

After generating synthetic samples to balance the datasets for sensitive attribute (gender) and class labels, different commonly used ML classifiers including Logistic regression (LR), Random Forest (RF), Decision Tree (DT), K-Nearest Neighbour (KNN) with default parameters are trained on the newly generated datasets to evaluate the performance of GAN variants.

Datasets

To evaluate the performance of GAN variants, we used three different medical datasets that contain sensitive attributes. The details of each of these datasets are as follows:

Asthma Disease Dataset: The Asthma Disease Dataset [18] contains a record of 2,392 samples with 28 features. The output label is the diagnosis indicator, which is taken as 0 for the absence and 1 for a positive case. It contains 2,268 samples for class 0 as compared to 124 samples with class label 1. Also, the number of samples for males is 1212 whereas for females the count is 1180.

Heart Disease Prediction Dataset: The Heart Disease Prediction Dataset [19] consists of 13 features and 303 samples. The dataset contains 207 male and 96 female samples.

Cancer Prediction Dataset:

The Cancer Prediction Dataset [20] contains 1,500 samples with 8 features. The target variable 'diagnosis' indicates whether a patient has cancer or not (0 for no cancer and 1 for cancer). The diagnosis distribution shows 943 patients without cancer and 557 with cancer. There are 736 female samples and 764 males in total.

Results

The performance is evaluated by training different ML classifiers as mentioned in Section 4. The classifiers are assessed using accuracy, f1-score, precision, recall and AUC. Whereas the fairness of the dataset is evaluated via EO, PS, and SP. EO guarantees that all individuals receive the same treatment and meet the same requirements [21]. PS can be defined as the conditional probability of being exposed to a treatment given the observed covariates [22]. SP is a fairness criterion that requires the probability of a favourable outcome to be the same for each demographic group [23]. Tables 1, Table 2, and Table 3 show each classifier's performance on the original as well as on each generated dataset. It can be seen that MedGAN performs well for the Asthma Disease Dataset and Cancer Prediction Dataset while MC-MedGAN has a better score for the Heart Disease Dataset. Figure 2, shows the fairness metric performance on the Asthma Disease dataset. The SP, PS, and EO scores improve when the dataset is balanced for class label and gender. MEDGAN has a better performance for all three datasets followed by MC-MedGAN and TGAN. The same performance is observed for the other two datasets. The other graphs are given in Appendix A.

Overall, the results show that balancing the dataset for class labels and sensitive attributes improves the performance as well as the fairness of the model. Among different GAN variants, the MEDGAN produces good results and lower statistical, propensity and equal opportunity scores showing its great capability for reducing bias followed by MC-MedGAN. Moreover, the predictive ability of RF classifiers is better than other classifiers in terms of accuracy, precision, recall, f1-score, and AUC.

Discussion

This section discusses the overall findings of the study in view of the literature review and extensive experimentation conducted to analyse our hypothesis. Based on our research question, the experiments show that classifier performance as well as the fairness metrics score improves when the datasets are balanced for sensitive attributes and class labels. Figure 2 shows the improvement in the fairness scores across each metric when the dataset is balanced via synthetic data generation using GAN variants as compared to the original dataset. Moreover, the analysis of each GAN variant based on performance evaluation using accuracy, precision, F1-score, recall, AUC and fairness metrics via EO, PS, and SP indicates that the MedGAN produces efficient performance followed by MC-MEDGAN across all three datasets. To validate any statistically significant difference between these two methods, we applied a paired t-test on the EO, PS, and SP scores for each of these methods. The p-values for EO, PS, and SP came out to be 0.34, 0.61, and 0.30 respectively. Therefore, we fail to reject our hypothesis and conclude that these two methods are not significantly different. These GAN variants are specifically designed for medical datasets to capture the interdependencies between the different variables to generate synthetic data similar to original data properties [16,17]. However, further experimentation with other datasets including post-hoc tests will be conducted in future to provide deeper insights into the capability of GAN variants for data generation.

Conclusion and future work

In this paper, we tested different types of GANs for their capacity to produce synthetic tabular data to decrease bias in medical datasets. Our key findings are GAN-based models are effective for bias migration and GAN can provide a balanced dataset to produce generalised AI models and provide a solution AI for all and AI for good. On the other hand, traditional GANs were successful but medical domain-based GANs displayed greater performance in generating high-quality and unbiased data. It drives us to have more specific models in the future. Despite certain advantages of the GAN, we face some obstacles such as evaluation metrics. There is a need to have more standardised and compressive evaluation metrics of this model focused on decreasing bias. The studies in this article suggest that synthetic data can assist in eliminating bias and improve the effectiveness of the classifier. Moreover, MedGAN performs better in terms of SP, PS, and EO. In future, we will extend our work for various variations of GAN focused on refining GAN architecture to adapt the multimodality medical data, bias-sensitive evaluation mechanism and testing the GAN-based techniques in real-world clinical data.

Figure 1 :1Figure 1: Systematic Methodology Diagram to Evaluate GAN Variants

Figure 2 :2Figure 2: Fairness Assessment for Asthma Disease dataset (a) Statistical Parity (b) Propensity Score, (c) Equal Opportunity

Table 11Accuracy, F1-score, Precision, Recall and AUC score comparison over the Asthma

Disease Dataset Method Model Accuracy F1-score Precision Recall AUC

LR0.49890.45940.49510.4285 0.4996Original DatasetRF DT0.4926 0.48640.5050 0.40290.4901 0.47700.5210 0.4628 0.3487 0.4559KNN0.49260.48400.48920.4789 0.4903LR0.53810.52830.56000.5000 0.5487GANRF DT0.7124 0.66620.6967 0.68620.7667 0.66800.6383 0.7912 0.7053 0.6648KNN0.59580.61280.60740.6183 0.6047LR0.51470.48920.53800.4485 0.5254TGANRF DT0.7267 0.66890.7064 0.69320.7967 0.66660.6345 0.7797 0.7221 0.6669KNN0.56910.58780.58270.5929 0.5912LR0.51030.49520.54280.4553 0.5119CTGANRF DT0.7425 0.65630.7248 0.69520.8309 0.65320.6427 0.7920 0.7429 0.6509KNN0.55970.57200.58710.5577 0.5786LR0.52720.51400.54720.4845 0.5425MedGANRF DT0.7409 0.67150.7233 0.69800.8054 0.66400.6563 0.7883 0.7356 0.6694KNN0.57560.58980.61140.5695 0.5858LR0.51340.51280.51670.5134 0.5084MC-MedGANRF DT0.6927 0.62260.6916 0.61720.7015 0.62380.6927 0.7635 0.6226 0.6220KNN0.57110.57060.57050.5711 0.6062

Table 22Accuracy, F1-score, Precision, Recall and AUC score comparison over the Heart

Disease Prediction Dataset Method Model Accuracy F1-score Precision Recall AUC

LR0.70490.81250.81250.8125 0.6522Original DatasetRF DT0.8032 0.68850.8775 0.76540.8600 0.93930.8958 0.7996 0.6458 0.8261KNN0.78680.86310.87230.8541 0.7203LR0.71260.81660.85670.8957 0.7039GANRF DT0.8915 0.89150.9010 0.89880.9318 0.95230.8723 0.9621 0.8510 0.8977KNN0.86740.87910.90900.8510 0.9255LR0.73490.84410.82050.8808 0.7352TGANRF DT0.8915 0.92770.9010 0.93330.9318 0.97670.8723 0.9621 0.8936 0.9329KNN0.86740.87910.90900.8510 0.9137LR0.72500.81530.85000.9217 0.6876CTGANRF DT0.9083 0.82500.9197 0.82920.9264 0.94440.9130 0.9766 0.7391 0.8975KNN0.87500.88000.98210.7971 0.9903LR0.71080.88910.83550.8734 0.7340MedGANRF DT0.9036 0.86740.9130 0.87910.9333 0.90900.8936 0.9598 0.8510 0.9021KNN0.86740.87910.90900.8510 0.9284LR0.77460.90960.95100.8382 0.7133MC-MedGANRF DT0.9277 0.92770.9347 0.93330.9555 0.97670.9148 0.9728 0.8936 0.9320KNN0.84330.85390.90470.8085 0.9414Table 3Accuracy, F1-score, Precision, Recall and AUC score comparison over the Cancer Prediction DatasetMethodModel Accuracy F1-score Precision Recall AUCLR0.60000.57140.58820.5555 0.6846Original DatasetRF DT0.6133 0.56660.5671 0.56080.6129 0.54600.5277 0.6588 0.5763 0.6028KNN0.63660.56570.66350.4930 0.6586LR0.62180.57760.59440.6431 0.6898GANRF DT0.7527 0.68900.7580 0.71540.7500 0.66560.7661 0.8453 0.7733 0.6881KNN0.70900.73590.67980.8021 0.8002LR0.65490.58910.59200.5921 0.6958TGANRF DT0.7271 0.68490.7478 0.73040.7317 0.66760.7647 0.8331 0.8062 0.6774KNN0.71610.75740.69140.8373 0.8369LR0.66960.64580.59580.5735 0.7680CTGANRF DT0.7287 0.72690.7349 0.74090.7375 0.72240.7323 0.8342 0.7605 0.7260KNN0.66900.70530.64980.7711 0.7836LR0.77450.66980.58990.6563 0.7057MedGANRF DT0.7071 0.67950.7028 0.71190.7230 0.65340.6836 0.7852 0.7818 0.6782KNN0.70710.73710.67570.8109 0.8123LR0.74520.59230.59770.7090 0.7694MC-MedGANRF DT0.7005 0.66350.7011 0.68620.7116 0.65240.6909 0.7782 0.7236 0.6625KNN0.65430.68670.63660.7454 0.7558

Acknowledgments

This research was managed by the CREATE-DkIT project, supported by the HEA's TU-Rise programme and co-financed by the Government of Ireland and the European Union through the ERDF Southern, Eastern Midland Regional Programme 2021-27 and the Northern Western Regional Programme 2021-27. This research is also partially supported by the Research Ireland under Grant Number 21/FFP-A/9255.

A. Fairness Assessment Graphs

Large language models struggle to learn long-tail knowledge NKandpal HDeng ARoberts EWallace CRaffel International Conference on Machine Learning

PMLR

2023 Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies EFerrara Sci 6 3 2023 Dataset bias in diagnostic ai systems: Guidelines for dataset collection and usage JVaughn ABaral MVadari WBoag Proceedings of the ACM Conference on Health, Inference and Learning the ACM Conference on Health, Inference and Learning

Toronto, ON, Canada

2020 Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists HAHaenssle CFink RSchneiderbauer FToberer TBuhl ABlum AKalloo AB HHassen LThomas AEnk Annals of oncology 29 2018 A framework for understanding sources of harm throughout the machine learning life cycle HSuresh JGuttag Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization 2021 A review of bias and fairness in artificial intelligence RGonzález-Sendino ESerrano JBajo PNovais 2023 Gan-based data generation for speech emotion recognition SEEskimez DDimitriadis RGmyr KKumanati INTERSPEECH 2020 Bias mitigation via synthetic data generation: A review MAShahul Hameed AMQureshi AKaushik Electronics 13 2079-9292. 2024 Smote: synthetic minority over-sampling technique NVChawla KWBowyer LOHall WPKegelmeyer Journal of artificial intelligence research 16 2002 Bias in machine learning software: Why? how? what to do? JChakraborty SMajumder TMenzies Proceedings of the 29th ACM joint meeting on european software engineering conference and symposium on the foundations of software engineering the 29th ACM joint meeting on european software engineering conference and symposium on the foundations of software engineering 2021 Borderline-smote: a new over-sampling method in imbalanced data sets learning HHan W.-YWang B.-HMao International conference on intelligent computing Springer 2005 Class imbalances versus small disjuncts TJo NJapkowicz ACM Sigkdd Explorations Newsletter 6 2004 Survey on synthetic data generation, evaluation methods and gans AFigueira BVaz Mathematics 10 2733 2022 LXu KVeeramachaneni arXiv:1811.11264 Synthesizing tabular data using generative adversarial networks 2018 arXiv preprint Modeling tabular data using conditional gan LXu MSkoularidou ACuesta-Infante KVeeramachaneni Advances in neural information processing systems 32 2019 Generating multi-label discrete patient records using generative adversarial networks EChoi SBiswal BMalin JDuke WFStewart JSun Machine learning for healthcare conference PMLR 2017 Generation and evaluation of synthetic patient data AGoncalves PRay BSoper JStevens LCoyle APSales BMC medical research methodology 20 2020 Asthma disease dataset REKharoua 2024. 2024-08-23 Heart disease prediction dataset KUjeniya 2024. 2024-08-23 Cancer prediction dataset REKharoua 2024. 2024-08-23 Equality of opportunity in supervised learning MHardt EPrice NSrebro Advances in neural information processing systems 29 2016 A brief guide to propensity score analysis AEValojerdi LJanani Medical journal of the Islamic Republic of Iran 32 122 2018 The statistical fairness field guide: perspectives from social and formal sciences ANCarey XWu AI and Ethics 3 2023 Leveraging feature bias for scalable misprediction explanation of machine learning models JGesi XShen YGeng QChen IAhmed IEEE/ACM 45th International Conference on Software Engineering (ICSE), IEEE 2023. 2023 Ai pitfalls and what not to do: mitigating bias in ai JWGichoya KThomas LACeli NSafdar IBanerjee JDBanja LSeyyed-Kalantari HTrivedi SPurkayastha The British Journal of Radiology 96 20230023 2023 Revolutionizing healthcare: the role of artificial intelligence in clinical practice SAAlowais SSAlghamdi NAlsuhebany TAlqahtani AIAlshaya SNAlmohareb AAldairem MAlrashed KBin HASaleh Badreldin BMC medical education 23 689 2023 JGeorges-Filteau ECirillo arXiv:2005.13510 Synthetic observational health data with gans: from slow adoption to a boom in medical research and ultimately digital twins? 2020 arXiv preprint Data augmentation and class imbalance compensation using ctgan to improve gas detection systems SMahinnezhad SMahinnezhad KKaur AShih 2024 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), IEEE 2024 Generating Synthetic Health Data Using Machine Learning GAN Methods ESShourmasti 2022 Master's thesis Multi-label clinical time-series generation via conditional gan CLu CKReddy PWang DNie YNing IEEE Transactions on Knowledge and Data Engineering 2023 Feature-limited prediction on the uci heart disease dataset KMAlfadli AOAlmagrabi Computers, Materials & Continua 74 2023 Investigating the impact of data normalization on classification performance DSingh BSingh Applied Soft Computing 97 105524 2020 Damage gan: A generative model for imbalanced data AAnaissi YJia ABraytee MNaji WAlyassine Australasian Conference on Data Science and Machine Learning Springer 2023 Gan-based approaches for generating structured data in the medical domain MAbedi LHempel SSadeghi TKirsten Applied Sciences 12 7075 2022 Gan-based one dimensional medical data augmentation YZhang ZWang ZZhang JLiu YFeng LWee ADekker QChen ATraverso Soft Computing 27 2023 Causal-tgan: Modeling tabular data using causally-aware gan BWen YCao FYang KSubbalakshmi RChandramouli ICLR Workshop on Deep Generative Models for Highly Structured Data 2022