Strelcenia, E. and Prakoonwit, S., 2022. Generating synthetic data for credit card fraud detection using GANs. In: 2022 3rd International Conference on Computers and Artificial Intelligence Technologies (CAIT), 4-6 November 2022, Zhejiang, China.
Full text available as:
|
PDF
Generating synthetic data for credit card fraud detection using GANs.pdf - Accepted Version Available under License Creative Commons Attribution Non-commercial. 852kB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1109/CAIT56099.2022.10072179
Abstract
Deep learning-based classifiers for object classification and recognition have been utilized in various sectors. However according to research papers deep neural networks achieve better performance using balanced datasets than imbalanced ones. It’s been observed that datasets are often imbalanced due to less fraud cases in production environments. Deep generative approaches, such as GANs have been applied as an efficient method to augment high-dimensional data. In this research study, the classifiers based on a Random Forest, Nearest Neighbor, Logistic Regression, MLP, Adaboost were trained utilizing our novel K-CGAN approach and compared using other oversampling approaches achieving higher F1 score performance metrics. Experiments demonstrate that the classifiers trained on the augmented set achieved far better performance than the same classifiers trained on the original data producing an effective fraud detection mechanism. Furthermore, this research demonstrates the problem with data imbalance and introduces a novel model that's able to generate high quality synthetic data.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | fraud; GANs; synthetic data; class imbalance |
Group: | Faculty of Science & Technology |
ID Code: | 38332 |
Deposited By: | Symplectic RT2 |
Deposited On: | 03 Apr 2023 13:10 |
Last Modified: | 03 Apr 2023 13:10 |
Downloads
Downloads per month over past year
Repository Staff Only - |