Generating bank transaction sequences with tabular GAN models

dc.contributor.authorMehri, Hamideh
dc.date.issued2024-04
dc.description.abstractThe digital age has equipped financial institutions with vast amounts of data. Privacy concerns have posed challenges to harnessing this data’s full potential. Generation of synthetic data is one of the most promising solutions for allowing analysis of the patterns and trends contained in this data without compromising privacy. Although initial methods for generating synthetic data were basic, emerging generative models have expanded the possibilities. However, generating synthetic data for unique datasets, like bank transaction sequences, remains challenging. These sequences exhibit complex variability driven by the various customer transaction behaviors, distinguishing them from the more predictable patterns in other data types. We propose BankGAN, an innovative conditional tabular GAN architecture designed specifically for synthesizing bank transaction sequences that exhibit non-uniform date patterns. We show that BankGAN outperforms a recurrent neural network (RNN)-based model in achieving superior statistical resemblance to real data. Moreover, it excels at replicating features of periodic transactions, surpassing both the RNN and transformer-based models. BankGAN distinguishes itself by generating privacy-preserving synthetic data without compromising data quality—a stark contrast to the existing models where adding privacy-preserving guarantees typically degrades performance.
dc.description.noteIncludes bibliographical references (pages 82-91)
dc.format.extentxi, 91 pages : illustrations (chiefly color)
dc.format.mediumText
dc.identifier.doihttps://doi.org/10.48336/ZSCR-DR27
dc.identifier.urihttps://hdl.handle.net/20.500.14783/14782
dc.language.isoen
dc.publisherMemorial University of Newfoundland
dc.rights.licenseThe author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.
dc.subjectsynthetic data
dc.subjectdeep learning
dc.subjectgenerative models
dc.subjectsequential tabular data
dc.subjectdecoder-only transformers
dc.subject.lcshData protection
dc.subject.lcshDeep learning (Machine learning)
dc.subject.lcshBanks and banking--Data processing
dc.subject.lcshElectronic data processing
dc.subject.lcshPrivacy, Right of
dc.titleGenerating bank transaction sequences with tabular GAN models
dc.typethesis
mem.campusSt. John's Campus
mem.convocationDate2024-10
mem.departmentComputer Science
mem.divisionsCompSci
mem.fullTextStatuspublic
mem.institutionMemorial University of Newfoundland
mem.isPublishedunpub
mem.thesisAuthorizedNameMehri, Hamideh
thesis.degree.disciplineComputer Science
thesis.degree.grantorMemorial University of Newfoundland
thesis.degree.levelmasters
thesis.degree.nameM. Sc.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis.pdf
Size:
2.12 MB
Format:
Adobe Portable Document Format

Collections