Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis

This repository contains the code for the paper "Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis" by Zhipeng He, Chun Ouyang, Laith Alzubaidi, Alistair Barros and Catarina Moreira. The paper is accepted at journal Intelligent Systems with Applications. The preprint version of the paper can be found on arXiv.

Abstract

Adversarial attacks are a potential threat to machine learning models by causing incorrect predictions through imperceptible perturbations to the input data. While these attacks have been extensively studied in unstructured data like images, applying them to tabular data, poses new challenges. These challenges arise from the inherent heterogeneity and complex feature interdependencies in tabular data, which differ from the image data. To account for this distinction, it is necessary to establish tailored imperceptibility criteria specific to tabular data. However, there is currently a lack of standardised metrics for assessing the imperceptibility of adversarial attacks on tabular data. To address this gap, we propose a set of key properties and corresponding metrics designed to comprehensively characterise imperceptible adversarial attacks on tabular data. These are: proximity to the original input, sparsity of altered features, deviation from the original data distribution, sensitivity in perturbing features with narrow distribution, immutability of certain features that should remain unchanged, feasibility of specific feature values that should not go beyond valid practical ranges, and feature interdependencies capturing complex relationships between data attributes. We evaluate the imperceptibility of five adversarial attacks, including both bounded attacks and unbounded attacks, on tabular data using the proposed imperceptibility metrics. The results reveal a trade-off between the imperceptibility and effectiveness of these attacks. The study also identifies limitations in current attack algorithms, offering insights that can guide future research in the area. The findings gained from this empirical analysis provide valuable direction for enhancing the design of adversarial attack algorithms, thereby advancing adversarial machine learning on tabular data.

Data Profiling

Dataset	Data Type	Total Inst.	Train/Test (80%:20%)	Batch/Adv Inst. (batch_size=64)	Total Feat.	Categorical Feat.	Numerical Feat.	Total Categorical Feat. after One Hot Enc.
Adult/Income	Mixed	32651	26048/6513	101/6464	12	8	4	98
Breast Cancer	Num	569	455/114	1/64	30	0	30	0
COMPAS	Mixed	7214	5771/1443	22/1408	11	7	4	19
Diabetes	Num	768	614/154	2/128	8	0	8	0
German Credit	Mixed	1000	800/200	3/192	20	15	5	58

Predictive Models

All model parameters can found in utils/models.py
Model training in 1_AE_model_training.ipynb

Adversarial Attacks

Attack methods and parameters in:
- DeepFool
- LowProFool
- C&W
- FGSM
- PGD
Generate adversarial examples by 2_generate_ae.py and generate_ae.bat
Adversarial examples in folder results

Evaluation

Evaluated in 3_AE_performance_metrics.ipynb
Raw results can be found in folder results
formatted result tables of qualitative analysis can be found in qualitative analysis
Some visualisations are produced in 5_visualisation.ipynb, 5_visualisation2.ipynb and 5_visualisation_poster.ipynb. The visualisations are saved in folder Visualisation.

$\LaTeX$ Poster for ADSN 2024

The poster PDF file can be found in Poster/ADSN_2024_Poster.pdf
Created by $\LaTeX$ in Poster/LaTex. Inspired by baposter template and SuperBruceJia's poster on GitHub.
Due to poster page limit, some details are omitted in the poster. Please refer to the paper for more details. Check more visualisations in markdown file visualisation.md

XAMI Lab

XAMI Lab

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Poster		Poster
Visualisation		Visualisation
asserts/images		asserts/images
datapoints		datapoints
datasets		datasets
results		results
saved_models		saved_models
utils		utils
.gitignore		.gitignore
1_AE_model_training.ipynb		1_AE_model_training.ipynb
2_generate_ae.py		2_generate_ae.py
3_AE_performance_metrics.ipynb		3_AE_performance_metrics.ipynb
4_logistic_regression_weights.ipynb		4_logistic_regression_weights.ipynb
5_visualisation.ipynb		5_visualisation.ipynb
5_visualisation2.ipynb		5_visualisation2.ipynb
5_visualisation_poster.ipynb		5_visualisation_poster.ipynb
6_Pareto_Front.ipynb		6_Pareto_Front.ipynb
CHANGELOG.md		CHANGELOG.md
README.md		README.md
all_imp.csv		all_imp.csv
art.md		art.md
generate_ae.bat		generate_ae.bat
parameters.md		parameters.md
qualitative_analysis.md		qualitative_analysis.md
requirements.txt		requirements.txt
success_rate.csv		success_rate.csv
visualisation.md		visualisation.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis

Abstract

Data Profiling

Predictive Models

Adversarial Attacks

Evaluation

$\LaTeX$ Poster for ADSN 2024

XAMI Lab

About

Releases

Packages

Languages

ZhipengHe/Imperceptibility-of-Tabular-Adversarial-attack

Folders and files

Latest commit

History

Repository files navigation

Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis

Abstract

Data Profiling

Predictive Models

Adversarial Attacks

Evaluation

$\LaTeX$ Poster for ADSN 2024

XAMI Lab

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages