Reconstruction-based Attack

This work presents an effective black-box membership inference attack framework tailored for the latest generation of image generator models. This novel framework exploits the memorization characteristics of image generator models regarding training data to mount attacks, utilizing reconstruction distance as indicative features.

In this repository, we can find:

Using LoRA to fine-tune Stable Diffusion on a customized dataset.
Generating images from the fine-tuned models based on different attack scenarios.
Calculating the reconstruction distance between the generated and query images.
Using the reconstruction distance as features to train an inference model, including Threshold-based, Distribution-based, and Classifier-based approaches.

Environment Setup

Before running the code, make sure to install all dependency files.

Install environment.yml and run:

conda env create -f environment.yml

And initialize an 🤗Accelerate environment with:

accelerate config

Fine-tune Image Generator Models

After preparing the dataset, we employed the 🤗diffusers train_text_to_image_lora.py to fine-tune the Stable Diffusion v1-5.

accelerate launch train_text_to_image_lora.py \
  --pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5" \
  --train_data_dir=prepare_dataset \
  --dataloader_num_workers=8 \
  --resolution=512 --center_crop --random_flip \
  --project_name="SD v1-5" \
  --train_batch_size=4 \
  --gradient_accumulation_steps=4 \
  --max_train_steps=62500 \
  --learning_rate=1e-04 \
  --max_grad_norm=1 \
  --lr_scheduler="cosine" --lr_warmup_steps=0 \
  --output_dir=output_dir \
  --report_to=wandb \
  --resume_from_checkpoint="latest" \
  --checkpointing_steps=12500 \
  --validation_prompt=valid_prompt \
  --seed=1337

Fine-tune Image Captioning Models

In our research, both Attack-II and Attack-IV scenarios operate without using image captions to query the model. Therefore, the captioning model requires fine-tuning on an auxiliary dataset.

python3 blip_finetune.py --data_dir auxiliary-dataset-dir

Generate Images from Models

After fine-tuning the Stable Diffusion model, we proceeded to generate images using this refined model.

accelerate launch --gpu_ids 0 --main_process_port=28500 inference.py \
--pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5" \
--num_validation_images=3 \
--inference=30 \
--output_dir=checkpoints-dir \
--data_dir=dataset-dir \
--save_dir=save-dir \
--seed=1337

Generate Images with Captioning Models

Both Attack-II and Attack-IV require the assistance of captioning models to generate images.

python3 build_caption.py \
--data_dir=data-dir \
--pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5" \
--output_dir=checkpoints-dir \
--seed=1337 \
--inference_step=30 \
--model_id=model-id \
--num_validation_images=3 \
--save_dir=save-dir \
--gpu_id=0

Calculate Reconstruction Distance

Calculate the reconstruction distance between the generated images and the query image.

python3 cal_embedding.py \
--data_dir=data-dir \
--sample_file=image-save-dir \
--membership=0 \
--img_num=3 \
--gpu=0 \
--save_dir=distance-save-dir

Test Attack Accuracy

Utilize the reconstruction distance to train an inference model for predicting the membership of the query data.

python3 test_accuracy.py \
--target_member_dir=target-member-dir \
--target_non_member_dir=target-non_member-dir \
--shadow_member_dir=shadow-member-dir \
--shadow_non_member_dir=shadow-non_member-dir \
--method="classifier"

Citation

@article{pang2023black,
  title={Black-box membership inference attacks against fine-tuned diffusion models},
  author={Pang, Yan and Wang, Tianhao},
  journal={arXiv preprint arXiv:2312.08207},
  year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reconstruction-based Attack

Table of Contents

Environment Setup

Fine-tune Image Generator Models

Fine-tune Image Captioning Models

Generate Images from Models

Generate Images with Captioning Models

Calculate Reconstruction Distance

Test Attack Accuracy

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
LICENSE		LICENSE
README.md		README.md
blip_finetune.py		blip_finetune.py
build_caption.py		build_caption.py
cal_embedding.py		cal_embedding.py
environment.yml		environment.yml
inference.py		inference.py
kandinsky2_2_inference.py		kandinsky2_2_inference.py
test_accuracy.py		test_accuracy.py
train_text_to_image_lora.py		train_text_to_image_lora.py

License

py85252876/Reconstruction-based-Attack

Folders and files

Latest commit

History

Repository files navigation

Reconstruction-based Attack

Table of Contents

Environment Setup

Fine-tune Image Generator Models

Fine-tune Image Captioning Models

Generate Images from Models

Generate Images with Captioning Models

Calculate Reconstruction Distance

Test Attack Accuracy

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages