Name		Name	Last commit message	Last commit date
parent directory ..
attack		attack
configs		configs
data		data
models		models
output		output
transform		transform
BLIP.gif		BLIP.gif
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
attack_nlvr.py		attack_nlvr.py
attack_vqa.py		attack_vqa.py
cog.yaml		cog.yaml
filter_words.py		filter_words.py
prepare_nlvr.py		prepare_nlvr.py
prepare_vqa.py		prepare_vqa.py
requirements.txt		requirements.txt
utils.py		utils.py

README.md

VLAttack on the BLIP model

[BLIP Paper]

In this repository, we test VLAttack through the VQA task and NLVR task on the VQAv2 and NLVR2 datasets, respectively. We conducted VLAttack on 5K correctly predicted samples. Instructions are shown below:

Pre-trained Model Preparation

Firstly, download the pretrained BLIP model weights (BLIP with ViT-B, 14M) from the BLIP original repository. We use these weights to generate adversarial samples in our work.

Attack VQAv2

Download the VQAv2 dataset from the original website, and then set the vqa_root in ./configs/vqa.yaml
Download the finetuned VQAv2 model weights from the original repo of BLIP. Specifically, the finetuned model weights can be downloaded from here. Don't forget to set the pretrain in ./configs/vqa.yaml with the path of model_vqa.pth.
Find 5K correctly predicted samples using the python prepare_vqa.py command. After running, it will generate right_vqa_list.txt and right_vqa_ans_table.txt, which store the indexes and predictions of correctly predicted samples.
To conduct VLAttack on the VQAv2 dataset, use the python attack_vqa.py command with different --method options shown below:

Method Options:
- BSA (ours)
- VLAttack (ours)
- Co-Attack
- BERTAttack
Command: Replace METHOD_NAME with your chosen options from above:
```
python attack_vqa.py --method METHOD_NAME
```

Attack NLVR2

Download the NLVR2 dataset from the original website, and then set the image_root in ./configs/nlvr.yaml
Download the finetuned NLVR2 model weights from the original repo of BLIP. Specifically, the finetuned model weights can be downloaded from here. Don't forget to set the pretrain in ./configs/nlvr.yaml with the path of model_base_nlvr.pth.
Find 5K correctly predicted samples using the python prepare_nlvr.py command. After running, it will generate right_nlvr_list.txt and right_nlvr_ans_table.txt, which store the indexes and predictions of correctly predicted samples.
To conduct VLAttack on the NLVR2 dataset, use the python attack_nlvr.py command with above --method options. For example, run below command to conduct VLAttack:

python attack_nlvr.py --method VLAttack

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BLIP_attack

BLIP_attack

README.md

VLAttack on the BLIP model

Pre-trained Model Preparation

Attack VQAv2

Attack NLVR2

Files

BLIP_attack

Directory actions

More options

Directory actions

More options

Latest commit

History

BLIP_attack

Folders and files

parent directory

README.md

VLAttack on the BLIP model

Pre-trained Model Preparation

Attack VQAv2

Attack NLVR2