Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks

Official implementation of Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks.

TODO

Add Attack Generation Code.
Add Eval Code.

Setup

Prepare dataset.

StanfordCars
Download StanfordCars dataset
Aircraft
Download Aircraft
OxfordPets
Download OxfordPets
Food101
Download Food101
Flowers
Download Flowers

For each dataset, set the right path at dataset_eval.py and dataset_eval_gpt4.py

Model Setup

To setup

LLaVA/InstructBLIP, simply make sure you have transformers installed.
MiniGPT4, make sure to clone their repo, follow their instructions, and then set up the path to the config file in utils.py line 236.

Eval.

To evaluate LLaVA/InstructBlip/MiniGPT-4, run:

python dataset_eval.py --model [llava/blip/minigpt4] --method [Method] --dataset [Dataset]

To evaluate GPT-4, first set your api key at utils_models/utils_gpt4.py, and then run:

python dataset_eval_gpt4.py --method [Method] --dataset [Dataset]

Citation

If you find this repository useful please give it a star and cite as follows! :) :

    @article{qraitem2024vision,
    title={Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks},
    author={Qraitem, Maan and Tasnim, Nazia and Saenko, Kate and Plummer, Bryan A},
    journal={arXiv preprint arXiv:2402.00626},
    year={2024}
    }

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
datasets		datasets
outputs		outputs
utils_eval		utils_eval
utils_models		utils_models
README.md		README.md
compute_results.py		compute_results.py
dataset_eval.py		dataset_eval.py
dataset_eval_gpt4.py		dataset_eval_gpt4.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks

TODO

Setup

Prepare dataset.

Model Setup

Eval.

Citation

About

Releases

Packages

Languages

mqraitem/Self-Gen-Typo-Attack

Folders and files

Latest commit

History

Repository files navigation

Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks

TODO

Setup

Prepare dataset.

Model Setup

Eval.

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages