ADV-LLM

2025/1/22 update: Our paper has been accepted to NAACL2025 main conference!

This is the official repository for the paper: Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities. Work done during the internship at Microsoft Research.

The code and models are still under review by Microsoft Research. We plan to release it very soon!

Overview

Cite this work

Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao, "Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities"

@article{advllm,
   title={Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities},
   author={Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao},
   journal={NAACL},
   year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
fig		fig
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADV-LLM

Overview

Cite this work

About

Releases

Packages

License

SunChungEn/ADV-LLM

Folders and files

Latest commit

History

Repository files navigation

ADV-LLM

Overview

Cite this work

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages