VulScribeR

Official repository for our paper:

VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMs

Datasets

Our Generated Vulnerable Samples

Filtered Datasets for All RQs, Unfiltered Datasets for All RQs
The unfiltered dataset contains samples from the Generator and hasn't gone through the Verification phase. They also include extra metadata that shows which clean_vul pair was used for generation, plus the vul lines.

How to use?

See here

How to train DLVD models

Go to the models directory, the readme for each model explains how to use each of the models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VulScribeR

Block or report VulScribeR

VulScribeR

Datasets

Primary Datasets

VGX and Vulgen (used as baselines)

Retriever's output

Our Generated Vulnerable Samples

How to use?

How to train DLVD models

Popular repositories Loading