SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

In this section, we utilize the synthesis of TNT as a representative case to examine the effects of different prompting strategies on the attack GPT-4-o and Llama-3-70B-Instruct. By comparing these approaches, we highlight how varying prompts can influence the performance and vulnerability of each model under attack scenarios.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

Red-Team Prompting

Explicit-Prompting

Implicit-Prompting

SMILES-Prompting

Files

README.md

Latest commit

History

README.md

File metadata and controls

SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

Red-Team Prompting

Explicit-Prompting

Implicit-Prompting

SMILES-Prompting