Enable more complex `prompts` #602

jmartin-tech · 2024-04-11T20:19:38Z

The current generator interface expects to receive prompts as str see: https://github.com/leondz/garak/blob/4127ae5092ad3acaba680a32011018fc564cc92a/garak/generators/base.py#L66

This initial simple submission process has worked to date; however #587 show an example of a query prompt that needs a more complex structure. In this case the Multi-modal model accepts both text and image data to generate a response.

I propose an added abstraction layer by implementing a Prompt base interface class that be extended to model these more complex prompts to be processed by each generator.

def generate(self, prompt: Prompt) -> List[str]:

or possibly also abstracting the response as well:

def generate(self, prompt: Prompt) -> List[PromptResponse]:

Prompts can then be further segmented into things like TextPrompt, MultiStepTextPrompt, VisualPrompt, VisualTextPrompt and other such constructs to that on the base functions available to allow use with different and even mixed prompt modalities for models that can accept various input patterns.

Rough example:

class Prompt:
    text = None

    def str(self)
        return self.text

class TextPrompt(Prompt):
    def __init__(self, text: str):
        self.text = text

class VisualTextPrompt(Prompt):
    image
    def __init__(self, text: str, image_path: str):
        self.text = text
        try:
            Image.open(image_path)
         except Exception:
             logger.error(f"No image found at: {image_path}")

The text was updated successfully, but these errors were encountered:

jmartin-tech · 2024-11-15T19:01:03Z

Another recent finding related to multi-modal prompts is a need to define relationships between parts of the prompt. The case identified is that some models request formats may have different expectations for referencing images in text. The current visual_jailbreak prompts include a placeholder in the text segment of the prompt that some models may need to remove or replace with an API specific linking/embedding.

jmartin-tech mentioned this issue Apr 11, 2024

[New Features] Multi-modal Jailbreaking Attack on LLaVA #587

Merged

leondz modified the milestones: release 0.9.1, rel Apr 18, 2024

leondz added the architecture Architectural upgrades label Apr 22, 2024

leondz modified the milestones: rel, release 0.9.1 Apr 23, 2024

DavidLee528 mentioned this issue May 5, 2024

Prompt Architecture Enhancement for Better Multi-modal Red Teaming #658

Draft

jmartin-tech mentioned this issue Jun 14, 2024

require probes to match input modality of the generator #738

Merged

jmartin-tech mentioned this issue Jun 28, 2024

probes: add ArtPrompt probes #617

Draft

leondz mentioned this issue Oct 25, 2024

generator: vision nims #959

Merged

3 tasks

leondz modified the milestones: release 0.9.1, 25.02 Efficiency Jan 14, 2025

leondz self-assigned this Jan 15, 2025

leondz linked a pull request Jan 27, 2025 that will close this issue

Migrate string output/input to Turn objects #1089

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable more complex `prompts` #602

Enable more complex `prompts` #602

jmartin-tech commented Apr 11, 2024 •

edited

Loading

jmartin-tech commented Nov 15, 2024

Enable more complex prompts #602

Enable more complex prompts #602

Comments

jmartin-tech commented Apr 11, 2024 • edited Loading

jmartin-tech commented Nov 15, 2024

Enable more complex `prompts` #602

Enable more complex `prompts` #602

jmartin-tech commented Apr 11, 2024 •

edited

Loading