-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More friendly output formats #11
Comments
@dxoigmn I wanted your help in clarifying some questions. Q1. Is the intention of this issue to make the output of llmart look more like tools like Inspect and Harmbench - Interface with adversarial prompts generated and their response from custom models like this ? Q2. Or is the intention to add support for more trackers like WandB, CometML, Aim, MLFlow, ClearML, DVCLive by modifying the section below. Lines 43 to 51 in 0a49cd9
Q3. Or is the intention to convert the protobuf output to more universal outputs like json, hdf5, csv, etc. Q4. Could you please explain the relation of this issue with #12 and how you want the solution which interlinks and fixes both of them ? |
It would be nice if our tool could output a dataset that is consumable by those tools. HarmBench, for example, has an evaluation script. Our tool should output a file that we can then pass as
No, we are happy with Tensorboard for now.
Yes, one option is to write a separate tool that can consume tensorboard outputs and turn them into something consumable by HarmBench's eval script.
The relationship is that whatever this new tool outputs should be the attack with the lowest loss and not just the last attack. |
The primary way we inspect outputs is via Tensorboard. However, most other tools cannot consume Tensorboard summaries (protobufs). Your job is to research output formats that can feed into other tools (e.g., Inspect, HarmBench, etc.), and add support for these formats.
This is probably best done in conjunction with #12.
The text was updated successfully, but these errors were encountered: