feat(plugins): ai-prompt-guard plugin #12230

ttyS0e · 2023-12-21T05:14:22Z

Summary

This commit offers another plugin that extends the functionality of "AI Proxy" in #12207.

It compares the user's llm/v1/chat or llm/v1/completions request against a series of regular expressions, in two config arrays:

Allow
Deny

If the request matches any regex pattern in deny, the caller is 400'd.

If any allow is specified, by the request matches none of them, the caller is also 400'd.

Engineering design document is available for this feature, but it is quite simple. Comprehensive tests supplied.

This reason for its development, is that many of our users would like to block specific prompts, words, phrases, or otherwise more tightly control how an AI / LLM model is used, if being called via Kong, and this applies especially with the AI Proxy plugin that will simplify this process.

Checklist

The Pull Request has tests
A changelog file has been created under changelog/unreleased/kong or skip-changelog label added on PR if changelog is unnecessary. README.md
There is a user-facing docs PR against https://github.com/Kong/docs.konghq.com - docs being discussed internally, as part of AI-Proxy plugin in feat(plugin): ai-proxy plugin #12207

fffonion

I'm wondering if this plugin can be just part of the ai-proxy plugin. The functionality of this plugin, aside from its name, seems not strictly related to AI workload.
So I would suggest we either move this to the ai-proxy plugin, or properly implement this
as a WAF feature.

ttyS0e · 2023-12-21T06:49:47Z

@fffonion Oh right sorry it's missing from the design...

This is separate so that it can be applied to e.g. a whole runtime group / Kong control plane, and govern all configured models...

Or to apply to one consumer to limit their AI usage.

…ith skip_transformation instruction for ai-proxy

kong/plugins/ai-prompt-guard/schema.lua

kong/plugins/ai-prompt-guard/access.lua

kong/plugins/ai-prompt-guard/handler.lua

kong/plugins/ai-prompt-guard/schema.lua

flrgh · 2024-01-10T01:10:36Z

kong/plugins/ai-prompt-guard/access.lua

+  if conf.deny_patterns and #conf.deny_patterns > 0 then
+    for i, v in ipairs(conf.deny_patterns) do
+      -- check each denylist; if prompt matches it, deny immediately
+      local m, err = ngx.re.match(user_prompt, v)


Prefer ngx.re.find over ngx.re.match when you don't care about capturing anything in the pattern and just need to know if it matches or not:

if ngx.re.find("subject", "pattern", "jo") then print("it matched!") end

flrgh · 2024-01-10T01:14:45Z

kong/plugins/ai-prompt-guard/access.lua

+    for i, v in ipairs(conf.deny_patterns) do
+      -- check each denylist; if prompt matches it, deny immediately
+      local m, err = ngx.re.match(user_prompt, v)
+      if err then return do_internal_server_error("bad regex execution for: " .. v) end


Let's make this condition unreachable at runtime by using a custom validator for {allow,deny}_patterns in your plugin's schema.lua file:

local function is_valid_regex(s) local _, _, err = ngx.re.find("", s) if err then return nil, "invalid regex: " .. err end return true end assert(is_valid_regex("^(.*"))

ERROR: t.lua:10: invalid regex: pcre_compile() failed: missing ) in "^(.*"

kong/plugins/ai-prompt-guard/access.lua

tysoekong · 2024-01-12T08:58:05Z

This is all being addressed in #12337

feat(plugins): ai-prompt-guard plugin

620171a

pull-request-size bot added the size/L label Dec 21, 2023

github-actions bot added chore Not part of the core functionality of kong, but still needed schema-change-noteworthy labels Dec 21, 2023

team-eng-enablement added the author/community PRs from the open-source community (not Kong Inc) label Dec 21, 2023

fffonion reviewed Dec 21, 2023

View reviewed changes

fix(ai-prompt-guard): various syntax fixes to schema; compatibility w…

0a7b7cd

…ith skip_transformation instruction for ai-proxy