More strict SQL validation #119

amoffat · 2023-08-24T05:38:19Z

Hey there, cool project. This prompt constraint caught my eye. It seems you are soft validating the resulting SQL query, which can be risky.

I have written a library to do hard validation, HeimdaLLM. It uses a grammar to parse, validate, and potentially edit the query created from an LLM. It gives you a frontend for rigorously constraining the output query so that it can only perform safe actions. You can read more about the attack surface that it addresses here.

Is there any interest in collaborating? HeimdaLLM provides the rigorous query validation, and dataherald could provide the LLM integration? Thoughts?

aazo11 · 2023-08-26T04:44:12Z

Hi @amoffat -- thanks for reaching out. We just merged this PR to do some more stringent blocking of DML statements, but are definitely interested to learn more and always open to collaborating. We will take a look at HeimdaLLM in the meanwhile.

amoffat · 2023-08-26T05:56:50Z

Some constructive criticism.. the regex in the PR appears to have false positives:

select field from table where field="hugh grant";

Postgres also has the concept of "non-reserved" keywords, which are keywords that can be used unquoted as column names:

select update from table;

HeimdaLLM uses full SQL grammars+parsers for each SQL dialect that can handle these cases, as well as restrict what columns can be selected and joined on. I don't want to shill my project on your project's issues, so I'll just say I'm passionate about this problem and if you'd like to chat more, reach out on the email in my profile.

amoffat closed this as completed Aug 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More strict SQL validation #119

More strict SQL validation #119

amoffat commented Aug 24, 2023 •

edited

Loading

aazo11 commented Aug 26, 2023

amoffat commented Aug 26, 2023 •

edited

Loading

More strict SQL validation #119

More strict SQL validation #119

Comments

amoffat commented Aug 24, 2023 • edited Loading

aazo11 commented Aug 26, 2023

amoffat commented Aug 26, 2023 • edited Loading

amoffat commented Aug 24, 2023 •

edited

Loading

amoffat commented Aug 26, 2023 •

edited

Loading