Add query performance statistics #7869

fantix · 2024-10-16T13:47:39Z

This is the 2nd take after #7814, forking the builtin pg_stat_statement extension from the upstream master branch. It's different in a way that we can extract JSON query info only once per query across parse/plan/execute runs (unless reset cleared the hashtable row), and some custom stats columns are directly stored as a hashtable column.

Please review each commit separately.

Add edb_stat_statements Postgres extension (forked from the master
branch of the upstream pg_stat_statement extension) to handle custom
query performance statistics. sys::QueryStats is added as a view of
the statistics.

This is done in a way that, for each stats-significant SQL we send to the
backend, one or more comment lines of "query stats info" JSONs are
prepended for the Postgres extension to ingest and record in the modified
statistics hash table. Among the stats info fields, id: uuid is especially
important to identify different queries and accumulate stats of the same
query onto the same hash table entry, which reflects some settings that
affected the compilation (excluding the user schema version for common
grouping of stats). Particularly, the first-8-bytes of id is also used by
the Postgres extension to replace the underlying queryId of the SQL
statement, so that the same frontend query can be recognized across all
PARSE/EXECUTE operations in Postgres for stats recording.

System queries, DDLs, and unrecognized queries are not recorded.

Refs #7725

Add compiler config columns
Add config to turn off stats (Add Config.track_query_stats to turn off query stats. #8033)

…38a38408e8adec971740966

* Add query info JSON to EdgeQL-compiled SQL * Extract info in the extension and track original query * Add `cache_key` in the stats hash table * Add view of `sys::QueryStats` * Add basic test

edb/server/compiler/sql.py

msullivan

OK this all looks good except I realize I might not understand the memory management model in play here.

edb_stat_statements/edb_stat_statements.c

edb/lib/sys.edgeql

1st1 · 2024-11-12T20:32:48Z

As discussed on the call, we need to make a few adjustments.

As @elprans mentioned, we want to exclude migration hash from the query id, so that schema migrations don't "reset" the stats. Most of schema migrations are relatively minor for real world application, affecting only a subset of queries anyway.
We want to add query "tagging". The motivation for that is to make it easier for users to browse through the query log and identify which queries are coming from EdgeDB extensions like Auth, or from EdgeDB UI (those can be quite slow which could be confusing), or from various parts of the user application.

The plan for tagging is as follows:

Add tag: string property to sys::QueryStats.
We should use "headers" in our protocol to apply tags on query by query basis. Server should pass the "tag" to the compiler, which should embed it into a SQL query, which the extension can later parse out.
Our client libraries will likely get withTag() method, so users will be able to run await client.withTag('myapp').query(...)

Tags should be documented as arbitrary strings. We can reserve "edgedb/" "namespace" for EdgeDB-related things, e.g. "edgedb/ui", "edgedb/auth", etc.

I think we should put a max-length constraint on this field and limit tags to 200-or-something characters.

cc @elprans @msullivan and obviously @fantix

msullivan · 2024-11-12T23:42:32Z

I think Elvis wanted to inject the tag on the server side, to avoid compiler round trips.

If the same key exists in multiple lines, the first is effective while the rest is simply ignored. If all expected keys are found, the remaining lines are also ignored.

Also, stop using cache_key as the stats entry ID, calculate hash with the JSON string instead.

msullivan

This looks basically good but I want to make sure I understand the behavior in some of the edge cases where the input might be malformed.

In particular, what happens if uuid_in fails because the id is malformed?

(To some extent, this is just me not knowing postgres internals well. It might be that the answer is simple? Does failure do a longjmp or something and just abort everything?)

msullivan · 2024-11-19T23:09:05Z

edb_stat_statements/edb_stat_statements.c

+			JsonParseErrorType parse_rv = pg_parse_json(lex, &sem);
+			freeJsonLexContext(lex);
+
+			if (parse_rv == JSON_SUCCESS)


Does the state get mutated even if there is an error? I guess that's probably fine?

What happens to info_len on an error? Is it still updated with however much got consumed?

Does the state get mutated even if there is an error? I guess that's probably fine?

Yes, it'll keep the parsed values and continue to the next line, which I think is fine.

What happens to info_len on an error? Is it still updated with however much got consumed?

On error, info_len will be updated to skip the whole failing line and restart on the next line.

msullivan · 2024-11-19T23:09:56Z

edb_stat_statements/edb_stat_statements.c

+				if ((state.found & EDB_STMT_INFO_PARSE_REQUIRED) == EDB_STMT_INFO_PARSE_REQUIRED)
+					return info->query_id != UINT64CONST(0) ? info : NULL;
+
+			info_str += info_len + 1;


The +1 makes me nervous about the case where there isn't a newline at the end of an info line?

I'm not sure if the cases where there are untrusted entries in the query log is that likely, but this is C so I want to be extra careful about our boundary cases.

Right, good question. The edbss_extract_info_line() function will tag the \n or the end of the query_str, so this +1 will either skip the \n properly, or go beyond the end of the query_str and cause the next call to edbss_extract_info_line() to return NULL.

Because len is negative, at that point?

Yes exactly

msullivan · 2024-11-19T23:12:10Z

edb_stat_statements/edb_stat_statements.c

+				Datum id_datum = DirectFunctionCall1(uuid_in, CStringGetDatum(token));
+				pg_uuid_t *id_ptr = DatumGetUUIDP(id_datum);


Can these fail?

Yes. The outer-most PG_CATCH in the current session will - like you said - do a longjmp and recover by sending an error to the peer. Such error will be propagated to the client as an edb.errors.InvalidValueError:

ERROR 116169 - 2024-11-20T11:22:27.377 postgres: invalid input syntax for type uuid: "b2f8e457-a4f8-ab73-1979-afb333f9c" INFO 116169 - 2024-11-20T11:22:27.377 postgres: -- {"query": "select\n (<__std__::int64>$0 + <__std__::int64>$1)", "type": 1, "extras": "{\"cc\": {\"__internal_no_apply_query_rewrites\": false, \"__internal_query_reflschema\": false, \"__internal_testmode\": false, \"allow_bare_ddl\": \"AlwaysAllow\", \"allow_dml_in_functions\": false, \"allow_user_specified_id\": false, \"apply_access_policies\": true, \"force_database_error\": \"false\", \"query_cache_mode\": \"Default\", \"simple_scoping\": null, \"store_migration_sdl\": \"NeverStore\", \"warn_old_scoping\": null}, \"pv\": [3, 0], \"of\": \"BINARY\", \"e1\": false, \"il\": 101, \"ii\": false, \"in\": true, \"io\": false, \"dn\": \"default\"}", "id": "b2f8e457-a4f8-ab73-1979-afb333f9c"} INFO 116169 - 2024-11-20T11:22:27.377 postgres: SELECT edgedb_v6_2f20a50ab0.__qh_bd20a1eba9bb696335db87182e5b207f(($1)::int8, ($2)::int8) ---------------------------------------------------------------------- Exception occurred: invalid input syntax for type std::uuid: "b2f8e457-a4f8-ab73-1979-afb333f9c" ---------------------------------------------------------------------- 1. edb.errors.InvalidValueError: invalid input syntax for type std::uuid: "b2f8e457-a4f8-ab73-1979-afb333f9c" ------------------------------------------------------------------------------------------------------------------ Details ------------------------------------------------------------------------------------------------------------------- edb.errors.InvalidValueError: invalid input syntax for type std::uuid: "b2f8e457-a4f8-ab73-1979-afb333f9c" ERROR 116111 _localdev 2024-11-20T11:22:27.377 asyncio: an error in edgedb protocol protocol: <edb.server.protocol.binary.EdgeConnection object at 0x73c6cfabf370> transport: <uvloop.loop._SSLProtocolTransport object at 0x73c6d48086c0>

msullivan · 2024-11-20T21:15:53Z

If you merge this now, please update #7725 with the remaining pending tasks so we can track them

fantix force-pushed the query-stats-4 branch from 198d6ab to b622a22 Compare October 17, 2024 16:53

fantix added 7 commits October 17, 2024 18:22

Fork the official pg_stat_statements extension from d5ca15ee54bf7faf0…

4b8d7b4

…38a38408e8adec971740966

Rename folder pg_stat_statements -> edb_stat_statements

6f36dca

Drop meson build for easier maintanence

5eab017

Squash extension versions into 1.0

d363d78

Rename the extension to edb_stat_statements

75004d5

Add editorconfig for PG extension

21c11aa

Rename functions, views, gucs and namespaces to edb_stat_statements

3e459e2

fantix force-pushed the query-stats-4 branch from 6ef6f64 to 9abe973 Compare October 17, 2024 22:35

fantix added 6 commits October 17, 2024 19:20

Support Postgres 16, 17 and 18

0a79cd4

Build the extension in dev environment

a98b33e

Honor EDGEDB_DEBUG=1 in building dev Postgres

e3b512f

Add test to run extension installcheck

2929831

Add new backend capabilities for the extension

284cbec

Turn off stats without superuser access

158cc6b

fantix force-pushed the query-stats-4 branch from 9abe973 to 158cc6b Compare October 17, 2024 23:21

Main: add sys::QueryStats through edb_stat_statements

d76eb07

* Add query info JSON to EdgeQL-compiled SQL * Extract info in the extension and track original query * Add `cache_key` in the stats hash table * Add view of `sys::QueryStats` * Add basic test

fantix force-pushed the query-stats-4 branch from 9790a9b to d76eb07 Compare October 18, 2024 02:00

fantix added 2 commits October 17, 2024 22:00

Only show stats of the current tenant

4c5ec5f

Implement the reset function

f1153ed

fantix marked this pull request as ready for review October 18, 2024 04:11

fantix requested review from msullivan and elprans October 18, 2024 04:11

fantix mentioned this pull request Oct 18, 2024

Add EdgeDB query performance statistics #7814

Closed

13 tasks

fantix commented Oct 18, 2024

View reviewed changes

edb/server/compiler/sql.py Show resolved Hide resolved

Track SQL queries in stats

ed36827

fantix force-pushed the query-stats-4 branch from 728f7ff to ed36827 Compare October 18, 2024 04:29

msullivan approved these changes Oct 18, 2024

View reviewed changes

edb_stat_statements/edb_stat_statements.c Outdated Show resolved Hide resolved

edb_stat_statements/edb_stat_statements.c Outdated Show resolved Hide resolved

fantix added 2 commits October 21, 2024 21:56

Merge remote-tracking branch 'origin/master' into query-stats-4

8e74a90

CRF: add comments and manually manage lifetime

44154bb

Merge remote-tracking branch 'origin/master' into query-stats-4

ac3b25e

fantix force-pushed the query-stats-4 branch from d2bf3e5 to ac3b25e Compare October 23, 2024 18:40

Merge remote-tracking branch 'origin/master' into query-stats-4

87c2c96

1st1 reviewed Oct 29, 2024

View reviewed changes

edb/lib/sys.edgeql Outdated Show resolved Hide resolved

fantix added 2 commits November 12, 2024 11:17

Merge remote-tracking branch 'origin/master' into query-stats-4

cd5d408

CRF: add annotations and fix test

eb453cf

fantix added 3 commits November 15, 2024 15:43

Merge remote-tracking branch 'origin/master' into query-stats-4

575517e

Fix merge

1bc09fe

Allow multiple info lines in SQL

f26c9c5

If the same key exists in multiple lines, the first is effective while the rest is simply ignored. If all expected keys are found, the remaining lines are also ignored.

fantix force-pushed the query-stats-4 branch from ca51bf3 to f26c9c5 Compare November 17, 2024 18:49

fantix added 2 commits November 18, 2024 11:29

Merge query_id and cache_key into a single id

ccaf5ac

Stop exposing query_id field, use id instead

00f9cd5

fantix force-pushed the query-stats-4 branch from c79bfb4 to 00f9cd5 Compare November 18, 2024 20:03

Merge remote-tracking branch 'origin/master' into query-stats-4

c7e861b

fantix force-pushed the query-stats-4 branch 2 times, most recently from 4784e0c to 205546a Compare November 19, 2024 18:20

Add compilation settings as extras jsonb field

9e1b40b

Also, stop using cache_key as the stats entry ID, calculate hash with the JSON string instead.

fantix force-pushed the query-stats-4 branch from 205546a to 9e1b40b Compare November 19, 2024 19:43

fantix requested a review from msullivan November 19, 2024 21:48

msullivan reviewed Nov 19, 2024

View reviewed changes

fantix added 2 commits November 20, 2024 13:57

Don't add info JSON in DDL function body

ac41168

Add comments

c5bec55

msullivan approved these changes Nov 20, 2024

View reviewed changes

fantix merged commit f5396fd into master Nov 21, 2024
23 checks passed

fantix deleted the query-stats-4 branch November 21, 2024 13:51

fantix mentioned this pull request Nov 21, 2024

Query performance observability #7725

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add query performance statistics #7869

Add query performance statistics #7869

fantix commented Oct 16, 2024 •

edited

Loading

msullivan left a comment

1st1 commented Nov 12, 2024

msullivan commented Nov 12, 2024

msullivan left a comment

msullivan Nov 19, 2024

fantix Nov 20, 2024

msullivan Nov 19, 2024

msullivan Nov 19, 2024

fantix Nov 20, 2024

msullivan Nov 20, 2024

fantix Nov 21, 2024

msullivan Nov 19, 2024

fantix Nov 20, 2024

msullivan commented Nov 20, 2024

		Datum id_datum = DirectFunctionCall1(uuid_in, CStringGetDatum(token));
		pg_uuid_t *id_ptr = DatumGetUUIDP(id_datum);

Add query performance statistics #7869

Add query performance statistics #7869

Conversation

fantix commented Oct 16, 2024 • edited Loading

msullivan left a comment

Choose a reason for hiding this comment

1st1 commented Nov 12, 2024

msullivan commented Nov 12, 2024

msullivan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

msullivan commented Nov 20, 2024

fantix commented Oct 16, 2024 •

edited

Loading