Optimize render_stacktrace() #1571

adamchainz · 2022-01-12T09:42:40Z

This function is called by the SQL panel to render each frame of each stack trace.

Previously it used a Django template, which is slow. By generating the HTML with pure Python code, we can avoid much of this overhead.

I tried this change on a demo app I built that has 96 total queries, due to N+1 queries. The queries execute in 13ms due to use of SQLite, so the whole view is a negligible concern. Nearly all the view runtime is the toolbar itself. Without this change, the runtime is ~1300ms; with the change it’s ~1100ms. That's a saving of 15%.

I also checked the appearance of the generated HTML hasn’t changed.

This function is called by the SQL panel to render each frame of each stack trace. Previously it used a Django template, which is slow. By generating the HTML with pure Python code, we can avoid much of this overhead. I tried this change on a demo app I built that has 96 total queries, due to N+1 queries. The queries execute in 13ms due to use of SQLite, so the whole view is a negligible concern. Nearly all the view runtime is the toolbar itself. Without this change, the runtime is ~1300ms; with the change it’s ~1100ms. That's a saving of **15%**. I also checked the appearance of the generated HTML hasn’t changed.

matthiask

Thanks! I like it.

I think we had reports about pformat being slow in the past, maybe we should skip pformat-ing when PRETTIFY_SQL is False but that would be an additional change/improvement.

adamchainz · 2022-01-12T09:53:32Z

debug_toolbar/utils.py

-        except KeyError:
-            # This frame doesn't have the expected format, so skip it and move
-            # on to the next one
-            continue


this didn't make sense - list.append cannot raise a KeyError

Add a few bits of caching: 1. Add sub-function to `parse_sql` with `lru_cache`, so that repeat calls with the same query are fast. This saves a lot of processing in N+1 situations. 2. Cache constructed filter stacks in `get_filter_stack()`. This avoids recreating all the various sqlparse objects for each query. 3. Pre-compile the simplification regex. The `re` module already uses an internal LRU cache of regexes, but this avoids recompiling if the regex ever drops out of that cache. Building on top of django-commons#1571, this takes run time for the same tested view from ~1100ms to ~950ms, another ~15% saving.

matthiask · 2022-01-12T12:10:46Z

Thanks!

Add a few bits of caching: 1. Add sub-function to `parse_sql` with `lru_cache`, so that repeat calls with the same query are fast. This saves a lot of processing in N+1 situations. 2. Cache constructed filter stacks in `get_filter_stack()`. This avoids recreating all the various sqlparse objects for each query. 3. Pre-compile the simplification regex. The `re` module already uses an internal LRU cache of regexes, but this avoids recompiling if the regex ever drops out of that cache. Building on top of django-commons#1571, this takes run time for the same tested view from ~1100ms to ~950ms, another ~15% saving.

Add a few bits of caching: 1. Add sub-function to `parse_sql` with `lru_cache`, so that repeat calls with the same query are fast. This saves a lot of processing in N+1 situations. 2. Cache constructed filter stacks in `get_filter_stack()`. This avoids recreating all the various sqlparse objects for each query. 3. Pre-compile the simplification regex. The `re` module already uses an internal LRU cache of regexes, but this avoids recompiling if the regex ever drops out of that cache. Building on top of #1571, this takes run time for the same tested view from ~1100ms to ~950ms, another ~15% saving.

This function is called by the SQL panel to render each frame of each stack trace. Previously it used a Django template, which is slow. By generating the HTML with pure Python code, we can avoid much of this overhead. I tried this change on a demo app I built that has 96 total queries, due to N+1 queries. The queries execute in 13ms due to use of SQLite, so the whole view is a negligible concern. Nearly all the view runtime is the toolbar itself. Without this change, the runtime is ~1300ms; with the change it’s ~1100ms. That's a saving of **15%**. I also checked the appearance of the generated HTML hasn’t changed.

Add a few bits of caching: 1. Add sub-function to `parse_sql` with `lru_cache`, so that repeat calls with the same query are fast. This saves a lot of processing in N+1 situations. 2. Cache constructed filter stacks in `get_filter_stack()`. This avoids recreating all the various sqlparse objects for each query. 3. Pre-compile the simplification regex. The `re` module already uses an internal LRU cache of regexes, but this avoids recompiling if the regex ever drops out of that cache. Building on top of django-commons#1571, this takes run time for the same tested view from ~1100ms to ~950ms, another ~15% saving.

adamchainz force-pushed the optimize_sql_panel_1 branch from 4289268 to 6e4830e Compare January 12, 2022 09:45

adamchainz force-pushed the optimize_sql_panel_1 branch from 6e4830e to 3d77806 Compare January 12, 2022 09:50

matthiask approved these changes Jan 12, 2022

View reviewed changes

adamchainz commented Jan 12, 2022

View reviewed changes

adamchainz mentioned this pull request Jan 12, 2022

Optimize SQL reformatting #1574

Merged

matthiask merged commit f7004fe into django-commons:main Jan 12, 2022

adamchainz deleted the optimize_sql_panel_1 branch January 12, 2022 12:45

tim-schilling mentioned this pull request May 2, 2022

ValueError: not enough values to unpack (expected 2, got 1) #1612

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize render_stacktrace() #1571

Optimize render_stacktrace() #1571

adamchainz commented Jan 12, 2022

matthiask left a comment

adamchainz Jan 12, 2022

matthiask commented Jan 12, 2022

Optimize render_stacktrace() #1571

Optimize render_stacktrace() #1571

Conversation

adamchainz commented Jan 12, 2022

matthiask left a comment

Choose a reason for hiding this comment

adamchainz Jan 12, 2022

Choose a reason for hiding this comment

matthiask commented Jan 12, 2022