Add a feature for using the same number of loops as a previous run #327

mdboom · 2024-01-30T20:43:35Z

Motivation:

On the Faster CPython team, we often collect pystats (counters of various interpreter events) by running the benchmark suite. It is very useful to compare the stats between two commits to see how a pull request affects the interpreter. Unfortunately, with pyperformance's default behavior where the number of loops is automatically calibrated, each benchmark may not be run the same number of times from run-to-run, making the data hard to compare.

This change adds a new argument to the run command which will use the same number of loops as a previous run. The loops for each benchmark is looked up from the metadata in the .json output of that previous run, and passed to the underlying call to pyperf using the --loops argument.

Additionally, this modifies one of the benchmarks, sqlglot to be compatible with that scheme. sqlglot is the only run_benchmark.py script that runs multiple benchmarks within it in a single call to the script. This makes it impossible to set the number of loops independently for each of these benchmarks. It's been updated to use the pattern from other "suites" of benchmarks (e.g. async_tree) where each benchmark has its own .toml file and is run independently. This should still be backward compatible with older data collected from this benchmark, but doing pyperformance run -b sqlglot will now only run a single benchmark.

Fidget-Spinner

Looks good in general. I'll let Eric approve though.

pyperformance/run.py

ericsnowcurrently

LGTM

I left a couple of minor comments that you can take or leave.

It might be worth splitting out the bm_sqlglot changes into a separate PR, but it isn't a big deal. I'll leave that up to you.

pyperformance/run.py

mdboom · 2024-02-01T23:08:52Z

@ericsnowcurrently: I think this is good to merge, but I don't have merge rights, if you don't mind... :)

mdboom added 4 commits January 29, 2024 15:04

Use the same number of loops

25c5dd0

Change how sqlglot is multiple benchmarks

aa39cde

Add docs

9cee060

Fix sqlglot configuration

8a1966c

mdboom requested a review from ericsnowcurrently January 30, 2024 20:43

mdboom added 3 commits January 30, 2024 15:51

Fix sqlglot

50b93bd

Fix MANIFEST

9e13117

Add newlines

3377159

Fidget-Spinner reviewed Jan 31, 2024

View reviewed changes

pyperformance/run.py Show resolved Hide resolved

Add same_loops to config

b2a3e7a

ericsnowcurrently approved these changes Feb 1, 2024

View reviewed changes

pyperformance/run.py Show resolved Hide resolved

pyperformance/run.py Outdated Show resolved Hide resolved

Don't pass hardcoded warmups

14cd648

ericsnowcurrently merged commit 79f80a4 into python:main Feb 2, 2024
11 checks passed

diegorusso mentioned this pull request Sep 19, 2024

Pass --timeout flag to pyperf #354

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a feature for using the same number of loops as a previous run #327

Add a feature for using the same number of loops as a previous run #327

mdboom commented Jan 30, 2024

Fidget-Spinner left a comment

ericsnowcurrently left a comment

mdboom commented Feb 1, 2024

Add a feature for using the same number of loops as a previous run #327

Add a feature for using the same number of loops as a previous run #327

Conversation

mdboom commented Jan 30, 2024

Fidget-Spinner left a comment

Choose a reason for hiding this comment

ericsnowcurrently left a comment

Choose a reason for hiding this comment

mdboom commented Feb 1, 2024