Feature: Measuring the airspeed velocity of an unladen signac #629

bdice · 2021-10-10T18:28:23Z

Description

This PR uses asv (airspeed velocity) to measure performance of the signac package. It could replace our existing benchmark.py script. I haven't yet tried all the features of asv, but it does a lot of things we care about and is a very powerful / helpful tool:

isolated testing of each commit in a virtual environment (which it handles for you)
automated testing from git ranges: running asv run v1.3.0..master tests every commit from v1.3.0 to master (with the current commit's set of benchmarks, so it doesn't require any git trickery!)
publishing results into a static HTML site that can be hosted/shared: asv publish
local preview of the HTML site: asv preview
checking that the benchmark suite runs correctly during development: asv dev
profiling+visualization with snakeviz: e.g. asv profile 'benchmarks.ProjectBench.time_iterate_load_sp(.*)' --gui=snakeviz
direct comparison of two commits with asv compare
automatic detection of performance regressions 👀

Preview of asv results from v1.3.0 to master (this link may be removed in the future): https://glotzerlab.github.io/signac/

To-do:

Decide what to do with benchmark.py (see below)
Update changelog
Update docs
Port this feature over to signac-flow and close Add benchmarks / profiling code. signac-flow#399

Motivation and Context

I have always wanted to try asv and I think this is a great tool for understanding the project's longitudinal improvements. From a reporting/visualization/profiling perspective, asv seems like a massive step forward. Especially since releases 1.6.0 and 1.7.0 were very carefully tested for performance, it would be nice to add this tool to ensure that we keep high performance into signac 2.0 and beyond.

We can separately discuss whether to remove benchmark.py. For now, I'm happy to keep both. To get rid of benchmark.py we'll want to decide how to incorporate all the flags from benchmark.py, like those controlling document sizes, number of state point keys, etc. I think I would recommend that we just add those as parameters, and then let users alter the settings of the benchmark file if they need to test particular configurations (e.g. large documents).

Types of Changes

Documentation update
Bug fix
New feature
Breaking change¹

¹The change breaks (or has the potential to break) existing functionality.

Checklist:

I am familiar with the Contributing Guidelines.
I agree with the terms of the Contributor Agreement.
My name is on the list of contributors.
My code follows the code style guideline of this project.
The changes introduced by this pull request are covered by existing or newly introduced tests.

If necessary:

I have updated the API documentation as part of the package doc-strings.
I have created a separate pull request to update the framework documentation on signac-docs and linked it here.
I have updated the changelog and added all related issue and pull request numbers for future reference (if applicable). See example below.

codecov · 2021-10-10T18:29:04Z

Codecov Report

Merging #629 (73625e9) into master (641b03b) will increase coverage by 0.39%.
The diff coverage is n/a.

❗ Current head 73625e9 differs from pull request most recent head 7b8a501. Consider uploading reports for the commit 7b8a501 to get more accurate results

@@            Coverage Diff             @@
##           master     #629      +/-   ##
==========================================
+ Coverage   77.92%   78.31%   +0.39%     
==========================================
  Files          65       65              
  Lines        7079     7079              
  Branches     1310     1310              
==========================================
+ Hits         5516     5544      +28     
+ Misses       1249     1227      -22     
+ Partials      314      308       -6

Impacted Files	Coverage Δ
signac/core/h5store.py	`91.06% <ø> (ø)`
signac/contrib/project.py	`85.15% <0.00%> (+0.36%)`	⬆️
signac/__main__.py	`71.89% <0.00%> (+0.55%)`	⬆️
signac/contrib/filesystems.py	`51.00% <0.00%> (+1.00%)`	⬆️
signac/contrib/linked_view.py	`83.91% <0.00%> (+1.39%)`	⬆️
...nac/synced_collections/backends/collection_json.py	`100.00% <0.00%> (+1.48%)`	⬆️
...ed_collections/buffers/file_buffered_collection.py	`99.17% <0.00%> (+1.65%)`	⬆️
signac/core/dict_manager.py	`88.23% <0.00%> (+2.35%)`	⬆️
signac/common/host.py	`40.00% <0.00%> (+2.85%)`	⬆️
signac/contrib/migration/__init__.py	`82.45% <0.00%> (+3.50%)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 641b03b...7b8a501. Read the comment docs.

csadorf · 2021-10-11T09:30:37Z

That's really cool! The only thing that bothers is me is the lack of axis labels. 😄

I think we can get rid of the old benchmark tests as long as it is super easy to still run benchmarks on your computer and as part of the CI.

bdice · 2021-10-11T13:52:51Z

That's really cool! The only thing that bothers is me is the lack of axis labels. 😄

You can get axis labels if you click through into the full chart, just not on the overview. Example: https://glotzerlab.github.io/signac/#benchmarks.ProjectBench.time_iterate

The overview chart is called a sparkline: https://en.wikipedia.org/wiki/Sparkline 😄

b-butler

I love how GitHub colors the JSON comments red it is like Christmas with the red and green stripe.

benchmarks/benchmarks.py

b-butler · 2021-10-13T19:53:33Z

benchmarks/benchmarks.py

+    def time_iterate_load_sp(self, *params):
+        for _ in range(10):
+            [job.sp() for job in self.project]


Given lazy loading it would be interesting to see single pass speeds as well, unless this does it every pass (I am not too familiar with the loading that takes place here.

I'd like to leave the scope of what we benchmark the same in this PR as we have currently implemented in benchmark.py. I am unsure how lazy loading / caching would work if we added another test that loads statepoints here. Ideally we want to have an empty cache when each test executes, but I can't remember if the setup/teardown are run for each test method or each test class. (It's been a few weeks since I worked on this.)

csadorf · 2021-11-02T09:48:45Z

@SchoeniPhlippsn Do you think you will be able to have a look at this soon?
@bdice Regardless of the missing review, this might be ready to move forward. Any way I can help?

bdice · 2021-11-02T14:44:08Z

@SchoeniPhlippsn Do you think you will be able to have a look at this soon?
@bdice Regardless of the missing review, this might be ready to move forward. Any way I can help?

Yes! The next action item is to move some additional options like document size, statepoint key/value sizes, etc. into the new benchmark script. I think we can ignore the ability to set those via command line and just edit the script (and rerun asv) if those values need to be set for a particular benchmark.

SchoeniPhlippsn

Sorry for the late response. I missed this request for review.
I don't have any more comments. So as soon as Brandon's review has been addressed, I think this is good to go.

bdice · 2021-11-08T01:37:49Z

@b-butler @csadorf What do you think we should do with benchmark.py? Currently we use benchmark.py for CI testing to ensure that we haven't introduced serious regressions, so I don't want to delete it. I would vote to keep the existing benchmark.py and its corresponding CI tests. The only downside is that we will have two ways to perform benchmarks, but otherwise we have to re-implement all the CI tooling for performance tests with asv in this PR.

csadorf · 2021-11-08T15:34:53Z

benchmarks/benchmarks.py

+        raise TypeError("N must be an integer!")
+
+    temp_dir = TemporaryDirectory()
+    project = signac.init_project(f"benchmark-N={N}", root=temp_dir.name)


Any reason to not use (a potentially extended/adapted variant of)

signac/signac/contrib/project.py

Line 2412 in e60bfbc

def TemporaryProject(name=None, cls=None, **kwargs):

and

signac/signac/testing.py

Line 8 in 8a804d0

def init_jobs(project, nested=False, listed=False, heterogeneous=False):

here?

The TemporaryProject yields a project when used as a context manager. The benchmark script requires something we can store (like self.project and self.temp_dir) in setup and then destroy during the teardown method. I'm not aware of a clean way to use that context manager here.

Well, even if you don't use TemporaryProject, you could still use the testing.init_jobs() function?

Hmm, I had forgotten about that function. It looks like it is currently designed for tests of projects with complex schemas, not tests of performance with varying data sizes. We would probably have to change that function to support the arguments like num_keys, num_doc_keys, data_size, data_std, etc., and that feels out of scope for this PR. I'll finalize and merge as-is, since you indicated this is a non-blocking issue.

csadorf · 2021-11-08T15:37:09Z

@b-butler @csadorf What do you think we should do with benchmark.py? Currently we use benchmark.py for CI testing to ensure that we haven't introduced serious regressions, so I don't want to delete it. I would vote to keep the existing benchmark.py and its corresponding CI tests. The only downside is that we will have two ways to perform benchmarks, but otherwise we have to re-implement all the CI tooling for performance tests with asv in this PR.

I think it's fine to leave it for now, but add an in-code comment where appropriate about the purpose of each method and about the motivation to keep both.

b-butler

I am fine with this as is.

bdice · 2021-11-14T05:17:43Z

@csadorf Feel free to give this a final pass of review if you'd like. Otherwise it's ready to merge. I added docs here: https://signac--629.org.readthedocs.build/projects/core/en/629/support.html#benchmarking

csadorf

I have one more question that I would like to have considered, but it does not block approval.

Not sure how all of these http->https fixes made it into this PR, but I will allow it. 😆

csadorf · 2021-11-17T15:36:20Z

benchmarks/benchmarks.py

+        raise TypeError("N must be an integer!")
+
+    temp_dir = TemporaryDirectory()
+    project = signac.init_project(f"benchmark-N={N}", root=temp_dir.name)


Well, even if you don't use TemporaryProject, you could still use the testing.init_jobs() function?

bdice · 2021-11-26T21:55:20Z

Not sure how all of these http->https fixes made it into this PR, but I will allow it. 😆

My mistake, I meant to push that fix directly to master and didn't notice that it was on this branch.

bdice added 3 commits October 9, 2021 22:04

Add basic asv configuration.

1ab2b60

Parameterize benchmarks.

b772cb0

Add back manual temp dir.

dc13107

bdice requested review from a team as code owners October 10, 2021 18:28

bdice requested review from b-butler and SchoeniPhlippsn and removed request for a team October 10, 2021 18:28

bdice self-assigned this Oct 10, 2021

bdice added the enhancement New feature or request label Oct 10, 2021

bdice added this to the v1.8.0 milestone Oct 10, 2021

b-butler reviewed Oct 13, 2021

View reviewed changes

SchoeniPhlippsn reviewed Nov 2, 2021

View reviewed changes

bdice added 4 commits November 7, 2021 18:54

Merge remote-tracking branch 'origin/master' into feature/asv

3114efd

Rename _make_doc to _make_json_data.

15ad225

Update parameters.

dcc45a1

Update changelog.

041e812

csadorf reviewed Nov 8, 2021

View reviewed changes

b-butler approved these changes Nov 12, 2021

View reviewed changes

bdice added 2 commits November 13, 2021 16:14

Add module docstrings to explain each benchmarking script.

4723b0a

Use https.

f79afd3

bdice added 2 commits November 13, 2021 16:55

Add section on benchmarks to docs.

0f6a326

Update docs.

729cfce

bdice requested a review from csadorf November 14, 2021 05:16

Update docs.

73625e9

csadorf approved these changes Nov 17, 2021

View reviewed changes

Merge remote-tracking branch 'origin/master' into feature/asv

7b8a501

bdice enabled auto-merge (squash) November 26, 2021 21:55

bdice merged commit d002db2 into master Nov 26, 2021

bdice deleted the feature/asv branch November 26, 2021 22:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Measuring the airspeed velocity of an unladen signac #629

Feature: Measuring the airspeed velocity of an unladen signac #629

bdice commented Oct 10, 2021 •

edited

Loading

codecov bot commented Oct 10, 2021 •

edited

Loading

csadorf commented Oct 11, 2021

bdice commented Oct 11, 2021 •

edited

Loading

b-butler left a comment •

edited

Loading

b-butler Oct 13, 2021

bdice Nov 8, 2021 •

edited

Loading

csadorf commented Nov 2, 2021 •

edited

Loading

bdice commented Nov 2, 2021

SchoeniPhlippsn left a comment

bdice commented Nov 8, 2021 •

edited

Loading

csadorf Nov 8, 2021

bdice Nov 13, 2021

csadorf Nov 17, 2021

bdice Nov 26, 2021

csadorf commented Nov 8, 2021

b-butler left a comment

bdice commented Nov 14, 2021

csadorf left a comment

csadorf Nov 17, 2021

bdice commented Nov 26, 2021

Feature: Measuring the airspeed velocity of an unladen signac #629

Feature: Measuring the airspeed velocity of an unladen signac #629

Conversation

bdice commented Oct 10, 2021 • edited Loading

Description

Motivation and Context

Types of Changes

Checklist:

codecov bot commented Oct 10, 2021 • edited Loading

Codecov Report

csadorf commented Oct 11, 2021

bdice commented Oct 11, 2021 • edited Loading

b-butler left a comment • edited Loading

Choose a reason for hiding this comment

b-butler Oct 13, 2021

Choose a reason for hiding this comment

bdice Nov 8, 2021 • edited Loading

Choose a reason for hiding this comment

csadorf commented Nov 2, 2021 • edited Loading

bdice commented Nov 2, 2021

SchoeniPhlippsn left a comment

Choose a reason for hiding this comment

bdice commented Nov 8, 2021 • edited Loading

csadorf Nov 8, 2021

Choose a reason for hiding this comment

bdice Nov 13, 2021

Choose a reason for hiding this comment

csadorf Nov 17, 2021

Choose a reason for hiding this comment

bdice Nov 26, 2021

Choose a reason for hiding this comment

csadorf commented Nov 8, 2021

b-butler left a comment

Choose a reason for hiding this comment

bdice commented Nov 14, 2021

csadorf left a comment

Choose a reason for hiding this comment

csadorf Nov 17, 2021

Choose a reason for hiding this comment

bdice commented Nov 26, 2021

bdice commented Oct 10, 2021 •

edited

Loading

codecov bot commented Oct 10, 2021 •

edited

Loading

bdice commented Oct 11, 2021 •

edited

Loading

b-butler left a comment •

edited

Loading

bdice Nov 8, 2021 •

edited

Loading

csadorf commented Nov 2, 2021 •

edited

Loading

bdice commented Nov 8, 2021 •

edited

Loading