Run perf continuously #113

Jongy · 2021-06-20T01:01:29Z

Description

This PR changes SystemProfiler (which runs perf) to run in true continuous mode (like PyPerf & phpspy, and more to come, hopefully).
Continuous mode produces more accurate results because we don't miss anything - we're always sampling. Also, avoiding the re-starts of perf should yield better performance.

Supersedes #103.
~~Based on #89.~~

Motivation and Context

Performance & accuracy.

How Has This Been Tested?

Basic sanity works.
I ran it for 30 mins, then ran master for 30 mins, and compared the results - they were very much the same (on the same machine with similar load)

I will decide on more elaborate tests & preferably add them to our test suite.

Checklist:

I have read the CONTRIBUTING document.
I have updated the relevant documentation.
I have added tests for new logic.
Test performance effect - ~~does it really help in benchmarks & do we see less perf in the flamegraph now :)~~ I ran master and this branch on the same machine, and compared the numbers of --log-cpu-usage & how much of perf I see in the graph. Both went down.
Complete all TODOs in code
Time to get rid of the --profiling-interval option? With perf now being continuous, there's no sense in allowing duration != interval.

This is done for 2 reasons: 1. More correct & accurate: collect samples *all* the time, without excluding any interval. 2. For performance reasons - this still has to be benched, but we believe that avoiding the start-stop-start-stop-... of "perf record" will be beneficial.

We now profile foreverrrr

Jongy · 2021-06-21T19:46:24Z

Time to get rid of the --profiling-interval option? With perf now being continuous, there's no sense in allowing duration != interval.

We have decided to get rid of it, in favor of true continuity.

…t it)

I don't see why it's needed *here*. We can add such a check in the main file.

buildid recording & cache is enabled by default in perf, but disabled in "switch output" mode (see explanation in Linux commit 0c1d46a8796e830). We had it enabled (and specifically handled it in #17) because I had in mind that it's required for perf to properly resolve symbols across namespaces. I have proved this incorrect by testing, so there's no reason to keep these flags on - we run "script" right after record is done, it's okay to access files just by path.

tests/test_java.py

michelhe · 2021-07-18T13:17:27Z

gprofiler/perf.py


 logger = get_logger_adapter(__name__)

-PERF_BUILDID_DIR = os.path.join(TEMPORARY_STORAGE_PATH, "perf-buildids")
+
+class PerfProcess:


Looks like your can derive from ProfilerBase here and get away with much of the code

Not really. It's not the same API. For example, its result isn't ProcessToStackSampleCounters but plain perf script.

michelhe · 2021-07-18T13:33:12Z

gprofiler/perf.py

+    def stop(self) -> None:
+        if self._process is not None:
+            self._process.terminate()  # okay to call even if process is already dead
+            self._process.wait()


terminate sends SIGTERM, which kills perf.

But I agree timeouts are nice nonetheless

But we'll add timeouts somewhen else (we have plenty of other missing timeouts - for example, when running py-spy, I happened to see it hang). So in another PR. There's an issue open (or I will open one)

michelhe · 2021-07-18T13:40:36Z

Changes LGTM

Jongy · 2021-07-18T16:39:04Z

I'll update the README

Jongy added the enhancement New feature or request label Jun 20, 2021

Base automatically changed from no-perf to master June 20, 2021 12:43

Jongy force-pushed the continuous-perf-2 branch from 15a8374 to 01aec64 Compare June 20, 2021 12:47

Jongy added 3 commits June 21, 2021 15:37

utils: Extract wait_for_file_by_prefix()

a8062ce

Remove profiling interval

770a28b

We now profile foreverrrr

Jongy force-pushed the continuous-perf-2 branch from 01aec64 to 770a28b Compare June 21, 2021 19:46

Jongy added 10 commits June 27, 2021 03:53

Merge remote-tracking branch 'master' into continuous-perf-2

f2e2f66

perf: Add some logs

ce1021b

Merge branch 'master' into continuous-perf-2

48170c5

Merge branch 'master' into continuous-perf-2

6346b22

Merge remote-tracking branch 'master' into continuous-perf-2

54162ad

perf: Remove TODO since I checked it

3bcd845

perf: Comment-out reading from stderr, remove stdout (don't care abou…

11a7830

…t it)

perf: Remove disk check

6922f6e

I don't see why it's needed *here*. We can add such a check in the main file.

Change error() -> exception() log

4d277cc

tests: Don't run perf where not needed

3589888

Jongy marked this pull request as ready for review July 15, 2021 16:09

Jongy requested a review from michelhe July 15, 2021 16:09

Jongy requested a review from guybortnikov July 15, 2021 22:29

Jongy mentioned this pull request Jul 15, 2021

Preliminary NodeJS support #126

Merged

10 tasks

michelhe reviewed Jul 18, 2021

View reviewed changes

tests/test_java.py Outdated Show resolved Hide resolved

michelhe reviewed Jul 18, 2021

View reviewed changes

tests: Add --no-ruby where needed

ba7e02a

Jongy requested a review from michelhe July 18, 2021 15:38

Update README

327ac85

michelhe approved these changes Jul 18, 2021

View reviewed changes

Jongy merged commit 4f310b7 into master Jul 18, 2021

Jongy deleted the continuous-perf-2 branch July 18, 2021 18:21

Jongy mentioned this pull request Jul 18, 2021

Use perfs output switching capability #103

Closed

2 tasks

Jongy mentioned this pull request Nov 1, 2022

java: async-profiler: use dump command for truly continuous profiling #569

Open

Jongy mentioned this pull request Mar 1, 2023

perf: Optionally utilize buildid cache #707

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run perf continuously #113

Run perf continuously #113

Jongy commented Jun 20, 2021 •

edited

Loading

Jongy commented Jun 21, 2021

michelhe Jul 18, 2021

Jongy Jul 18, 2021

michelhe Jul 18, 2021

Jongy Jul 18, 2021

Jongy Jul 18, 2021

michelhe commented Jul 18, 2021

Jongy commented Jul 18, 2021

Run perf continuously #113

Run perf continuously #113

Conversation

Jongy commented Jun 20, 2021 • edited Loading

Description

Motivation and Context

How Has This Been Tested?

Checklist:

Jongy commented Jun 21, 2021

michelhe Jul 18, 2021

Choose a reason for hiding this comment

Jongy Jul 18, 2021

Choose a reason for hiding this comment

michelhe Jul 18, 2021

Choose a reason for hiding this comment

Jongy Jul 18, 2021

Choose a reason for hiding this comment

Jongy Jul 18, 2021

Choose a reason for hiding this comment

michelhe commented Jul 18, 2021

Jongy commented Jul 18, 2021

Jongy commented Jun 20, 2021 •

edited

Loading