profiling and optimizations #189

vogler · 2021-04-14T09:32:55Z

Originally posted by @vogler in #188 (comment)

This likely needs some more in-depth profiling (like most of Goblint really...) because it's not obvious at all. For example, if using just DefExc in a 4-component IntDomTuple, most operations would also be doing lots of BatOption.map calls on None.

The assumption was that for n activated out of m total domains you have for a binop

n * (1+2) (List.map2 + both args) match for IntDomList
m match for IntDomTuple.

I think I ran some test where it was faster, but would be better to have some inline benchmarks [1] in bench for questions like this.
The main motivation was to reduce boilerplate and make adding domains easier.

Or for example, create/create2 in IntDomTuple make repeated get_bool calls each time, which involves a lot more than looking up a locally cached bool variable (some places do this with options).

create/create2 shouldn't be called often compared to map*.
But good point in general. Maybe it's a good idea to include some memoization for GobConfig.get_* instead letting consumers implement it repeatedly.
Edit: done: 5c73008

My general suspicion and experience with #164 is that we have subtle but massive performance bottlenecks in all sorts of subtle places, but nobody has ever profiled Goblint properly. Which isn't surprising because OCaml profiling tooling is almost non-existent. I tried and still ended up just guessing and using Stats.time to pin the problem.

True, perf report [2] works but it's hard to find bottlenecks, for example it does not list things like get_bool. Haven't tried if enabling frame pointers helps.
The equality stuff in bench/hashcons also looks interesting. What speaks against let (=) a b = a == b || a = b?

[1] https://github.com/Chris00/ocaml-benchmark
[2] https://ocaml.org/learn/tutorials/performance_and_profiling.html#Using-perf-on-Linux

The text was updated successfully, but these errors were encountered:

vogler · 2021-04-14T09:37:31Z

how to profile? Is perf the best option?
- do frame pointers help to filter for functions like get_bool?
5c73008: memoize GobConfig.get_* -> ~10%?
is IntDomTuple faster than IntDomList or some Obj list impl. like in MCP2?
What speaks against let (=) a b = a == b || a = b to speed up everything? See bench/hashcons.
Profile derived implementations Reduce equal & compare boilerplate using deriving eq & ord #227, see inf-recursion now runs ~6x longer before stack overflow #265 (comment)

vogler · 2021-07-27T12:14:06Z

Example of profiling with perf that shows how name mangling and inlining make it hard to use: #265 (comment)

sim642 · 2021-07-27T12:32:27Z

Regarding name mangling I found this but didn't try it: https://discuss.ocaml.org/t/ann-perf-demangling-of-ocaml-symbols-a-short-introduction-to-perf/7143. Supposedly those improvements are also going upstream into perf.

michael-schwarz · 2022-11-12T07:44:14Z

We currently have a practical course at TUM investigating this and related issues.

jerhard · 2023-11-14T14:57:58Z

I would suggest to close this issue as we already had a look at optimizing by introducing short-circuiting, caching of options, and using flambda. The remaining issue of profiling and optimization then seems overly broad.

michael-schwarz · 2023-11-16T09:47:43Z

Close as overly broad for now. We should open dedicated issues if we have new ideas.

sim642 added the performance Analysis time, memory usage label Apr 14, 2021

sim642 added a commit that referenced this issue Jul 20, 2021

Document profiling with perf (issue #208, issue #189)

3040a99

michael-schwarz added the practical-course Practical Course at TUM label Nov 12, 2022

michael-schwarz closed this as not planned Won't fix, can't repro, duplicate, stale Nov 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

profiling and optimizations #189

profiling and optimizations #189

vogler commented Apr 14, 2021

vogler commented Apr 14, 2021 •

edited by michael-schwarz

Loading

vogler commented Jul 27, 2021

sim642 commented Jul 27, 2021

michael-schwarz commented Nov 12, 2022

jerhard commented Nov 14, 2023

michael-schwarz commented Nov 16, 2023

profiling and optimizations #189

profiling and optimizations #189

Comments

vogler commented Apr 14, 2021

vogler commented Apr 14, 2021 • edited by michael-schwarz Loading

vogler commented Jul 27, 2021

sim642 commented Jul 27, 2021

michael-schwarz commented Nov 12, 2022

jerhard commented Nov 14, 2023

michael-schwarz commented Nov 16, 2023

vogler commented Apr 14, 2021 •

edited by michael-schwarz

Loading