BinaryHeap: Use full sift down in .pop() #30534

bluss · 2015-12-23T03:08:26Z

BinaryHeap: Use full sift down in .pop()

.sift_down can either choose to compare the element on the way down (and
place it during descent), or to sift down an element fully, then sift
back up to place it.

A previous PR changed .sift_down() to the former behavior, which is much
faster for relatively small heaps and for elements that are cheap to
compare.

A benchmarking run suggested that BinaryHeap::pop() suffers
improportionally from this, and that it should use the second strategy
instead. It's logical since .pop() brings last element from the
heapified vector into index 0, it's very likely that this element will
end up at the bottom again.

Closes #29969
Previous PR #29811

.sift_down can either choose to compare the element on the way down (and place it during descent), or to sift down an element fully, then sift back up to place it. A previous PR changed .sift_down() to the former behavior, which is much faster for relatively small heaps and for elements that are cheap to compare. A benchmarking run suggested that BinaryHeap::pop() suffers improportionally from this, and that it should use the second strategy instead. It's logical since .pop() brings last element from the heapified vector into index 0, it's very likely that this element will end up at the bottom again.

rust-highfive · 2015-12-23T03:08:37Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

bluss · 2015-12-23T03:08:50Z

r? @gankro

Gankra · 2015-12-23T03:40:29Z

Can you post the numbers, for posterity?

bluss · 2015-12-23T04:13:45Z

Sure, they're here #29969 (comment) and the direct link is https://gist.github.com/bluss/b179df31a8c683c087c7

I guess they could be interpreted in different ways. It's only testing two different data set sizes, so it's not exactly a well resolved picture of the asymptotics.

Gankra · 2015-12-23T05:10:45Z

TL;DR version for time travellers (EDIT: THIS IS BACKWARDS, SEE BELOW):

This reduces benchmark time to be 1/2 to 1/5th for most inputs (small heaps, or cheap comparisons), but for certain inputs makes the benchmark time 2x slower (large heap of expensive comparisons).

Gankra · 2015-12-23T05:31:29Z

ARGH.

Ok I had it backwards. This PR is is basically regressing the latest perf of pop ops for small/simple cases in favour of asymptotic gains on large/complex cases. I'm used to integer/float keys for heaps, so this seems backwards!

CC @rust-lang/libs, what do you think we should optimize more for?

bluss · 2015-12-23T05:50:51Z

cc @dgrunwald

Gankra · 2016-01-06T23:47:37Z

We discussed this at the libs team triage today. There was some wondering if we could dynamically (or statically?) branch on anything to "guess" which strategy would be best. In particular, we could check the size of the heap? The "large heap" case seems like the more pressing one than the "complex key" one.

What do you think?

bluss · 2016-01-07T17:55:38Z

I'd prefer to merge this change, and put down improved BinaryHeap strategy in the issue list. I'm unlikely to set out for such a major project right now.

Gankra · 2016-01-09T22:09:22Z

Ok, that seems reasonable.

@bors r+

bors · 2016-01-09T22:09:23Z

📌 Commit 52883ab has been approved by Gankro

bors · 2016-01-11T02:21:31Z

⌛ Testing commit 52883ab with merge ffdfeeb...

bors · 2016-01-11T02:24:23Z

💔 Test failed - auto-win-gnu-64-nopt-t

alexcrichton · 2016-01-11T04:12:24Z

@bors: retry

On Mon, Jan 11, 2016 at 3:24 PM, bors [email protected] wrote:

[image: 💔] Test failed - auto-win-gnu-64-nopt-t
http://buildbot.rust-lang.org/builders/auto-win-gnu-64-nopt-t/builds/2644

—
Reply to this email directly or view it on GitHub
#30534 (comment).

BinaryHeap: Use full sift down in .pop() .sift_down can either choose to compare the element on the way down (and place it during descent), or to sift down an element fully, then sift back up to place it. A previous PR changed .sift_down() to the former behavior, which is much faster for relatively small heaps and for elements that are cheap to compare. A benchmarking run suggested that BinaryHeap::pop() suffers improportionally from this, and that it should use the second strategy instead. It's logical since .pop() brings last element from the heapified vector into index 0, it's very likely that this element will end up at the bottom again. Closes #29969 Previous PR #29811

bors · 2016-01-11T19:03:20Z

⌛ Testing commit 52883ab with merge 1586005...

bors · 2016-01-11T20:59:13Z

☀️ Test successful - auto-linux-32-nopt-t, auto-linux-32-opt, auto-linux-64-debug-opt, auto-linux-64-nopt-t, auto-linux-64-opt, auto-linux-64-x-android-t, auto-linux-cross-opt, auto-linux-musl-64-opt, auto-mac-32-opt, auto-mac-64-nopt-t, auto-mac-64-opt, auto-win-gnu-32-nopt-t, auto-win-gnu-32-opt, auto-win-gnu-64-nopt-t, auto-win-gnu-64-opt, auto-win-msvc-32-opt, auto-win-msvc-64-opt

rust-highfive assigned alexcrichton Dec 23, 2015

rust-highfive assigned Gankra and unassigned alexcrichton Dec 23, 2015

alexcrichton added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Dec 23, 2015

Gankra mentioned this pull request Jan 9, 2016

Fine-tune binary heap sifting logic #30802

Open

bors merged commit 52883ab into rust-lang:master Jan 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BinaryHeap: Use full sift down in .pop() #30534

BinaryHeap: Use full sift down in .pop() #30534

bluss commented Dec 23, 2015

rust-highfive commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Dec 23, 2015

Gankra commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Jan 6, 2016

bluss commented Jan 7, 2016

Gankra commented Jan 9, 2016

bors commented Jan 9, 2016

bors commented Jan 11, 2016

bors commented Jan 11, 2016

alexcrichton commented Jan 11, 2016

bors commented Jan 11, 2016

bors commented Jan 11, 2016

BinaryHeap: Use full sift down in .pop() #30534

BinaryHeap: Use full sift down in .pop() #30534

Conversation

bluss commented Dec 23, 2015

rust-highfive commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Dec 23, 2015

Gankra commented Dec 23, 2015

bluss commented Dec 23, 2015

Gankra commented Jan 6, 2016

bluss commented Jan 7, 2016

Gankra commented Jan 9, 2016

bors commented Jan 9, 2016

bors commented Jan 11, 2016

bors commented Jan 11, 2016

alexcrichton commented Jan 11, 2016

bors commented Jan 11, 2016

bors commented Jan 11, 2016