automatic shrinking of hash table capacity is very expensive #17645

thestinger · 2014-09-30T03:42:28Z

This interacts poorly with with_capacity since it will just end up shrinking the allocation immediately. It's very widely used since FromIter uses it with the iterator size hint. Even in code that's not pre-allocating capacity, the resize strategy results in many reallocations as the hash table shrinks and then grows again. Allocator improvements are possible but this is always going to be extremely expensive for huge collections on most platforms.

The text was updated successfully, but these errors were encountered:

Gankra · 2014-09-30T03:54:21Z

If you ask for a hashmap with_capacity, it will record that number as a minimum and never shrink the map below that.

thestinger · 2014-09-30T03:59:18Z

I think it's trying to be too clever. It will always have problems reminiscent of the segmented stack thrashing issues. It would be better to just have the caller use shrink_to_fit as is done for vectors. There's a lot of value in having simple and predictable performance characteristics. Implementing jemalloc/jemalloc#134 would reduce the problem smaller for huge hash tables but I think it's still too aggressive as a default.

Gankra · 2014-09-30T04:15:01Z

cc @pczarn

jfager · 2014-09-30T07:57:17Z

Automatic shrinking is pretty surprising behavior, and only seems to be hinted at in the rustdoc via the reserve method. +1 to stop doing it.

mitsuhiko · 2014-10-01T16:24:51Z

+1 on explicit calls for more predictable performance. I was quite surprised to learn that this is what it does currently.

arthurprs · 2014-10-06T00:02:25Z

+1 same as @mitsuhiko

pczarn · 2014-10-10T22:19:18Z

I don't mind explicit calls. This is a simple approach. ResizePolicy would allow opt-in automatic shrinking in the future.

Just keep in mind that the capacity of a hash table affects the performance of accesses, even though it's more predicable. (Is it?)

pczarn · 2015-01-14T17:59:10Z

Fixed by #18770.

Gankra · 2015-01-14T18:15:17Z

🎊

Fix path resolution for child mods of those expanded by `include!` Child modules wouldn't use the correct candidate paths due to a branch that doesn't seem to be doing what it's intended to do. Removing the branch fixes the problem and all existing test cases pass. Having no knowledge of how any of this works, I believe this fixes rust-lang#17645. Using another test that writes the included mod directly into `lib.rs` instead, I found the difference can be traced to the candidate files we use to look up mods. A separate branch for if the file comes from an `include!` macro doesn't take into account the original mod we're contained within: ```rust None if file_id.macro_file().map_or(false, |it| it.is_include_macro(db.upcast())) => { candidate_files.push(format!("{}.rs", name.display(db.upcast()))); candidate_files.push(format!("{}/mod.rs", name.display(db.upcast()))); } ``` I'm not sure why this branch exists. Tracing the branch back takes us to 3bb9efb but it doesn't say *why* the branch was added. The test case that was added in this commit passes with the branch removed, so I think it's just superfluous at this point.

thestinger added A-libs I-slow Issue: Problems and improvements with respect to performance of generated code. labels Sep 30, 2014

Gankra closed this as completed Jan 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic shrinking of hash table capacity is very expensive #17645

automatic shrinking of hash table capacity is very expensive #17645

thestinger commented Sep 30, 2014

Gankra commented Sep 30, 2014

thestinger commented Sep 30, 2014

Gankra commented Sep 30, 2014

jfager commented Sep 30, 2014

mitsuhiko commented Oct 1, 2014

arthurprs commented Oct 6, 2014

pczarn commented Oct 10, 2014

pczarn commented Jan 14, 2015

Gankra commented Jan 14, 2015

automatic shrinking of hash table capacity is very expensive #17645

automatic shrinking of hash table capacity is very expensive #17645

Comments

thestinger commented Sep 30, 2014

Gankra commented Sep 30, 2014

thestinger commented Sep 30, 2014

Gankra commented Sep 30, 2014

jfager commented Sep 30, 2014

mitsuhiko commented Oct 1, 2014

arthurprs commented Oct 6, 2014

pczarn commented Oct 10, 2014

pczarn commented Jan 14, 2015

Gankra commented Jan 14, 2015