cherrypick-1.1: storage: avoid replica thrashing when localities are different sizes #20934

a-robinson · 2017-12-20T16:58:20Z

This is the minimal set of unmodified (other than to fix merge conflicts) commits needed to fix #20241 on the release-1.1 branch.

This is larger than our typical cherrypick, and while the code is pretty well tested, it hasn't been fully tested without a few other allocator commits that are on the master branch. If @bdarnell and others think it's worth cherrypicking something like this, I can do some additional manual testing of this branch with allocsim and/or spinning indigo back up.

cockroach-teamcity · 2017-12-20T16:58:29Z

This change is

bdarnell · 2017-12-20T17:53:43Z

Review status: 0 of 6 files reviewed at latest revision, all discussions resolved, some commit checks failed.

Comments from Reviewable

Updating a target store write stats immediately after rebalancing was recently addressed in cockroachdb#18425. With that change, if `updateLocalStoreAfterRebalance` is called before the `StorePool` had seen the `StoreDescriptor` in gossip, it will trigger a NPE. This change fixes this by making the update a no-op if the descriptor has yet to be seen in gossip.

… to improve balance

Fixes cockroachdb#19180. Fixes cockroachdb#19207.

If the first target attempted was rejected due to the simulation claiming that it would be immediately removed, we would reuse the modified `rangeInfo.Desc.Replicas` that had the target added to it, messing with future iterations of the loop. Also, we weren't properly modifying the `candidates` slice, meaning that we could end up trying the same replica multiple times. Release note (bug fix): Improve data rebalancing to make thrashing back and forth between nodes much less likely.

Skipping the simulation when raftStatus.Progress is nil can make for undesirable thrashing of replicas, as seen when testing cockroachdb#20241. It's better to run the simulation without properly filtering replicas than to not run it at all. Release note: None

Fixes cockroachdb#20241 Release note (bug fix): avoid rebalance thrashing when localities have very different numbers of nodes

a-robinson · 2017-12-20T21:53:13Z

I've tested this with all the multi-node allocsim configs as well as some other arbitrary nodes-per-locality configurations I made up. It didn't always end up super balanced within the small localities (due to #20751), but did its job of maintaining a rough balance and avoided thrashing in all of them.

I'll merge this later tonight unless someone speaks up before then.

a-robinson requested a review from a team December 20, 2017 16:58

a-robinson mentioned this pull request Dec 20, 2017

Unexpected behaviour in a split-brain scenario #20241

Closed

a-robinson force-pushed the cherrypick_20241 branch from 72e35a4 to 1cbb256 Compare December 20, 2017 17:21

a6802739 and others added 8 commits December 20, 2017 14:37

storage: immediately update target store write stats after rebalance

6ef56ae

storage: Consider which replica will be removed when adding a replica…

5dded67

… to improve balance

storage : fix bug for the testrace of TestSystemZoneConfigs

21d4cd9

Fixes cockroachdb#19180. Fixes cockroachdb#19207.

storage: allocate the slice just once

5dbe05e

storage: Avoid replica thrashing when localities are different sizes

f01eb24

Fixes cockroachdb#20241 Release note (bug fix): avoid rebalance thrashing when localities have very different numbers of nodes

a-robinson force-pushed the cherrypick_20241 branch from 1cbb256 to f01eb24 Compare December 20, 2017 19:50

a-robinson merged commit 2cded8b into cockroachdb:release-1.1 Dec 21, 2017

a-robinson deleted the cherrypick_20241 branch May 18, 2018 20:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cherrypick-1.1: storage: avoid replica thrashing when localities are different sizes #20934

cherrypick-1.1: storage: avoid replica thrashing when localities are different sizes #20934

a-robinson commented Dec 20, 2017

cockroach-teamcity commented Dec 20, 2017

bdarnell commented Dec 20, 2017

a-robinson commented Dec 20, 2017

cherrypick-1.1: storage: avoid replica thrashing when localities are different sizes #20934

cherrypick-1.1: storage: avoid replica thrashing when localities are different sizes #20934

Conversation

a-robinson commented Dec 20, 2017

cockroach-teamcity commented Dec 20, 2017

bdarnell commented Dec 20, 2017

a-robinson commented Dec 20, 2017