Driver performance degradation with TokenAwarePolicy when (nodes < RF) #452

michoecho · 2022-05-26T08:45:25Z

scylla-rust-driver/scylla/src/transport/load_balancing/token_aware.rs

Lines 26 to 29 in 652f131

    
           cluster 
        
               .ring_range(token) 
        
               .unique() 
        
               .take(replication_factor)

Scrutinize the above. Usually (when nodes >= RF) the above ends after just a few iterations. Since tokens are distributed between nodes randomly, we will quickly find RF unique nodes. But if the number of nodes in the ring is below RF, this code will iterate over all tokens, which takes significant work.

I bumped into this during a benchmark. It caused the driver to spend more than 50% of its CPU time in plan() when driving a 2-node cluster with RF=3.

The text was updated successfully, but these errors were encountered:

piodul · 2022-05-27T10:13:21Z

Good catch. Looking at the code of network_topology_strategy_replicas I suspect we might have a similar problem with NetworkTopologyStrategy, too (but not 100% sure, didn't try to reproduce).

@havaker is refactoring load balancing right now (#449), so I think we should either fix it during or after the refactor.

…iable token_aware code uses cluster.ring_range(token).unique() to iterate over candidate replicas until enough candidates are found to satisfy the RF. This behaves badly when the number of candidates is smaller than RF -- we always iterate over the entire ring, which is very wasteful (it was seen to slow down the driver by a factor of >2 in a simple performance test). Fix that by ending the iteration early when all unique candidate nodes were already considered. Fixes scylladb#452

havaker · 2023-03-23T11:06:23Z

Fixed in #612.

michoecho mentioned this issue Feb 17, 2023

token_aware: avoid iterating over the entire ring when RF is unsatisfiable #648

Closed

6 tasks

havaker closed this as completed Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Driver performance degradation with TokenAwarePolicy when (nodes < RF) #452

Driver performance degradation with TokenAwarePolicy when (nodes < RF) #452

michoecho commented May 26, 2022

piodul commented May 27, 2022

havaker commented Mar 23, 2023

Driver performance degradation with TokenAwarePolicy when (nodes < RF) #452

Driver performance degradation with TokenAwarePolicy when (nodes < RF) #452

Comments

michoecho commented May 26, 2022

piodul commented May 27, 2022

havaker commented Mar 23, 2023