roachtest: run tests against larger clusters #26978
Labels
A-kv-client
Relating to the KV client and the KV interface.
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
A user has privately shared multiple issues found working with generally ~32 node clusters, including stuck requests during IMPORT and minutes of failed liveness heartbeats (the first likely and the second definitely related to a GC'able replica that isn't getting removed in due time).
While we generally have some theory and understanding about these issues, we should be more proactive about running into them ourselves. A few tests that run heavy rebalancing plus chaos along with regular activity (that is somewhat latency-sensitive, i.e. we would notice if it got stuck for 30+s) on most ranges should be sufficient.
The text was updated successfully, but these errors were encountered: