-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix flaky TopicReplicasChangeST#testKafkaTopicReplicaChangePositiveRoundTrip #10339
Conversation
/azp run regression |
Azure Pipelines successfully started running 1 pipeline(s). |
/packit test --labels regression |
As a side effect, all CC tests should become faster if this works. |
What does this have to do with |
Yes, sorry, let me fix it. Working on too many flaky tests :D |
…undTrip This change is an attempt to fix the flaky TopicReplicasChangeST#testKafkaTopicReplicaChangePositiveRoundTrip. Looking at logs I can confirm that the timeout is indeed due to the Cruise Control's cluster model not being ready in time. The Cruise Control setup proposed here seems to make the cluster model generation faster. It basically reduces the partition/replica count of Cruise Control's topics and the metrics window. On my machine, the whole test suite went down from 26 minutes to 13 minutes, which is a 50% improvement. It would be good if someone else could confirm that by running TopicReplicasChangeST before and after this change. With this change, I wasn't able to trigger the issue locally after many runs. I would suggest to run regression tests 2 or 3 times to confirm this really helps. Signed-off-by: Federico Valeri <[email protected]>
a7ca402
to
7206661
Compare
Signed-off-by: Federico Valeri <[email protected]>
/azp run regression |
Azure Pipelines successfully started running 1 pipeline(s). |
/packit test --labels regression |
Looks like it worked. Wdyt? |
The results are good, 1 flake on azure, not sure if it could be somehow connected or could be improved |
systemtest/src/main/java/io/strimzi/systemtest/templates/crd/KafkaTemplates.java
Show resolved
Hide resolved
Signed-off-by: Federico Valeri <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this @fvaleri 👍 Good job
@kyguy @ppatierno are you good with these changes? |
Type of change
Description
This change is an attempt to fix the flaky
TopicReplicasChangeST.testKafkaTopicReplicaChangePositiveRoundTrip
. Looking at logs I can confirm that the timeout is indeed due to the Cruise Control's cluster model not being ready in time.The Cruise Control setup proposed here seems to make the cluster model generation faster. It basically reduces the partition/replica count of Cruise Control's topics and the metrics window. On my machine, the whole test suite went down from 26 minutes to 13 minutes, which is a 50% improvement. It would be good if someone else could confirm that by running
TopicReplicasChangeST
before and after this change.With this change, I wasn't able to trigger the issue locally after many runs. I would suggest to run regression tests 2 or 3 times to confirm this really helps.
Should fix #10295.
Checklist