Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roachtest: read from cache when checking for preemption #122018

Conversation

renatolabs
Copy link
Contributor

Previously, the check for VM preemptions after test failure would perform a Sync operation, making API calls to GCE in order to find clusters. However, that shouldn't be necessary -- roachtest itself created the cluster, so the cluster data should exist in the cache.

This commit also adds more debug information when the cluster cannot be found (a list of existing clusters). While this shouldn't happen in practice in regular runs, we have seen it fail in the past.

Fixes: #121488

Release note: None

@cockroach-teamcity
Copy link
Member

This change is Reviewable

@renatolabs renatolabs marked this pull request as ready for review April 9, 2024 18:41
@renatolabs renatolabs requested a review from a team as a code owner April 9, 2024 18:41
@renatolabs renatolabs requested review from herkolategan and DarrylWong and removed request for a team April 9, 2024 18:41
@renatolabs
Copy link
Contributor Author

Did a quick test to make sure preemptions continue to work in this branch: link.

Previously, the check for VM preemptions after test failure would
perform a `Sync` operation, making API calls to GCE in order to find
clusters. However, that shouldn't be necessary -- roachtest itself
created the cluster, so the cluster data should exist in the cache.

This commit also adds more debug information when the cluster cannot
be found (a list of existing clusters). While this shouldn't happen in
practice in regular runs, we have seen it fail in the past.

Fixes: cockroachdb#121488

Release note: None
@renatolabs renatolabs force-pushed the rc/roachtest-cluster-not-found-debug-info branch from bd26a48 to 2743138 Compare April 9, 2024 19:14
@renatolabs
Copy link
Contributor Author

TFTR!

bors r=srosenberg

@craig
Copy link
Contributor

craig bot commented Apr 9, 2024

Build failed:

@renatolabs
Copy link
Contributor Author

bors retry

@craig craig bot merged commit 8ff9f39 into cockroachdb:master Apr 9, 2024
22 checks passed
@renatolabs renatolabs deleted the rc/roachtest-cluster-not-found-debug-info branch April 10, 2024 13:59
@renatolabs
Copy link
Contributor Author

blathers backport 24.1

@vidit-bhat
Copy link
Contributor

vidit-bhat commented May 25, 2024

We would need to backport this PR to 23.2 to be able to backport #124243 and consequently #124443. I need the getCachedCluster and the bucketVMsByProvider function.

@vidit-bhat
Copy link
Contributor

blathers backport 23.2

Copy link

blathers-crl bot commented May 25, 2024

Encountered an error creating backports. Some common things that can go wrong:

  1. The backport branch might have already existed.
  2. There was a merge conflict.
  3. The backport branch contained merge commits.

You might need to create your backport manually using the backport tool.


error creating merge commit from 2743138 to blathers/backport-release-23.2-122018: POST https://api.github.com/repos/cockroachdb/cockroach/merges: 409 Merge conflict []

you may need to manually resolve merge conflicts with the backport tool.

Backport to branch 23.2 failed. See errors above.


🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

roachtest: ycsb/F/nodes=3/cpu=32 failed
4 participants