-
Notifications
You must be signed in to change notification settings - Fork 559
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GCP] skip MIG termination for only CPU instances #4654
base: master
Are you sure you want to change the base?
Conversation
Is this the case for the issue in #4594? IIRC, for CPU machines, we will skip the MIG code path for creation, so the termination should also correctly skip that? |
The skypilot/sky/provision/gcp/mig_utils.py Line 144 in 63c04b4
It seems that it skips in my reproduce: Skip resize request cancellation and Skip deletion. ![]() I am not sure if it is as expected. I have tried If it is not as expected, which means that the permission is not the root cause, right? If so, let me figure it out. |
Please double check the code path for launching a CPU instance with the |
Hi Michaelvll, Thanks for the explanation.
Just want to make sure:
I find that
We need to keep the above tags/attributes as For creation, I found that current code path will check both MIG config and availablity of Accelerators to make GCPNodeType as MIG. here If the node do not have TPUs, the node type will be GCPNodeType.COMPUTE, and then So, for termination, we could also rely on (1) MIG config and (2) the existence of TPUs to decide whether skip or not. I am not sure if it is correct, please check it. Consequently, under above situation, for termination, the code path will not call API requests for GCP ManagedInstanceGroup.
|
Resolve for #4594
check:
https://cloud.google.com/compute/docs/reference/rest/beta/instanceGroupManagerResizeRequests/get
https://cloud.google.com/compute/docs/reference/rest/beta/instanceGroupManagerResizeRequests/cancel
Tested (run the relevant ones):
bash format.sh
pytest tests/test_smoke.py
pytest tests/test_smoke.py::test_fill_in_the_name
conda deactivate; bash -i tests/backward_compatibility_tests.sh