-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KEP-5007: DRA Device Binding Conditions #5012
KEP-5007: DRA Device Binding Conditions #5012
Conversation
KobayashiD27
commented
Dec 20, 2024
- One-line enhancement description: Some network- or fabric-attached devices need to be attached to a node before a pod using them can be scheduled.
- Issue link: DRA: Device Binding Conditions #5007
- Other comments:
Hi @KobayashiD27. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
@pohly |
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
/ok-to-test I'm a bit time-constrained right now. Not sure whether I can review for 1.33. |
@x13n - FYI |
@pohly |
Discussing it in the meeting would be a good first step to get more people on board. |
/cc @zvonkok |
I have implemented an experimental version of this DRA scheduler feature. I hope it will be useful for your review. |
@johnbelamaric |
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/kep.yaml
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
API review: some small gaps in the mechanics (+listType
) and suggestions to make the API description a bit more complete.
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
keps/sig-scheduling/5007-device-attach-before-pod-scheduled/README.md
Outdated
Show resolved
Hide resolved
UsageRestrictedToNode bool | ||
|
||
// BindingTimeout indicates the prepare timeout period. | ||
// If the timeout period is exceeded before all BindingConditions reach a True state, the scheduler clears the allocation in the ResourceClaim and reschedules the Pod. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// If the timeout period is exceeded before all BindingConditions reach a True state, the scheduler clears the allocation in the ResourceClaim and reschedules the Pod. | |
// If the timeout period is exceeded before all BindingConditions reach a True state, the scheduler clears the allocation in the ResourceClaim and reschedules the Pod. | |
// | |
// The default timeout if not set is 10 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One could argue that this should be set via defaulting, but then we get that value set also when there aren't any binding conditions, or need to be very (too!) clever with the defaulting.
// BindingTimeout indicates the prepare timeout period. | ||
// If the timeout period is exceeded before all BindingConditions reach a True state, the scheduler clears the allocation in the ResourceClaim and reschedules the Pod. | ||
// | ||
// +optional |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// +optional | |
// The default timeout if not set is 10 minutes. | |
// | |
// +optional |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for your review. I have reflected all of your suggestions and comments.
/lgtm |
/lgtm (FYI, I'll be taking two days off from tomorrow, so feel free to DM me on Slack, or just skip me if that's something not critical. I've joined only a few discussion here in this PR, so likely there would be nothing such though) |
/assign @alculquicondor For SIG Scheduling approval. For reviews, see #5012 (comment) (@sanposhiho), #5012 (comment) (@dom4ha) and #5012 (comment) (summary by @dom4ha). PRR is done and only needs final approval, see #5012 (comment). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
for SIG Scheduling, based on the discussions during the last meeting.
@alculquicondor @johnbelamaric |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alculquicondor, johnbelamaric, KobayashiD27, pohly The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@johnbelamaric @pohly @klueska
I will create a separate issue about this(updating ResourceSlices). Please let me discuss it there. |