Mega Issue: Node Disruption Lifecycle Taints #624

njtran · 2023-10-20T20:20:42Z

Description

What problem are you trying to solve?
Karpenter has driven disruption of nodes through annotations and processes maintained in memory.

Karpenter should drive disruption by through its own taint mechanism(s) while it discovers and executes disruption actions.

This issue proposes that each node owned by Karpenter will be in one of four states:

Not Disrupting (No Taints) - Karpenter doesn't want to disrupt this node, and neither does the user.
Candidate (PreferNoSchedule Taint) - Karpenter identifies a node as a possible option for disruption for any of the programmatic disruption mechanisms that Karpenter does - expiration, drift, consolidation. A node that's chosen as a candidate can always be removed from candidacy.
Disrupting (NoSchedule Taint) - Karpenter has validated and executed the disruption action for the node, and has begun the standard flow.
Karpenter can fail to disrupt a node. If it does, the node will go back to Not Disrupting, where it may be picked up as a Candidate again later.
Terminating (NoExecute Taint) - Karpenter has deleted the node, triggering the finalization logic, where the last of the pods (e.g. Daemonsets) need to be evicted before terminating the underlying instance, then removing the node.
Once a node has begun terminating, there's no turning back. Karpenter will eventually terminate it.

Related Issues:

PreferNoSchedule/NoSchedule when a node is marked as Drifted: Cordon Drifted nodes before processing evictions #623
PreferNoSchedule/NoSchedule when a node is marked as Expired: Cordon node with a do-not-evict pod when ttlSecondsUntilExpired is met #622
- Race condition where pods blocking eviction can schedule right after Karpenter finishes validating a disruption action for Consolidation resulting in the pod getting deleted anyway: Taint nodes with a NoSchedule for Consolidation before Validation begins #651
Add A NoSchedule taint when Karpenter begins executing a disruption action: BREAKING CHANGE: replace node.kubernetes.io/unschedulable with a Karpenter-specific taint #508
NoExecute when a node is terminating to cleanup daemonsets: Taint nodes before deletion #621

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Legion2 · 2023-12-05T22:56:35Z

I really like the idea of this issue. This will solve the spread out behavior of default scheduler when there is are continuously added new pods but there is much idle capacity. With the described behavior karpenter would taint some of the nodes with PreferNoSchedule and will cause the scheduler perform bin packing of the new pods on the remaining nodes instead of distributing them across all underutilized nodes.
I hope there will be policies or configurations in place that allow Karpenter to identify nodes as disruption candidates even though they are still running some small jobs which can not be evicted.

k8s-triage-robot · 2024-03-04T23:35:31Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-06-10T22:42:37Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

jmdeal · 2024-06-10T23:24:34Z

/remove-lifecycle stale

Nuru · 2024-07-01T21:41:08Z

Please be sure to handle the use case where a Pod running on a Node adds a "do-not-evict" annotation while it is running. Of course there will be an unavoidable race condition, but it is important to realize that just because the Node is tainted, it does not mean that annotated Pods will not appear on the Node.

It would be good for my use case if there were a way for a Pod to get notified that Karpenter is considering consolidating the node (NoSchedule Taint added) so it can immediately decide to either quit or annotate itself, which will give the Pod a head start in the race and avoid most if not all real-world mishaps.

One way to do this would be via another annotation, such as ok-to-distrupt or prefer-to-disrupt or something, that tells Karpenter to send the pod some Signal other than SIGTERM that the Pod can respond to (and by default would ignore) when Karpenter considers the Node a likely consolidation target. This would have to be after the Node is tainted, so that when the Pod quits and is immediately replaced with a new Pod by the Deployment, the new Pod does not get scheduled onto the same Node. We would also want a configurable delay between the taint and notification in step 3 and the actual termination in step 4, so we can be sure to give enough time for the Pod to respond and block termination.

k8s-triage-robot · 2024-09-29T21:57:25Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-10-29T22:41:52Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Nuru · 2024-10-30T00:50:15Z

/remove-lifecycle rotten

riyas-rawther · 2025-01-27T19:15:37Z

/remove-lifecycle rotten

njtran added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 20, 2023

This was referenced Oct 21, 2023

feat: added NoExecute taint to terminating Nodes #626

Closed

feat: PreferNoSchedule taint on candidates for disruption #629

Closed

Karpenter consolidation replaces the node with exact same node (EC2 instance) type aws/karpenter-provider-aws#4826

Closed

ellistarn mentioned this issue Oct 26, 2023

Leaked pods with topology blocks provisioning #640

Closed

sadath-12 mentioned this issue Oct 31, 2023

docs: add karpenter preferNoSchedule taint design #649

Closed

This was referenced Oct 31, 2023

Pod is scheduled right before karpenter decides to delete a node aws/karpenter-provider-aws#4310

Closed

Taint nodes with a NoSchedule for Consolidation before Validation begins #651

Open

njtran added deprovisioning Issues related to node deprovisioning v1 Issues requiring resolution by the v1 milestone labels Oct 31, 2023

njtran mentioned this issue Oct 31, 2023

Configurable Deprovisioning Process aws/karpenter-provider-aws#3520

Closed

njtran mentioned this issue Nov 13, 2023

Do not evict running jobs #701

Open

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 4, 2024

njtran removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 12, 2024

jmdeal mentioned this issue Apr 10, 2024

fix: address disruption taint race condition #1180

Merged

jonathan-innis mentioned this issue May 14, 2024

Make consolidation configurable #1209

Open

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 10, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 10, 2024

Nuru mentioned this issue Jul 7, 2024

[eks/actions-runner-controller] Multiple bug fixes and enhancements cloudposse/terraform-aws-components#1075

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 29, 2024

SebastianScherer88 mentioned this issue Oct 23, 2024

Pipelines fail with components being scheduled on ephemeral, de-provisioned Karpenter nodes SebastianScherer88/bettmensch.ai#15

Open

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 29, 2024

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mega Issue: Node Disruption Lifecycle Taints #624

Mega Issue: Node Disruption Lifecycle Taints #624

njtran commented Oct 20, 2023 •

edited

Loading

Legion2 commented Dec 5, 2023

k8s-triage-robot commented Mar 4, 2024

k8s-triage-robot commented Jun 10, 2024

jmdeal commented Jun 10, 2024

Nuru commented Jul 1, 2024

k8s-triage-robot commented Sep 29, 2024

k8s-triage-robot commented Oct 29, 2024

Nuru commented Oct 30, 2024

riyas-rawther commented Jan 27, 2025

Mega Issue: Node Disruption Lifecycle Taints #624

Mega Issue: Node Disruption Lifecycle Taints #624

Comments

njtran commented Oct 20, 2023 • edited Loading

Description

Legion2 commented Dec 5, 2023

k8s-triage-robot commented Mar 4, 2024

k8s-triage-robot commented Jun 10, 2024

jmdeal commented Jun 10, 2024

Nuru commented Jul 1, 2024

k8s-triage-robot commented Sep 29, 2024

k8s-triage-robot commented Oct 29, 2024

Nuru commented Oct 30, 2024

riyas-rawther commented Jan 27, 2025

njtran commented Oct 20, 2023 •

edited

Loading