Skip to content

Commit

Permalink
Update karpenter-nodepool.yaml
Browse files Browse the repository at this point in the history
Reducing karpenter node expiration times
  • Loading branch information
SebastianScherer88 authored Dec 5, 2024
1 parent 915f0a3 commit e86f60d
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions kubernetes/manifests/karpenter/karpenter-nodepool.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ spec:
disruption:
consolidationPolicy: WhenEmpty
consolidateAfter: 15m # ~5 minutes required for larger images, otherwise karpenter will deprovision before pods are running
expireAfter: 24h
expireAfter: 2h
---
apiVersion: karpenter.sh/v1beta1
kind: NodePool
Expand Down Expand Up @@ -61,7 +61,7 @@ spec:
disruption:
consolidationPolicy: WhenEmpty
consolidateAfter: 15m # dont go lower to prevent decomissioning bc of image pull phase
expireAfter: 3h # limit GPU nodes to 3 ours - enough to train annotated transformer on one GPU (~2.8h)
expireAfter: 1h # limit GPU nodes to 3 ours - enough to train annotated transformer on one GPU (~2.8h)
---
apiVersion: karpenter.sh/v1beta1
kind: NodePool
Expand Down Expand Up @@ -96,4 +96,4 @@ spec:
disruption:
consolidationPolicy: WhenEmpty
consolidateAfter: 30m # larger training images take longer to pull
expireAfter: 240h # limit GPU nodes to 12h ours
expireAfter: 24h # limit GPU nodes to 12h ours

0 comments on commit e86f60d

Please sign in to comment.