-
Notifications
You must be signed in to change notification settings - Fork 179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Init container gets OOM killed on new cluster POD startup #1566
Comments
can you please upload must-gather without changes? the |
There are no other resources. I'm only creating a cluster and it fails on init container OOM without producing any logs so nothing is actually booting up. Do you anticipate anything apart from the cluster STS pods to be running? |
collected again |
sorry, my bad, we don't yet collect events related to the scyllaclusters. can you please run it with |
here you go scylla-operator-must-gather-gdc4dj7vl8p8.zip |
Thanks, it is weird that we don't hit it on other platforms. I think the resources should be fixed and cp should use only as much memory as it has available, not sure if k3d behaves differently or if it's somehow connected to the hardware. initContainers:
- command:
- /bin/sh
- -c
- cp -a /usr/bin/scylla-operator /mnt/shared
image: docker.io/scylladb/scylla-operator@sha256:942098adb09134460264c7470d06efa2d5ee32e98354bb3929f9a16f12cf8b4a
name: sidecar-injection
resources:
limits:
cpu: 10m
memory: 50Mi
requests:
cpu: 10m
memory: 50Mi status:
initContainerStatuses:
- containerID: containerd://51ceeb40ef28485c803b983b42e48b0ec12a704cee93b99f79e23ed51edcb2af
image: sha256:fb3b6307d762133e6e718eb4c61078b423859be3d21d54f294f8e3e3e91095d3
imageID: docker.io/scylladb/scylla-operator@sha256:942098adb09134460264c7470d06efa2d5ee32e98354bb3929f9a16f12cf8b4a
lastState:
terminated:
containerID: containerd://676594f27b4b1d80d24e8f2171f97f04814da13d45f36b30d22d7c083a29b740
exitCode: 137
finishedAt: "2023-11-15T08:25:22Z"
reason: OOMKilled
startedAt: "2023-11-15T08:25:17Z" /assign @rzetelskik |
The Scylla Operator project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
/lifecycle stale |
/remove-lifecycle stale |
/triage accepted |
What happened?
Created the local cluster using the k3s
Created the operator
Created the cluster, got OOM on Init container
Edited the STS, increased the resource to 150Mi, cluster got created
Unfortunately the init container limits seem to be hard-coded so there is no way to influence the allocation.
What did you expect to happen?
no OOM, or at least able to change initContainer limits
How can we reproduce it (as minimally and precisely as possible)?
config.yaml
Create cluster
Scylla Operator version
v1.12.0-alpha.0-102-geb68db4
also reproducible on v.1.11.0
Kubernetes platform name and version
reproduced on 1.21 & on 1.25
Kubernetes platform info:
Please attach the must-gather archive.
scylla-operator-must-gather-hdqcl4psgfqd.zip
Anything else we need to know?
No response
The text was updated successfully, but these errors were encountered: