-
Notifications
You must be signed in to change notification settings - Fork 328
Issues: NVIDIA/gpu-operator
NOTICE: Containers losing access to GPUs with error: "Failed ...
#485
opened Feb 7, 2023 by
cdesiniotis
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
gpu-operator error causes pods on time-sliced H100 node to restart intermittently
#1310
opened Mar 5, 2025 by
vinkamath
how to remove mig from a node if the nvidia-device-plugin pod does not start
#1309
opened Mar 5, 2025 by
okyspace
k8s-driver-manager container image is not included as part of the OLM Operator Bundle
#1294
opened Feb 26, 2025 by
tginer
Adding profile to 'mig.config' does not apply MIG configuration
#1286
opened Feb 19, 2025 by
simgyuryeol
GSP Firmware not loaded properly, make nvidia-vgpu-manager-daemonset CrashLoopBackOff
#1278
opened Feb 17, 2025 by
rjhaikal
nvidia.com/gpu-driver-upgrade-enabled: "true" even when the driver.enabled=false
#1277
opened Feb 15, 2025 by
davidshen84
Examples showing how to specify custom MIG config in values.yaml are misformatted
#1259
opened Feb 7, 2025 by
tomtseng
Nvidia driver daemonset does not run due to apt-cache issue.
#1244
opened Jan 30, 2025 by
ScottWatsonWork
Anyway to find if gpu operator has completed node discovery
#1216
opened Jan 21, 2025 by
SSushmitha8
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.