You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OS/Version Red Hat Enterprise Linux CoreOS release 4.12
Kernel Version: 4.18.0-372.69.1.el8_6.x86_64
Container Runtime Type/Version: CRI-O
Openshift 4.12.29
GPU Operator Version: 23.9.1
2. Issue or feature description
nvidia-driver-daemonset-xx pod reports "Startup probe failed: No devices were found" in events, but I can see the v100 GPU is ready on the os, below is the "lspci" output
@garyyang85No devices were found typically indicates that GPU initialization failed. Can you get system logs by running dmesg | grep -i nvrm on the host?
1. Quick Debug Information
2. Issue or feature description
nvidia-driver-daemonset-xx pod reports "Startup probe failed: No devices were found" in events, but I can see the v100 GPU is ready on the os, below is the "lspci" output
3. Steps to reproduce the issue
Deploy the GPU operator, cluster-policy definition.
The text was updated successfully, but these errors were encountered: