You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ubuntu@hpc-03:~$ lxc info --show-log windows11-deleteme
Name: windows11-deleteme
Status: STOPPED
Type: virtual-machine
Architecture: x86_64
Location: hpc-04
Created: 2025/03/14 11:59 CET
Last Used: 2025/03/20 12:14 CET
Error: open /var/snap/lxd/common/lxd/logs/windows11-deleteme/qemu.log: no such file or directory
Issue description
one of our customers is trying to attach a gpu to an existing windows 11 vm, but can't make it work.
gpu attachment does work on a ubuntu desktop vm, which we stopped after validating it, for this report purpose.
$ lxc info --resources | grep -i nvidia -B10 -A10
...
Card 1:
NUMA node: 1
Vendor: NVIDIA Corporation (10de)
Product: GA107GL [A2 / A16] (25b6)
PCI address: 0000:81:00.0
Driver: nvidia (550.120)
DRM:
ID: 1
Card: card1 (226:1)
Control: controlD65 (226:1)
Render: renderD128 (226:128)
NVIDIA information:
Architecture: 8.6
Brand: Nvidia
Model: NVIDIA A2
CUDA Version: 12.4
NVRM Version: 550.120
UUID: GPU-4a335844-1d7d-07e8-a2c1-f750fec0994f
SR-IOV information:
Current number of VFs: 0
Maximum number of VFs: 16
Steps to reproduce
first steps were done through the lxd ui (the UI was served from node3/hpc-03).
tpm device is removed after installation, because sometimes vm instance gets stuck on start operation and tpm removal solved it (will re-validate this with customer and open separate issue if needed)
installed fresh Windows windows 11 24H2, with tpm device ; ok
turned Vm off with tpm attached; ok
turned VM on with tpm attached; ok
shut down Windows, tpm detached, start windows; ok
shut down Windows, attach GPU, start Windows ; ko cannot start instance
some more context
test for this bug report were done from a new fresh windows vm windows11-deleteme on a new instance config, following official tutorial
please can you clarify what specifically shut down Windows, attach GPU, start Windows ; ko cannot start instance means.
Does an LXD throws any error on lxc start <instance_name> command or it's just Windows doesn't boot anymore?
I was able to reproduce a behavior when Windows 11 does not boot after attaching an external GPU and when NVIDIA Display driver is installed inside. In my case symptom is that Windows boot process is getting stuck with a UEFI firmware logo and windows loading spinner. Are you experiencing the same symptoms or not?
Please confirm
Distribution
Ubuntu server
Distribution version
22.04.5 LTS
Output of "snap list --all lxd core20 core22 core24 snapd"
ubuntu@hpc-01:~/offline_snaps$ $ snap list Name Version Rev Tracking Publisher Notes core 16-2.61.4-20240607 17200 - canonical✓ core core22 20250110 1748 - canonical✓ base core24 20240920 609 - canonical✓ base lxd 5.21.3-75def3c 32455 - canonical✓ - microceph 19.2.0+snap2fbf0bad05 1271 - canonical✓ - microcloud 2.1.0-3e8b183 1144 - canonical✓ - microovn 24.03.2+snapa2c59c105b 667 - canonical✓ - snapd 2.67 23545 - canonical✓ snapd
Output of "lxc info" or system info if it fails
ubuntu@hpc-03:~$ lxc info --show-log windows11-deleteme Name: windows11-deleteme Status: STOPPED Type: virtual-machine Architecture: x86_64 Location: hpc-04 Created: 2025/03/14 11:59 CET Last Used: 2025/03/20 12:14 CET Error: open /var/snap/lxd/common/lxd/logs/windows11-deleteme/qemu.log: no such file or directory
Issue description
one of our customers is trying to attach a gpu to an existing windows 11 vm, but can't make it work.
gpu attachment does work on a ubuntu desktop vm, which we stopped after validating it, for this report purpose.
GPU details
NVIDIA Ampere A2, PCIe, 60W, 16GB Passive
Steps to reproduce
first steps were done through the lxd ui (the UI was served from node3/hpc-03).
tpm device is removed after installation, because sometimes vm instance gets stuck on start operation and tpm removal solved it (will re-validate this with customer and open separate issue if needed)
some more context
windows11-deleteme
on a new instance config, following official tutorialwe launched, on same node where the instance was located,
sudo journalctl -u snap.lxd.daemon -n 300
andtail -n 300 /var/snap/lxd/common/lxd/logs/lxd.log
but nothing relevant there, except some noise feat(ux,warning messages): spam on Error getting disk usage / Cannot get disk usage of unmounted volume when ceph.rbd.du is false #15254probably not related, but in case it gives some hint, on some previous instance tests, also with same win11 image, but on another instance
Information to attach
dmesg
)lxc info NAME --show-log
)lxc config show NAME --expanded
)/var/log/lxd/lxd.log
or/var/snap/lxd/common/lxd/logs/lxd.log
)--debug
--debug
(or uselxc monitor
while reproducing the issue)The text was updated successfully, but these errors were encountered: