-
-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whonix DispVM Fails with "Got empty response from qubesd" "KeyError" unhandled exception while calling src=b'dom0' meth=b'admin.vm.List' #5121
Comments
Does this happen with non-Whonix DisposableVMs?
Note that |
Hi Andrew, thanks for the fast response :)
No. I only experience this issue with the whonix DisposableVM.
Noted, thank you. |
The error is happening on |
No, so this indeed looks different. |
I just timed it. For two runs, the time it takes from initiating the
|
Sure, here it is:
|
This is the problem. Still looks very similar to #4918 . Can you confirm that setting |
Here's the output before running
And here's the query after initiating the
And note from my comment above that I validated that this DispVM halts prematurely before 40 seconds. iirc even the default is 60, yes? |
Yes, should be 60 sec by default... Maybe for some reason qrexec_timeout isn't applied at all? Lets go deeper:
(you may need to install strace in dom0 first) |
Ok, I installed strace with Indeed, it appears to pass the
|
I assume there wasn't 5 minutes before that failure, right? |
Another idea - check To get some deeper info, lets add
You should see somewhere a call like |
It took 36 seconds between the time I executed the It does take 5 minutes (actually, 5 minutes and 30 seconds) for the
Here you go:
I did a
I haven't noticed any issues with time drift on my system's time clock. And I'm not making any modifications to my system's cpu clock. fwiw, I have an Asus ux305 laptop. |
ok, so I increased the "initial memory" of the DispVM Template = Thanks for your assistance in this! But I still don't understand why it wouldn't ever encounter issues when I attempted to start the DispVM Template itself.. Is 400M supposed to be generally sufficient for a whonix DispVM? Also, the max is 4000M (note the extra zero), so I'd expect it to grow to 4000M = 4G without crashing, right? Is this a bug? My laptop has 8G of RAM, and I conducted these tests this morning with 5 VMs running. Just checking my RAM usage (after the tests) without the whonix DispVM running, Edit: I also didn't get any |
See also #4135 |
Those number (initial 400MB, max 4000MB) should be ok. Do you use some custom kernel that could lack balloon driver? Or some custom kernel parameters? Otherwise, balloon driver in linux kernel should give back the extra 3.6GB to Xen, before starting any userspace process (systemd etc). And you clearly see those processes starting, no? |
I have made no modifications to the kernel or kernel parameters.
Sorry, I know nothing about this Xen balloon driver. How can I trace the memory given to the VM (and returned to Xen)? Is there some log file I can watch to better understand what's happening?
I dropped the initial RAM down to 400M and attempted to start a new whonix DispVM to answer this question. I tail'd the guest console log, and I confirmed that it made it all the way to the final Interestingly, when I was searching for 'systemd' in the guest console log, I saw these lines:
That may or may not be an issue. Indeed, I also found a line suggesting that the Xen balloon driver was (attempted to be) loaded.
|
I encountered this issue as well and was able to fix it by closing some VMs in order to free up memory. |
Qubes 4.0.3, Xen-4.8.5-28.fc25. I ran into similar problem after switching from Ubuntu bionic template to Ubuntu focal by unman. Dependent VMs with large private storage crash on first boot under new template with error:
In 'xl dmesg' output I saw:
Trying different VM kernels didn't help. Affected VMs all have memory balancing enabled and balloon driver loading, 400 MB initial RAM, 4000 MB max RAM. What they have in common is that they have private storage max size of at least 20-30 GB; other VMs with less than 10 GB private storage don't crash with new template. What seems to help is slightly increasing the VM private storage max size via Qubes Manager, and then booting the affected VM once with large initial RAM assignment, 2-3 GB. After it boots successfully once, I can lower initial RAM assignment back to 400 MB, and VMs will boot successfully. |
Similar issue is fixed in R4.1 here:
I guess I need to backport those too. |
fwiw, when a DispVM halts immediately after starting, I usually just launch it again with |
This issue is being closed because:
If anyone believes that this issue should be reopened and reassigned to an active milestone, please leave a brief comment. |
Qubes OS version:
Qubes release 4.0 (R4.0)
Affected component(s) or functionality:
whonix-ws-14-dvm DispVm
Steps to reproduce the behavior:
The dom0 terminal where I launched the VM eventually returns this (after some delay following the "halted"
notify-send
:Per the msg above, I left
journalctl -f
running. In-between the "started" notify-send and the "halted" notify-send, it outputted the following:As for the guest console log, sometimes it finishes as expected with the "host login:" line, and sometimes it ends prematurely. This log doesn't appear to be relevant; the issue appears to lie in dom0 (I think).
I never have this issue when attempting to launch the actual TemplateVM = 'whonix-ws-14-dvm'.
Expected or desired behavior:
I can use apps in my new disposable whonix 14 vm without it halting prematurely
Actual behavior:
The VM halts unexpectedly.
General notes:
Unfortunately, this is not 100% reproducible. About 5% of the time, my whonix DispVM starts as desired. 95% of the time, it just halts prematurely.
I up'd the qrexec_timeout to 300 to be sure, no change.
I checked /var/cache on the TemplateVM 'whonix-ws-14-dvm', which is 301M, and there is only one
tor-browser
dir in/var/cache/tb-browser/.tb
-- no backuptor-browser
dirs.I have consulted the following relevant documentation:
https://dev.qubes-os.org/projects/core-admin-client/en/latest/manpages/qvm-run.html
I am aware of the following related, non-duplicate issues:
#4918
The text was updated successfully, but these errors were encountered: