Allow nvproxy to be enabled on platform/kvm #11436

nixprime · 2025-02-04T22:38:05Z

Description

Per https://github.com/google/gvisor/blob/master/g3doc/proposals/nvidia_driver_proxy.md, nvproxy is currently incompatible with platform/kvm for two reasons:

platform/kvm configures all guest page table entries to enable write-back caching, which is not generally correct for device mappings. On Intel CPUs, KVM mostly mitigates this by mapping uncacheable and write-combining pages as UC in EPT¹; the hardware takes roughly the intersection of the EPT and guest page table memory type². On AMD CPUs, the hardware has the same capability³, but KVM does not configure it⁴, so guest page table entires must set memory type correctly to obtain correct behavior. I don't think plumbing memory type through the sentry should be too difficult, but matching the kernel driver's memory types will be tricky; we will probably have to approximate conservatively in some cases, which may degrade performance.
Mappings of any given file offset in /dev/nvidia-uvm must be at the matching virtual address. In KVM, providing mappings of /dev/nvidia-uvm to the guest requires mapping it into the KVM-using process' (the sentry's) address space and then forwarding it into the guest using a KVM memslot. Thus any UVM mapping at an offset that conflicts with an existing sentry mapping is unimplementable. AFAIK, this only affects CUDA - Vulkan and NVENC/DEC do not use UVM - so this should no longer block use of nvproxy with platform/kvm in any case. But it is still a problem for CUDA.

Experimentally:

CUDA-using binaries unconditionally map 2 MB of /dev/nvidia-uvm at a fixed address (in cuda_malloc_test this happens to be 0x205000000, but I'm unsure if this is consistent between binaries). This happens to work (at least in my minimal testing) if nvproxy.uvmFDMemmapFile.MapInternal() is changed to attempt a mapping using MAP_FIXED_NOREPLACE.
cudaMallocManaged() reserves application address space using mmap(MAP_PRIVATE|MAP_ANONYMOUS) and then overwrites the reservation mapping with a MAP_FIXED mapping of /dev/nvidia-uvm, which sometimes collides with an existing sentry mapping depending on ASLR for both the sentry and application.

Options:

Keep the existing implementation of uvmFDMemmapFile.MapInternal(), which will unconditionally cause CUDA binaries to fail on platform/kvm.
Try to use MAP_FIXED_NOREPLACE for the sentry mapping. This should cause CUDA binaries that don't use cudaMallocManaged() to succeed (good), but will cause CUDA binaries that do to flake (bad). This is probably unacceptable for use cases that run arbitrary GPU workloads, but might be fine for others; AFAIK use of cudaMallocManaged() is uncommon for performance reasons.
Use MAP_FIXED_NOREPLACE, and use a custom ELF loader to load the sentry in an address range that is less likely to collide with application addresses. I'm not sure this would actually work since IIUC the interfering sentry mappings are from runtime mmaps.
Use MAP_FIXED_NOREPLACE, and also try to avoid returning application mappings that would collide with existing sentry mappings. This is racy and leaks information about the sentry to applications, so it's probably undesirable.
Fail at startup if platform/kvm is in use, nvproxy is enabled, and NVIDIA_DRIVER_CAPABILITIES contains compute. I mention this option mostly to rule it out; some containers in practice specify NVIDIA_DRIVER_CAPABILITIES=all even if only e.g. graphics support is required, and in fact NVIDIA's Vulkan support requires libnvidia-gpucomp.so which libnvidia-container only mounts when --compute is specified⁵ so this is necessary!

Is this feature related to a specific bug?

No response

Do you have a specific solution in mind?

No response

Linux: arch/x86/kvm/mmu/spte.c:make_spte() => kvm_is_mmio_pfn(), arch/x86/kvm/vmx/vmx.c:vmx_get_mt_mask() ↩
Intel 64 and IA-32 Software Developer Manual, Vol. 3, Sec. 30.3.7.2 "Memory Type Used for Translated Guest-Physical Addresses" ↩
AMD64 Architecture Programmer's Manual, Vol. 2, Sec. 15.25.8 "Combining Memory Types, MTRRs" ↩
SVM does not set shadow_memtype_mask or implement the get_mt_mask static call; see also fc07e76ac7ff ('Revert "KVM: SVM: use NPT page attributes"') ↩
https://github.com/NVIDIA/libnvidia-container/blob/95d3e86522976061e856724867ebcaf75c4e9b60/src/nvc_info.c#L85 ↩

The text was updated successfully, but these errors were encountered:

Updates #11436 PiperOrigin-RevId: 723723715

Updates #11436 PiperOrigin-RevId: 724028319

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

Updates #11436 PiperOrigin-RevId: 723723715

Updates #11436 PiperOrigin-RevId: 724028319

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

Updates #11436 PiperOrigin-RevId: 723723715

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

Updates #11436 PiperOrigin-RevId: 723723715

Updates #11436 PiperOrigin-RevId: 724028319

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

nixprime added the type: enhancement New feature or request label Feb 4, 2025

copybara-service bot pushed a commit that referenced this issue Feb 6, 2025

kvm: honor memmap.File.MemoryType()

578733b

Updates #11436 PiperOrigin-RevId: 723723715

copybara-service bot mentioned this issue Feb 6, 2025

kvm: honor memmap.File.MemoryType() #11443

Open

copybara-service bot pushed a commit that referenced this issue Feb 6, 2025

kvm: honor memmap.File.MemoryType()

3001759

Updates #11436 PiperOrigin-RevId: 723723715

copybara-service bot pushed a commit that referenced this issue Feb 6, 2025

nvproxy: allow use with KVM platform

f4c3387

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot mentioned this issue Feb 6, 2025

nvproxy: allow use with KVM platform #11449

Open

copybara-service bot pushed a commit that referenced this issue Feb 6, 2025

nvproxy: allow use with KVM platform

d44d84c

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 7, 2025

nvproxy: allow use with KVM platform

098a747

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 7, 2025

nvproxy: allow use with KVM platform

002c656

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 8, 2025

nvproxy: allow use with KVM platform

c126226

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 8, 2025

Add memmap.File.MemoryType()

d5b0d60

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

copybara-service bot mentioned this issue Feb 8, 2025

Add memmap.File.MemoryType() #11452

Open

copybara-service bot pushed a commit that referenced this issue Feb 8, 2025

kvm: honor memmap.File.MemoryType()

6cde596

Updates #11436 PiperOrigin-RevId: 723723715

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

nvproxy: allow use with KVM platform

6426ffc

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

Add memmap.File.MemoryType()

51a58b5

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

kvm: honor memmap.File.MemoryType()

57142d8

Updates #11436 PiperOrigin-RevId: 723723715

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

Add memmap.File.MemoryType()

e6f9fc4

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

kvm: honor memmap.File.MemoryType()

1ddc234

Updates #11436 PiperOrigin-RevId: 723723715

copybara-service bot pushed a commit that referenced this issue Feb 13, 2025

nvproxy: allow use with KVM platform

0449d65

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 14, 2025

nvproxy: allow use with KVM platform

edc2ad7

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 14, 2025

nvproxy: allow use with KVM platform

a59892b

Updates #11436 PiperOrigin-RevId: 724028319

copybara-service bot pushed a commit that referenced this issue Feb 14, 2025

Add memmap.File.MemoryType()

f488064

This has no effect (outside of debug logging) until cl/723723715. Updates #11436 PiperOrigin-RevId: 723723714

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow nvproxy to be enabled on platform/kvm #11436

Allow nvproxy to be enabled on platform/kvm #11436

nixprime commented Feb 4, 2025

Allow nvproxy to be enabled on platform/kvm #11436

Allow nvproxy to be enabled on platform/kvm #11436

Comments

nixprime commented Feb 4, 2025

Description

Is this feature related to a specific bug?

Do you have a specific solution in mind?

Footnotes