Reduce CPU load during GPU initialization #118

maxpla3 · 2023-08-10T16:32:35Z

The CPU load during initialization is constantly at 100% for each postcli process. This has been reported from many users.

This issue is due to the specific Nvidia implementation of the OpenCL synchronization. The enqueued buffer read operation is constantly probing the status of the running OpenCL kernel, which puts the CPU under high load.

A workaround would be to put the CPU thread in sleep for a defined duration right after enqueuing the kernel, and only then enqueue a buffer read. The sleep duration can be obtained by averaging the execution time over a number of kernel executions and then subtracting a safety factor from it (e.g. sleep duration is 90% of kernel execution).
Ideally, the kernel execution time should be measured periodically by disabling the sleep for a couple of kernel executions every few seconds. This ensures that if the kernel execution time decreases below the sleep duration (e.g. becuase the user increased the power limit of the GPU), the decrease is properly detected.

Tests on a RTX 3080 Ti show, that a sleep duration of 25ms reduces the CPU load to 10% while maintaining the initialization speed. While a further increase of the sleep duration to 30ms reduced the CPU load to 2.5% it has a negative impact on the initialization speed.

sleep (ms)	CPU load (%)	init. speed (MiB/s)
0	100	3.30
25	10	3.30
30	2.5	2.75

KVolc5O1 · 2023-08-10T16:42:51Z

I experience the same issue on my node. Would be nice if there was a fix to save some electricity 👍

poszu · 2023-08-11T14:29:42Z

Thanks for creating the issue @maxpla3 👍 . I will take it as soon as I finish other tasks.

poszu self-assigned this Aug 11, 2023

poszu added enhancement New feature or request good first issue Good for newcomers labels Aug 11, 2023

poszu added this to Dev team kanban Aug 11, 2023

github-project-automation bot moved this to 📋 Backlog in Dev team kanban Aug 11, 2023

poszu moved this from 📋 Backlog to 🏗 Doing in Dev team kanban Feb 15, 2024

poszu mentioned this issue Feb 15, 2024

Reduce cpu load during init #188

Merged

poszu closed this as completed in #188 Feb 16, 2024

github-project-automation bot moved this from 🏗 Doing to ✅ Done in Dev team kanban Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce CPU load during GPU initialization #118

Reduce CPU load during GPU initialization #118

maxpla3 commented Aug 10, 2023 •

edited

Loading

KVolc5O1 commented Aug 10, 2023

poszu commented Aug 11, 2023

Reduce CPU load during GPU initialization #118

Reduce CPU load during GPU initialization #118

Comments

maxpla3 commented Aug 10, 2023 • edited Loading

KVolc5O1 commented Aug 10, 2023

poszu commented Aug 11, 2023

maxpla3 commented Aug 10, 2023 •

edited

Loading