-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Validate CUDA SHM region registration size #7178
Conversation
krishung5
commented
May 2, 2024
•
edited
Loading
edited
- Validate CUDA SHM region size during registration
- Add CUDA SHM registration tests
- Refactor tests for CUDA SHM and System SHM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Refactor and general approach LGTM! Only minor comments, thanks for tackling this so quickly @krishung5 !
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
want to review level of detail passed back to client / user of api
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Want to normalize to a single error message for "failed to register shared memory .. invalid args"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Minor suggestions. Thanks for refactoring the tests
Made the error message less informative for |
src/shared_memory_manager.cc
Outdated
@@ -255,11 +255,57 @@ OpenCudaIPCRegion( | |||
cudaError_t err = cudaIpcOpenMemHandle( | |||
data_ptr, *cuda_shm_handle, cudaIpcMemLazyEnablePeerAccess); | |||
if (err != cudaSuccess) { | |||
// Should not pass the detailed error message back to the client. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// Should not pass the detailed error message back to the client. | |
// Log detailed error message and send generic error to client |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
error logging looks good - one slight suggested tweak to comment