Update RMM tests based on deprecated CNMeM #359

jakirkham · 2020-08-12T02:52:04Z

The recent RMM PR ( rapidsai/rmm#466 ) deprecated CNMeM and made some changes for handling memory resources for multiple devices (needed by XGBoost). As a result, we are seeing a few test failures in Dask-CUDA. This makes the necessary changes to update these tests.

As `get_default_resource_type` was dropped in RMM recently, use the newly introduce `rmm.mr.get_per_device_resource` instead to access the resource on device `0` (configured to a unique device for each worker). Since the memory resource itself is not realistically serializable, instead grab the type of each resource to send back. This is all done within a `lambda` to allow for a function that can be run on each worker.

As CNMeM has been dropped from Python and replaced with RMM's own pool resource, just check for that pool resource instead.

harrism

I think there's a function for that. :)

dask_cuda/tests/test_dask_cuda_worker.py

dask_cuda/tests/test_local_cuda_cluster.py

Simplify the resource type checks a bit. Thanks Mark! :) Co-authored-by: Mark Harris <[email protected]>

jakirkham · 2020-08-12T03:03:57Z

Much cleaner. Thanks Mark! :)

codecov-commenter · 2020-08-12T03:13:24Z

Codecov Report

Merging #359 into branch-0.15 will increase coverage by 0.39%.
The diff coverage is n/a.

@@               Coverage Diff               @@
##           branch-0.15     #359      +/-   ##
===============================================
+ Coverage        59.65%   60.04%   +0.39%     
===============================================
  Files               17       17              
  Lines             1321     1334      +13     
===============================================
+ Hits               788      801      +13     
  Misses             533      533

Impacted Files	Coverage Δ
dask_cuda/device_host_file.py	`98.64% <0.00%> (+0.03%)`	⬆️
dask_cuda/cli/dask_cuda_worker.py	`96.77% <0.00%> (+0.05%)`	⬆️
dask_cuda/initialize.py	`92.59% <0.00%> (+0.28%)`	⬆️
dask_cuda/_version.py	`44.80% <0.00%> (+0.39%)`	⬆️
dask_cuda/is_device_object.py	`88.88% <0.00%> (+3.88%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5f06e05...cd4b9ac. Read the comment docs.

pentschev

LGTM, thanks @jakirkham !

jakirkham added 3 commits August 11, 2020 19:43

Access memory resources from public module

58013f8

Test for RMM's PoolMemoryResource

69680e7

As CNMeM has been dropped from Python and replaced with RMM's own pool resource, just check for that pool resource instead.

jakirkham requested a review from a team as a code owner August 12, 2020 02:52

harrism suggested changes Aug 12, 2020

View reviewed changes

dask_cuda/tests/test_dask_cuda_worker.py Outdated Show resolved Hide resolved

dask_cuda/tests/test_local_cuda_cluster.py Outdated Show resolved Hide resolved

Use get_current_device_resource_type

cd4b9ac

Simplify the resource type checks a bit. Thanks Mark! :) Co-authored-by: Mark Harris <[email protected]>

This was referenced Aug 12, 2020

Fix a black error in explicit comms #360

Merged

Fix an isort error #361

Merged

pentschev approved these changes Aug 12, 2020

View reviewed changes

pentschev merged commit 859ea21 into rapidsai:branch-0.15 Aug 12, 2020

jakirkham deleted the fix_mr_tsts_cnmem_dep branch August 13, 2020 07:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update RMM tests based on deprecated CNMeM #359

Update RMM tests based on deprecated CNMeM #359

jakirkham commented Aug 12, 2020

harrism left a comment

jakirkham commented Aug 12, 2020

codecov-commenter commented Aug 12, 2020 •

edited

Loading

pentschev left a comment

Update RMM tests based on deprecated CNMeM #359

Update RMM tests based on deprecated CNMeM #359

Conversation

jakirkham commented Aug 12, 2020

harrism left a comment

Choose a reason for hiding this comment

jakirkham commented Aug 12, 2020

codecov-commenter commented Aug 12, 2020 • edited Loading

Codecov Report

pentschev left a comment

Choose a reason for hiding this comment

codecov-commenter commented Aug 12, 2020 •

edited

Loading