Offload and modules with unused submodules #442

sgugger · 2022-06-13T19:56:18Z

This PR deals with models with unused submodules combined with offload (on CPU and disk). By unused submodules, we mean a something like the ModuleWithUnusedSubModules introduced in the test file in this PR: a module that defines a submodule, but does not call it during its forward, just does some operations using its weights.

Currently, the offload will fail for such modules because the weights are fetched in a pre-forward hook and the forward method is never called. This PR fixes that by introducing a new argument named load_all_weights_classes (if you have any better suggestion, please let me know as I'm not super happy with this name) which contains the name of such submodules used (like no_split_module_classes contains the names of the modules that shouldn't be split across devices).

HuggingFaceDocBuilderDev · 2022-06-13T19:59:24Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr

Thanks! The fix makes sense to me, left one naming suggestion/alternative

src/accelerate/hooks.py

pacman100

In attach_align_device_hook_on_block, I think below change is needed

if not isinstance(execution_device, Mapping):
-        execution_device = {key: offload for key in offload.keys()}
+        execution_device = {key: execution_device for key in offload.keys()}

Left a minor suggestion too.
Apart from that everything LGTM!

src/accelerate/hooks.py

Co-authored-by: Sourab Mangrulkar <[email protected]>

Offload and modules with unused submodules

23362f4

sgugger requested review from LysandreJik and muellerzr June 13, 2022 19:56

muellerzr approved these changes Jun 13, 2022

View reviewed changes

src/accelerate/hooks.py Outdated Show resolved Hide resolved

Renaming

1f5b11d

sgugger requested a review from pacman100 June 14, 2022 14:27

pacman100 reviewed Jun 17, 2022

View reviewed changes

src/accelerate/hooks.py Outdated Show resolved Hide resolved

sgugger and others added 2 commits June 17, 2022 14:29

Update src/accelerate/hooks.py

4977cd4

Co-authored-by: Sourab Mangrulkar <[email protected]>

Address review comment

81e9bdb

sgugger merged commit eeaba59 into main Jun 18, 2022

sgugger deleted the cpu_offload_unused_subs branch June 18, 2022 00:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offload and modules with unused submodules #442

Offload and modules with unused submodules #442

sgugger commented Jun 13, 2022

HuggingFaceDocBuilderDev commented Jun 13, 2022 •

edited

Loading

muellerzr left a comment

pacman100 left a comment

Offload and modules with unused submodules #442

Offload and modules with unused submodules #442

Conversation

sgugger commented Jun 13, 2022

HuggingFaceDocBuilderDev commented Jun 13, 2022 • edited Loading

muellerzr left a comment

Choose a reason for hiding this comment

pacman100 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 13, 2022 •

edited

Loading