[`core`] Add `max_grad_norm` support #177

younesbelkada · 2023-02-27T15:16:28Z

What does this PR do?

This PR adds the support of clipping gradients norm, that was not supported before but implemented in other frameworks such as clean RL: https://github.com/vwxyzjn/cleanrl/blob/2e41da2a3649c50f27121d74896110fe8f69dd52/cleanrl/ppo.py#L286

This PR also adds a nice test

I can confirm multi-GPU training works with this PR (ran 1 PPO step of gpt2-sentiment with 2 GPU)

HuggingFaceDocBuilderDev · 2023-02-27T15:19:28Z

The documentation is not available anymore as the PR was closed or merged.

tests/test_ppo_trainer.py

edbeeching

LGTM, thanks.

lvwerra

LGTM, thanks for adding @younesbelkada!

add max_grad_norm support

eb2ab5e

younesbelkada commented Feb 27, 2023

View reviewed changes

tests/test_ppo_trainer.py Outdated Show resolved Hide resolved

Update tests/test_ppo_trainer.py

6a83f6e

younesbelkada requested review from lvwerra, edbeeching and natolambert February 27, 2023 15:31

edbeeching approved these changes Feb 27, 2023

View reviewed changes

lvwerra approved these changes Feb 28, 2023

View reviewed changes

younesbelkada merged commit 8855022 into huggingface:main Feb 28, 2023

younesbelkada deleted the add-grad-clipping branch February 28, 2023 09:52