-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Gather Stat is not working for DeepSpeed? #960
Comments
looks like a valid fix to me! 👍 |
maybe I will try to make a pull request for this. Also, I feel like there are some redundant for loop that we can avoid in the code |
Hi everyone, yes this looks like a reasonable change, would you be happy to open a quick PR for it? |
I have created the simple PR fix. Please help review: #997 |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. |
I guess the issue is mainly caused by this line:
trl/trl/trainer/ppo_trainer.py
Line 333 in 6ff0fac
The
self.accelerator.distributed_type
isDEEPSPEED
when I use DeepSpeed Zero stage 2 withaccelerate
.It works when I change this line to
But I think, to make this more general, we can try something like
Any suggestions?
The text was updated successfully, but these errors were encountered: