-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training loop temporarily hangs after every 4 steps #1322
Comments
Hi! thanks for your contribution!, great first issue! |
@PyTorchLightning/core-contributors |
Thanks for the issue! Would it be possible to post the code that reproduces this error? I've only seen this sort of behaviour before when the number of data loading workers is low - are you working with large data here (e.g. big images)? |
I increased the number of workers and it works perfectly now, thank you very much! You can close this issue. |
should we throw a warning when users use few workers? |
If possible, sure! Seems like an obvious solution now, but it could save a couple of hours for other people. :) |
I am porting some of my code to pytorch lightning, and everything seems to work fine. However, for some reason after every 4 training steps I see some temporary hanging (~1 second), which is severely slowing down my overall training time. Am I missing some obvious configuration? This is my Trainer configuration:
The text was updated successfully, but these errors were encountered: