-
-
Notifications
You must be signed in to change notification settings - Fork 391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WandB batch metrics logging error #1290
Comments
Hi! Thank you for your contribution! Please re-check all issue template checklists - unfilled issues would be closed automatically. And do not forget to join our slack for collaboration. |
maybe, @AyushExel could help 👀 |
|
@ivan-chai I remember working on this. the reason that the batch metrics are also logged on epoch level is that W&B supports 1 global step and you cannot edit or log at previous steps, so I decided to log everything on epoch level. There are other methods that will end up dropping data if any previous step is encountered. |
Thank you for your response! May be it is better to always log sample_step? |
yeah, I think, we could just use |
@Scitator @ivan-chai Is the |
Yes ;) |
@AyushExel could you please drop a PR with such a small hotfix? |
@Scitator sounds good. I'll do it within 2 days |
@Scitator I'm working on it now. I noticed that the logger API now supports artifacts as well. I'll add that too. I just need a bit of clarification on the usage.
here, |
yup, looks like so
here is another example - catalyst/catalyst/loggers/neptune.py Line 281 in e1d78b7
|
btw, @ditwoo why do we use the profiler in such a way? 😂 |
@Scitator in the 2nd example that you linked, it'll raise exception if both |
@AyushExel yes, and I think it's correct behavior. |
Okay, thanks for the clarification. I'll update the logger with the intended artifacts use case. |
@Scitator I have fixed this and added artifacts support. Can you provide an example training script using catalyst trainer which log artifacts? I want to check all the use cases before submitting the PR. Sorry, I couldn't find something relevant in the quickstart section of docs. |
btw, we are going to release another version tomorrow, so, the fix would truly welcome ;) |
Just running tests. PR coming soon |
🐛 Bug Report
In wandb all batch metrics are logged as single value per epoch.
Expected behavior
Batch metrics must be logged once per step.
Additional context
The problem is here:
https://github.com/catalyst-team/catalyst/blob/master/catalyst/loggers/wandb.py#L115
Step must be equal to global_sample_step, not global_epoch_step.
The text was updated successfully, but these errors were encountered: