[bugfix] Accumulated_gradient and TensoBoard #4738

tchaton · 2020-11-18T12:12:34Z

What does this PR do?

This PR tries to improve the logging display on TensorBoard when using accumulated_grad_batches > 1.
It also introduces log_epoch_metrics_on_step parameter within the trainer.

log_epoch_metrics_on_step idea was dropped !

Fixes #4304

log_epoch_metrics_on_step = True

log_epoch_metrics_on_step = False

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In in short, see following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified; Bugfixes should be including in bug-fix release milestones (m.f.X) and features should be included in (m.X.b) releases.

Did you have fun?

Make sure you had fun coding 🙃

pep8speaks · 2020-11-18T12:12:38Z

Hello @tchaton! Thanks for updating this PR.

In the file pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py:

Line 193:121: E501 line too long (122 > 120 characters)
Line 194:121: E501 line too long (134 > 120 characters)

Comment last updated at 2020-11-25 12:01:40 UTC

…thub.com/PyTorchLightning/pytorch-lightning into bugfix/4304_tensorboard_accumulated_grad

codecov · 2020-11-18T16:19:50Z

Codecov Report

Merging #4738 (f5cb188) into master (d24a267) will increase coverage by 0%.
The diff coverage is 100%.

@@          Coverage Diff           @@
##           master   #4738   +/-   ##
======================================
  Coverage      93%     93%           
======================================
  Files         118     118           
  Lines        9031    9033    +2     
======================================
+ Hits         8403    8405    +2     
  Misses        628     628

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py

pytorch_lightning/trainer/trainer.py

tchaton · 2020-11-19T08:45:35Z

@tchaton can we split this to 2 PRs?

Hey @edenlightning, I removed the parameters as you suggested. This PR contains only the fix for tensorboard logging in case of accumulated_gradient > 1

Borda

sure about the APPI, and pls update docs

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py

SkafteNicki

LGTM

tchaton · 2020-11-25T12:01:30Z

Hey @williamFalcon , can you review this one ?

resolve bug

47efe42

tchaton self-assigned this Nov 18, 2020

tchaton added bug Something isn't working logging Related to the `LoggerConnector` and `log()` labels Nov 18, 2020

tchaton added this to the 1.1 milestone Nov 18, 2020

tchaton added 6 commits November 18, 2020 12:14

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

e961b44

update

dbd740d

Merge branch 'bugfix/4304_tensorboard_accumulated_grad' of https://gi…

f5ef9f5

…thub.com/PyTorchLightning/pytorch-lightning into bugfix/4304_tensorboard_accumulated_grad

update

891d0ed

modify one test

07816d8

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

5030043

tchaton marked this pull request as ready for review November 18, 2020 14:04

tchaton requested review from ananyahjha93, awaelchli, Borda, justusschock, nateraw, SeanNaren, teddykoker and williamFalcon as code owners November 18, 2020 14:04

tchaton added 4 commits November 18, 2020 14:19

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

dfab61a

remove paramters

f571f4d

Merge branch 'bugfix/4304_tensorboard_accumulated_grad' of https://gi…

7d2ea62

…thub.com/PyTorchLightning/pytorch-lightning into bugfix/4304_tensorboard_accumulated_grad

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

24cee2c

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

de12d31

Borda reviewed Nov 18, 2020

View reviewed changes

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py Outdated Show resolved Hide resolved

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

tchaton added 2 commits November 18, 2020 19:30

update on comments

011b65f

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

fdce1a9

tchaton added 2 commits November 19, 2020 08:19

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

c18a070

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

88e1afa

tchaton added 4 commits November 19, 2020 16:54

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

7de5c99

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

df6571b

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

f6d0f0a

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

ef357ca

tchaton modified the milestones: 1.1, 1.0.x Nov 20, 2020

tchaton changed the title ~~Accumulated_gradient and tensorboard~~ [bugfix] Accumulated_gradient and TensoBoard Nov 20, 2020

Borda reviewed Nov 20, 2020

View reviewed changes

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py Show resolved Hide resolved

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py Show resolved Hide resolved

tchaton added 3 commits November 23, 2020 08:33

update changelog

b86fa58

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

5898183

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

0dd02d0

SkafteNicki reviewed Nov 23, 2020

View reviewed changes

pytorch_lightning/trainer/connectors/logger_connector/logger_connector.py Show resolved Hide resolved

tchaton added 3 commits November 23, 2020 17:26

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

a28bbec

update docstring

fa3a57b

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

3f7bb6a

justusschock approved these changes Nov 24, 2020

View reviewed changes

SkafteNicki approved these changes Nov 24, 2020

View reviewed changes

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

21fe93a

SeanNaren approved these changes Nov 24, 2020

View reviewed changes

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

d8ac3c5

Merge branch 'master' into bugfix/4304_tensorboard_accumulated_grad

f5cb188

tchaton added the ready PRs ready to be merged label Nov 25, 2020

williamFalcon approved these changes Nov 25, 2020

View reviewed changes

tchaton merged commit 204a0a2 into master Nov 25, 2020

Borda deleted the bugfix/4304_tensorboard_accumulated_grad branch November 29, 2020 22:11

Tomiinek mentioned this pull request Jan 7, 2021

W&B logger not working as expected with accumulate_grad_batches>1 #5405

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] Accumulated_gradient and TensoBoard #4738

[bugfix] Accumulated_gradient and TensoBoard #4738

tchaton commented Nov 18, 2020 •

edited

Loading

pep8speaks commented Nov 18, 2020 •

edited

Loading

codecov bot commented Nov 18, 2020 •

edited

Loading

tchaton commented Nov 19, 2020

Borda left a comment

SkafteNicki left a comment

tchaton commented Nov 25, 2020

[bugfix] Accumulated_gradient and TensoBoard #4738

[bugfix] Accumulated_gradient and TensoBoard #4738

Conversation

tchaton commented Nov 18, 2020 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Nov 18, 2020 • edited Loading

Comment last updated at 2020-11-25 12:01:40 UTC

codecov bot commented Nov 18, 2020 • edited Loading

Codecov Report

tchaton commented Nov 19, 2020

Borda left a comment

Choose a reason for hiding this comment

SkafteNicki left a comment

Choose a reason for hiding this comment

tchaton commented Nov 25, 2020

tchaton commented Nov 18, 2020 •

edited

Loading

pep8speaks commented Nov 18, 2020 •

edited

Loading

codecov bot commented Nov 18, 2020 •

edited

Loading