added `on_backward` trainer callback #5249

ArjunSubramonian · 2021-06-09T20:07:11Z

Additions proposed in this pull request:

Added on_backward training callback which allows for control over backpropagation and gradient manipulation.

dirkgr

I think this design is too heavy. I don't like putting a core functionality like loss.backward() into a callback. It makes it too hard to see what's going on in the trainer.

Instead, can we use the normal TrainerCallback, and give it some extra methods, like pre_backward() and post_backward()? Can you solve your problem with that?

ArjunSubramonian · 2021-06-09T23:04:04Z

I think this design is too heavy. I don't like putting a core functionality like loss.backward() into a callback. It makes it too hard to see what's going on in the trainer.

Instead, can we use the normal TrainerCallback, and give it some extra methods, like pre_backward() and post_backward()? Can you solve your problem with that?

I can't do adversarial training without backward() being called in a callback because I need to call backward() two times and one call requires retain_graph=True.

dirkgr · 2021-06-09T23:17:19Z

You could leave one backward() call as it is, and issue the second one in the callback?

ArjunSubramonian · 2021-06-10T16:23:41Z

You could leave one backward() call as it is, and issue the second one in the callback?

@dirkgr I have made the revisions you suggested :)

dirkgr

Other than the changelog entry, this is great!

dirkgr · 2021-06-10T17:04:07Z

CHANGELOG.md

@@ -7,6 +7,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

 ## Unreleased

+### Added
+
+- Added `BackwardCallback`, a training callback which allows for control over backpropagation and gradient manipulation.


This comment isn't accurate anymore, is it?

dirkgr · 2021-06-10T17:05:01Z

allennlp/training/callbacks/backward.py

+        if not backward_called:
+            trainer._scaler.scale(loss).backward()  # type: ignore
+            return True
+        return False


Is it an error if this gets called with backward_called == True? Should we throw an exception in that case?

dirkgr · 2021-06-10T17:09:23Z

tests/training/trainer_test.py

+        if not backward_called:
+            loss.backward()
+            for param in trainer.model.parameters():
+                param.grad *= 0.0


Is that really the best way to do that?

Suggested change

param.grad *= 0.0

param.zero_()

I don't know for sure, but I would guess that zero_() is faster.

dirkgr

This is great! Will make a great update for tomorrow's meeting, too!

* added BackwardCallback * finished tests * fixed linting issue * revised design per Dirk's suggestion * added OnBackwardException, changed loss to batch_ouputs, etc. Co-authored-by: Arjun Subramonian <[email protected]>

added BackwardCallback

c07a0cb

ArjunSubramonian self-assigned this Jun 9, 2021

finished tests

7803ff7

ArjunSubramonian requested review from dirkgr, epwalsh and AkshitaB and removed request for dirkgr June 9, 2021 22:23

dirkgr suggested changes Jun 9, 2021

View reviewed changes

fixed linting issue

1551a7f

ArjunSubramonian requested a review from dirkgr June 9, 2021 23:07

revised design per Dirk's suggestion

11c3885

dirkgr approved these changes Jun 10, 2021

View reviewed changes

added OnBackwardException, changed loss to batch_ouputs, etc.

a62c313

ArjunSubramonian changed the title ~~added BackwardCallback~~ added on_backward trainer callback Jun 10, 2021

dirkgr approved these changes Jun 10, 2021

View reviewed changes

ArjunSubramonian merged commit a6cfb12 into main Jun 11, 2021

ArjunSubramonian deleted the arjuns/during-backward-callback branch June 11, 2021 00:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added `on_backward` trainer callback #5249

added `on_backward` trainer callback #5249

ArjunSubramonian commented Jun 9, 2021 •

edited

Loading

dirkgr left a comment

ArjunSubramonian commented Jun 9, 2021

dirkgr commented Jun 9, 2021

ArjunSubramonian commented Jun 10, 2021

dirkgr left a comment

dirkgr Jun 10, 2021

dirkgr Jun 10, 2021

dirkgr Jun 10, 2021

dirkgr left a comment

added on_backward trainer callback #5249

added on_backward trainer callback #5249

Conversation

ArjunSubramonian commented Jun 9, 2021 • edited Loading

dirkgr left a comment

Choose a reason for hiding this comment

ArjunSubramonian commented Jun 9, 2021

dirkgr commented Jun 9, 2021

ArjunSubramonian commented Jun 10, 2021

dirkgr left a comment

Choose a reason for hiding this comment

dirkgr Jun 10, 2021

Choose a reason for hiding this comment

dirkgr Jun 10, 2021

Choose a reason for hiding this comment

dirkgr Jun 10, 2021

Choose a reason for hiding this comment

dirkgr left a comment

Choose a reason for hiding this comment

added `on_backward` trainer callback #5249

added `on_backward` trainer callback #5249

ArjunSubramonian commented Jun 9, 2021 •

edited

Loading