There should be a warning or an exception if using `amp_backend="native"` and setting an `amp_level` (it doesn't use it) #9739

JulesGM · 2021-09-28T19:03:54Z

🚀 Feature

There should be a warning or an exception if using Trainer(amp_backend="native") and setting a value to the amp_level argument (of Trainer).

Motivation

It's not obvious for someone who hasn't looked into the code or who knows a lot about Apex that using Pytorch's native amp backend that the amp_level argument will have no effect, giving a false understanding of what is actually happening. Knowing how much of the model was made fp16 is really important to understand what is happening in the training, to make training reproducible.

With amp_type="apex" (amp_level used):
https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/trainer/connectors/accelerator_connector.py#L586
With amp_type="native" (not used):
https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/trainer/connectors/accelerator_connector.py#L573

Pitch

There should be an exception or a warning added, maybe around https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/trainer/connectors/accelerator_connector.py#L573, when Trainer(amp_level=...) is set and Trainer(amp_backend="native"). Again, Knowing how much of the model was made fp16 is really important to understand what is happening in the training, to make training reproducible.

Alternatives

Rename amp_level to apex_level to make it more obvious that it only works with the apex backend. Breaks backwards compatibility.

The text was updated successfully, but these errors were encountered:

JulesGM · 2021-09-28T19:06:38Z

To keep the default amp_level behavior, the default of amp_level could be None in the constructor and set to O2 after amp_backend has been verified to be apex. Afaik this is a pretty standard way of doing things.

JulesGM added the feature Is an improvement or enhancement label Sep 28, 2021

JulesGM changed the title ~~There should be a warning or an exception if using the native amp backend and setting an amp_level (it doesn't use it)~~ There should be a warning or an exception if using the amp_backend="native" and setting an amp_level (it doesn't use it) Sep 28, 2021

JulesGM changed the title ~~There should be a warning or an exception if using the amp_backend="native" and setting an amp_level (it doesn't use it)~~ There should be a warning or an exception if using amp_backend="native" and setting an amp_level (it doesn't use it) Sep 28, 2021

rohitgr7 mentioned this issue Sep 29, 2021

Raise an exception if using amp_level with native amp_backend #9755

Merged

12 tasks

rohitgr7 closed this as completed in #9755 Oct 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

There should be a warning or an exception if using `amp_backend="native"` and setting an `amp_level` (it doesn't use it) #9739

There should be a warning or an exception if using `amp_backend="native"` and setting an `amp_level` (it doesn't use it) #9739

JulesGM commented Sep 28, 2021

JulesGM commented Sep 28, 2021

There should be a warning or an exception if using amp_backend="native" and setting an amp_level (it doesn't use it) #9739

There should be a warning or an exception if using amp_backend="native" and setting an amp_level (it doesn't use it) #9739

Comments

JulesGM commented Sep 28, 2021

🚀 Feature

Motivation

Pitch

Alternatives

JulesGM commented Sep 28, 2021

There should be a warning or an exception if using `amp_backend="native"` and setting an `amp_level` (it doesn't use it) #9739

There should be a warning or an exception if using `amp_backend="native"` and setting an `amp_level` (it doesn't use it) #9739