Make `swa_lrs` as required inside `SWACallback` #11822

rohitgr7 · 2022-02-09T12:51:25Z

Proposed Enhancement

Currently when swa_lrs is not set here:
https://github.com/PyTorchLightning/pytorch-lightning/blob/e3820da28a0cd0982dd1c65d7da1a0e2180454c1/pytorch_lightning/callbacks/stochastic_weight_avg.py#L34-L38

we initialize it to the optimizer lrs here:
https://github.com/PyTorchLightning/pytorch-lightning/blob/e3820da28a0cd0982dd1c65d7da1a0e2180454c1/pytorch_lightning/callbacks/stochastic_weight_avg.py#L167-L168

but during SWALR scheduler update, the values won't be updated because alpha here will be canceled out.

https://github.com/pytorch/pytorch/blob/bf233aa049c4b479fd6cb19f9b8672bb2d42b0e2/torch/optim/swa_utils.py#L281-L286

Motivation

If we keep it as it is, it will lead to issues like this: #9453

Pitch

Make swa_lrs in the callback as required since it's a required parameter in SWALR too and don't initialize it with any default.
https://github.com/pytorch/pytorch/blob/bf233aa049c4b479fd6cb19f9b8672bb2d42b0e2/torch/optim/swa_utils.py#L231

Additional context

If you enjoy Lightning, check out our other projects! ⚡

Metrics: Machine learning metrics for distributed, scalable PyTorch applications.
Lite: enables pure PyTorch users to scale their existing code on any kind of device while retaining full control over their own loops and optimization logic.
Flash: The fastest way to get a Lightning baseline! A collection of tasks for fast prototyping, baselining, fine-tuning, and solving problems with deep learning.
Bolts: Pretrained SOTA Deep Learning models, callbacks, and more for research and production with PyTorch Lightning and PyTorch.
Lightning Transformers: Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

cc @Borda @justusschock @awaelchli @akihironitta @rohitgr7 @carmocca

The text was updated successfully, but these errors were encountered:

felipemello1 · 2022-03-15T16:35:59Z

Hi @rohitgr7 , I think that maybe the solution can be improved and simplified. As an user, I already have my learning rate scheduler. I dont want to have to figure out what my learning rate should be at epoch*0.8 and use this LR an starting point. Instead, if there is already a scheduler, just dont override it with SWA scheduler.

rohitgr7 added feature Is an improvement or enhancement refactor callback: swa labels Feb 9, 2022

rohitgr7 added this to the future milestone Feb 9, 2022

rohitgr7 removed the refactor label Feb 9, 2022

rohitgr7 modified the milestones: future, 1.7 Feb 23, 2022

This was referenced Mar 30, 2022

Remove deprecated stochastic_weight_avg argument from Trainer #12535

Merged

Mark swa_lrs argument in StochasticWeightAveraging callback as required #12556

Merged

rohitgr7 closed this as completed in #12556 Apr 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `swa_lrs` as required inside `SWACallback` #11822

Make `swa_lrs` as required inside `SWACallback` #11822

rohitgr7 commented Feb 9, 2022 •

edited by github-actions bot

Loading

felipemello1 commented Mar 15, 2022

Make swa_lrs as required inside SWACallback #11822

Make swa_lrs as required inside SWACallback #11822

Comments

rohitgr7 commented Feb 9, 2022 • edited by github-actions bot Loading

Proposed Enhancement

Motivation

Pitch

Additional context

If you enjoy Lightning, check out our other projects! ⚡

felipemello1 commented Mar 15, 2022

Make `swa_lrs` as required inside `SWACallback` #11822

Make `swa_lrs` as required inside `SWACallback` #11822

rohitgr7 commented Feb 9, 2022 •

edited by github-actions bot

Loading