Tensorboard fails to log found lr when auto_lr_find is enabled. #3219

LeeJZh · 2020-08-27T07:26:13Z

🐛 Bug

To Reproduce

Steps to reproduce the behavior:

run job with --auto_lr_find enabled
note the found lr
open tensorboard hparams tab
note the logged lr
they are different, the logged lr is the default lr mannully assigned

Code sample

Expected behavior

the logged lr is the found lr

Environment

Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).

You can get the script and run it with:

wget https://raw.githubusercontent.com/PyTorchLightning/pytorch-lightning/master/tests/collect_env_details.py
# For security purposes, please check the contents of collect_env_details.py before running it.
python collect_env_details.py

PyTorch Version (e.g., 1.0): 1.6.0
OS (e.g., Linux): ubuntu 16.04
How you installed PyTorch (conda, pip, source): pip
Build command you used (if compiling from source):
Python version: 3.7.7
CUDA/cuDNN version: 10.2
GPU models and configuration: V100
Any other relevant information:

Additional context

The text was updated successfully, but these errors were encountered:

awaelchli · 2020-08-27T10:16:06Z

Hi, thanks for reporting this. I noticed you did not specify the PL version. Could you check that you are on 0.9, because we recently fixed a bug regarding the lr_find setting the learning rate attribute on hparams. #2821

LeeJZh · 2020-08-28T03:56:07Z

Hi, thanks for reporting this. I noticed you did not specify the PL version. Could you check that you are on 0.9, because we recently fixed a bug regarding the lr_find setting the learning rate attribute on hparams. #2821

yes pl version 0.9

ddrevicky · 2020-09-25T16:09:29Z

I've looked at this and this actually has nothing to do with TensorBoard or any other logger. PR #3293 added a tune() method which extracted the learning rate finder out of fit(). Looking at it, it seems that William intended it to be used separately from fit().

Anyway, learning rate finder is not called in fit() at all so the learning rate the user sets is used (and logged by TensorBoard). So learning rate finder doesn't work now with fit() and also auto_scale_batch_size does not since it was also extracted to the tune() method.

vedal · 2021-04-16T21:21:06Z

I had to both set Trainer(..., auto_lr_find=True) and call trainer.tune(model, datamodule=datamodule) explicitly to make this work before calling trainer.fit() in version 1.2.8

LeeJZh added bug Something isn't working help wanted Open to be worked on labels Aug 27, 2020

awaelchli added the information needed label Aug 27, 2020

awaelchli self-assigned this Aug 27, 2020

edenlightning added this to the 0.9.x milestone Sep 1, 2020

Borda added the good first issue Good for newcomers label Sep 15, 2020

edenlightning removed the information needed label Sep 22, 2020

edenlightning unassigned awaelchli Sep 22, 2020

edenlightning added v1.0 allowed labels Sep 22, 2020

edenlightning added docs Documentation related and removed bug Something isn't working good first issue Good for newcomers Hacktoberfest labels Oct 2, 2020

edenlightning modified the milestones: 0.9.x, 1.0 Oct 4, 2020

edenlightning closed this as completed Oct 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensorboard fails to log found lr when auto_lr_find is enabled. #3219

Tensorboard fails to log found lr when auto_lr_find is enabled. #3219

LeeJZh commented Aug 27, 2020

awaelchli commented Aug 27, 2020

LeeJZh commented Aug 28, 2020

ddrevicky commented Sep 25, 2020

vedal commented Apr 16, 2021 •

edited

Loading

Tensorboard fails to log found lr when auto_lr_find is enabled. #3219

Tensorboard fails to log found lr when auto_lr_find is enabled. #3219

Comments

LeeJZh commented Aug 27, 2020

🐛 Bug

To Reproduce

Code sample

Expected behavior

Environment

Additional context

awaelchli commented Aug 27, 2020

LeeJZh commented Aug 28, 2020

ddrevicky commented Sep 25, 2020

vedal commented Apr 16, 2021 • edited Loading

vedal commented Apr 16, 2021 •

edited

Loading