Deprecate and remove `on_epoch_start/end` and `on_batch_start/end` hooks #10807

rohitgr7 · 2021-11-29T09:07:30Z

Proposed refactor

I propose to deprecate and remove on_epoch_start/end and on_batch_start/end hooks.

Motivation

on_epoch_start/on_epoch_end: These 2 hooks runs within each mode train/val/test currently. We already have on_train_epoch_start/on_val_epoch_start/on_test_epoch_start. The reason why we kept them to have a common hook that is called in each mode is so that users can configure operations required to be done within each mode. But I think this is more of a specific application/use-case and can be configured easily by the user without this hook. Also, it can be confusing when referring to val/test because epoch doesn't mean anything during evaluation. Also, there are other mode-specific hooks (actually many) that don't have a special version of a hook that runs for all modes. For eg. we don't have any separate on_start/on_end hooks or on_dataloader hook that run along with on_{train/val/test}_start/end so why special treatment here?
on_batch_start/on_batch_end: They run along with on_train_batch_start/on_train_batch_end and don't provide any other significance so we should remove them as well. We can make them run within each mode but then again the same points from above, do we even need them?

History:
all these hooks seem to be added at the inception, the best I could find is this PR which is very old.
the behavior that enabled on_epoch_start/end was I think discussed/approved over slack and I added them 😅 a while back: #6498

Pitch

Simply deprecate and remove.

Additional context

If you enjoy Lightning, check out our other projects! ⚡

Metrics: Machine learning metrics for distributed, scalable PyTorch applications.
Lite: enables pure PyTorch users to scale their existing code on any kind of device while retaining full control over their own loops and optimization logic.
Flash: The fastest way to get a Lightning baseline! A collection of tasks for fast prototyping, baselining, fine-tuning, and solving problems with deep learning.
Bolts: Pretrained SOTA Deep Learning models, callbacks, and more for research and production with PyTorch Lightning and PyTorch.
Lightning Transformers: Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

cc @tchaton @carmocca @awaelchli @Borda @ninginthecloud @justusschock @akihironitta

The text was updated successfully, but these errors were encountered:

carmocca · 2021-11-29T15:13:47Z

Can you explore the depths of our git history and link here the PRs/discussion that added them initially?

rohitgr7 · 2021-11-29T16:31:24Z

@carmocca updated the description with details.

awaelchli · 2021-11-30T12:29:48Z

There was also the PR from @williamFalcon that added all the on_train_batch_start etc. hooks (can't find it). This was the beginning of the overlap of these hooks.
Note that the on_batch_start/end initially were only for training and they ONLY ran for training. Later we made the decision to make them run generally in all modes. I am not a fan of constantly changing our mind and reverting decisions.

An additional argument that can be added to your issue is that there are other mode-specific hooks (actually many) that don't have a special version of a hook that runs for all modes (example: x_dataloader()).

rohitgr7 · 2021-11-30T13:53:06Z

Note that the on_batch_start/end initially were only for training and they ONLY ran for training. Later we made the decision to make them run generally in all modes.

this still runs only in the training mode.

An additional argument that can be added to your issue is that there are other mode-specific hooks (actually many) that don't have a special version of a hook that runs for all modes (example: x_dataloader()).

yep I have mentioned one example.. updated it.

carmocca · 2021-11-30T13:58:12Z

Later we made the decision to make them run generally in all modes

I also remember this but the problem is nobody actually went through and implemented them for all modes.

rohitgr7 · 2021-11-30T14:04:43Z

yeah, but still don't think this is necessary.

one can simply do:

def _common_batch_start(self, ...):
    ...

def on_train_batch_start(self, ...):
    self._common_batch_start(...)

def on_val_batch_start(self, ...):
    self._common_batch_start(...)

at least this makes sure that user has implemented what they need as per their requirement but if for some reason they miss the doc and implement on_batch_start thinking that it will run only for training, then it can lead to issues. Also if they log in this hook then we will have to add some extra checks to detect the default on_step/on_epoch values within self.log for each mode.

ruro · 2022-04-22T23:22:04Z

I am sorry, but in my opinion this was the wrong decision. "The user can just implement it themselves" is a bad argument when your library advertises itself as a tool for reducing boilerplate. I know, that it's just 9 lines, but in my opinion, it's the thought that counts.

With pytorch-lightning I often find myself writing code that looks like this

    def some_method(self, ...):
         return self.some_other_method(...)

And if this isn't boilerplate, then I don't know what is.

rohitgr7 added refactor hooks Related to the hooks API labels Nov 29, 2021

rohitgr7 added this to the 1.6 milestone Nov 29, 2021

carmocca added deprecation Includes a deprecation and removed refactor labels Nov 29, 2021

This was referenced Jan 22, 2022

Deprecate on_batch_start/on_batch_end callback hooks #11577

Merged

Deprecate on_epoch_start/on_epoch_end hook #11578

Merged

carmocca closed this as completed Feb 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate and remove `on_epoch_start/end` and `on_batch_start/end` hooks #10807

Deprecate and remove `on_epoch_start/end` and `on_batch_start/end` hooks #10807

rohitgr7 commented Nov 29, 2021 •

edited

Loading

carmocca commented Nov 29, 2021

rohitgr7 commented Nov 29, 2021

awaelchli commented Nov 30, 2021

rohitgr7 commented Nov 30, 2021 •

edited

Loading

carmocca commented Nov 30, 2021

rohitgr7 commented Nov 30, 2021

ruro commented Apr 22, 2022

Deprecate and remove on_epoch_start/end and on_batch_start/end hooks #10807

Deprecate and remove on_epoch_start/end and on_batch_start/end hooks #10807

Comments

rohitgr7 commented Nov 29, 2021 • edited Loading

Proposed refactor

Motivation

Pitch

Additional context

If you enjoy Lightning, check out our other projects! ⚡

carmocca commented Nov 29, 2021

rohitgr7 commented Nov 29, 2021

awaelchli commented Nov 30, 2021

rohitgr7 commented Nov 30, 2021 • edited Loading

carmocca commented Nov 30, 2021

rohitgr7 commented Nov 30, 2021

ruro commented Apr 22, 2022

Deprecate and remove `on_epoch_start/end` and `on_batch_start/end` hooks #10807

Deprecate and remove `on_epoch_start/end` and `on_batch_start/end` hooks #10807

rohitgr7 commented Nov 29, 2021 •

edited

Loading

rohitgr7 commented Nov 30, 2021 •

edited

Loading