[RFC] Create explicit setup and teardown hooks for each stage on the Lightning and DataModules #6420

ananthsub · 2021-03-08T20:28:01Z

🚀 Feature

LightningModules and DataModules currently support a setup API which takes an optional stage argument.
#6386 addresses some issues in the setup/teardown lifecycle, so I was wondering if we should take this further (#6401)

Motivation

Pros of making the separate hooks for each stage:

Clarity in the API that helps forwards compatibility: In the current scheme, the Lightning trainer can pass an arbitrary value for stage that user code might not handle. With the explicit hooks, new stages becomes opt-in for users, as users must implement the corresponding hook in their lightning/data module
Consistency in the API: this matches the pattern already established for Lightning/data modules which have train/validation/test/predict defined as separate hooks
On the Lightning internals, we can remove the base datamodule wrapper class, and remove the has_setup_{stage} attributes since it'll be obvious when the hooks are called

Cons:

This requires a deprecation process and can cause thrash for users
Users now have to implement more hooks. However, a mitigation is that the refactoring should be straightforward as users can easily share code with a helper function in the lightning/data module.

Pitch

We add the following hooks to the DataHooks base:

on_{stage}_prepare_data
on_{stage}_setup
on_{stage}_teardown

for the existing values of stage: fit, test, validate, predict

Similarly, we add corresponding hooks to the Callback base:

on_{stage}_setup
on_{stage}_teardown

During the migration, in the trainer, if the Lightning(Data)Module has this hook implemented, then we call it. Otherwise, we fallback to calling the existing setup/teardown hooks. We do the same for the callback hooks.

We could set a longer deprecation timeline for this given how prevalent these hooks are. For example, we don't deprecate prepare_data, setup, or teardown until version 1.7+.

Additionally, we should move the trainer argument prepare_data_per_node to the DataHooks base, similar to how automatic_optimization is a property of the LightningModule. This point is separate from the overall hooks discussion and could happen faster to slightly simplify the trainer API.

Alternatives

Keep the existing hooks

Additional context

The text was updated successfully, but these errors were encountered:

carmocca · 2021-03-29T13:55:13Z

One tricky issue about this is that setup is suggested for layer initialization (https://pytorch-lightning.readthedocs.io/en/latest/starter/introduction_guide.html#models-defined-by-data).

So implementing this change would need to consider this case. This also is problematic given the current direction the model parallel hook is taking: #6679 (comment)

cc: @SeanNaren

stale · 2021-04-29T03:17:35Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

stale · 2021-11-07T09:57:58Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

ananthsub added feature Is an improvement or enhancement help wanted Open to be worked on design Includes a design discussion and removed help wanted Open to be worked on labels Mar 8, 2021

ananthsub mentioned this issue Mar 22, 2021

Refactor base profilers 3/5 #6621

Merged

11 tasks

ananthsub mentioned this issue Apr 19, 2021

Mechanism to skip certain hooks #5586

Closed

stale bot added the won't fix This will not be worked on label Apr 29, 2021

kaushikb11 removed the won't fix This will not be worked on label Apr 29, 2021

edenlightning added this to the v1.4 milestone May 9, 2021

ananthsub mentioned this issue Jun 24, 2021

[Docs revamp 2/N] New doc for managing data #8034

Merged

edenlightning modified the milestones: v1.4, v1.5 Jul 1, 2021

ananthsub removed this from the v1.5 milestone Oct 7, 2021

ananthsub mentioned this issue Oct 7, 2021

Only invoke setup() once, not in both trainer.fit() and trainer.test() - #2620 follow up #9865

Closed

stale bot added the won't fix This will not be worked on label Nov 7, 2021

stale bot closed this as completed Nov 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Create explicit setup and teardown hooks for each stage on the Lightning and DataModules #6420

[RFC] Create explicit setup and teardown hooks for each stage on the Lightning and DataModules #6420

ananthsub commented Mar 8, 2021 •

edited

Loading

carmocca commented Mar 29, 2021 •

edited

Loading

stale bot commented Apr 29, 2021

stale bot commented Nov 7, 2021

[RFC] Create explicit setup and teardown hooks for each stage on the Lightning and DataModules #6420

[RFC] Create explicit setup and teardown hooks for each stage on the Lightning and DataModules #6420

Comments

ananthsub commented Mar 8, 2021 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

carmocca commented Mar 29, 2021 • edited Loading

stale bot commented Apr 29, 2021

stale bot commented Nov 7, 2021

ananthsub commented Mar 8, 2021 •

edited

Loading

carmocca commented Mar 29, 2021 •

edited

Loading