[Prototype] 4855 compose with lazy resampling #4911

wyli · 2022-08-15T10:41:01Z

Description

The goal of this PR is to add a lazy_resample: bool option to Compose, so
that consecutive spatial transforms could be applied with a single step image resampling.
This will have several benefits:

reduce the memory footprint
speed up the compose transform
reduce interpolation errors

The implementation mainly consists of three types of non-breaking API modifications:

make sure all spatial transforms have an option to skip the array resampling step, and not skipping the metadata update step. this is achieved with a new base class (LazyTransform with class variable eager_mode: bool).
the MetaTensor must have a mechanism to track lazily applied changes, this is achieved by adding a MetaTensor.evaluated flag and spatial_shape properties to track the latest expected output spatial shape
the Compose transform's __call__ must be able to find out when to evaluate the fused transforms eagerly in a sequence of transforms.

Some examples of improving the composed transfom quality, based on this sequence:

xforms = [
    mt.LoadImageD(keys, ensure_channel_first=True),
    mt.Orientationd(keys, "RAS"),
    mt.SpacingD(keys, (1.5, 1.5, 1.5)),
    mt.CenterScaleCropD(keys, roi_scale=0.9),
    # mt.CropForegroundD(keys, source_key="seg", k_divisible=5),
    mt.RandRotateD(keys, prob=1.0, range_y=np.pi / 2, range_x=np.pi / 3),
    mt.RandSpatialCropD(keys, roi_size=(76, 87, 73)),
    mt.RandScaleCropD(keys, roi_scale=0.9),
    mt.Resized(keys, (30, 40, 60)),
    # mt.NormalizeIntensityd(keys),
    mt.ZoomD(keys, 1.3, keep_size=False),
    mt.FlipD(keys),
    mt.Rotate90D(keys),
    mt.RandAffined(keys),
    mt.ResizeWithPadOrCropd(keys, spatial_size=(32, 43, 54)),
    mt.DivisiblePadD(keys, k=3),
]
xform = mt.Compose(xforms, lazy_resample=True, mode='nearest')
xform.set_random_state(0)

before this PR (xform = mt.Compose(xforms)):

after this PR (xform = mt.Compose(xforms, lazy_resample=True, mode='nearest')):

after this PR (xform = mt.Compose(xforms, lazy_resample=True, mode=3, padding_mode="reflect")):

TODOs:

unit tests
skip resampling when the affine transform is almost ones diagonal
lazy resample in the inverse
discuss and improve the API design
compatibility with the cache-based datasets

Status

Hold

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: Wenqi Li <[email protected]>

atbenmurray · 2022-08-15T11:33:48Z

Initial thoughts:

It certainly cuts out quite a lot of the reworking that I thought would be necessary to do.
The way you use the metatensor to do the heavy lifting for the feature certainly seems to work well
I think that my points on Friday about Compose and the interaction between various things that might want to change the effective pipeline that gets run still stand.
I think it is still worth looking at refactoring the transforms so that they give people a simple way to write new transforms with the same principle
Do you intend for this to be possible if metatensors aren't used, or do we require metatensor as a prerequisite?

Signed-off-by: Wenqi Li <[email protected]>

Nic-Ma · 2022-08-18T01:04:28Z

Hi @atbenmurray ,

I think we all aligned that eventually, we will use MetaTensor as the unique data type, it can simplify the codebase and provide flexible info for the future features.
CC @ericspod @wyli please feel free to comment if you have different thinking.

Thanks.

Signed-off-by: Wenqi Li <[email protected]>

atbenmurray · 2022-08-19T14:39:59Z

Hi @atbenmurray ,

I think we all aligned that eventually, we will use MetaTensor as the unique data type, it can simplify the codebase and provide flexible info for the future features. CC @ericspod @wyli please feel free to comment if you have different thinking.

Thanks.

Hi Nic. We could indeed assume MetaTensor for everything, and that makes life a lot simpler internally. The main issue I see with that is that people who want to write transforms have to retrofit their code with MetaTensors. We always stated that we wanted Monai to be as close to vanilla pytorch as possible and that extensions were additive. I think we have mechanisms that allow us to handle pending transforms independently of meta tensors (they are WIP in #4922, in atmostonce/functional.py and atmostonce/array.py). Can we have that as a subtopic of the Friday meeting?

atbenmurray · 2022-08-19T14:47:17Z

I see two major points to address, as discussed wednesday:

Pending transform representation
As it stands, the metatensor has an evaluated flag. That flag indicates whether the transforms are pending or historical. If you have lazy transforms followed by a non-lazy transform followed by more lazy transforms:

La -> Lb -> Nc -> Ld -> Le

lazy transforms La and Lb will be applied twice. The application of La and Lb at the point we reach the non lazy transform Nc flips the flag to evaluated = True. Upon reaching Ld, the flag is flipped back to false. At the end of Le, a final apply must be done, but the transform history of the tensor is La, Lb, Lc, Ld still (we can argue about Nc but there are scenarios in which it does nothing to the metatensor as it is not aware of those capabilities). A pending list, such as added to MetaTensor in #4922, distinguishes between applied transforms and pending transforms. All pending transforms become applied transforms on the metatensor once applied. There are other ways, but this is a simple and (IMHO) clean approach.

Signed-off-by: Wenqi Li <[email protected]>

myron · 2022-10-26T20:13:58Z

Hi Nic. We could indeed assume MetaTensor for everything, and that makes life a lot simpler internally. The main issue I see with that is that people who want to write transforms have to retrofit their code with MetaTensors. We always stated that we wanted Monai to be as close to vanilla pytorch as possible and that extensions were additive. I think we have mechanisms that allow us to handle pending transforms independently of meta tensors (they are WIP in #4922, in atmostonce/functional.py and atmostonce/array.py). Can we have that as a subtopic of the Friday meeting?

@atbenmurray I agree with your comment completely . I also think we need to be close to vanilla pytorch - as close as possible.

MetaTensor already deviates from Pytorch, (by creating a custom subclass torch.Tensor), which introduces a high learning curve for any new user of monai (to understand it or modify it), and may lead to bugs such as #5283

By putting more things in MetaTensor it will deviate from pytorch even further. We should consider alternative way to compose spatial tranforms, e.g. by having a class to Fuse the transforms (that will compose spatial grids, keep track of intermediate things, and will do interpolation only once at the end), e.g. see here #4855 (comment)

atbenmurray · 2022-10-28T14:32:15Z

Hi Nic. We could indeed assume MetaTensor for everything, and that makes life a lot simpler internally. The main issue I see with that is that people who want to write transforms have to retrofit their code with MetaTensors. We always stated that we wanted Monai to be as close to vanilla pytorch as possible and that extensions were additive. I think we have mechanisms that allow us to handle pending transforms independently of meta tensors (they are WIP in #4922, in atmostonce/functional.py and atmostonce/array.py). Can we have that as a subtopic of the Friday meeting?

@atbenmurray I agree with your comment completely . I also think we need to be close to vanilla pytorch - as close as possible.

MetaTensor already deviates from Pytorch, (by creating a custom subclass torch.Tensor), which introduces a high learning curve for any new user of monai (to understand it or modify it), and may lead to bugs such as #5283

By putting more things in MetaTensor it will deviate from pytorch even further. We should consider alternative way to compose spatial tranforms, e.g. by having a class to Fuse the transforms (that will compose spatial grids, keep track of intermediate things, and will do interpolation only once at the end), e.g. see here #4855 (comment)

At the moment, the implementation of #5420 containing the apply method that performs the accumulation of transforms and lazy resampling can work with metatensors or with standard tensors and separate transform descriptions. However, given that transforms always convert to metatensor, there would be other work that would have to happen to support use of tensors in preprocessing once more and that would be going back on the decision to exclusively use them.

wyli · 2023-01-17T00:39:20Z

#5860 has a more recent implementation

part of #4855 upgrade #4911 to use the latest dev API ### Description Example usage: for a sequence of spatial transforms ```py xforms = [ mt.LoadImageD(keys, ensure_channel_first=True), mt.Orientationd(keys, "RAS"), mt.SpacingD(keys, (1.5, 1.5, 1.5)), mt.CenterScaleCropD(keys, roi_scale=0.9), # mt.CropForegroundD(keys, source_key="seg", k_divisible=5), mt.RandRotateD(keys, prob=1.0, range_y=np.pi / 2, range_x=np.pi / 3), mt.RandSpatialCropD(keys, roi_size=(76, 87, 73)), mt.RandScaleCropD(keys, roi_scale=0.9), mt.Resized(keys, (30, 40, 60)), # mt.NormalizeIntensityd(keys), mt.ZoomD(keys, 1.3, keep_size=False), mt.FlipD(keys), mt.Rotate90D(keys), mt.RandAffined(keys), mt.ResizeWithPadOrCropd(keys, spatial_size=(32, 43, 54)), mt.DivisiblePadD(keys, k=3), ] lazy_kwargs = dict(mode=("bilinear", 0), padding_mode=("border", "nearest"), dtype=(torch.float32, torch.uint8)) xform = mt.Compose(xforms, lazy_evaluation=True, overrides=lazy_kwargs, override_keys=keys) xform.set_random_state(0) ``` lazy_evaluation=True preserves more details ![Screenshot 2023-01-17 at 00 31 40](https://user-images.githubusercontent.com/831580/212784981-ea39833b-54ab-42fb-bc03-38b012281857.png) compared with the regular compose ![Screenshot 2023-01-17 at 00 31 43](https://user-images.githubusercontent.com/831580/212785016-ba3be8ff-f17f-47b4-8025-cd351a637a82.png) ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: Wenqi Li <[email protected]> Signed-off-by: Yiheng Wang <[email protected]> Signed-off-by: KumoLiu <[email protected]> Signed-off-by: Ben Murray <[email protected]> Co-authored-by: Ben Murray <[email protected]> Co-authored-by: binliu <[email protected]> Co-authored-by: Yiheng Wang <[email protected]> Co-authored-by: YunLiu <[email protected]> Co-authored-by: Yiheng Wang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: KumoLiu <[email protected]>

part of Project-MONAI#4855 upgrade Project-MONAI#4911 to use the latest dev API ### Description Example usage: for a sequence of spatial transforms ```py xforms = [ mt.LoadImageD(keys, ensure_channel_first=True), mt.Orientationd(keys, "RAS"), mt.SpacingD(keys, (1.5, 1.5, 1.5)), mt.CenterScaleCropD(keys, roi_scale=0.9), # mt.CropForegroundD(keys, source_key="seg", k_divisible=5), mt.RandRotateD(keys, prob=1.0, range_y=np.pi / 2, range_x=np.pi / 3), mt.RandSpatialCropD(keys, roi_size=(76, 87, 73)), mt.RandScaleCropD(keys, roi_scale=0.9), mt.Resized(keys, (30, 40, 60)), # mt.NormalizeIntensityd(keys), mt.ZoomD(keys, 1.3, keep_size=False), mt.FlipD(keys), mt.Rotate90D(keys), mt.RandAffined(keys), mt.ResizeWithPadOrCropd(keys, spatial_size=(32, 43, 54)), mt.DivisiblePadD(keys, k=3), ] lazy_kwargs = dict(mode=("bilinear", 0), padding_mode=("border", "nearest"), dtype=(torch.float32, torch.uint8)) xform = mt.Compose(xforms, lazy_evaluation=True, overrides=lazy_kwargs, override_keys=keys) xform.set_random_state(0) ``` lazy_evaluation=True preserves more details ![Screenshot 2023-01-17 at 00 31 40](https://user-images.githubusercontent.com/831580/212784981-ea39833b-54ab-42fb-bc03-38b012281857.png) compared with the regular compose ![Screenshot 2023-01-17 at 00 31 43](https://user-images.githubusercontent.com/831580/212785016-ba3be8ff-f17f-47b4-8025-cd351a637a82.png) ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: Wenqi Li <[email protected]> Signed-off-by: Yiheng Wang <[email protected]> Signed-off-by: KumoLiu <[email protected]> Signed-off-by: Ben Murray <[email protected]> Co-authored-by: Ben Murray <[email protected]> Co-authored-by: binliu <[email protected]> Co-authored-by: Yiheng Wang <[email protected]> Co-authored-by: YunLiu <[email protected]> Co-authored-by: Yiheng Wang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: KumoLiu <[email protected]>

wyli added 8 commits August 14, 2022 13:28

fixes 4907

6ddcaf6

Signed-off-by: Wenqi Li <[email protected]>

compose knows when to eagerly execute

b871548

Signed-off-by: Wenqi Li <[email protected]>

update spacing and orientation

be9b48f

Signed-off-by: Wenqi Li <[email protected]>

reviewing spatial transforms

fb943c3

Signed-off-by: Wenqi Li <[email protected]>

update spatial xforms

58caa93

Signed-off-by: Wenqi Li <[email protected]>

Merge branch 'dev' into compose-resampling

8ad664a

Signed-off-by: Wenqi Li <[email protected]>

review spatial transforms

e71476b

Signed-off-by: Wenqi Li <[email protected]>

autofix

0bf0efa

Signed-off-by: Wenqi Li <[email protected]>

wyli requested review from atbenmurray, ericspod, Nic-Ma and rijobro August 15, 2022 10:41

fixes tests

aa1bcdd

Signed-off-by: Wenqi Li <[email protected]>

fixes tests

7c6f714

Signed-off-by: Wenqi Li <[email protected]>

wyli force-pushed the compose-resampling branch from 4bb8b5d to 7c6f714 Compare August 15, 2022 12:55

Merge remote-tracking branch 'upstream/dev' into compose-resampling

c888882

Signed-off-by: Wenqi Li <[email protected]>

atbenmurray mentioned this pull request Aug 17, 2022

[Early WIP] - Lazy resampling #4922

Closed

18 tasks

Merge branch 'dev' into compose-resampling

b89e989

Signed-off-by: Wenqi Li <[email protected]>

wyli force-pushed the compose-resampling branch from 6e70166 to b89e989 Compare August 19, 2022 10:25

testing backends

713fde9

Signed-off-by: Wenqi Li <[email protected]>

wyli and others added 3 commits August 22, 2022 12:00

Merge branch 'dev' into compose-resampling

e297f4e

Merge remote-tracking branch 'upstream' into compose-resampling

ec5304c

Signed-off-by: Wenqi Li <[email protected]>

Merge remote-tracking branch 'upstream/dev' into compose-resampling

4133535

Signed-off-by: Wenqi Li <[email protected]>

atbenmurray mentioned this pull request Oct 3, 2022

Lazy Resampling #4855

Open

33 tasks

Merge remote-tracking branch 'upstream/dev' into compose-resampling

8e6b645

Signed-off-by: Wenqi Li <[email protected]>

myron mentioned this pull request Oct 26, 2022

4922 adding a minimal lazy transform interface #5407

Merged

7 tasks

wyli mentioned this pull request Jan 17, 2023

4855 lazy resampling impl -- Compose #5860

Merged

7 tasks

wyli closed this Jan 17, 2023

wyli deleted the compose-resampling branch February 8, 2023 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Prototype] 4855 compose with lazy resampling #4911

[Prototype] 4855 compose with lazy resampling #4911

wyli commented Aug 15, 2022 •

edited

Loading

atbenmurray commented Aug 15, 2022 •

edited

Loading

Nic-Ma commented Aug 18, 2022

atbenmurray commented Aug 19, 2022

atbenmurray commented Aug 19, 2022

myron commented Oct 26, 2022

atbenmurray commented Oct 28, 2022 •

edited

Loading

wyli commented Jan 17, 2023

[Prototype] 4855 compose with lazy resampling #4911

[Prototype] 4855 compose with lazy resampling #4911

Conversation

wyli commented Aug 15, 2022 • edited Loading

Description

Status

Types of changes

atbenmurray commented Aug 15, 2022 • edited Loading

Nic-Ma commented Aug 18, 2022

atbenmurray commented Aug 19, 2022

atbenmurray commented Aug 19, 2022

myron commented Oct 26, 2022

atbenmurray commented Oct 28, 2022 • edited Loading

wyli commented Jan 17, 2023

wyli commented Aug 15, 2022 •

edited

Loading

atbenmurray commented Aug 15, 2022 •

edited

Loading

atbenmurray commented Oct 28, 2022 •

edited

Loading