Remove addressed workaround in ResizeV2 #7606

NicolasHug · 2023-05-19T13:11:27Z

This PR removes a workaround which is not needed anymore: the original problem was fixed in torch core already in pytorch/pytorch#101136

I can confirm that the same stress-test from #7557 (review) are still properly passing (and that those tests were hitting the workaround code)

cc @vfdev-5

pytorch-bot · 2023-05-19T13:11:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7606

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 26 New Failures

As of commit 017cdfe:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

NicolasHug · 2023-05-19T13:17:43Z

torchvision/transforms/v2/functional/_geometry.py

-        elif interpolation == InterpolationMode.BILINEAR and image.device.type == "cpu":
+        elif (
+            interpolation == InterpolationMode.BILINEAR
+            and image.is_cpu


Just a nit from me, I thought it was simpler than image.device.type == "cpu", but I can put it back

I can attest that this actually works, but

I've never seen used it anywhere

it is undocumented. This looks like a bug though, since is_cuda and is_meta are there.

Thus, I would prefer to leave it as is, but no strong opinion. How did you learn about it?

torchvision/transforms/v2/functional/_geometry.py:195: error: "Tensor" has no attribute "is_cpu" [attr-defined] and image.is_cpu ^~~~~~~~~~~~ Found 1 error in 1 file (checked 236 source files) Error: Process completed with exit code 1.

bwhahahaha.
Anyway.

How did you learn about it?

I was reminded that is_cuda exists so I sort of guessed is_cpu should exist as well. It's used in the torch core code-base, but sparsely. I'll revert anyway to avoid fighting mypy

NicolasHug · 2023-05-19T13:44:08Z

I wonder whether this is actually want we want

We're going from this on main:

transform               median
--------------------  --------
PILToTensor                292
RandomResizedCrop          432
RandomHorizontalFlip        82
ConvertDtype                84
Normalize                  595
--------------------  --------
Total                     1472

To this on this PR:

transform               median
--------------------  --------
PILToTensor                270
RandomResizedCrop          534
RandomHorizontalFlip        67
ConvertDtype                77
Normalize                  137
--------------------  --------
Total                     1090

RandomResizedCrop being slightly slower and Normalize being significantly faster shows that the output is allocated as CF now.

I find that a bit surprising because from offline discussions with @vfdev-5, I thought the output should be preserved as CL. Isn't that what test_memory_format_consistency_resize_image_tensor() is supposed to enforce as well?

pmeier

Thanks Nicolas!

pmeier · 2023-05-20T19:47:49Z

torchvision/transforms/v2/functional/_geometry.py

-        elif interpolation == InterpolationMode.BILINEAR and image.device.type == "cpu":
+        elif (
+            interpolation == InterpolationMode.BILINEAR
+            and image.is_cpu


I can attest that this actually works, but

I've never seen used it anywhere

it is undocumented. This looks like a bug though, since is_cuda and is_meta are there.

Thus, I would prefer to leave it as is, but no strong opinion. How did you learn about it?

pmeier · 2023-05-20T19:49:04Z

torchvision/transforms/v2/functional/_geometry.py

            # uint8 dtype support for bilinear mode is limited to cpu and
            # according to our benchmarks non-AVX CPUs should prefer u8->f32->interpolate->u8 path
-            if "AVX2" in torch.backends.cpu.get_cpu_capability():


This could have been in the original elif already or am I missing something?

yes there's nothing before or after that block so it's logically the same

vfdev-5 · 2023-05-22T11:21:57Z

For visibility, here is what happens in the PR:

PIL to Tensor returns a CL-like 3D tensor
Random Crop makes it non-contig CL

In Resize we have

vision/torchvision/transforms/v2/functional/_geometry.py

Lines 205 to 217 in 508bc1d

    
           strides = image.stride() 
        
           if image.is_contiguous(memory_format=torch.channels_last) and image.shape[0] == 1 and numel != strides[0]: 
        
               # There is a weird behaviour in torch core where the output tensor of `interpolate()` can be allocated as 
        
               # contiguous even though the input is un-ambiguously channels_last (https://github.com/pytorch/pytorch/issues/68430). 
        
               # In particular this happens for the typical torchvision use-case of single CHW images where we fake the batch dim 
        
               # to become 1CHW. Below, we restride those tensors to trick torch core into properly allocating the output as 
        
               # channels_last, thus preserving the memory format of the input. This is not just for format consistency: 
        
               # for uint8 bilinear images, this also avoids an extra copy (re-packing) of the output and saves time. 
        
               # TODO: when https://github.com/pytorch/pytorch/issues/68430 is fixed (possibly by https://github.com/pytorch/pytorch/pull/100373), 
        
               # we should be able to remove this hack. 
        
               new_strides = list(strides) 
        
               new_strides[0] = numel 
        
               image = image.as_strided((1, num_channels, old_height, old_width), new_strides)

which should hint to pytorch code to make output of the same memory format. This does not work as input is not image.is_contiguous(memory_format=torch.channels_last) due to slicing, so output is CF. As output is CF and in the upsample AVX code we are doing a copy of the input into CL format here: https://github.com/pytorch/pytorch/blob/4f2c007a1b5170c2aa0d47e388ff9e07c7a7d354/aten/src/ATen/native/cpu/UpSampleKernelAVXAntialias.h#L323-L331 thus there should be an additional call of pack_rgb.

vfdev-5

LGTM, let's merge it and think of another way to avoid memory format change later

github-actions · 2023-05-22T12:50:50Z

Hey @NicolasHug!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

Reviewed By: vmoens Differential Revision: D46071408 fbshipit-source-id: 8216a893fc11741260c6c741bfa609cbe4a31a54

Remove addressed workaround in ResizeV2

9e040cf

NicolasHug requested a review from vfdev-5 May 19, 2023 13:11

facebook-github-bot added the cla signed label May 19, 2023

NicolasHug commented May 19, 2023

View reviewed changes

pmeier approved these changes May 20, 2023

View reviewed changes

vfdev-5 approved these changes May 22, 2023

View reviewed changes

NicolasHug and others added 2 commits May 22, 2023 13:25

Put back image.device.type == 'cpu'

8d25a95

Merge branch 'main' into resize_remove_check

017cdfe

NicolasHug merged commit 6ccc712 into pytorch:main May 22, 2023

NicolasHug added module: transforms code quality labels May 22, 2023

facebook-github-bot pushed a commit that referenced this pull request May 23, 2023

[fbsync] Remove addressed workaround in ResizeV2 (#7606)

27ad088

Reviewed By: vmoens Differential Revision: D46071408 fbshipit-source-id: 8216a893fc11741260c6c741bfa609cbe4a31a54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove addressed workaround in ResizeV2 #7606

Remove addressed workaround in ResizeV2 #7606

NicolasHug commented May 19, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented May 19, 2023 •

edited

Loading

NicolasHug May 19, 2023

pmeier May 20, 2023

NicolasHug May 22, 2023

NicolasHug commented May 19, 2023

pmeier left a comment

pmeier May 20, 2023

pmeier May 20, 2023

NicolasHug May 22, 2023 •

edited

Loading

vfdev-5 commented May 22, 2023 •

edited

Loading

vfdev-5 left a comment

github-actions bot commented May 22, 2023

Remove addressed workaround in ResizeV2 #7606

Remove addressed workaround in ResizeV2 #7606

Conversation

NicolasHug commented May 19, 2023 • edited by pytorch-bot bot Loading

pytorch-bot bot commented May 19, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7606

❌ 26 New Failures

NicolasHug May 19, 2023

Choose a reason for hiding this comment

pmeier May 20, 2023

Choose a reason for hiding this comment

NicolasHug May 22, 2023

Choose a reason for hiding this comment

NicolasHug commented May 19, 2023

pmeier left a comment

Choose a reason for hiding this comment

pmeier May 20, 2023

Choose a reason for hiding this comment

pmeier May 20, 2023

Choose a reason for hiding this comment

NicolasHug May 22, 2023 • edited Loading

Choose a reason for hiding this comment

vfdev-5 commented May 22, 2023 • edited Loading

vfdev-5 left a comment

Choose a reason for hiding this comment

github-actions bot commented May 22, 2023

NicolasHug commented May 19, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented May 19, 2023 •

edited

Loading

NicolasHug May 22, 2023 •

edited

Loading

vfdev-5 commented May 22, 2023 •

edited

Loading