Add Modulated Deformable Convolution layer #2196

Licht-T · 2020-10-08T02:10:28Z

Description

This is the implementation of Modulated Deformable Convolution. This fixes #179. There is another PR #1129, but that is stale and contaminated by unknown license codes. So I re-implemented this which is based on the Torchvision's implementation.

~~I knew this would be too big to get reviewed, so, at first, I made the CPU only kernel with Eigen. When the CPU kernel is good enough, I will move to the next step: the GPU kernel implementation.~~
UPDATE: GPU kernel available!

Type of change

Checklist:

I've properly formatted my code according to the guidelines
- By running Black + Flake8
- By running pre-commit hooks
This PR addresses an already submitted issue for TensorFlow Addons
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
This PR contains modifications to C++ custom-ops

How Has This Been Tested?

pytest unit-testing

google-cla · 2020-10-08T02:10:33Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2020-10-08T03:23:26Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

google-cla · 2020-10-08T03:34:30Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

Licht-T · 2020-10-08T03:36:58Z

I signed CLA.

This reverts commit 44bf775.

Licht-T · 2020-10-08T03:53:07Z

Compliant test fails due to conv_utils. It seems that there is no alternative on public TensorFlow Python API.

Licht-T · 2020-10-08T13:54:31Z

I found that Addons has tensorflow_addons.utils.keras_utils and moved some private APIs into it.

Licht-T · 2020-10-08T14:38:15Z

All green! Ready for review!

…add-deformable-conv

Licht-T · 2020-10-30T10:35:16Z

I couldn't wait to make the GPU kernel until the CPU one reviewed!

bhack · 2020-10-30T11:34:00Z

/cc @tanzhenyu @dynamicwebpaige for ecosystem check.

bhack · 2020-10-30T11:38:57Z

I put this under ecosystem review. If no project in TF ecosystem is interested in this or has this in It own internal roadmap we will try to review It here.

cheng-yu-td · 2021-01-25T00:02:05Z

Thanks for your eco-system review!
What should I do next? Any requests are welcome since I know this is quite massive!

Can you split your contribution in more than one PR? Or is this just a single component PR?

This is one single feature, deformable convolution layer. I don't see how this can be split into multiple pull requests.

cheng-yu-td · 2021-01-25T00:03:03Z

Thanks @Licht-T I look forward to this feature! Can't wait to use this for experiments.

bhack · 2021-01-25T10:50:42Z

This PR Is now near 3k lines.

Have we already evaluated the performance gap of a compositional (also if v1) implementation like https://tensorlayer.readthedocs.io/en/latest/_modules/tensorlayer/layers/convolution/deformable_conv.html ?

anshkumar · 2021-02-08T18:21:48Z

tensorflow_addons/layers/deformable_conv2d.py

+        if self.data_format != "channels_first":
+            raise ValueError("`channels_last` data format is not supported.")


"channels_last" is the default data_format in tf.keras.layers.Conv2D. If user is selecting "channels_last", the output should be transposed, instead of raising error.

swapaxis makes a lot of computing cost. IMO, swapaxis should not be implicitly applied.

Cospel · 2021-05-25T08:31:43Z

The code looks great, we are using it successfully in one of our projects. Is anybody still working on it? It's been more than a year of waiting on deformable conv :) ...

Licht-T · 2021-06-06T14:47:16Z

Thanks for a lot of comments. I'm back!

Now I have much time to focus on this PR and will resolve some conflicts this week.

Licht-T · 2021-06-06T14:57:07Z

That's difficult as cheng-yu-td told, but I should do so if I can. Any ideas are welcome.

Can you split your contribution in more than one PR? Or is this just a single component PR?

Licht-T · 2021-06-06T15:17:07Z

@bhack Not tested yet. I'll try.

According to tensorlayer/TensorLayer#641 (comment), TensorLayer implementation is 100 times slower than the original MXNet implementation. That's why we need the optimized C++ code for Deformable Convolution.

Have we already evaluated the performance gap of a compositional (also if v1) implementation like https://tensorlayer.readthedocs.io/en/latest/_modules/tensorlayer/layers/convolution/deformable_conv.html ?

bhack · 2021-06-07T14:39:41Z

According to tensorlayer/TensorLayer#641 (comment)

It is 3 years old. We could check what happens now with the current compile stack.

As a general side note:

/cc @yarri-oss @tomerk As custom kernels are going to create a quite large code and maintainership overhead in the repo can we have a contact point in the XLA/HLO team to just undertestand if and when we could achieve enought performance with a compositional/compiling path (e.g. trigger some FR to the compiler stack)?

We really want reduce the number of maitnained custom ops in the library.

See also https://discuss.tensorflow.org/t/tfr-compositional-ops/

Licht-T · 2021-06-12T09:45:57Z

@bhack Thanks for your comments! I did some tests. In the best case, this PR is approx. 1000 times faster than the TensorLayer one.

Input height/width v.s. Runtime plot with fixed batch size: perf_hw.py

Batch size v.s. Runtime plot with fixed input width/height: perf_batch.py

Licht-T · 2021-06-12T10:12:46Z

I know this PR is quite a large code. I don't want to make maintainers annoyed by the large custom ops code, but I just want to provide the Deformable Convolution, which is one of the greatest achievements in CNN research, to TensorFlow users.

If we can achieve good performance in another way like XLA/HLO, I should/will re-implement this PR in that way.

bhack · 2021-06-12T10:51:51Z

@Licht-T Yes thank you for effort.
The main issue is that Codeowners on average disappaer quite fast see the last two graphs here and #2024).

In this specific case we will have near 3k lines to maintain.
It is I've asked to @seanpmorgan @tomerk @yarri-oss if we could have some feedback by the compiler team about compositional alternatives in cases like this.

Also I think that you need to test the tensorlayer implementation with jit_compile in core functions to test it with XLA (or duplicate the class yourself to set core parts under XLA).

You can inspect the compiled function with :

https://www.tensorflow.org/xla#inspect_compiled_programs

If you could profile it could be also interesting to see if there is any specific dominant ops.

https://www.tensorflow.org/tensorboard/tensorboard_profiling_keras

You can use profiler start and stop if you don't want to use callbacks
https://www.tensorflow.org/api_docs/python/tf/profiler/experimental/start?hl=en

peter-kettmann · 2021-06-25T15:25:59Z

@bhack Thanks for your comments! I did some tests. In the best case, this PR is approx. 1000 times faster than the TensorLayer one.

Input height/width v.s. Runtime plot with fixed batch size: perf_hw.py

Batch size v.s. Runtime plot with fixed input width/height: perf_batch.py

That looks really great! We are currently using deformable convolutions for a project and are using the tensorflowlayer version. But it's too slow for our use case. So I tried your implementation but ran into problems. How do I correctly build the code including the deformable conv layer?

So far I cloned your repo, switched to add-deformable-conv branch (tried the add-deformable-conv-internal as well) and followed the instructions to install tensorflow_addons from source. The build process runs fine (just some bazel warnings). However, installing the wheel and trying a simple code

import tensorflow as tf
import tensorflow_addons

input_layer = tf.keras.layers.Input(shape=(512,512,3))
channels_first = tf.transpose(input_layer, (0,3,1,2))

offset_1 = tf.keras.layers.Conv2D(2, (3,3), padding='same', data_format='channels_first', kernel_initializer='zeros')(channels_first)
deformable_1 = tensorflow_addons.layers.DeformableConv2D(3, (1,1), padding='same')([channels_first, offset_1])

channels_last = tf.transpose(deformable_1, (0,2,3,1))
model = tf.keras.Model(inputs=input_layer, outputs=channels_last)

test_data = tf.random.uniform((32,512,512,3), dtype=tf.float32)

out = model(test_data)

throws this error:

NotFoundError: /net/io-truenas/mnt/pool0/workspaces/peter/code/addons/venv/lib/python3.6/site-packages/tensorflow_addons/custom_ops/layers/_deformable_conv2d_ops.so: undefined symbol: _ZN10tensorflow7strings8internal9CatPiecesB5cxx11ESt16initializer_listIN4absl14lts_2020_09_2311string_viewEE

What am I doing wrong? Some help would be much appreciated :)

Some additional info: I'm using

Ubuntu 18.04
Python 3.6.9
tensorflow 2.5.0
bazel 3.7.2
CUDA 11.2
CUDNN 8.2

and the warnings are several

/usr/local/cuda-11.2/bin/../targets/x86_64-linux/include/thrust/detail/config/cpp_dialect.h:118:13: warning: Thrust requires C++14. Please pass -std=c++14 to your compiler. Define THRUST_IGNORE_DEPRECATED_CPP_DIALECT to suppress this message. THRUST_COMPILER_DEPRECATION(C++14, pass -std=c++14 to your compiler);

and a

WARNING: home/peter/.cache/bazel/_bazel_peter/c6b879c04b9dc802e9234eb6fb8e9fc9/external/local_config_tf/BUILD:12633:8: target 'libtensorflow_framework.so.2' is both a rule and a file; please choose another name for the rule

axeldavy · 2021-07-21T15:21:31Z

Hey, I've tried this deformable convolution implementation in my project, and it seems to work as expected.

However the performance hit compared to a normal convolution is huge.
Indeed the implementation doesn't support the NHWC ordering, which means I have to swap before and after the convolution, and it addition it doesn't support float16 or mixed_float16, and thus doesn't use fp16 tensorcores.

Would it be possible to support NHWC ordering and mixed_fp16/fp16 ?

An additional thought: unless I'm mistaken, it seems that the modulated convolution could be implemented as first a layer that grabs the values at the predicted positions (with bilinear interpolation), and applies the mask, and concatenates the values (for example for a 3x3 convolution the number of channels of the output would be multiplied by 9), then a standard 1x1 convolution.
Possibly it would be easier to implement and maintain that way (only the first layer needs to be implemented).

anshkumar · 2022-02-23T16:40:48Z

@Licht-T I tried compiling it using cuda 11.6, cudnn 8.3 but getting following error on usage:

2022-02-23 16:37:03.133656: F tensorflow_addons/custom_ops/layers/cc/kernels/deformable_conv2d_op_gpu.cu.cc:292] Non-OK-status: GpuLaunchKernel(DeformableIm2ColKernel<float>, config.block_count, config.thread_per_block, 0, device.stream(), b, num_kernels, p, input_eigen_tensor, offset_eigen_tensor, mask_eigen_tensor, column_buffer_eigen_tensor) status: Internal: too many resources requested for launch

anshkumar · 2022-02-26T14:35:45Z

@Licht-T I tried compiling it using cuda 11.6, cudnn 8.3 but getting following error on usage:

2022-02-23 16:37:03.133656: F tensorflow_addons/custom_ops/layers/cc/kernels/deformable_conv2d_op_gpu.cu.cc:292] Non-OK-status: GpuLaunchKernel(DeformableIm2ColKernel<float>, config.block_count, config.thread_per_block, 0, device.stream(), b, num_kernels, p, input_eigen_tensor, offset_eigen_tensor, mask_eigen_tensor, column_buffer_eigen_tensor) status: Internal: too many resources requested for launch

Changing every occurrence of config.thread_per_block with config.thread_per_block/2 fixed this.

bhack · 2022-05-10T13:12:34Z

Thanks for the contribution but I don't think we have here the bandwidth here to add another custom-ops in the repo.

Please try to contribute a python only version in Keras-cv https://github.com/keras-team/keras-cv

SimonBiggs · 2022-12-14T18:40:08Z

Hi @bhack,

If I was to give this PR another try but attempt to split it down into smaller more bite sized PRs would you/others be willing to review and merge?

Another caveat, is I would want to have my initial "hello world" implementation be a 3D deformable convolution as opposed to 2D within this PR. It would be my intent to leave a 2D implementation to someone else who wants to modify/extend my work (where I would be happy to support and review in that endeavour, but would be leaving someone else to spearhead it).

Although I can't prove that I'll stick around after the PR, I do maintain a package in my field which pulls in outside contributions, and by feeling some of the pain of having code-owners disappear, I hope my word has enough weight when I say I want to stick around and help when there is future work/issues/PRs going on around the code that I contribute.

Cheers,
Simon

SimonBiggs · 2022-12-14T18:43:41Z

An additional thought: unless I'm mistaken, it seems that the modulated convolution could be implemented as first a layer that grabs the values at the predicted positions (with bilinear interpolation), and applies the mask, and concatenates the values (for example for a 3x3 convolution the number of channels of the output would be multiplied by 9), then a standard 1x1 convolution.
Possibly it would be easier to implement and maintain that way (only the first layer needs to be implemented).

@axeldavy, if I was to take this on, might you want to help me?

bhack · 2022-12-14T18:45:23Z

@SimonBiggs Can you open a ticket to propose this in Keras-CV https://github.com/keras-team/keras-cv/

As many of the computer vision related contributions are converging there.

SimonBiggs · 2022-12-14T18:50:03Z

I notice the following comment re custom ops:

https://github.com/keras-team/keras-cv/blob/master/.github/CONTRIBUTING.md#contributing-custom-ops

Would that make this contribution the first custom op in Keras-CV that pulls in CUDA code?

I would aim to use the approach that @axeldavy mentioned, so the custom op section would hopefully have a reasonably smaller surface area.

bhack · 2022-12-14T19:06:05Z

We could start with a ticket there to discuss the topic and how to cover this convolution before creating a PR.

SimonBiggs · 2022-12-14T19:15:30Z

Thanks @bhack,

New issue now over at keras-team/keras-cv#1140.

Cheers :),
Simon

Add DeformableConv2D layer

3014347

boring-cyborg bot added custom-ops layers labels Oct 8, 2020

google-cla bot added the cla: no label Oct 8, 2020

Fix headers

5066b2a

Remove string format expression on Python codes

04b283d

Remove Tensorflow Python internal reference

44bf775

google-cla bot added cla: yes and removed cla: no labels Oct 8, 2020

Revert "Remove Tensorflow Python internal reference"

536fc16

This reverts commit 44bf775.

Licht-T added 2 commits October 8, 2020 22:32

Use keras_utils instead of TensorFlow private API

a64e05b

Use keras_utils when testing instead of TensorFlow private API

480bbb2

Refactor and add GPU kernel

7e9654f

Licht-T force-pushed the add-deformable-conv branch from fa722c2 to 7e9654f Compare October 29, 2020 15:28

Licht-T added 4 commits October 30, 2020 00:33

Register GPU kernel iff GOOGLE_CUDA is defined

6e02ca7

Declare extern template iff GOOGLE_CUDA is defined

99034d6

Bug fix and add more tests

e751f57

Merge branch 'add-deformable-conv' of github.com:Licht-T/addons into …

f62ebf7

…add-deformable-conv

Licht-T mentioned this pull request Oct 30, 2020

Deformable Convolutional Op Support #179

Closed

bhack added the ecosystem-review label Oct 30, 2020

anshkumar reviewed Feb 8, 2021

View reviewed changes

Licht-T added 2 commits June 11, 2021 22:34

Merge branch 'master' into add-deformable-conv

043eb7f

Now works on the latest TensorFlow

310f046

seanpmorgan added the backlog-grooming-to-close label Dec 16, 2021

bhack closed this May 10, 2022

SimonBiggs mentioned this pull request Dec 14, 2022

PR proposal: 3D deformable convolution keras-team/keras-cv#1140

Open

themaigod mentioned this pull request Aug 2, 2023

ivy.deform_conv2d ivy-llc/ivy#21187

Closed

jlamperez mentioned this pull request Aug 29, 2023

DeformConv operation from Onnx to TFLite PINTO0309/onnx2tf#469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Modulated Deformable Convolution layer #2196

Add Modulated Deformable Convolution layer #2196

Licht-T commented Oct 8, 2020 •

edited

Loading

google-cla bot commented Oct 8, 2020

google-cla bot commented Oct 8, 2020

google-cla bot commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 30, 2020

bhack commented Oct 30, 2020

bhack commented Oct 30, 2020 •

edited

Loading

cheng-yu-td commented Jan 25, 2021

cheng-yu-td commented Jan 25, 2021

bhack commented Jan 25, 2021

anshkumar Feb 8, 2021

Licht-T Jun 6, 2021

Cospel commented May 25, 2021

Licht-T commented Jun 6, 2021

Licht-T commented Jun 6, 2021

Licht-T commented Jun 6, 2021

bhack commented Jun 7, 2021

Licht-T commented Jun 12, 2021 •

edited

Loading

Licht-T commented Jun 12, 2021

bhack commented Jun 12, 2021

peter-kettmann commented Jun 25, 2021

axeldavy commented Jul 21, 2021

anshkumar commented Feb 23, 2022

anshkumar commented Feb 26, 2022

bhack commented May 10, 2022

SimonBiggs commented Dec 14, 2022 •

edited

Loading

SimonBiggs commented Dec 14, 2022

bhack commented Dec 14, 2022

SimonBiggs commented Dec 14, 2022

bhack commented Dec 14, 2022

SimonBiggs commented Dec 14, 2022

		if self.data_format != "channels_first":
		raise ValueError("`channels_last` data format is not supported.")

Add Modulated Deformable Convolution layer #2196

Add Modulated Deformable Convolution layer #2196

Conversation

Licht-T commented Oct 8, 2020 • edited Loading

Description

Type of change

Checklist:

How Has This Been Tested?

google-cla bot commented Oct 8, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Oct 8, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

google-cla bot commented Oct 8, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 8, 2020

Licht-T commented Oct 30, 2020

bhack commented Oct 30, 2020

bhack commented Oct 30, 2020 • edited Loading

cheng-yu-td commented Jan 25, 2021

cheng-yu-td commented Jan 25, 2021

bhack commented Jan 25, 2021

anshkumar Feb 8, 2021

Choose a reason for hiding this comment

Licht-T Jun 6, 2021

Choose a reason for hiding this comment

Cospel commented May 25, 2021

Licht-T commented Jun 6, 2021

Licht-T commented Jun 6, 2021

Licht-T commented Jun 6, 2021

bhack commented Jun 7, 2021

Licht-T commented Jun 12, 2021 • edited Loading

Licht-T commented Jun 12, 2021

bhack commented Jun 12, 2021

peter-kettmann commented Jun 25, 2021

axeldavy commented Jul 21, 2021

anshkumar commented Feb 23, 2022

anshkumar commented Feb 26, 2022

bhack commented May 10, 2022

SimonBiggs commented Dec 14, 2022 • edited Loading

SimonBiggs commented Dec 14, 2022

bhack commented Dec 14, 2022

SimonBiggs commented Dec 14, 2022

bhack commented Dec 14, 2022

SimonBiggs commented Dec 14, 2022

Licht-T commented Oct 8, 2020 •

edited

Loading

bhack commented Oct 30, 2020 •

edited

Loading

Licht-T commented Jun 12, 2021 •

edited

Loading

SimonBiggs commented Dec 14, 2022 •

edited

Loading