New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Implement Box blur fast filter that could approximate gaussian filter #223

Open

light-le wants to merge 7 commits into kornia:main from light-le:box-blur-fast

light-le commented Jan 16, 2025 •

edited

Loading

solve #168. The algorithm was derived from this blog post

light-le added 3 commits

January 16, 2025 16:32


          implement box_blur_fast_kernels_1d with test


          implement fast_horizontal_filter with test

2c5c101


          implement box_blur_fast ops with tests

johnnv1 reviewed

View reviewed changes

Member

johnnv1 left a comment

can you add benchmarks to it as well?

Author

light-le commented Jan 17, 2025

You mean in crates/kornia-imgproc/benches/bench_filters.rs ? Sure ok

edgarriba requested changes

View reviewed changes

crates/kornia-imgproc/src/filter/kernels.rs Show resolved Hide resolved

crates/kornia-imgproc/src/filter/kernels.rs Show resolved Hide resolved

crates/kornia-imgproc/src/filter/kernels.rs

@@ @@ -92,4 +122,11 @@ mod tests { @@
                           assert_eq!(k, expected[i]);
                       }
                   }
+                  #[test]

Member

edgarriba Jan 18, 2025

Maybe add also some test that it’s not only ones ?

Author

light-le Jan 25, 2025

Ok I'll add a few more tests

crates/kornia-imgproc/src/filter/ops.rs Outdated

+              /// * `src` - The source image with shape (H, W, C).
+              /// * `dst` - The destination image with shape (H, W, C).
+              /// * `kernel_size` - The size of the kernel (kernel_x, kernel_y).
+              /// * `sigma` - The sigma of the gaussian kernel.

Member

edgarriba Jan 18, 2025

Specify that it should be x-y ordered

Author

light-le Jan 25, 2025

Right

crates/kornia-imgproc/src/filter/ops.rs

+              mod tests {
+                  use super::*;
+                  #[test]

Member

edgarriba Jan 18, 2025

I would some simple numbers test too similar to the other functions to verify that’s doing the right thing

Author

light-le Jan 25, 2025

So I added 2 tests here: test_box_blur_fast() and test_gaussian_blur(). Both has the same input (0..25) to show that the outputs are not that much different.

I did attempt to use the same input (all 0.0, 9.0 in the middle) as test_fast_horizontal_filter(), if that's how you mean by this comment. The result was a little disappointing as there's a big difference between the outputs of the 2 methods. I figured it's because the test input was odd. It might be fitting for test_fast_horizontal_filter() but not for these. Therefore I went with something more randomized.

Member

edgarriba Jan 26, 2025

Why should be different? The test you describe should give you a box of ones, right ?

crates/kornia-imgproc/src/filter/separable_filter.rs

@@ @@ -88,6 +88,72 @@ pub fn separable_filter<const C: usize>( @@
                   Ok(())
               }
+              /// Apply a fast filter horizontally, take advantage of property where all

Member

edgarriba Jan 18, 2025

Split header docs. Usually there’s a single line explaining in short the purpose of the function followed by end of line then you can add some clarification, formulation of anything needed

crates/kornia-imgproc/src/filter/separable_filter.rs Outdated

+              /// * `src` - The source image with shape (H, W, C).
+              /// * `dst_transposed` - The destination image with shape (W, H, C).
+              /// * `half_kernel_x_size` - Half of the kernel at weight 1. The total size would be 2*this+1
+              pub fn fast_horizontal_filter<const C: usize>(

Member

edgarriba Jan 18, 2025

Suggested change

      
            pub fn fast_horizontal_filter<const C: usize>(
          
            fn fast_horizontal_filter<const C: usize>(

I wouldn’t make public to users, this is more a utility function

Author

light-le Jan 25, 2025

I change it to pub(crate) because just removing pub would not expose it to ops.rs/box_blur_fast()

crates/kornia-imgproc/src/filter/separable_filter.rs Outdated

+              /// * `half_kernel_x_size` - Half of the kernel at weight 1. The total size would be 2*this+1
+              pub fn fast_horizontal_filter<const C: usize>(
+                  src: &Image<f32, C>,
+                  dst_transposed: &mut Image<f32, C>,

Member

edgarriba Jan 18, 2025

For consistency just dst ?

Author

light-le Jan 25, 2025

You're right. I was signaling that the result would be transposed. But it could be placed in the docstring.

crates/kornia-imgproc/src/filter/separable_filter.rs

+                              row_acc[ch] += src_data[kernel_pix_offset];
+                          }
+                          leftmost_pixel[ch] = *source_pixel;
+                          rightmost_pixel[ch] = src_data[pix_offset+((src.cols()-1)*C)];

Member

edgarriba Jan 18, 2025

(src.cols()-1)*C) could be computed outside before the loops

crates/kornia-imgproc/src/filter/separable_filter.rs

+                      if c == 0 {
+                          row_acc[ch] = *source_pixel * (half_kernel_x_size+1) as f32;
+                          let mut kernel_pix_offset = pix_offset;

Member

edgarriba Jan 18, 2025

Wondering wether this offset could be precomputed beforehand as you are computing several times in the top level function

Member

edgarriba commented Jan 18, 2025

@johnnv1 any idea why python tests are failing (I believe it’s unrelated to this PR). Shouldn’t we be using the new just commands in https://github.com/kornia/kornia-rs/blob/main/.github/workflows/python_test.yml#L40

Member

johnnv1 commented Jan 19, 2025

@johnnv1 any idea why python tests are failing (I believe it’s unrelated to this PR). Shouldn’t we be using the new just commands in https://github.com/kornia/kornia-rs/blob/main/.github/workflows/python_test.yml#L40

yeah, seems unrelated, but should be working

johnnv1 closed this

johnnv1 reopened this

light-le and others added 4 commits

January 25, 2025 06:34


          Apply docstring suggestions from code review

37fc576

Co-authored-by: Edgar Riba <[email protected]>


          Merge branch 'kornia:main' into box-blur-fast

5facd4f


          add more tests for box_blur_fast_kernels_1d

849d57a


          some fixes from suggestions

b7b86ce

edgarriba requested changes

View reviewed changes

Member

edgarriba left a comment

Can you expand this benchmark and report the numbers so that we know wether this method is really making what’s expected ?

https://github.com/kornia/kornia-rs/blob/main/crates/kornia-imgproc/benches/bench_filters.rs

I highly suggest once you have the benchmark setup that you play around with it and try to do micro optimisations like reusing as much as possible pre-computed variables as I suggested in the review to see how affects in the benchmarks.

crates/kornia-imgproc/src/filter/ops.rs

+                  let mut input_img = src.clone();
+                  for (half_kernel_x_size, half_kernel_y_size) in half_kernel_x_sizes.iter().zip(half_kernel_y_sizes.iter()) {
+                      let mut transposed = Image::<f32, C>::from_size_val(transposed_size, 0.0)?;

Member

edgarriba Jan 26, 2025

You can allocate once outside the loop

crates/kornia-imgproc/src/filter/ops.rs

+              mod tests {
+                  use super::*;
+                  #[test]

Member

edgarriba Jan 26, 2025

Why should be different? The test you describe should give you a box of ones, right ?

edgarriba linked an issue

that may be closed by this pull request

Implement fast-box-blur #168

Open

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet