Optimize multiplication for Normed #213

kimikage · 2020-08-14T08:03:18Z

This adds wrapping_mul, saturating_mul and checked_mul binary operations. However, this does not specialize them for Fixed and does not change * for Fixed.

This applies the wrapping arithmetic as the default arithmetic of * for the abstract FixedPoint, but as mentioned above, the *for Fixed is not affected by the default arithmetic. Furthermore, * for Normed overrides the default arithmetic with the "checked" arithmetic for backward compatibility. (Strictly speaking, the error type is changed from ArgumentError to OverflowError.)

This replaces most of Normed's implementation of multiplication with integer operations. This improves the speed in many cases and the accuracy in some cases.

Fixes #174

codecov · 2020-08-14T08:19:50Z

Codecov Report

Merging #213 into master will increase coverage by 0.61%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #213      +/-   ##
==========================================
+ Coverage   90.39%   91.01%   +0.61%     
==========================================
  Files           6        6              
  Lines         510      534      +24     
==========================================
+ Hits          461      486      +25     
+ Misses         49       48       -1

Impacted Files	Coverage Δ
src/FixedPointNumbers.jl	`87.75% <100.00%> (+0.32%)`	⬆️
src/normed.jl	`92.14% <100.00%> (+0.86%)`	⬆️
src/utilities.jl	`96.00% <0.00%> (+4.00%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5794adf...1dea1e9. Read the comment docs.

kimikage · 2020-08-14T11:57:52Z

Benchmark

x_n0f8  = collect(rand(N0f8,  1000, 1000)); y_n0f8  = collect(rand(N0f8,  1000, 1000));
x_n0f16 = collect(rand(N0f16, 1000, 1000)); y_n0f16 = collect(rand(N0f16, 1000, 1000));
x_n4f4  = collect(rand(N4f4,  1000, 1000)); y_n4f4  = collect(rand(N4f4,  1000, 1000));
x_n4f12 = collect(rand(N4f12, 1000, 1000)); y_n4f12 = collect(rand(N4f12, 1000, 1000));
z_n4f4  = N4f4.( rand(1000, 1000));
z_n4f12 = N4f12.(rand(1000, 1000));

@btime $x_n0f8 .* $y_n0f8;
@btime $x_n0f16 .* $y_n0f16;
@btime saturating_mul.($x_n4f4, $y_n4f4);
@btime saturating_mul.($x_n4f12, $y_n4f12);
@btime $x_n4f4 .* $z_n4f4;
@btime $x_n4f12 .* $z_n4f12;

	Julia v1.5.0 before	Julia v1.5.0 after	Julia v1.4.2 before	Julia v1.4.2 after
`x_n0f8 .* y_n0f8`	1.965 ms	77.600 μs	1.978 ms	85.100 μs
`x_n0f16 .* y_n0f16`	2.172 ms	165.900 μs	2.168 ms	166.200 μs
`saturating_mul.(x_n4f4, y_n4f4)`	-	105.900 μs	-	104.100 μs
`saturating_mul.(x_n4f12, y_n4f12)`	-	304.200 μs	-	302.700 μs
`$x_n4f4 .* $z_n4f4` (checked_mul)	2.018 ms	794.400 μs	1.991 ms	794.700 μs
`$x_n4f12 .* $z_n4f12`(checked_mul)	2.636 ms	1.086 ms	2.688 ms	1.090 ms

timholy

Remarkable work as usual!

This adds `wrapping_mul`, `saturating_mul` and `checked_mul` binary operations. However, this does not specialize them for `Fixed` and does not change `*` for `Fixed`. This replaces most of Normed's implementation of multiplication with integer operations. This improves the speed in many cases and the accuracy in some cases.

kimikage · 2020-08-24T05:38:00Z

Thank you for all your reviews.
As for the default arithmetic settings, I will ask for feedback again (in Discourse?) before the release of v0.9.

This adds `wrapping_mul` and `checked_mul` binary operations for `Normed`. This replaces most of Normed's implementation of multiplication with integer operations. This improves the speed in many cases and the accuracy in some cases.

kimikage force-pushed the mul_normed branch from e66a277 to 3cc3dfc Compare August 14, 2020 09:58

kimikage force-pushed the mul_normed branch 2 times, most recently from e330f1d to 817f18f Compare August 15, 2020 16:05

kimikage changed the title ~~[WIP] Optimize multiplication for Normed~~ Optimize multiplication for Normed Aug 15, 2020

kimikage marked this pull request as ready for review August 15, 2020 16:24

kimikage force-pushed the mul_normed branch from 817f18f to 0834e19 Compare August 16, 2020 01:00

kimikage mentioned this pull request Aug 17, 2020

Overflow checked arithmetics #41

Open

kimikage force-pushed the mul_normed branch from 0834e19 to 57f5610 Compare August 19, 2020 10:38

This was referenced Aug 20, 2020

Problems with rem for Fixed #219

Closed

Specialize multiplication for Fixed #220

Merged

timholy approved these changes Aug 23, 2020

View reviewed changes

kimikage force-pushed the mul_normed branch from 57f5610 to 1dea1e9 Compare August 24, 2020 03:13

kimikage merged commit 134646f into JuliaMath:master Aug 24, 2020

kimikage deleted the mul_normed branch August 24, 2020 05:39

kimikage mentioned this pull request Aug 31, 2020

Add checked arithmetic for / #222

Merged

kimikage mentioned this pull request Sep 10, 2020

Don't wrap round #227

Open

kimikage mentioned this pull request Oct 29, 2020

Change the default arithmetic of * for Normed to wrapping #236

Merged

kimikage mentioned this pull request Apr 27, 2021

Broken optimization of Normed multiplication and inefficient multiplication JuliaGraphics/ColorVectorSpace.jl#166

Closed

kimikage mentioned this pull request Apr 30, 2024

[RFC] Backports for v0.8.5 #293

Closed

38 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize multiplication for Normed #213

Optimize multiplication for Normed #213

kimikage commented Aug 14, 2020 •

edited

Loading

codecov bot commented Aug 14, 2020 •

edited

Loading

kimikage commented Aug 14, 2020

timholy left a comment

kimikage commented Aug 24, 2020

Optimize multiplication for Normed #213

Optimize multiplication for Normed #213

Conversation

kimikage commented Aug 14, 2020 • edited Loading

codecov bot commented Aug 14, 2020 • edited Loading

Codecov Report

kimikage commented Aug 14, 2020

Benchmark

timholy left a comment

Choose a reason for hiding this comment

kimikage commented Aug 24, 2020

kimikage commented Aug 14, 2020 •

edited

Loading

codecov bot commented Aug 14, 2020 •

edited

Loading