Fix some simplification rules for floating-point arithmetic operations #7515

jonahgao · 2023-09-09T17:30:12Z

Which issue does this PR close?

N/A

Rationale for this change

Similar to #7503.

Because of the presence of NaN, some simplification rules will no longer be applicable to floating-point types.

What changes are included in this PR?

Are these changes tested?

Yes

Are there any user-facing changes?

No

alamb · 2023-09-11T18:02:30Z

datafusion/optimizer/src/simplify_expressions/expr_simplifier.rs

            Expr::BinaryExpr(BinaryExpr {
                left,
                op: Modulo,
                right,
            }) if !info.nullable(&left)? && is_zero(&right) => {
-                return Err(DataFusionError::ArrowError(ArrowError::DivideByZero));
+                match info.get_data_type(&left)? {


I don't really understand the rationale for float % float --> NaN (rather than error)

But on the other hand, postgres doesn't seem to support % on floating point values:

select 1.0::float % 0::float; operator does not exist: double precision % double precision LINE 1: select 1.0::float % 0::float; ^ HINT: No operator matches the given name and argument types. You might need to add explicit type casts.

@alamb
DataFusion utilizes the rem() function from the arrow-rs to perform the Modulo operation.

The modification here ensures that it behaves the same as the rem() function in arrow-rs, i.e., float % 0. --> NAN.
https://github.com/apache/arrow-rs/blob/77455d48cd6609045a4728ba908123de9d0b62fd/arrow-arith/src/numeric.rs#L71-L77

And in the IEEE 754-2008 standard：

7.2 Invalid operation 7.2.0
For operations producing results in floating-point format, the default result of an operation that signals the
invalid operation exception shall be a quiet NaN...
These operations are:
...
f) remainder: remainder(x, y), when y is zero or x is infinite...
...

Ref: https://en.wikipedia.org/wiki/NaN#Operations_generating_NaN

alamb

Thank you @jonahgao -- these changes look good to me (though as I mention, I don't understand the behavior for % vs /).

As always, very nicely tested 🏅

alamb · 2023-09-12T13:53:04Z

Thanks for the clarification @jonahgao

Fix some simplification rules for floating-point arithmetic operations

4632f80

github-actions bot added optimizer Optimizer rules sqllogictest SQL Logic Tests (.slt) labels Sep 9, 2023

alamb reviewed Sep 11, 2023

View reviewed changes

alamb approved these changes Sep 11, 2023

View reviewed changes

alamb merged commit 87527c4 into apache:main Sep 11, 2023

jonahgao deleted the float_simplification branch September 12, 2023 03:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix some simplification rules for floating-point arithmetic operations #7515

Fix some simplification rules for floating-point arithmetic operations #7515

jonahgao commented Sep 9, 2023

alamb Sep 11, 2023

jonahgao Sep 12, 2023

alamb left a comment

alamb commented Sep 12, 2023

Fix some simplification rules for floating-point arithmetic operations #7515

Fix some simplification rules for floating-point arithmetic operations #7515

Conversation

jonahgao commented Sep 9, 2023

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb Sep 11, 2023

Choose a reason for hiding this comment

jonahgao Sep 12, 2023

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment

alamb commented Sep 12, 2023