Fix column indices in EnforceDistribution optimizer in Partial AggregateMode #4878

jonmmease · 2023-01-11T21:34:27Z

Which issue does this PR close?

Closes #4873.

Rationale for this change

See #4873 for a failing example and some preliminary investigation.

The issue is in the EnforceDistribution physical optimizer (the query from the issue works without it). After looking at this for a while, I think I see what's going on. Normally, the column expressions for a group by have indices that correspond to the schema of the input operation. But for the case of a partial aggregation, these indices need to be updated to correspond to the schema of the partial aggregation instead of the input.

This make sense to me, and it fixes the query, but this is the first time I've looked at the Physical optimizer so someone who's more familiar with that side of things should definitely take a look before merging this.

Are these changes tested?

I added a test case based on the query reported in the issue.

…ateMode Column expressions need to be updated to correspond with the partial aggregation schema rather than the input schema.

apache/datafusion#4878

* Add VegaFusionTable::with_ordering method * Use explicit ordering column instead of relying on the row_number window function in each transform * Support impute null * Add impute tests and fix serialization of null value * Add ordering to inline tables * Remove test case workaround to get consistent ordering * Remove order column when inline dataset has no transforms * Update DataFusion branch to include fix (apache/datafusion#4878) * Add area streamgraph spec that used to trigger error * Remove row number from joinaggregate transform

alamb

Thank you @jonmmease -- reading the 🕵️ story on #4873 (comment) is fascinating

The fact that this code fixes a bug and has test coverage is quite compelling to me

@mingmwang and @yahoNanJing , who I think are familiar / authors of this code, can you please take a look if you have time. Also cc @metesynnada

alamb · 2023-01-12T22:24:58Z

datafusion/core/src/physical_optimizer/dist_enforcement.rs

+                            group_expr.as_any().downcast_ref::<Column>()
+                        {
+                            Arc::new(Column::new(group_col.name(), idx))
+                                as Arc<dyn PhysicalExpr>


I think you can do something like this to get the rust compiler to do the right cast for you:

Suggested change

as Arc<dyn PhysicalExpr>

as _

However the rest of this module is in the same as Arc<...> style so no need to change it in this PR

Good catch!! It is a bug.

alamb · 2023-01-12T22:32:50Z

datafusion/core/tests/sql/joins.rs

@@ -2810,3 +2810,61 @@ async fn type_coercion_join_with_filter_and_equi_expr() -> Result<()> {

    Ok(())
 }
+
+#[tokio::test]
+async fn test_cross_join_to_groupby_with_different_key_ordering() -> Result<()> {


I verified that this test does indeed panic without the test code

---- sql::joins::test_cross_join_to_groupby_with_different_key_ordering stdout ---- thread 'sql::joins::test_cross_join_to_groupby_with_different_key_ordering' panicked at 'called `Option::unwrap()` on a `None` value', datafusion/core/src/physical_plan/joins/hash_join.rs:923:17 stack backtrace:

mingmwang · 2023-01-13T09:27:52Z

The Column Index in the physical plan is confusing and error prone, I will take a closer look at this PR.

mingmwang · 2023-01-13T10:24:22Z

Regarding the fix, I think you can call below methods to simply the logic. I had test it on my local and it works.

                    // Build new group expressions that correspond to the output of partial_agg
                    let new_final_group: Vec<Arc<dyn PhysicalExpr>> =
                        partial_agg.output_group_expr();
                    let new_group_by= PhysicalGroupBy::new_single(
                        new_final_group
                            .iter()
                            .enumerate()
                            .map(|(i, expr)| (expr.clone(), partial_agg.group_expr().expr()[i].1.clone()))
                            .collect(),
                    );

metesynnada

LGTM, thanks for the fix! I will look closer in a couple of hours.

jonmmease · 2023-01-13T12:17:22Z

Thanks for taking a look @alamb @mingmwang @metesynnada! I made the simplification @mingmwang suggested in f32a8d9.

metesynnada · 2023-01-14T01:30:34Z

LGTM, good work 😀

alamb · 2023-01-14T11:32:16Z

Thank you everyone for your help!

ursabot · 2023-01-14T11:42:54Z

Benchmark runs are scheduled for baseline = a9ddcd3 and contender = dee0dd8. dee0dd8 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ec2-t3-xlarge-us-east-2] ec2-t3-xlarge-us-east-2
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on test-mac-arm] test-mac-arm
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ursa-i9-9960x] ursa-i9-9960x
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ursa-thinkcentre-m75q] ursa-thinkcentre-m75q
Buildkite builds:
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

…ateMode (apache#4878) * Fix column indices in EnforceDistribution optimizer in Partial AggregateMode Column expressions need to be updated to correspond with the partial aggregation schema rather than the input schema. * Simplify new_group_by calculation

…ateMode (apache#4878) * Fix column indices in EnforceDistribution optimizer in Partial AggregateMode Column expressions need to be updated to correspond with the partial aggregation schema rather than the input schema. * Simplify new_group_by calculation (cherry picked from commit dee0dd8)

…ateMode (#4878) (#4959)

* Add VegaFusionTable::with_ordering method * Use explicit ordering column instead of relying on the row_number window function in each transform * Support impute null * Add impute tests and fix serialization of null value * Add ordering to inline tables * Remove test case workaround to get consistent ordering * Remove order column when inline dataset has no transforms * Update DataFusion branch to include fix (apache/datafusion#4878) * Add area streamgraph spec that used to trigger error * Remove row number from joinaggregate transform

Fix column indices in EnforceDistribution optimizer in Partial Aggreg…

a2f1160

…ateMode Column expressions need to be updated to correspond with the partial aggregation schema rather than the input schema.

github-actions bot added the core Core DataFusion crate label Jan 11, 2023

jonmmease mentioned this pull request Jan 11, 2023

panic when GROUP BY column order doesn't match USING column order #4873

Closed

jonmmease added a commit to vega/vegafusion that referenced this pull request Jan 11, 2023

Update DataFusion branch to include fix

7b460b0

apache/datafusion#4878

jonmmease mentioned this pull request Jan 11, 2023

Improve transform ordering using explicit ordering column vega/vegafusion#222

Merged

alamb approved these changes Jan 12, 2023

View reviewed changes

Simplify new_group_by calculation

f32a8d9

metesynnada reviewed Jan 13, 2023

View reviewed changes

alamb merged commit dee0dd8 into apache:master Jan 14, 2023

jonmmease deleted the jonmmease/GH4873 branch January 14, 2023 12:39

andygrove mentioned this pull request Jan 17, 2023

[maint-16.x] Cherry pick PRs related to windowed aggregations #4956

Closed

jonmmease mentioned this pull request Jan 18, 2023

Maint-16.x Backport: #4878 #4959

Merged

andygrove pushed a commit that referenced this pull request Jan 18, 2023

Fix column indices in EnforceDistribution optimizer in Partial Aggreg…

35e34d4

…ateMode (#4878) (#4959)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix column indices in EnforceDistribution optimizer in Partial AggregateMode #4878

Fix column indices in EnforceDistribution optimizer in Partial AggregateMode #4878

jonmmease commented Jan 11, 2023

alamb left a comment

alamb Jan 12, 2023

mingmwang Jan 13, 2023 •

edited

Loading

alamb Jan 12, 2023

mingmwang commented Jan 13, 2023

mingmwang commented Jan 13, 2023

metesynnada left a comment •

edited

Loading

jonmmease commented Jan 13, 2023

metesynnada commented Jan 14, 2023

alamb commented Jan 14, 2023

ursabot commented Jan 14, 2023

Fix column indices in EnforceDistribution optimizer in Partial AggregateMode #4878

Fix column indices in EnforceDistribution optimizer in Partial AggregateMode #4878

Conversation

jonmmease commented Jan 11, 2023

Which issue does this PR close?

Rationale for this change

Are these changes tested?

alamb left a comment

Choose a reason for hiding this comment

alamb Jan 12, 2023

Choose a reason for hiding this comment

mingmwang Jan 13, 2023 • edited Loading

Choose a reason for hiding this comment

alamb Jan 12, 2023

Choose a reason for hiding this comment

mingmwang commented Jan 13, 2023

mingmwang commented Jan 13, 2023

metesynnada left a comment • edited Loading

Choose a reason for hiding this comment

jonmmease commented Jan 13, 2023

metesynnada commented Jan 14, 2023

alamb commented Jan 14, 2023

ursabot commented Jan 14, 2023

mingmwang Jan 13, 2023 •

edited

Loading

metesynnada left a comment •

edited

Loading