REF: Reductions #53261

jbrockmendel · 2023-05-16T16:17:53Z

We have reductions implemented in nanops, _libs.groupby, and _libs.window.aggregations. We should refactor these with the following goals in mind:

Have one/fewer distinct implementations
Avoid copies, particularly in the nanops versions where we do something like values[notna(values)]
Chunked-friendliness, so that we can re-write ArrowExtensionArray._groupby_op to operate chunk-by-chunk, avoiding a copy in multi-chunk cases. (This could also be useful for hypothetical distributed EAs)
Avoid casting/inference in nanops
update Do axis=1 reductions without transposing/copying, inspired by PERF: axis=1 reductions with EA dtypes #54341

The implementation of group_skew is derived from https://www.johndcook.com/blog/skewness_kurtosis/ which includes a method for "adding" multiple RunningStats instances. Something like that could be adapted for 3).

The text was updated successfully, but these errors were encountered:

mroeschke · 2023-05-16T17:55:17Z

Just noting that _libs.window.aggregations should ideally keep its implementation since the sliding window aggregation is performant sensitive. I think the other reductions could be implemented in terms of the sliding windowing aggregation i.e. they would be non-overlapping windows

jbrockmendel added Bug Needs Triage Issue that has not been reviewed by a pandas team member labels May 16, 2023

mroeschke added Refactor Internal refactoring of code Reduction Operations sum, mean, min, max, etc. and removed Bug Needs Triage Issue that has not been reviewed by a pandas team member labels May 16, 2023

mroeschke mentioned this issue Jun 20, 2023

ENH: Add separate numba kernels for groupby aggregations #53731

Merged

5 tasks

jbrockmendel mentioned this issue Jul 10, 2023

BUG: GroupBy.std floating point error #51332

Open

jbrockmendel mentioned this issue Aug 3, 2023

PERF: axis=1 reductions with EA dtypes #54341

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF: Reductions #53261

REF: Reductions #53261

jbrockmendel commented May 16, 2023 •

edited

Loading

mroeschke commented May 16, 2023

REF: Reductions #53261

REF: Reductions #53261

Comments

jbrockmendel commented May 16, 2023 • edited Loading

mroeschke commented May 16, 2023

jbrockmendel commented May 16, 2023 •

edited

Loading