feat: support more scalar operations for duckdb, Increase width for ipython #1877

MarcoGorelli · 2025-01-27T19:29:56Z

Very similar to the recent PySpark PR #1870

What type of PR is this? (check all applicable)

Related issues

Related issue #<issue number>
Closes #<issue number>

Checklist

Code follows style guide (ruff)
Tests added
Documented the changes

If you have comments or can explain your changes, please do so below

MarcoGorelli · 2025-01-27T19:31:27Z

narwhals/utils.py

-        terminal_width = 80
+        terminal_width = int(os.getenv("COLUMNS", 80))  # noqa: PLW1508
    native_lines = native_repr.splitlines()
    max_native_width = max(len(line) for line in native_lines)

-    if max_native_width + 2 < terminal_width:
+    if max_native_width + 2 <= terminal_width:


drive-by, but i noticed that on kaggle notebooks os.get_terminal_size() raises, but COLUMNS is set to 100. COLUMNS seems to be the IPython standard

tpch/execute.py

MarcoGorelli · 2025-01-27T20:01:35Z

uurghh

gonna have to come back to this

MarcoGorelli · 2025-01-27T20:06:56Z

🤔 we might need _is_literal 😭

MarcoGorelli · 2025-01-27T20:56:27Z

just jotting down some thoughts:

by "aggregation", I mean both nw.col('a').mean() and nw.lit(1)
literal aggregation: nw.lit(1)
non-literal aggregation: nw.col('a').mean()
non-aggregation: nw.col('a').round()

n-ary operation between expressions:

if there's at least one non-literal aggregation and at least one non-aggregation, then the aggregation needs a "over ()"
in all other cases, leave the expressions as they are
if all the expressions are literal aggregations, then the output is a literal aggregation
if all expressions aggregate, then the output expressions also aggregates
in all other cases, the output doesn't aggregate

select:

if all expressions aggregate or are literals, use agg
in all other cases, use .select. broadcasting should already have happened by this stage

with_columns:

aggregations which aren't literals need an "over ()"

…ipython

MarcoGorelli · 2025-01-27T22:39:12Z

tpch/execute.py

-DUCKDB_SKIPS = ["q14", "q15"]
+DUCKDB_SKIPS = ["q15"]


just one left!

🥳

FBruzzesi

Amazing 🤩 I think this approach will be very useful elsewhere 😉

Let a tiny suggestion on ExprKind docstring, feel free to expand on it

FBruzzesi · 2025-01-27T23:00:36Z

narwhals/_duckdb/utils.py

-    # it means that it was a scalar (e.g. nw.col('a') + 1), and so we default
-    # to `True`.
-    return lhs._returns_scalar and getattr(rhs, "_returns_scalar", True)
+def n_ary_operation_expr_kind(*args: DuckDBExpr | Any) -> ExprKind:


FBruzzesi · 2025-01-27T23:05:44Z

narwhals/_duckdb/utils.py

+class ExprKind(Enum):
+    LITERAL = auto()  # e.g. nw.lit(1)


Might be worth adding a tiny docstring and describe interaction between values:

Suggested change

class ExprKind(Enum):

LITERAL = auto() # e.g. nw.lit(1)

class ExprKind(Enum):

"""Describe which kind of expression we are dealing with.

Composition rule is:

- LITERAL & LITERAL -> LITERAL

- TRANSFORM & X -> TRANSFORM

- X & TRANSFORM -> TRANSFORM

- all remaining cases -> AGGREGATION

"""

LITERAL = auto() # e.g. nw.lit(1)

MarcoGorelli · 2025-01-28T07:47:52Z

thanks for your review!

MarcoGorelli added 2 commits January 27, 2025 13:50

feat: increase terminal width for ipython

80840f4

feat: support for scalar operations for duckdb

6889bd8

MarcoGorelli commented Jan 27, 2025

View reviewed changes

fixup

701aebd

MarcoGorelli commented Jan 27, 2025

View reviewed changes

tpch/execute.py Show resolved Hide resolved

MarcoGorelli marked this pull request as ready for review January 27, 2025 19:44

MarcoGorelli added the enhancement New feature or request label Jan 27, 2025

MarcoGorelli marked this pull request as draft January 27, 2025 19:58

MarcoGorelli added 4 commits January 27, 2025 22:21

eureka!!!

76116ba

Merge remote-tracking branch 'upstream/main' into increase-width-for-…

3444c90

…ipython

fixup horizontal aggs

9676570

xfail new test for dask

9f71692

MarcoGorelli marked this pull request as ready for review January 27, 2025 22:38

MarcoGorelli commented Jan 27, 2025

View reviewed changes

FBruzzesi approved these changes Jan 27, 2025

View reviewed changes

extra docstring

868af9e

MarcoGorelli merged commit b4ff6a0 into narwhals-dev:main Jan 28, 2025
22 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support more scalar operations for duckdb, Increase width for ipython #1877

feat: support more scalar operations for duckdb, Increase width for ipython #1877

MarcoGorelli commented Jan 27, 2025 •

edited

Loading

MarcoGorelli Jan 27, 2025 •

edited

Loading

MarcoGorelli commented Jan 27, 2025

MarcoGorelli commented Jan 27, 2025

MarcoGorelli commented Jan 27, 2025

MarcoGorelli Jan 27, 2025

FBruzzesi left a comment

FBruzzesi Jan 27, 2025

FBruzzesi Jan 27, 2025

MarcoGorelli commented Jan 28, 2025

-class ExprKind(Enum):
-    LITERAL = auto()  # e.g. nw.lit(1)
+class ExprKind(Enum):
+    """Describe which kind of expression we are dealing with.
+    Composition rule is:
+    - LITERAL & LITERAL -> LITERAL
+    - TRANSFORM & X -> TRANSFORM
+    - X & TRANSFORM -> TRANSFORM
+    - all remaining cases -> AGGREGATION
+    """
+    LITERAL = auto()  # e.g. nw.lit(1)

feat: support more scalar operations for duckdb, Increase width for ipython #1877

feat: support more scalar operations for duckdb, Increase width for ipython #1877

Conversation

MarcoGorelli commented Jan 27, 2025 • edited Loading

What type of PR is this? (check all applicable)

Related issues

Checklist

If you have comments or can explain your changes, please do so below

MarcoGorelli Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

MarcoGorelli commented Jan 27, 2025

MarcoGorelli commented Jan 27, 2025

MarcoGorelli commented Jan 27, 2025

MarcoGorelli Jan 27, 2025

Choose a reason for hiding this comment

FBruzzesi left a comment

Choose a reason for hiding this comment

FBruzzesi Jan 27, 2025

Choose a reason for hiding this comment

FBruzzesi Jan 27, 2025

Choose a reason for hiding this comment

MarcoGorelli commented Jan 28, 2025

MarcoGorelli commented Jan 27, 2025 •

edited

Loading

MarcoGorelli Jan 27, 2025 •

edited

Loading