Feature/narwhalify arb imp #390

alphanumericmale · 2025-03-06T15:10:08Z

Narwhalification of the arbitrary imputer #315

tubular/imputers.py

tests/imputers/test_ArbitraryImputer.py

tubular/imputers.py

liamholmes31 · 2025-03-13T14:44:17Z

I think one of the other issues is coming from here:
https://github.com/azukds/tubular/blob/9308197248d24d5452315b742843b2c3f67007bc/tests/imputers/test_BaseImputer.py#L222C1-L222C17

Should be:

expected_df_3["c"] = expected_df_3["c"].cat.add_categories(
                transformer.impute_values_["c"],
            )

tubular/imputers.py

liamholmes31 · 2025-03-18T10:35:35Z

Note, also:

cleaned up some leftover debug prints in package
updated narwhals to avoid CI fails from deprecated argument

tubular/imputers.py

cjmwills · 2025-03-20T16:58:00Z

tubular/imputers.py

-        impute_value: float | str,
-        columns: str | list[str],
-        **kwargs: dict[str, bool],
+        impute_value: Union[int, float, str, bool],


Q: just checking we're happy to now accept bool values in impute_value? Only because it looks like that wasn't allowed before

yeah I think this should be OK, i imagine it was just left out by oversight previously, unless you can think of a reason to exclude?

We have a test that makes sure the falsey value False works, so one potential issue covered there

No reason I can think of to be fair, and like you said the test is a good check for it. I've also just noticed the docstring for this will need updating to include bool. Currently it's impute_value : int or float or str

tests/imputers/test_BaseImputer.py

tests/imputers/test_ArbitraryImputer.py

cjmwills · 2025-03-27T09:29:16Z

tests/imputers/test_ArbitraryImputer.py

+        ):
+            transformer.transform(df)
+
+    @pytest.mark.parametrize("library", ["pandas", "polars"])


Q: strangely there are a couple of the pandas tests here where the transformer doesn't appear to work properly. With the parameters (pandas, 'a', 'String', 'z') the transformer doesn't impute the value z, it just leaves it as null. And then for (pandas, 'c', 'Boolean', True) it actually appears to impute with False. Can you double check you're seeing the same thing?

The problem being the tests pass because the dtype has still been preserved

alphanumericmale added 3 commits March 6, 2025 14:43

tests narwhalified

14ece40

failing tests atm

e0c88ed

some progress

18da6d5

liamholmes31 reviewed Mar 6, 2025

View reviewed changes

tubular/imputers.py Outdated Show resolved Hide resolved

tubular/imputers.py Outdated Show resolved Hide resolved

liamholmes31 reviewed Mar 6, 2025

View reviewed changes

tests/imputers/test_ArbitraryImputer.py Outdated Show resolved Hide resolved

davidhopkinson26 reviewed Mar 6, 2025

View reviewed changes

tests/imputers/test_ArbitraryImputer.py Outdated Show resolved Hide resolved

tubular/imputers.py Outdated Show resolved Hide resolved

tubular/imputers.py Outdated Show resolved Hide resolved

tubular/imputers.py Outdated Show resolved Hide resolved

davidhopkinson26 reviewed Mar 6, 2025

View reviewed changes

tubular/imputers.py Outdated Show resolved Hide resolved

tubular/imputers.py Outdated Show resolved Hide resolved

alphanumericmale added 2 commits March 9, 2025 23:41

downcast df parametrized

2d10a54

mostly converted - nan to string conversion issue

ab09ade

Boluwatife28 added 2 commits March 14, 2025 15:45

Used narwhals logic Arb imp

650a73f

changes to test_ArbitraryImputer

f28ef50

liamholmes31 reviewed Mar 14, 2025

View reviewed changes

tubular/imputers.py Outdated Show resolved Hide resolved

liamholmes31 reviewed Mar 14, 2025

View reviewed changes

tubular/imputers.py Outdated Show resolved Hide resolved

liamholmes31 reviewed Mar 14, 2025

View reviewed changes

tubular/imputers.py Outdated Show resolved Hide resolved

Boluwatife28 and others added 4 commits March 16, 2025 23:35

narwhal logic added

e02f5ab

Expected outcome test for new cat level added

ec10aa2

narwhalifying arbitrary imputer

01a1ef1

finished narwhalifying arb imputer

7f9a284

liamholmes31 marked this pull request as ready for review March 18, 2025 10:17

updated narwhals due to backcompatibility errors in CI

abc7c1d

liamholmes31 added 2 commits March 19, 2025 14:46

Merge branch 'main' into feature/narwhalify_arb_imp

b1d1f91

added falsey test to arb imputer

24fac90

cjmwills self-assigned this Mar 20, 2025

cjmwills self-requested a review March 20, 2025 16:59

cjmwills reviewed Mar 25, 2025

View reviewed changes

liamholmes31 added 3 commits March 25, 2025 13:53

fixed type in arb imputer cat handling

cbb9bdf

removed extra from_native call in arb imputer

2b77049

removed if not polars compatible statements in base imputer tests

0d3c4ac

improved ArbImputer type handling

d08906a

cjmwills reviewed Mar 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/narwhalify arb imp #390

Feature/narwhalify arb imp #390

alphanumericmale commented Mar 6, 2025

liamholmes31 commented Mar 13, 2025

liamholmes31 commented Mar 18, 2025

cjmwills Mar 20, 2025

liamholmes31 Mar 25, 2025

cjmwills Mar 26, 2025

cjmwills Mar 27, 2025

cjmwills Mar 27, 2025

Feature/narwhalify arb imp #390

Are you sure you want to change the base?

Feature/narwhalify arb imp #390

Conversation

alphanumericmale commented Mar 6, 2025

liamholmes31 commented Mar 13, 2025

liamholmes31 commented Mar 18, 2025

cjmwills Mar 20, 2025

Choose a reason for hiding this comment

liamholmes31 Mar 25, 2025

Choose a reason for hiding this comment

cjmwills Mar 26, 2025

Choose a reason for hiding this comment

cjmwills Mar 27, 2025

Choose a reason for hiding this comment

cjmwills Mar 27, 2025

Choose a reason for hiding this comment