BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

jbrockmendel · 2021-01-08T22:30:49Z

closes #xxxx
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

xref #38896 (doesn't close). In general df[:, "A"] = foo tries to operate in-place before falling back to casting. This makes sure we do that when foo is a PandasArray, IntegerArray, or FloatingArray

I guess could/should do the same for BooleanArray

…sArray, IntegerArray, FloatingArray

pandas/core/dtypes/missing.py

jreback

will really need to look closely. this is adding non-trivial code.

pandas/core/dtypes/missing.py

…g-38896

pandas/core/internals/blocks.py

pandas/tests/extension/test_floating.py

jreback · 2021-01-16T01:36:46Z

pandas/tests/extension/test_numpy.py

@@ -28,6 +32,31 @@ def dtype(request):
    return PandasDtype(np.dtype(request.param))


+orig_setitem = pd.core.internals.Block.setitem


can you use monkeypatch instead?

this does use monkeypatch. the monkeypatched method calls the original method

…g-38896

pandas/core/dtypes/missing.py

…g-38896

jbrockmendel · 2021-01-25T16:52:36Z

rebased + green

…g-38896

pandas/core/dtypes/missing.py

…g-38896

jbrockmendel · 2021-01-29T03:33:55Z

ok maybe if you can add a doc-string to the function will be more obvious that we are trying to match the len here

docstring updated + green

pandas/tests/extension/test_floating.py

jorisvandenbossche

Can you summarize which behaviours are changed?
Eg from the description I would suppose that

df = pd.DataFrame({'a': [1, 2, 3]})
df.loc[:, 'a'] = pd.array([3, 4, 5])

changed behaviour to preserve the original dtype, but I don't see that directly tested?

Does this need a whatsnew?

jorisvandenbossche · 2021-01-29T18:45:07Z

pandas/tests/extension/test_integer.py

@@ -193,7 +197,20 @@ class TestGetitem(base.BaseGetitemTests):


 class TestSetitem(base.BaseSetitemTests):
-    pass
+    def test_setitem_series(self, data, full_indexer):


Can you indicate here why it is overriding the base class?

comment added

jorisvandenbossche · 2021-01-29T18:47:54Z

pandas/tests/extension/test_integer.py

+        if not data._mask.any():
+            # GH#38896 like we do with ndarray, we set the values inplace
+            #  but cast to the new numpy dtype
+            expected = pd.Series(data.to_numpy(data.dtype.numpy_dtype), name="data")


Why are we converting to the numpy dtype here? That's also not the original dtype?

…g-38896

jreback · 2021-02-07T17:35:40Z

am ok with this, @jorisvandenbossche if any addl comments.

jorisvandenbossche · 2021-02-07T22:14:18Z

There are some questions in my last review above for which I was still waiting on an answer

jbrockmendel · 2021-02-07T23:51:41Z

let's stick a pin in this until #39163 goes in; it will be easier to address outstanding questions/comments at that point

jbrockmendel · 2021-02-18T00:15:28Z

mothballing until after #39163

BUG/API: make setitem-inplace preserve dtype when possible with Panda…

6fffb02

…sArray, IntegerArray, FloatingArray

jbrockmendel commented Jan 8, 2021

View reviewed changes

pandas/core/dtypes/missing.py Outdated Show resolved Hide resolved

jreback requested changes Jan 9, 2021

View reviewed changes

pandas/core/dtypes/missing.py Outdated Show resolved Hide resolved

pandas/core/dtypes/missing.py Show resolved Hide resolved

jbrockmendel added 5 commits January 12, 2021 12:58

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

160f3f7

…g-38896

do patching inside test_numpy

de10708

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

ba98a99

…g-38896

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

84261a7

…g-38896

move kludge to test_numpy

284f36a

jreback added Dtype Conversions Unexpected or buggy dtype conversions ExtensionArray Extending pandas with custom dtypes or arrays. Indexing Related to indexing on series/frames, not to indexes themselves labels Jan 16, 2021

jreback requested changes Jan 16, 2021

View reviewed changes

jbrockmendel added 2 commits January 15, 2021 18:05

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

2639b5c

…g-38896

staticmethod -> function

b2aa366

This was referenced Jan 19, 2021

API: setitem copy/view behavior ndarray vs Categorical vs other EA #38896

Closed

WIP/REF: BlockManager.setitem_blockwise #39302

Closed

BUG: setting dt64 values into Series[int] incorrectly casting dt64->int #39266

Merged

jreback requested changes Jan 20, 2021

View reviewed changes

pandas/core/dtypes/missing.py Show resolved Hide resolved

jbrockmendel added 3 commits January 20, 2021 15:54

typo fixup+test

7aeb2b5

Merge branch 'master' into bug-38896

715a602

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

b55155b

…g-38896

jbrockmendel added 2 commits January 25, 2021 10:56

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

c72e566

…g-38896

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

0b9f343

…g-38896

jreback requested changes Jan 28, 2021

View reviewed changes

pandas/core/dtypes/missing.py Show resolved Hide resolved

jreback added this to the 1.3 milestone Jan 28, 2021

jbrockmendel added 3 commits January 27, 2021 19:45

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

cd6adbe

…g-38896

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

071ab1b

…g-38896

docstring

6ab44af

jorisvandenbossche reviewed Jan 29, 2021

View reviewed changes

pandas/tests/extension/test_floating.py Outdated Show resolved Hide resolved

jorisvandenbossche requested changes Jan 29, 2021

View reviewed changes

jbrockmendel added 5 commits January 29, 2021 15:05

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

daacff8

…g-38896

comment, revert floating

450bf73

Merge branch 'master' into bug-38896

dba7c11

Merge branch 'master' into bug-38896

9258cbb

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

88309ab

…g-38896

jreback approved these changes Feb 7, 2021

View reviewed changes

Merge branch 'master' into bug-38896

1847209

Merge branch 'master' into bug-38896

6b8cc31

jbrockmendel closed this Feb 18, 2021

jbrockmendel added the Mothballed Temporarily-closed PR the author plans to return to label Feb 18, 2021

jbrockmendel removed the Mothballed Temporarily-closed PR the author plans to return to label Jul 16, 2021

jbrockmendel deleted the bug-38896 branch July 16, 2021 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

jbrockmendel commented Jan 8, 2021

jreback left a comment

jreback Jan 16, 2021

jbrockmendel Jan 16, 2021

jbrockmendel commented Jan 25, 2021

jbrockmendel commented Jan 29, 2021

jorisvandenbossche left a comment

jorisvandenbossche Jan 29, 2021

jbrockmendel Feb 7, 2021

jorisvandenbossche Jan 29, 2021

jreback commented Feb 7, 2021

jorisvandenbossche commented Feb 7, 2021

jbrockmendel commented Feb 7, 2021

jbrockmendel commented Feb 18, 2021

		@@ -28,6 +32,31 @@ def dtype(request):
		return PandasDtype(np.dtype(request.param))


		orig_setitem = pd.core.internals.Block.setitem

BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

Conversation

jbrockmendel commented Jan 8, 2021

jreback left a comment

Choose a reason for hiding this comment

jreback Jan 16, 2021

Choose a reason for hiding this comment

jbrockmendel Jan 16, 2021

Choose a reason for hiding this comment

jbrockmendel commented Jan 25, 2021

jbrockmendel commented Jan 29, 2021

jorisvandenbossche left a comment

Choose a reason for hiding this comment

jorisvandenbossche Jan 29, 2021

Choose a reason for hiding this comment

jbrockmendel Feb 7, 2021

Choose a reason for hiding this comment

jorisvandenbossche Jan 29, 2021

Choose a reason for hiding this comment

jreback commented Feb 7, 2021

jorisvandenbossche commented Feb 7, 2021

jbrockmendel commented Feb 7, 2021

jbrockmendel commented Feb 18, 2021