AttributeError: 'IntBlock'/'FloatBlock'/etc. object has no attribute 'sp_index' #17198

Bthompso8784 · 2017-08-08T12:56:39Z

Code Sample, a copy-pastable example if possible

import pandas as pd
# AttributeError: 'IntBlock' object has no attribute 'sp_index'
s = pd.SparseSeries([1, 2])
print(s.where(s >= 2, 0))
# AttributeError: 'FloatBlock' object has no attribute 'sp_index'
df = pd.SparseDataFrame([[1, 2], [3, 4]])
print(df.quantile())

Problem description

On the surface, SparseSeries and SparseDataFrame should produce the same results as Series and DataFrame, respectively.

Expected Output

0 0
1 2
dtype: int64
0 2.0
1 3.0
Name: 0.5, dtype: float64

Output of `pd.show_versions()`

INSTALLED VERSIONS

commit: None
python: 3.6.1.final.0
python-bits: 64
OS: Windows
OS-release: 10
machine: AMD64
processor: Intel64 Family 6 Model 94 Stepping 3, GenuineIntel
byteorder: little
LC_ALL: None
LANG: en
LOCALE: None.None

pandas: 0.20.1
pytest: 3.0.7
pip: 9.0.1
setuptools: 36.0.1
Cython: 0.25.2
numpy: 1.12.1
scipy: 0.19.0
xarray: None
IPython: 5.3.0
sphinx: 1.6.3
patsy: 0.4.1
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: 1.2.1
tables: 3.2.2
numexpr: 2.6.2
feather: None
matplotlib: 2.0.2
openpyxl: 2.4.7
xlrd: 1.0.0
xlwt: 1.2.0
xlsxwriter: 0.9.6
lxml: 3.7.3
bs4: 4.6.0
html5lib: 0.999
sqlalchemy: 1.1.9
pymysql: None
psycopg2: None
jinja2: 2.9.6
s3fs: None
pandas_gbq: None
pandas_datareader: None

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2017-08-08T13:47:05Z

@Bthompso8784 Thanks for the report.

For the first case (s.where(s >= 2, 0)) I actually get a segfault (core dumped) on master.
For the second case (df.quantile()), it is seems to be the repr that produces this error, because df.quantile() seems to work only partially (it creates an invalid SparseSeries). Not sure if it should actually return a SparseSeries rather than a plain Series (which is what eg df.mean() is doing).

kernc · 2017-11-14T13:30:02Z

Can't reproduce this on 0.22.0.dev0+116, Python 3.5. The output is as expected.

jreback · 2017-11-14T13:37:27Z

@kernc can you do a PR with validation tests?

@Licht-T in addition to you have been improving sparse lately

jorisvandenbossche · 2017-11-14T14:37:12Z

The where case still doesn't work for me on master (I currently get a RecursionError, while got a segfault in august).
But the df.quantile() seems to work now.

kernc · 2017-11-14T14:42:40Z

Ah, right, can confirm RecursionError. I was on a branch where I appear to have inadvertently fixed it.

ColinSuess · 2018-08-15T04:22:20Z

Confirmed both issues repro in Pandas 0.23.4 with Python 3.7 amd64 on Win7SP1x64.

mroeschke · 2019-10-13T00:11:13Z

This looks to work on master with SparseDtype. Could use a test:

In [8]: s = pd.Series(pd.SparseArray([1, 2]))

In [9]: s
Out[9]:
0    1
1    2
dtype: Sparse[int64, 0]

In [10]: s.where(s >= 2, 0)
Out[10]:
0    0
1    2
dtype: Sparse[int64, 0]

In [11]: s1 = pd.Series(pd.SparseArray([3, 4]))

In [12]: df = pd.DataFrame({0: s, 1: s1})

In [13]: df
Out[13]:
   0  1
0  1  3
1  2  4

In [14]: df.dtypes
Out[14]:
0    Sparse[int64, 0]
1    Sparse[int64, 0]
dtype: object

In [15]: df.quantile()
Out[15]:
0    1.5
1    3.5
Name: 0.5, dtype: float64

In [16]: pd.__version__
Out[16]: '0.26.0.dev0+555.gf7d162b18'

jorisvandenbossche added Bug Sparse Sparse Data Type labels Aug 8, 2017

jorisvandenbossche added this to the Next Major Release milestone Aug 8, 2017

Licht-T mentioned this issue Aug 31, 2017

BUG: Fix wrong SparseBlock initialization #17386

Closed

4 tasks

jreback modified the milestones: Next Major Release, 0.21.1 Nov 14, 2017

jreback added Needs Backport and removed Needs Backport labels Nov 14, 2017

jorisvandenbossche modified the milestones: 0.21.1, 0.22.0 Nov 14, 2017

kernc mentioned this issue Nov 14, 2017

ENH: SparseDataFrame/SparseSeries value assignment #17785

Closed

4 tasks

jreback modified the milestones: 0.23.0, Next Major Release Apr 14, 2018

mroeschke added good first issue Needs Tests Unit test(s) needed to prevent regressions and removed Bug Sparse Sparse Data Type labels Oct 13, 2019

jbrockmendel mentioned this issue Dec 12, 2019

TST: tests for needs-test issues #30222

Merged

11 tasks

jreback modified the milestones: Contributions Welcome, 1.0 Dec 12, 2019

WillAyd closed this as completed in #30222 Dec 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'IntBlock'/'FloatBlock'/etc. object has no attribute 'sp_index' #17198

AttributeError: 'IntBlock'/'FloatBlock'/etc. object has no attribute 'sp_index' #17198

Bthompso8784 commented Aug 8, 2017

INSTALLED VERSIONS

jorisvandenbossche commented Aug 8, 2017 •

edited

Loading

kernc commented Nov 14, 2017

jreback commented Nov 14, 2017

jorisvandenbossche commented Nov 14, 2017

kernc commented Nov 14, 2017

ColinSuess commented Aug 15, 2018

mroeschke commented Oct 13, 2019

AttributeError: 'IntBlock'/'FloatBlock'/etc. object has no attribute 'sp_index' #17198

AttributeError: 'IntBlock'/'FloatBlock'/etc. object has no attribute 'sp_index' #17198

Comments

Bthompso8784 commented Aug 8, 2017

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

INSTALLED VERSIONS

jorisvandenbossche commented Aug 8, 2017 • edited Loading

kernc commented Nov 14, 2017

jreback commented Nov 14, 2017

jorisvandenbossche commented Nov 14, 2017

kernc commented Nov 14, 2017

ColinSuess commented Aug 15, 2018

mroeschke commented Oct 13, 2019

Output of `pd.show_versions()`

jorisvandenbossche commented Aug 8, 2017 •

edited

Loading