Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Object data types are no longer inferred from data frames #3177

Closed
joelostblom opened this issue Aug 29, 2023 · 4 comments · Fixed by #3179
Closed

Object data types are no longer inferred from data frames #3177

joelostblom opened this issue Aug 29, 2023 · 4 comments · Fixed by #3179
Assignees
Labels

Comments

@joelostblom
Copy link
Contributor

Support for inferring the data type from object columns in data frames used to work, but is not working anymore:

import altair as alt
import pandas as pd

alt.Chart(
    pd.DataFrame({'not_ints': pd.Series([1, 2, 3], dtype='object')})
).mark_circle().encode(x='not_ints')
NotImplementedError: Non-string object dtypes are not supported yet

@jonmmease I think this is a regression from #3114 that added support for the dataframe interchange as it used to work fine in Altair 5 (see screenshot below). Do you think it is possible to add a workaround similar to what you have done for datetime data until there is official pyarrow support?

Altair 5.0:
image

@jonmmease
Copy link
Contributor

Yeah, this is from #3114. I don't think the workaround datetime workaround is going to work here because the error message doesn't include enough information to determine the correct type to infer.

I think we could catch NotImplementedErrors exceptions raised from this line:

https://github.com/altair-viz/altair/blob/17e8d07fee3c3d891c9667a14a2a6b3df3d3b581/altair/utils/core.py#L589

And then try again with the pandas implementation (if the datarame is a pandas DataFrame).

I'll take this. Do you think warrants an immediate 5.1.1?

@jonmmease jonmmease self-assigned this Aug 29, 2023
@mattijn
Copy link
Contributor

mattijn commented Aug 29, 2023

Or maybe using the infer_objects() function?
I can imagine doing a 5.1.1 for this

@joelostblom
Copy link
Contributor Author

I'll take this. Do you think warrants an immediate 5.1.1?

Great, thank you! I think I quick 5.1.1 would be helpful if you both have time for that, but not critical if time is tight. I agree with @mattijn that infer_objects could be a good option for just restoring the regressed behavior (pandas df support). In terms of also supporting inference in other dataframes, I guess we can either wait for it to be added in pyarrow and be content with raising an error for now (lowest effort and totally ok with me) or try to infer ourselves by inspecting the objects (maybe with a logic inspired from pd.inferobject? Could be brittle and higher maintenance though). What do you think?

mayagbarnes added a commit to streamlit/streamlit that referenced this issue Aug 29, 2023
Altair v5.1.0 includes a regression on inferring data types (see vega/altair#3177 for more details). To unblock, excluding this version but will revisit based on whether patch released.
@jonmmease
Copy link
Contributor

I'll get this fixed shortly. I'll tag you for review @mattijn, and then you can make the 5.1.1 release at your convenience.

eric-skydio pushed a commit to eric-skydio/streamlit that referenced this issue Dec 20, 2023
Altair v5.1.0 includes a regression on inferring data types (see vega/altair#3177 for more details). To unblock, excluding this version but will revisit based on whether patch released.
zyxue pushed a commit to zyxue/streamlit that referenced this issue Mar 22, 2024
Altair v5.1.0 includes a regression on inferring data types (see vega/altair#3177 for more details). To unblock, excluding this version but will revisit based on whether patch released.
zyxue pushed a commit to zyxue/streamlit that referenced this issue Apr 16, 2024
Altair v5.1.0 includes a regression on inferring data types (see vega/altair#3177 for more details). To unblock, excluding this version but will revisit based on whether patch released.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants