-
Notifications
You must be signed in to change notification settings - Fork 655
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT-#5936: support pandas 2.0.2 #5995
Merged
Merged
Changes from 184 commits
Commits
Show all changes
190 commits
Select commit
Hold shift + click to select a range
6887fb0
FEAT-#5936: support pandas 2.0.0
anmyachev c0c9cf2
remove Int64Index, UInt64Index, Float64Index
anmyachev 93b7f66
remove pandas.datetime
anmyachev eca0422
remove convert_to_index_sliceable
anmyachev 929edcc
remove 'ensure_clean_dir'
anmyachev 025d14a
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 417bf0c
disable 'xarray' until it adds support for pandas 2.0.0
anmyachev b9a83b5
remove 'mad' [2]
anmyachev af9b32e
add 'dtype_backend' parameter for all functions and remove is_monotonic
anmyachev 5626de0
remove 'tshift' and 'iteritems'
anmyachev 9ee440c
remove 'slice_shift'
anmyachev 3cb0007
remove 'append'
anmyachev 2b0fbfc
add new parameter: 'axis' for 'add_suffix', 'add_prefix'
anmyachev 510b907
use copy=None instead of copy=True
anmyachev 3cddf29
remove deprecated parameters in 'between_time'
anmyachev bc5e97f
update 'drop_duplicates'
anmyachev 0d5672e
update 'dropna'
anmyachev f586ad1
update 'factorize'
anmyachev be041e8
update 'groupby'
anmyachev bdead97
update 'kurt'
anmyachev 78217ef
update 'mask'
anmyachev f0af110
update 'mean' and 'max'
anmyachev 7daad2d
update 'min', 'median'
anmyachev 1a37565
update 'resample', 'rank'
anmyachev 3b40481
update 'sem', 'skew'
anmyachev 79c4b63
update 'var', 'sum', 'std'
anmyachev 24f6f5c
update 'to_json'
anmyachev db67661
update 'all', 'any'
anmyachev 886a7cf
update 'describe', 'clip', 'count'
anmyachev 2cc84a8
update 'resample', 'expanding'
anmyachev 2bc36c9
update 'take', 'to_excel', 'to_latex'
anmyachev 9814acd
update 'where'
anmyachev 9ba3042
remove 'pad', 'backfill'
anmyachev fb0693c
add 'dt.unit', 'dt.as_unit'
anmyachev efe8dec
remove 'lookup'
anmyachev 6a024fb
update 'corr', 'corrwith', 'cov'
anmyachev dd931ca
update 'merge', 'pivot'
anmyachev 80eede9
update 'to_dict', 'quantile'
anmyachev b277c58
update 'info'
anmyachev f285848
update 'read_sql'
anmyachev 7a272b7
update top level functions
anmyachev 61d3180
fix
anmyachev bdb3bfe
Merge branch 'master' into pandas-2.0.0
anmyachev 4c5ea4b
remove 'TimeAppend' benchmark for HDK
anmyachev 0ae862e
update 'rename_axis'
anmyachev 28d6440
update 'drop'
anmyachev 4344caa
remove 'resample_pad'
anmyachev 172a1d8
disable 'exercise_3' notebook
anmyachev 739331c
remove 'warn_bad_lines', 'error_bad_lines' for hdk and tests
anmyachev d752d59
fix TestCsv tests
anmyachev 27dae06
fix some parquet cases
anmyachev e13a13d
fix 'drop' test; remove 'Series__array_wrap__'
anmyachev 498e5eb
fix 'test_get'
anmyachev bbbf82e
fix some docs
anmyachev b300345
fix 'test_internals.py'
anmyachev e4ed3f8
fix 'time_drop'
anmyachev 24174aa
fix 'test_binary.py'
anmyachev cc78e9e
remove 'lookup' op, remove 'base' param for resample, remove 'null_co…
anmyachev 23e644d
remove 'inplace' parameter for 'as_ordered', 'as_unordered', 'reorder…
anmyachev 8e6fa4f
more fixes for 'test_groupby.py'
anmyachev 103ca00
Disallow passing non-keyword arguments to 'interpolate'
anmyachev eccb018
fix for 'test_indexing'
anmyachev bbdf297
fix 'test_loc_series'
anmyachev 7932315
fixes for 'test_reduce.py'; affects usage of 'level', 'numeric_only' …
anmyachev 5b47230
fix for 'test_udf.py'
anmyachev ff9821c
remove 'dt.week', 'dt.weekofyear'
anmyachev 98f99cf
fix 'Series.describe' and 'test_between_time'
anmyachev 5f3279d
remove 'inplace' parameter for some 'cat.' functions
anmyachev 48ef8b3
update 'str.split' and 'str.rsplit' methods
anmyachev 26c50bf
remove '.week' and 'weekofyear' in tests; remove workarounds for #3142
anmyachev 46ccf5b
small fixes for 'test_series.py'
anmyachev 7a39531
fix 'test_series_dt_api_equality'
anmyachev 9578709
remove 'squeeze' field for groupby objects
anmyachev 5b0295c
fixes for gropby.skew/__iter__; use 'eval_general' when op doesn't su…
anmyachev 8127bbc
update 'std' and 'var' signatures; use 'numeric_only=True' explicitly…
anmyachev aaa4c58
add 'numeric_only=False' for cumsum/cummax/cummin/cumprod
anmyachev 0ba1a50
'skew' fixes; add 'numeric_only=True' for cummax/cummin/cumprod/cumsu…
anmyachev fc4e157
add fix for 'test_general.py'
anmyachev c56acec
remove 'inplace' parameter for 'set_axis' (leftovers)
anmyachev 2abbf1f
add 'dtype_backend' parameter for 'read_sql_table' base imple
anmyachev d37b614
Merge branch 'master' into pandas-2.0.0
anmyachev 22eddf0
fix 'test_skew_corner_cases' on Dask engine
anmyachev c8e4190
fixes for 'test_groupby.py'
anmyachev 7dcd7a2
fixes for 'test_groupby.py'
anmyachev 024b626
fix 'test_to_dense'
anmyachev 6d88c92
fix 'test_read_spss'
anmyachev eeea136
fix 'test_read_orc'
anmyachev 357aa03
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 1ac3c86
Fix df.mean(numeric_only=True)
vnlitvinov d5a67c4
Properly validate kwargs for stat functions
vnlitvinov 58dd2d7
Improve tests for udf, stop checking df stat funcs defaulting to pandas
vnlitvinov 6184f47
fixup! Properly validate kwargs for stat functions
vnlitvinov 0c1cd44
Fix black formatting
vnlitvinov 09d7d5f
fix flake8
anmyachev 59b804a
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 0ddb2c1
Uncomment all tests to see the status
vnlitvinov 4f112fd
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 7639004
Merge branch 'pandas-2.0.0' of https://github.com/anmyachev/modin int…
anmyachev 6702b3a
fix after merge
anmyachev 548d182
fixes
anmyachev 872fa58
fixes
anmyachev f941c5d
fixes
anmyachev 0c92b9e
fix expanding
anmyachev 52c0688
use pandas==2.0.1
anmyachev 916aed3
xarray still does not work with pandas>=2.0.0
anmyachev b87a421
fix describe
anmyachev b53747a
use 'format=mixed' for hdk tests
anmyachev 44b5951
fix describe
anmyachev 63cf78b
fixes for groupby
anmyachev c6c4bc6
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 4d08bc0
groupby fixes
anmyachev e960efc
groupby fixes
anmyachev 63fc347
fix 'test_read_csv_error_handling'
anmyachev a74c66f
temp skip 'test_read_csv_error_handling'
anmyachev 74fe647
fix Series.value_counts
anmyachev 8d82f54
read_csv_glob fix
anmyachev 728bfcb
fix for dt.year/manth/day dtype; fix concat
anmyachev 3b476d1
fix read_html
anmyachev 0b9762c
add comment for xarray's tests
anmyachev b9be4bb
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 9636b10
fix
anmyachev be6e750
fix
anmyachev 005b386
Apply suggestions from code review
anmyachev 139f694
change 'fill_method' value for 'pct_change'
anmyachev 1c1f7b0
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 9baa5c3
fixes for 'read_sql'
anmyachev 9a6cba2
fixes
anmyachev 85fd9c4
use pandas==2.0.2
anmyachev ac16843
fix 'infer_objects'
anmyachev 0f03510
remove 'line_terminator'
anmyachev adfbf4c
skip 'test_fillna_sanity' for hdk
anmyachev 70228cd
skip 'test_read_csv_error_handling' again
anmyachev 6ce3231
fix experimental 'read_sql'
anmyachev 7457882
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev c647abe
remove '_AXIS_ORDERS', '_AXIS_LEN'
anmyachev b6a1f5d
remove debug stuff
anmyachev 00b9f24
fix 'add_perfix', 'add_suffix'
anmyachev 277070a
Update modin/pandas/series.py
anmyachev 81305f8
fixes for 'skew'
anmyachev b286373
fix 'add_prefix', 'add_suffix' for BaseOnPython
anmyachev d09c4d5
fix 'read_parquet'; add test for 'dtype_backend' param
anmyachev 1243b4a
fix 'read_parquet'
anmyachev 981ab28
add 'dtype_backend' test for 'read_csv'
anmyachev e0b9cc6
add test for 'read_fwf' and 'read_excel'
anmyachev 807d7f8
keyword only parameters for 'read_excel'
anmyachev 7aac716
test 'dtype_backend' for 'read_json'
anmyachev 9d2e857
test 'dtype_backend' for 'read_sql'
anmyachev 4dfc07f
test 'dtype_backend' for 'read_feather'
anmyachev c6ef04b
test 'dtype_backend' for 'convert_dtypes'
anmyachev 030c09c
fix 'test_read_sql_dtype_backend'
anmyachev 77e61b0
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 89f5b22
add tests for 'dt.unit', 'dt.as_unit'
anmyachev 5af97ba
fix 'test_read_spss'
anmyachev 21ba8d9
add 'test_to_xarray_mock'
anmyachev b9b25b2
add test cases for 'pivot'
anmyachev a6210c6
revert some changes
anmyachev 78ac576
skip new 'pivot' test cases on 'BaseOnPython'
anmyachev a8c0823
skip also for 'hdk'
anmyachev 99271ff
revert some changes
anmyachev 0fddb6a
fixes
anmyachev 7edd9bd
revert some changes
anmyachev c124765
address review comments
anmyachev 0c922ab
address review comments[2]
anmyachev 2f7fd7c
change 'collections' imports
anmyachev f3835b6
use 'os.PathLike'
anmyachev 997bea8
fix pandas version for pip
anmyachev 353bdc5
try to fix mypy
anmyachev 761192d
forgotten 'pyarrow' pin
anmyachev 3fba35f
xfail hdk tests
anmyachev 2be7d42
remove 'MD01'
anmyachev a08f152
use 'Optional'
anmyachev 594b3e6
update 'to_dict'
anmyachev c0fd0cc
update 'set_axis' calls in dataframe.__init__
anmyachev dc7e115
Update modin/pandas/test/test_groupby.py
anmyachev 6b6802c
use False value for 'return_tuple_when_iterating'
anmyachev 53a399b
update 'pivot'
anmyachev 0944041
update type hints for 'copy' parameter
anmyachev 9cfd7a1
fixes
anmyachev be3c148
remove unused code
anmyachev d24d68c
update comment for 'test_groupby_api_equality'
anmyachev 2dc03a6
add 'FIXME'
anmyachev e6c5ec9
changes in 'test_groupby.py'
anmyachev 276b383
update comments
anmyachev 6d7e8d2
try to avoid old logic for processing 'numeric_only' param
anmyachev 8121867
address review comments
anmyachev c068d61
Merge branch 'master' of https://github.com/modin-project/modin into …
anmyachev 4222292
try to exclude 'datetime' type for hdk tests
anmyachev 7b8d207
Update modin/pandas/series.py
anmyachev c1e7cb8
fix
anmyachev 3e7e55a
Merge branch 'master' into pandas-2.0.0
anmyachev File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -276,23 +276,6 @@ def time_concat(self, shapes, how, axis, ignore_index): | |
) | ||
|
||
|
||
class TimeAppend: | ||
param_names = ["shapes", "sort"] | ||
params = [ | ||
get_benchmark_shapes("TimeAppend"), | ||
[False, True], | ||
] | ||
|
||
def setup(self, shapes, sort): | ||
self.df1 = generate_dataframe("int", *shapes[0], RAND_LOW, RAND_HIGH) | ||
self.df2 = generate_dataframe("int", *shapes[1], RAND_LOW, RAND_HIGH) | ||
if sort: | ||
self.df1.columns = self.df1.columns[::-1] | ||
|
||
def time_append(self, shapes, sort): | ||
execute(self.df1.append(self.df2, sort=sort)) | ||
|
||
|
||
class TimeBinaryOp: | ||
param_names = ["shapes", "binary_op", "axis"] | ||
params = [ | ||
|
@@ -488,7 +471,7 @@ def setup(self, shape, axis, drop_ncols): | |
self.labels = self.df.axes[axis][:drop_count] | ||
|
||
def time_drop(self, shape, axis, drop_ncols): | ||
execute(self.df.drop(self.labels, axis)) | ||
execute(self.df.drop(self.labels, axis=axis)) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
|
||
|
||
|
||
class TimeHead: | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This operation has been removed in pandas 2.0