-
Notifications
You must be signed in to change notification settings - Fork 6.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[data] support for read_sql to handle tasks concurrently. #49424
[data] support for read_sql to handle tasks concurrently. #49424
Conversation
9507221
to
4126c25
Compare
Signed-off-by: jukejian <[email protected]>
67d65fd
to
ef0feee
Compare
@bveeramani @richardliaw can you help me review this PR |
hi, i will take this but currently am traveling - will get back to you
after 1/10 - thanks!
…On Thu, Jan 2, 2025 at 6:06 PM jay ***@***.***> wrote:
@bveeramani <https://github.com/bveeramani> @richardliaw
<https://github.com/richardliaw> can you help me review this PR
—
Reply to this email directly, view it on GitHub
<#49424 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABCRZZKP2HJDZ34HW5SU43L2IT6SHAVCNFSM6AAAAABUESLI7KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRXGQ2TSMRVGU>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
get, tks, Wish you a pleasant holiday. |
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Hi, I created a PR on your fork to discuss some changes - https://github.com/Jay-ju/ray/pull/1/files |
Signed-off-by: Richard Liaw <[email protected]>
Cleanup `fix_read_sql_parallelism`
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: jukejian <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
Signed-off-by: Richard Liaw <[email protected]>
e4b140f
to
04d742f
Compare
… fix_read_sql_parallelism
python/ray/data/read_api.py
Outdated
|
||
if len(read_tasks) < parallelism: | ||
parallelism = len(read_tasks) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What happens if we don't add this? I think it might be causing some tests to fail:
[2025-01-28T19:29:11Z] FAILED python/ray/data/tests/test_splitblocks.py::test_small_file_split - ass...
--
| [2025-01-28T19:29:11Z] FAILED python/ray/data/tests/test_splitblocks.py::test_large_file_additional_split
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh, good catch, let me try
Signed-off-by: Richard Liaw <[email protected]>
…t#49424) ## Related issue number Parallel SQL reads support by using MOD/CAT/Custom hashes. Closes ray-project#49206 <!-- For example: "Closes ray-project#1234" --> ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: jukejian <[email protected]> Signed-off-by: Richard Liaw <[email protected]> Co-authored-by: Richard Liaw <[email protected]>
…t#49424) ## Related issue number Parallel SQL reads support by using MOD/CAT/Custom hashes. Closes ray-project#49206 <!-- For example: "Closes ray-project#1234" --> ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: jukejian <[email protected]> Signed-off-by: Richard Liaw <[email protected]> Co-authored-by: Richard Liaw <[email protected]>
…t#49424) ## Related issue number Parallel SQL reads support by using MOD/CAT/Custom hashes. Closes ray-project#49206 <!-- For example: "Closes ray-project#1234" --> ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: jukejian <[email protected]> Signed-off-by: Richard Liaw <[email protected]> Co-authored-by: Richard Liaw <[email protected]>
Why are these changes needed?
For more details, you can refer to the discussion in the corresponding issue. #49206
Related issue number
Closes #49206
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.