Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FIX] Improve the Warning Displayed for Uneven Batch Distributions #4920

Merged
merged 2 commits into from
Feb 11, 2025

Conversation

alexbarghi-nv
Copy link
Member

In order to better debug issues with batch distribution, this PR updates the warning message to show how many batches each rank received. Partially resolves rapidsai/cugraph-gnn#130.

@alexbarghi-nv alexbarghi-nv self-assigned this Feb 5, 2025
Copy link

copy-pr-bot bot commented Feb 5, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@alexbarghi-nv alexbarghi-nv added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 5, 2025
@github-actions github-actions bot added python and removed improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 5, 2025
@alexbarghi-nv alexbarghi-nv added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 5, 2025
@alexbarghi-nv alexbarghi-nv marked this pull request as ready for review February 5, 2025 20:09
@alexbarghi-nv alexbarghi-nv requested a review from a team as a code owner February 5, 2025 20:09
@alexbarghi-nv
Copy link
Member Author

/merge

@rapids-bot rapids-bot bot merged commit 6e5ca5a into rapidsai:branch-25.04 Feb 11, 2025
76 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change python
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Warn users if they have too many GPUs
2 participants