We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tracking the issue here
No response
The text was updated successfully, but these errors were encountered:
first step: merge init_device into worker.__init__, before we create the model runner and cache engine, so that they can know the pp rank.
init_device
worker.__init__
initialize distributed environment has complicated interference with spec decode. the temporary solution is to store the rank in the parallel config.
Sorry, something went wrong.
Successfully merging a pull request may close this issue.
🚀 The feature, motivation and pitch
tracking the issue here
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: