We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
As titled. Make DeepSeek V2 MLA Faster!
No response
The text was updated successfully, but these errors were encountered:
Is there a specific timeline for this?
Sorry, something went wrong.
bmm fp8 has been implemented with flashinfer-ai/flashinfer#469 fp8 e5m2 kv cache has been implemented with #1204
Currently, there is no adaptation for DeepSeek V2 as we are focusing on other higher priority tasks. Expected to be completed within these few days.
done
ispobock
zhyncs
No branches or pull requests
Checklist
Motivation
As titled. Make DeepSeek V2 MLA Faster!
Related resources
No response
The text was updated successfully, but these errors were encountered: