Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fallback to triton if we fail to compile for CUDA #223

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

zbowling
Copy link
Contributor

The CUDA driver might be available but the full CUDA toolkit might not be installed so we won't have a nvcc to compile the kernel and in this case use triton as a fallback.

We could consider doing an AOT of the kernel and not JITing the kernel extension, however this really blows the packaging complexity out of the water since we would have to target multiple PyTorch extension ABIs.

@zbowling zbowling changed the title Fallback triton if we fail to compile for CUDA Fallback to triton if we fail to compile for CUDA Feb 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant