-
Notifications
You must be signed in to change notification settings - Fork 396
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For pm-gpu: use small kernels as default for shoc/p3 #7099
For pm-gpu: use small kernels as default for shoc/p3 #7099
Conversation
doh! Yep, I was testing with it ON/OFF, and committed before turning it back ON. Fixed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if this runs for you, then gucci 🍏
…o next (PR #7099) For pm-gpu, we see the small kernel path (SMALL_KERNELS) performing faster in all situations. Also BFB with monolithic kernels. Currently, this only impacts SHOC/P3. [bfb]
merged to next. Late Friday, but what's the worst that could happen? |
Noting that with daily testing, all of the non-DEBUG cases are BFB. |
For pm-gpu, we see the small kernel path (SMALL_KERNELS) performing faster in all situations.
Also BFB with monolithic kernels.
Currently, this only impacts SHOC/P3.
[bfb]