-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nightly Trilinos failures in Stokhos, cuda/11.2.2, non-UVM build #1959
Comments
I'm trying to reproduce this now |
@ndellingwood I reproduced this and got a backtrace. It seems that the issue is simply that
So I think to fix this, Stokhos changes are needed. But those Stokhos changes would only be compatible with KK develop, not the 4.1 in Trilinos now. Should I try to patch the three recent spmv-related PRs (#1932, #1937 and #1953) into Trilinos, as well as the Stokhos update? |
Actually, I may be able to fix this in Stokhos using version ifdefs to work with both develop and master KK. |
OK, this was actually pretty easy to fix - trilinos/Trilinos#12190 should take care of it and only needed changes in Stokhos. I haven't finished testing it locally so it's a draft. |
Thanks for addressing this so quickly @brian-kelley ! |
That PR actually didn't fix the issue for kokkos-kernels develop, even though it passed PR testing with kokkos-kernels master. I'm now trying a different way to fix the issue and will open a new PR |
Nightly Trilinos failures in Stokhos, cuda/11.2.2, non-UVM builds in the following tests:
The build had previously been broken until merge of #1937 to resolve synchronization of kokkos-kernels@develop with Trilinos@develop; there is some discrepancy between the changes in #1937 and trilinos/Trilinos#12103 (I think the changes were motivated by failing Stokhos tests) - @cwpearson @brian-kelley could the discrepancy between that kokkos-kernels vs Trilinos result in the Stokhos failures shown here? Or is there an additional change needed in Stokhos to support the use of exec space instances with spmv added with #1932 ? Or something else?
To pinpoint if the incompatibility came with the streams update I can test before the bsr changes merged to both repos, e.g.
6c06bd0
trilinos/Trilinos@eece9b3
Reproducer (Weaver rhel8 queue):
The text was updated successfully, but these errors were encountered: