Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kokkos + KokkosKernels Release to 3.5.00 #9836

Merged
merged 22 commits into from
Oct 29, 2021
Merged
Changes from 1 commit
Commits
Show all changes
22 commits
Select commit Hold shift + click to select a range
76e1cd9
tpetra: move sort_crs_matrix out of Impl namespace
ndellingwood Jun 10, 2021
da10abe
amesos2: move sort_crs_matrix out of Impl namespace
ndellingwood Jul 8, 2021
f8c911d
zoltan2: modify "Vector" alias in Test_Sphynx
ndellingwood Jul 12, 2021
544eeda
sacado, stokhos: replace KOKKOS_IMPL_CUDA_* macros with Cuda functions
ndellingwood Jul 22, 2021
60e27ce
tpetra: move sort_crs_matrix out of Impl namespace
ndellingwood Jun 10, 2021
c0e1471
amesos2: move sort_crs_matrix out of Impl namespace
ndellingwood Jul 8, 2021
a891ab2
zoltan2: modify "Vector" alias in Test_Sphynx
ndellingwood Jul 12, 2021
f86c91a
sacado, stokhos: replace KOKKOS_IMPL_CUDA_* macros with Cuda functions
ndellingwood Jul 22, 2021
d72eb85
atdm/contributed/weaver: update modules
ndellingwood Oct 1, 2021
e6aee98
intrepid2: workaround intel internal compiler error in Intrepid2_Data
ndellingwood Oct 1, 2021
08f1f07
Merge branch 'kokkos-promotion' of https://github.com/trilinos/Trilin…
ndellingwood Oct 1, 2021
536a384
tpetra: move sort_crs_* out of Impl namespace
ndellingwood Oct 14, 2021
bed59f2
Merge branch 'develop' into kokkos-promotion
ndellingwood Oct 14, 2021
fa115c1
ifpack2,sacado: rename CUDA_SAFE_CALL -> KOKKOS_IMPL_CUDA_SAFE_CALL
ndellingwood Oct 14, 2021
7b63df4
Update packages to move Kokkos::Timer out of impl namespace
ndellingwood Oct 15, 2021
448daeb
intrepid2: remove deprecation warnings
ndellingwood Oct 22, 2021
eb03da3
amesos2: resolve unused warning in superlu interface
ndellingwood Oct 22, 2021
2ac5617
stokhos: resolve -Werror
ndellingwood Oct 22, 2021
4275f6b
Snapshot of kokkos.git from commit 8dc4a906d43ae8eacc951cc5d7e95ad2df…
ndellingwood Oct 28, 2021
cd7a9c7
Snapshot of kokkos-kernels.git from commit 14d29f0a04f9fc959c7c96d98e…
ndellingwood Oct 28, 2021
6909d93
Intrepid2 - deep copy range match
kyungjoo-kim Oct 28, 2021
d63e635
intrepid2: resolve signed-unsigned warning
ndellingwood Oct 29, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion packages/intrepid2/src/Shared/Intrepid2_PointToolsDef.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -321,7 +321,9 @@ getWarpBlendLatticeLine( Kokkos::DynRankView<pointValueType,pointPropertie
// this should be fixed after view and dynrankview is interoperatable
auto z = Kokkos::DynRankView<pointValueType,Kokkos::HostSpace>(zHost.data() + offset, np-offset);

Kokkos::deep_copy(pts, z);
const auto common_range = range_type(0, std::min(pts.extent(0), z.extent(0)));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kyungjoo-kim It seems to me that a cleaner way to fix this is to declare z as
auto z = Kokkos::DynRankView<pointValueType,Kokkos::HostSpace>(zHost.data() + offset, s);
No reason to create it larger than s = np-2*offset and then do a subview of it. This would avoid two subviews.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

u r right. it will be fixed in the next commit.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are wrong. The common range is necessary. I printed both dimension and for some case its is larger and sometimes z is larger. I did not particularly dig further as the current solution still resave the deep copy mismatch issue.

Kokkos::deep_copy(Kokkos::subview(pts, common_range),
Kokkos::subview(z, common_range));
}
}

Expand Down