-
Notifications
You must be signed in to change notification settings - Fork 579
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kokkos: Not rebuilding libkokkoscore correctly on rebuilds? #6855
Comments
Adding @jjwilke |
I will post back where what I find and try to characterize this problem. I have build directories that have been rebuilding for months that show this problem. When I blew away the build directory and let it build from scratch, the correct libkokkoscore.a got built (see SPAR-767). Hopefully I can simulate what happened in a new source and build directory so we can reproduce this. Stay tuned ... |
Linking a couple kokkos issues that may help investigating: kokkos/kokkos#1902 |
That is not the problem. When you build Trilinos from scratch you get 'libkokkoscore.a'. Of course that big refactoring that is what I am suspecting is the cause here. I will see. |
FYI: I have been able to create a local reproducer. I will post a detailed comment with instructions on how to run the simulation and prove that the Kokkos_Serial.cpp.o file is not getting rebuilt correctly (but when you build from scratch it does). Using ninja debugging aids it should hopefully be pretty easy to figure out what the problem is and how to fix this so this does not happen in the future with rebuilds of Trilinos. |
) This is to get all of the ATDM Trilinos builds to build from scratch tomorrow so that the SPARC Trilinos Integration builds will start working against on 2020-02-21. I will then revert this commit the next day and get back to work debugging the core problem with the Kokkos rebuild problem in #6855.
…y-manual-updates (SPAR-767, #6855)
@trilinos/kokkos, FYI: More evidence in #7195 that Kokkos is not rebuilding correctly after the big CMake refactor. I will add detailed instructions on how to reproduce the problem. |
Do we have reproducer instructions yet? |
I will get this updated when I can. Just have a bunch of other stuff right now. |
We have not seen this problem in a long time. Perhaps it has been fixed? I could still provide reproducibility instructions but not sure what the point would be at this point. |
CC: @crtrott I think another example of this problem is documented in #8638 (comment) (expand TEST DETAILS and look at section ride). Trying to rebuild the build
I had to delete the build |
FYI: I just his this error after a local rebuild attempt of Trilinos:
I had to blow away the entire build dir and configure from scratch to make this error go away. (Just deleting the After the big Kokkos CMake build system refactoring 1.5 years ago something went wrong that has broken rebuilds in some cases. Some dependency rule must not have been set correctly as part of that refactoring. At some point, someone needs to take some time to debug what is happening and fix this. Before this, I can't remember the last time I had a build error with Trilinos doing a rebuild after reconfiguring from scratch (i.e. |
This issue has had no activity for 365 days and is marked for closure. It will be closed after an additional 30 days of inactivity. |
This issue was closed due to inactivity for 395 days. |
As part of debugging a SPARC Trilinos Integration Build error (see SPAR-767), it was discovered that
libkokkoscore
was not getting correctly rebuilt after the Kokkos 2.99 promotion after the merging of the PR #6671This issue is to try to characterize the problem and see if it can be reproduced so it can be fixed. We really need to maintain the ability to rebuild Trilinos reliability since it saves a massive amount of CPU cycles in the ATDM Trilinos builds and saves a lot developer time if they can just rebuild and not have to blow away their build directories all the time.
The text was updated successfully, but these errors were encountered: