-
Notifications
You must be signed in to change notification settings - Fork 578
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ATDM: ATS-2 testing documentation needs environment variable info #6902
Comments
@kddevin, I will look into adding some documentation for this to the atdm/READM.md file. However, I just noticed that it looks like
That is not good. We will have to debug why the driver that is setting Also an FYI: The Trilinos Framework team does not support that ATDM Trilinos build configurations currently. Perhaps that will change in the future but at this point there is no connection between the ATDM Trilinos testing efforts and that Trilinos Framework efforts. (Therefore, I removed the mention of the Trilinos Framework team and removed the |
…trilinos#6902) Turns out that the tests in the "cuda-aware-mpi" builds on 'vortex' were not actually setting TPETRA_ASSUME_CUDA_AWARE_MPI=1. This commit fixes that and it also documents for users how to run the test suite with and without CUDA-aware MPI in Tpetra.
…anual-updates (CDOFA-100, #6902)
… mention trilinos_jsrun (CDOFA-100, trilinos#6902)
… mention trilinos_jsrun (CDOFA-100, trilinos#6902)
…6902-cuda-aware-mpi Automatically Merged using Trilinos Pull Request AutoTester PR Title: Fix up and document handling of CUDA-aware MPI with Tpetra (CDOFA-100, #6902) PR Author: bartlettroscoe
Thanks for the PR #6904 |
It is printed out by
If you compare that to the CUDA-aware running of that same test here you see:
How much of that info needs to be stated in that README.md file? Can we just say:
? |
I like showing the examples, as you did in this github issue, so that people know what they are looking for. |
@kddevin, okay, I will post a new PR and let you review it |
…s:develop' (b6d9daf). * trilinos-develop: ATDM: ats2: Update documentation for TPETRA_ASSUME_CUDA_AWARE_MPI and mention trilinos_jsrun (CDOFA-100, trilinos#6902) Allow for default setting of TPETRA_ASSUME_CUDA_AWARE_MPI=0 in trilinos_jsrun (CDOFA-100) Fix typo in trilinos_jsrun default value for Tpetra cuda-aware MueLu: Use Amesos2 parameter list option for matrices with non-contiguous Maps. Fix up and document handling of CUDA-aware MPI with Tpetra (CDOFA-100, trilinos#6902) Framework: Enhance failure messaging in PR test driver Framework: Fixing bug in PR test script Framework: Fixing PR test driver python script tests
…s:develop' (b6d9daf). * trilinos-develop: ATDM: ats2: Update documentation for TPETRA_ASSUME_CUDA_AWARE_MPI and mention trilinos_jsrun (CDOFA-100, trilinos#6902) Allow for default setting of TPETRA_ASSUME_CUDA_AWARE_MPI=0 in trilinos_jsrun (CDOFA-100) Fix typo in trilinos_jsrun default value for Tpetra cuda-aware MueLu: Use Amesos2 parameter list option for matrices with non-contiguous Maps. Fix up and document handling of CUDA-aware MPI with Tpetra (CDOFA-100, trilinos#6902) Framework: Enhance failure messaging in PR test driver Framework: Fixing bug in PR test script Framework: Fixing PR test driver python script tests
@jjellio, responding to your comment from the merged PR #6904 below (since this is the issue about documentation) ...
I can see some advantages to having the build name keyword
Right. I could not find any documentation for the env var The only documentation for this should not be in the ATDM Trilinos configuration system and just for ATS-2. That is not good.
Is there is any disadvantage to turning off the CUDA hooks for a single-rank MPI job? I can't think of any for smart code. (Software that is running on one MPI rank should not be calling any MPI communication routines if it is smart.)
@jjellio, would you be willing to try to post a new PR to extend the documentation in the section: and negotiate with @kddevin on what she would consider to be adequate documentation for this system? |
@jjellio and @kddevin , it occurred to me that we could define the ATDM Trilinos build name keywords
That could not be more clear. It would also simplify the logic in our ctest -S driver scripts as well and in our instructions on the README.md page. @kddevin, I think that would solve the CUDA-aware MPI documentation problem, no? |
Sure, the naming convention plus the BEFORE/AFTER examples you have above would be great. Thanks. |
This issue has had no activity for 365 days and is marked for closure. It will be closed after an additional 30 days of inactivity. |
This issue was closed due to inactivity for 395 days. |
Documentation
ATS-2 testing documentation has no info on use of environment variables such as TPETRA_ASSUME_CUDA_AWARE_MPI in testing.
https://github.com/trilinos/Trilinos/blob/develop/cmake/std/atdm/README.md#ats-2
For users to reproduce errors seen on the dashboard, they need to know where to look in CDash for environment variable settings and how to set them the same way on the platform. The location of these settings in CDash is not obvious.
This documentation is needed because ATS-2 testing now uses a single build with different environment variable settings for multiple tests. Previously, all environment information could be gleaned from the build configuration, as currently described in the documentation link above.
The text was updated successfully, but these errors were encountered: