Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable TeuchosNumerics_LAPACK_test_MPI_1 test in 'debug' Power8 and Power8 builds and disable just the STEQR() test #4064

Conversation

bartlettroscoe
Copy link
Member

CC: @trilinos/teuchos, @fryeguy52

Description

Enable TeuchosNumerics_LAPACK_test_MPI_1 test in 'debug' Power8 and Power8 builds and disable just the STEQR() test.

NOTE: This test was being disabled in some release-debug builds where it did not need to be disabled. In fact, the STEQR() function does not segfault when using optimized compiler options (which are used in a release-debug build).

Motivation and Context

We need to be running LAPACK tests for the functions being used by Trilinos and not disable them all (see #2410).

How Has This Been Tested?

On 'ride' I did:

$ bsub -x -Is -q rhel7F -n 16 \
  ./checkin-test-atdm.sh \
  cuda-9.2-debug gnu-openmp-debug \
  --enable-packages=TeuchosNumerics --local-do-all

which returned:

PASSED (NOT READY TO PUSH): Trilinos: ride10

Mon Dec 17 08:51:43 MST 2018

Enabled Packages: TeuchosNumerics

Build test results:
-------------------
1) cuda-9.2-debug => passed: passed=16,notpassed=0 (1.50 min)
2) gnu-openmp-debug => passed: passed=16,notpassed=0 (0.60 min)

and showed:

$ for build_name in cuda-9.2-debug gnu-openmp-debug ; do grep -nH TeuchosNumerics_LAPACK_test ${build_name}/ctest.out ; done | grep "Pass"

cuda-9.2-debug/ctest.out:33:16/16 Test  #9: TeuchosNumerics_LAPACK_test_MPI_1 ..................   Passed    0.43 sec
gnu-openmp-debug/ctest.out:30:13/16 Test  #9: TeuchosNumerics_LAPACK_test_MPI_1 ..................   Passed    1.88 sec

And I ran this on 'waterman' using:

$ bsub -x -Is -n 20 \
  ./checkin-test-atdm.sh \
  cuda-9.2-debug cuda-9.2-release-debug gnu-openmp-release-debug \
  --enable-packages=TeuchosNumerics --local-do-all

which returned:

PASSED (NOT READY TO PUSH): Trilinos: waterman1

Mon Dec 17 08:56:33 MST 2018

Enabled Packages: TeuchosNumerics

Build test results:
-------------------
1) cuda-9.2-debug => passed: passed=16,notpassed=0 (1.20 min)
2) cuda-9.2-release-debug => passed: passed=16,notpassed=0 (1.34 min)
3) gnu-openmp-release-debug => passed: passed=16,notpassed=0 (1.08 min)

and showed:

$ for build_name in cuda-9.2-debug cuda-9.2-release-debug gnu-openmp-release-debug ; do grep -nH TeuchosNumerics_LAPACK_test ${build_name}/ctest.out ; done | grep "Pass"

cuda-9.2-debug/ctest.out:32:15/16 Test  #9: TeuchosNumerics_LAPACK_test_MPI_1 ..................   Passed    2.20 sec
cuda-9.2-release-debug/ctest.out:29:12/16 Test  #9: TeuchosNumerics_LAPACK_test_MPI_1 ..................   Passed    2.24 sec
gnu-openmp-release-debug/ctest.out:26: 9/16 Test  #9: TeuchosNumerics_LAPACK_test_MPI_1 ..................   Passed    4.10 sec

Checklist

  • My commit messages mention the appropriate GitHub issue numbers.
  • All new and existing tests passed.
  • No new compiler warnings were introduced.

This is the only function that continues to segfault on Power8 and Power9
builds using NETLIB LAPACK in debug builds.  All of the Trilinos code
downstream has removed a dependence on this LAPACK function. This will allow
the rest of the LAPACK tests to be run in Power8 and Power9 'debug' builds.
…EQR() test (trilinos#2410)

At least this way we are running the Teuchos LAPACK tests for the LAPACK
functions being used downstream in Trilinos.
@bartlettroscoe bartlettroscoe added type: bug The primary issue is a bug in Trilinos code or tests pkg: Teuchos Issues primarily dealing with the Teuchos Package client: ATDM Any issue primarily impacting the ATDM project ATDM Env Issue Issue with ATDM build or test caused (at least partly) by the env, not a bug in Trilinos PA: Framework Issues that fall under the Trilinos Framework Product Area labels Dec 17, 2018
@bartlettroscoe bartlettroscoe added the stage: in progress Work on the issue has started label Dec 17, 2018
@bartlettroscoe bartlettroscoe added the AT: AUTOMERGE Causes the PR autotester to automatically merge the PR branch once approvals are completed label Dec 17, 2018
@trilinos-autotester
Copy link
Contributor

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection Is Not Necessary for this Pull Request.

Copy link
Contributor

@hkthorn hkthorn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. This is a good idea to move forward with improving LAPACK test coverage.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Failure: Timed out waiting for job Trilinos_pullrequest_intel_17.0.1 to start: Total Wait = 603

  • Other jobs have been previously started - We must stop them...

@bartlettroscoe bartlettroscoe added the AT: RETEST Causes the PR autotester to run a new round of PR tests on the next iteration label Dec 17, 2018
@mhoemmen
Copy link
Contributor

Thanks @bartlettroscoe ! :D

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - User Requested Retest - Label AT: RETEST will be reset after testing.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Failure: Timed out waiting for job Trilinos_pullrequest_gcc_7.2.0 to start: Total Wait = 603

@trilinos-autotester trilinos-autotester removed the AT: RETEST Causes the PR autotester to run a new round of PR tests on the next iteration label Dec 17, 2018
@bartlettroscoe bartlettroscoe added the AT: RETEST Causes the PR autotester to run a new round of PR tests on the next iteration label Dec 18, 2018
@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - User Requested Retest - Label AT: RETEST will be reset after testing.

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: Trilinos_pullrequest_intel_17.0.1

  • Build Num: 1933
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.9.3

  • Build Num: 2499
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
COMPILER_MODULE sems-gcc/4.9.3
JENKINS_BUILD_TYPE Release
JENKINS_COMM_TYPE MPI
JENKINS_DO_COMPLEX OFF
JENKINS_JOB_TYPE Experimental
MPI_MODULE sems-openmpi/1.8.7
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.8.4

  • Build Num: 2131
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
COMPILER_MODULE sems-gcc/4.8.4
JENKINS_BUILD_TYPE Release
JENKINS_COMM_TYPE MPI
JENKINS_DO_COMPLEX OFF
JENKINS_JOB_TYPE Experimental
MPI_MODULE sems-openmpi/1.8.7
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.9.3_SERIAL

  • Build Num: 424
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_7.2.0

  • Build Num: 38
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Using Repos:

Repo: TRILINOS (bartlettroscoe/Trilinos)
  • Branch: 2410-power-disable-lapack-steqr
  • SHA: 71e9543
  • Mode: TEST_REPO

Pull Request Author: bartlettroscoe

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: Trilinos_pullrequest_intel_17.0.1

  • Build Num: 1933
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.9.3

  • Build Num: 2499
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
COMPILER_MODULE sems-gcc/4.9.3
JENKINS_BUILD_TYPE Release
JENKINS_COMM_TYPE MPI
JENKINS_DO_COMPLEX OFF
JENKINS_JOB_TYPE Experimental
MPI_MODULE sems-openmpi/1.8.7
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.8.4

  • Build Num: 2131
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
COMPILER_MODULE sems-gcc/4.8.4
JENKINS_BUILD_TYPE Release
JENKINS_COMM_TYPE MPI
JENKINS_DO_COMPLEX OFF
JENKINS_JOB_TYPE Experimental
MPI_MODULE sems-openmpi/1.8.7
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_4.9.3_SERIAL

  • Build Num: 424
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63

Build Information

Test Name: Trilinos_pullrequest_gcc_7.2.0

  • Build Num: 38
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PULLREQUESTNUM 4064
TEST_REPO_ALIAS TRILINOS
TRILINOS_SOURCE_BRANCH 2410-power-disable-lapack-steqr
TRILINOS_SOURCE_REPO https://github.com/bartlettroscoe/Trilinos
TRILINOS_SOURCE_SHA 71e9543
TRILINOS_TARGET_BRANCH develop
TRILINOS_TARGET_REPO https://github.com/trilinos/Trilinos
TRILINOS_TARGET_SHA f94ba63


CDash Test Results for PR# 4064.

@trilinos-autotester trilinos-autotester removed the AT: RETEST Causes the PR autotester to run a new round of PR tests on the next iteration label Dec 18, 2018
@trilinos-autotester
Copy link
Contributor

Status Flag 'Pre-Merge Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED AND APPROVED by [ mhoemmen ]!

@trilinos-autotester
Copy link
Contributor

Status Flag 'Pull Request AutoTester' - Pull Request will be Automerged

@trilinos-autotester trilinos-autotester merged commit 0079967 into trilinos:develop Dec 18, 2018
@trilinos-autotester
Copy link
Contributor

Merge on Pull Request# 4064: IS A SUCCESS - Pull Request successfully merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ATDM Env Issue Issue with ATDM build or test caused (at least partly) by the env, not a bug in Trilinos client: ATDM Any issue primarily impacting the ATDM project PA: Framework Issues that fall under the Trilinos Framework Product Area pkg: Teuchos Issues primarily dealing with the Teuchos Package type: bug The primary issue is a bug in Trilinos code or tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants