CC: @trilinos/tpetra, @kddevin (Trilinos Data Services Product Lead), @bartlettroscoe, @fryeguy52
??: Add label "ATDM">
??: Add label "bug"?>
??: Add label for affected packages (e.g. "MueLu", "Tpetra", "Kokkos", etc.)>
??: Add milestone "Initial cleanup of new ATDM builds of Trilinos" or "Keep promoted ATDM builds of Trilinos clean">
??: Once GitHub Issue is created, add entries for tests to TrilinosATDMStatus/*.csv files>
??: Add label "PA: ???Project Area???" (e.g. "PA: Linear Solvers", "PA: Data Services")>
As shown in this query the build:
has several failing TpetraCore tests. The following are failing everyday (CDash)
While these have failed randomly over the last couple weeks (CDash):
several more tests have failed over the last 2 weeks in similar complex shared build shown here:
Those tests are:
Failed TpetraCore tests for the current testing day
One should be able to reproduce this failure on with a sems rhel6 environment as described in:
More specifically, the commands given for with a sems rhel6 environment are provided at:
The exact commands to reproduce this issue should be:
$ cd <some_build_dir>/
$ source $TRILINOS_DIR/cmake/std/atdm/load-env.sh Trilinos-atdm-sems-rhel7-cuda-9.2-Volta70-complex-static-release-debug
$ cmake \
-GNinja \
-DTrilinos_CONFIGURE_OPTIONS_FILE:STRING=cmake/std/atdm/ATDMDevEnv.cmake \
-DTrilinos_ENABLE_TESTS=ON -DTrilinos_ENABLE_Tpetra=ON \
$TRILINOS_DIR
$ make NP=16
$ ctest -j8
FYI: We switched from ctest -j10 to ctest -j4 on 'ascicgpu14' in #4865 merged to 'develop' on 4/10/2019 and it seems to have addressed the problem. Putting this in review and running for a few days to verify the problem is fixed.
As shown in the below table from our CDash analysis tool (#2933) all of these tests have been passing for at least 9 consecutive days since the merge of PR #4865.
Therefore, we can close this issue.
Closing as complete as per above.