Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ISSUE] Pods Randomly Going Down During Test Runs #1873

Open
Tracked by #2774
slolatte opened this issue May 29, 2024 · 5 comments
Open
Tracked by #2774

[ISSUE] Pods Randomly Going Down During Test Runs #1873

slolatte opened this issue May 29, 2024 · 5 comments
Assignees
Labels
area/ci cycle/alpha4 Tasks will be done in alpha4 cycle kind/issue Unidentified issue, it could be a bug, misconfig, or anything in between

Comments

@slolatte
Copy link

Describe the issue:

The component pods are going down randomly because of the nature of the GKE nodes. It is causing most of the UI tests to fail during the test run of the cross component test suite. This is causing the test suite to become unreliable and flaky which is a problem because this test suite is used for nightly regression testing of the Helm Chart with SNAPSHOT component versions and as well used in release testing.

Actual behavior:

The pods randomly go down which result in a '503 Service Temporarily Unavailable' error appearing in the UI.

image

image (1)

Expected behavior:

The pods should not go down and the UI should appear as normal.

How to reproduce:

Run the nightly SM Helm cross-component test suite test run

Logs:

Environment:

Please note: Without the following info, it's hard to resolve the issue and probably it will be closed.

  • Platform:
  • Helm CLI version:
  • Chart version:
  • Values file:

(for environment see details above)

@slolatte slolatte added the kind/issue Unidentified issue, it could be a bug, misconfig, or anything in between label May 29, 2024
@aabouzaid aabouzaid self-assigned this Jul 4, 2024
@aabouzaid aabouzaid added cycle/alpha4 Tasks will be done in alpha4 cycle area/ci labels Jul 4, 2024
@aabouzaid aabouzaid changed the title [ISSUE] <Pods Randomly Going Down During Test Runs> [ISSUE] Pods Randomly Going Down During Test Runs Jul 4, 2024
@markfarkas-camunda
Copy link

After the mentioned 503 errors, when the login screen is available again providing the correct credentials results in showing an error message for a brief moment saying: "your session expired", After that half of a second it redirects back to the login page again.

@slolatte
Copy link
Author

slolatte commented Jul 5, 2024

I am also seeing this error randomly now too
Screenshot 2024-07-05 at 19 14 31

@slolatte
Copy link
Author

slolatte commented Aug 6, 2024

@aabouzaid Is there any update on when this might be fixed?

@aabouzaid
Copy link
Member

@slolatte
Copy link
Author

slolatte commented Sep 9, 2024

@aabouzaid When performing release testing I encountered 404 errors sometimes...could this be related?

See the following Slack thread for more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ci cycle/alpha4 Tasks will be done in alpha4 cycle kind/issue Unidentified issue, it could be a bug, misconfig, or anything in between
Projects
None yet
Development

No branches or pull requests

3 participants