-
Notifications
You must be signed in to change notification settings - Fork 25.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] ActivateWatchTests fails occasionally on master not able to delete index #66495
Comments
Pinging @elastic/es-core-features (Team:Core/Features) |
During the teardown, the test attempts to delete write index of data stream. This is prohibited, a write index of a data stream must always exist (other backing indices can be deleted). The data stream should be removed in its entirety, which is what is supposed to happen via |
It looks like the watcher history data stream gets created after watcher has been stopped and the previous data stream has been removed:
I suspect this happens because write requests for watch history are still inflight after watcher has been stopped. |
by waiting for all tasks to complete before deleting watcher history data stream. Closes elastic#66495
by waiting for all tasks to complete before deleting watcher history data stream. Closes #66495
I see the same stacktrace in a failure today: https://gradle-enterprise.elastic.co/s/5hq6kmhcgi53i/console-log?anchor=7304 |
This happened in a PR build here: pretty sure it is not related to my changes, since the only change I did was to merge in master. |
another failure on 6.8
does not reproduce locally |
And another on master https://gradle-enterprise.elastic.co/s/haa4hv337tiaq Also this seems to be the same issue as #61538. I'll close that as this one has had more discussion. |
Another one on master https://gradle-enterprise.elastic.co/s/hy5u6idtonkea |
Happened today again on
|
Happened again off of main branch: https://gradle-enterprise.elastic.co/s/y6r4m2tn4ogo2 trace
|
Another instance: https://gradle-enterprise.elastic.co/s/zmyadhq4xjppw, same trace. |
happened again https://gradle-enterprise.elastic.co/s/e5dxrbejgnibk, I'll mute this on master |
This commit attempts to fix a set of Watcher tests that can fail due to unexpected execution of Watches after Watcher has been stopped. The theory here is that a Watch can be queued but not fully executed then Watcher is shutdown, the test does some clean up, then the queued Watch finishes execution and causes some some additional cleanup to fail. The change here ensures that when Watcher is stopped from AbstractWatcherIntegrationTestCase that it will also wait until there are no more current Watches executing. closes #66495
This commit attempts to fix a set of Watcher tests that can fail due to unexpected execution of Watches after Watcher has been stopped. The theory here is that a Watch can be queued but not fully executed then Watcher is shutdown, the test does some clean up, then the queued Watch finishes execution and causes some some additional cleanup to fail. The change here ensures that when Watcher is stopped from AbstractWatcherIntegrationTestCase that it will also wait until there are no more current Watches executing. closes elastic#66495
This commit attempts to fix a set of Watcher tests that can fail due to unexpected execution of Watches after Watcher has been stopped. The theory here is that a Watch can be queued but not fully executed then Watcher is shutdown, the test does some clean up, then the queued Watch finishes execution and causes some some additional cleanup to fail. The change here ensures that when Watcher is stopped from AbstractWatcherIntegrationTestCase that it will also wait until there are no more current Watches executing. closes #66495
This commit attempts to fix a set of Watcher tests that can fail due to unexpected execution of Watches after Watcher has been stopped. The theory here is that a Watch can be queued but not fully executed then Watcher is shutdown, the test does some clean up, then the queued Watch finishes execution and causes some some additional cleanup to fail. The change here ensures that when Watcher is stopped from AbstractWatcherIntegrationTestCase that it will also wait until there are no more current Watches executing. closes elastic#66495
This commit attempts to fix a set of Watcher tests that can fail due to unexpected execution of Watches after Watcher has been stopped. The theory here is that a Watch can be queued but not fully executed then Watcher is shutdown, the test does some clean up, then the queued Watch finishes execution and causes some some additional cleanup to fail. The change here ensures that when Watcher is stopped from AbstractWatcherIntegrationTestCase that it will also wait until there are no more current Watches executing. closes elastic#66495
Seeing this failing occasionally (4 out of ~1200 in 7 days). I wonder if this is showing some specific issue with the test or with the infrastructure as the error seems to indicate some sort of a permission issue?
Build scan:
https://gradle-enterprise.elastic.co/s/usis5fhbuqx6y/tests/:x-pack:plugin:watcher:internalClusterTest/org.elasticsearch.xpack.watcher.transport.action.activate.ActivateWatchTests/testDeactivateAndActivate#1
Repro line:
./gradlew ':x-pack:plugin:watcher:internalClusterTest' --tests "org.elasticsearch.xpack.watcher.transport.action.activate.ActivateWatchTests.testDeactivateAndActivate" -Dtests.seed=FE8DAE73AB0A616A -Dtests.security.manager=true -Dtests.locale=ru-RU -Dtests.timezone=US/Central -Druntime.java=11
Reproduces locally?:
no.
Applicable branches:
master
Failure history:
https://gradle-enterprise.elastic.co/scans/tests?search.relativeStartTime=P7D&search.timeZoneId=Europe/Berlin&tests.container=org.elasticsearch.xpack.watcher.transport.action.activate.ActivateWatchTests&tests.sortField=FAILED&tests.test=testDeactivateAndActivate&tests.unstableOnly=true
-->
Failure excerpt:
The text was updated successfully, but these errors were encountered: