-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
webdriver/tests/ are flaky in Firefox and Safari #28925
Comments
The flakiness with the user prompts in safaridriver is known (rdar://54401037 for anyone at Apple who comes across this) |
@gsnedders if it is all caused by user prompts, do you think there's any hacky hack that could be used to make the tests more stable in the meantime? |
No idea. |
OK 😄 |
@foolip are the failures always around changing the window size of the Firefox window? We have some known intermittent failures on Linux for that, but I wonder if using Ubuntu 20.04 makes it even worse. In our CI we still have 18.04 LTS. |
@whimboo it could be, but the From https://wpt.fyi/insights you can generate views that are helpful for this: It does look like it's always those 3 tests... |
Thanks for that link. So from these failures and when I enable details I can only see an What changed recently in Firefox is the new type of content modal dialogs, which are in use by |
If you start at https://wpt.fyi/runs and start scrolling down you can find older runs, but that's a bit tedious. This isn't possible via the UI (I think) but if you add It looks like the tests were flaky then as well. Going back a year more in time it's the same. Have you checked if these are stable in Gecko CI? If yes, and if these tests are perfectly reliable locally, then you could try making logging more verbose by tweaking here: wpt/tools/ci/taskcluster-run.py Lines 75 to 84 in e545686
|
Oh, I can actually see a lot of multiple statuses set for these tests: That might explain why I haven't seen any failures in our CI. Sadly we won't have the time to dig further into this anytime soon. :/ But it's good to see it's not related to the new kind of modals. |
Quite a bit of time has been passed by and we improved the tests and our WebDriver classic implementation a lot since then. I would suggest that we close this issue and if necessary file specific issues for flakiness as seen. |
In many recent PRs touching webdriver/tests/ I've seen that the wpt.fyi checks show many differences in test results for Firefox and Safari, with the largest differences typically for Safari. Example:
https://github.com/web-platform-tests/wpt/pull/28875/checks?check_run_id=2522115464
https://github.com/web-platform-tests/wpt/pull/28875/checks?check_run_id=2521309324
This makes it difficult to make changes to WebDriver tests with confidence, because it always looks like there are some regressions. I've had to result to comparing two sets of results with manually constructed wpt.fyi URLs and filters to convince myself in a few occasions:
#28757 (comment)
#28789 (comment)
@burg @gsnedders @jgraham @whimboo is this something you observe in your own CI as well? If it could be made more consistent here in WPT's CI, the risk of regressing the tests accidentally would go down.
The text was updated successfully, but these errors were encountered: