Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tests: stabilize shard locations earlier in test_scrubber_tenant_snapshot #10606

Merged
merged 2 commits into from
Feb 3, 2025

Conversation

jcsp
Copy link
Collaborator

@jcsp jcsp commented Jan 31, 2025

Problem

This test would sometimes emit unexpected logs from the storage controller's requests to do migrations, which overlap with the test's restarts of pageservers, where those migrations are happening some time after a shard split as the controller moves load around.

Example: https://neon-github-public-dev.s3.amazonaws.com/reports/pr-10602/13067323736/index.html#testresult/f66f1329557a1fc5/retries

Summary of changes

  • Do a reconcile_until_idle after shard split, so that the rest of the test doesn't run concurrently with migrations

@jcsp jcsp added a/test Area: related to testing a/tech_debt Area: related to tech debt labels Jan 31, 2025
@jcsp jcsp force-pushed the jcsp/scrubber-test-reconcile branch from 7ff6cf8 to 992615a Compare January 31, 2025 11:11
@jcsp jcsp requested a review from arssher January 31, 2025 11:11
Copy link

github-actions bot commented Jan 31, 2025

6740 tests run: 6401 passed, 0 failed, 339 skipped (full report)


Code coverage* (full report)

  • functions: 33.3% (8526 of 25632 functions)
  • lines: 49.0% (71475 of 145930 lines)

* collected from Rust tests only


The comment gets automatically updated with the latest test results
39c9638 at 2025-02-03T09:07:55.830Z :recycle:

@jcsp
Copy link
Collaborator Author

jcsp commented Jan 31, 2025

Test failure is #10609 -- maybe that scenario become more likely as a result of the extra tenant movements that this PR does after shard split.

@jcsp jcsp marked this pull request as ready for review January 31, 2025 14:46
@jcsp jcsp enabled auto-merge February 3, 2025 08:12
@jcsp jcsp added this pull request to the merge queue Feb 3, 2025
Merged via the queue into main with commit f071800 Feb 3, 2025
82 checks passed
@jcsp jcsp deleted the jcsp/scrubber-test-reconcile branch February 3, 2025 09:04
winter-loo pushed a commit to winter-loo/neon that referenced this pull request Feb 4, 2025
…shot (neondatabase#10606)

## Problem

This test would sometimes emit unexpected logs from the storage
controller's requests to do migrations, which overlap with the test's
restarts of pageservers, where those migrations are happening some time
after a shard split as the controller moves load around.

Example:
https://neon-github-public-dev.s3.amazonaws.com/reports/pr-10602/13067323736/index.html#testresult/f66f1329557a1fc5/retries

## Summary of changes

- Do a reconcile_until_idle after shard split, so that the rest of the
test doesn't run concurrently with migrations
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
a/tech_debt Area: related to tech debt a/test Area: related to testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants