restore: split & scatter regions concurrently #27034

YuJuncen · 2021-08-09T11:23:11Z

What problem does this PR solve?

Before, when restoring many small tables, the batcher would probably send small batch due to the so called AutoCommit feature of the batcher. By this, we can make the split & scatter & restore worker more active.

But frequently send small batches isn't free. The split step is costly and I/O bounded for even small batches. For example, it costs about 3s to splitting 60 ranges, but restore those ranges typically costs only 1s. Then the restore worker get idle at most time. The restore hence has slowed down.

What is changed and how it works?

Instead of using a single split worker, this PR allow multi restore batches be split concurrently.
We added two hidden flags, --batch-flush-interval and --pd-concurrency, the former for better tuning the behavior of batcher, the latter for tweaking the concurrent split.
Also, more logs were added so the create table speed, download, ingest time cost can be observed via log.

Check List

Tests

Integration test
Manual test (add detailed scripts or steps below)
A internal test shows, in a 190GB, 6000 tables workload, this PR can speed up the restoration: the original version takes over 2 hours for restoring, and this version takes about 30mins for restoring. The latter is nearly equal to the time cost of creating tables(see figure below).

Release note

Restore many small tables would be faster now.

ti-chi-bot · 2021-08-09T11:23:12Z

[REVIEW NOTIFICATION]

This pull request has been approved by:

Little-Wallace
kennytm

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

YuJuncen · 2021-08-09T11:24:00Z

/sig migrate

YuJuncen · 2021-08-09T11:24:23Z

/cc Little-Wallace
/cc 3pointer

3pointer · 2021-08-09T11:57:43Z

/run-integration-tests

3pointer · 2021-08-09T12:12:37Z

/run-integration-tests

…nc-split

kennytm · 2021-08-12T17:35:30Z

/component br

YuJuncen · 2021-08-16T03:24:52Z

/run-integration-test

YuJuncen · 2021-08-16T03:25:09Z

/run-integration-tests

YuJuncen · 2021-08-16T07:16:07Z

/run-check_dev_2

So many unit tests >= 5s...

Little-Wallace

LGTM

…nc-split

kennytm

rest LGTM

br/pkg/restore/import.go

Co-authored-by: kennytm <[email protected]>

YuJuncen · 2021-08-26T03:13:23Z

/merge

ti-chi-bot · 2021-08-26T03:13:25Z

This pull request has been accepted and is ready to merge.

Commit hash: afe9bd3

ti-chi-bot · 2021-08-26T03:13:35Z

@YuJuncen: Your PR was out of date, I have automatically updated it for you.

At the same time I will also trigger all tests for you:

/run-all-tests

If the CI test fails, you just re-trigger the test that failed and the bot will merge the PR for you after the CI passes.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

YuJuncen added 2 commits August 9, 2021 19:15

squash merge conc-split

d5cddad

restore: fix build

7a3110b

ti-chi-bot added the sig/migrate label Aug 9, 2021

ti-chi-bot requested review from 3pointer and Little-Wallace August 9, 2021 11:24

Merge branch 'master' into conc-split

33c9be9

YuJuncen added 7 commits August 10, 2021 14:15

restore: decrease the PD concurrenct, and increase the batch flush time

c2aeba6

Merge branch 'conc-split' of https://github.com/yujuncen/tidb into co…

6651020

…nc-split

debug

6755218

debug

e67b458

restore: retry hole in regions

393e22b

Merge branch 'master' of https://github.com/pingcap/tidb into conc-split

8b4856b

ddl: revert ddl change

4d7a945

ti-chi-bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Aug 12, 2021

ti-chi-bot added the component/br This issue is related to BR of TiDB. label Aug 12, 2021

*: remove the complex hole-fine

a98ba23

ti-chi-bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 15, 2021

YuJuncen added 2 commits August 16, 2021 09:48

restore: revert retry-relative changes

7e60bfb

Merge branch 'master' of https://github.com/pingcap/tidb into conc-split

9882e22

YuJuncen force-pushed the conc-split branch from 4aa1ad3 to b5a4ce7 Compare August 16, 2021 02:24

ti-chi-bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Aug 16, 2021

YuJuncen and others added 2 commits August 16, 2021 10:33

restore: mute some verbose logs

1715381

Merge branch 'master' into conc-split

470323b

Merge branch 'master' of https://github.com/pingcap/tidb into conc-split

e437a8f

Merge branch 'master' into conc-split

af51ea6

Little-Wallace approved these changes Aug 17, 2021

View reviewed changes

ti-chi-bot added the status/LGT1 Indicates that a PR has LGTM 1. label Aug 17, 2021

YuJuncen and others added 4 commits August 19, 2021 10:55

*: disable concurrent split by default

545b061

Merge branch 'conc-split' of https://github.com/yujuncen/tidb into co…

4934af2

…nc-split

Merge branch 'master' into conc-split

f012d4b

Merge branch 'master' into conc-split

c5fc5b9

kennytm approved these changes Aug 25, 2021

View reviewed changes

br/pkg/restore/import.go Outdated Show resolved Hide resolved

br/pkg/restore/import.go Outdated Show resolved Hide resolved

ti-chi-bot added status/LGT2 Indicates that a PR has LGTM 2. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Aug 25, 2021

kennytm mentioned this pull request Aug 25, 2021

restore: split & scatter regions concurrently pingcap/br#1363

Closed

Apply suggestions from code review

afe9bd3

Co-authored-by: kennytm <[email protected]>

ti-chi-bot added the status/can-merge Indicates a PR has been approved by a committer. label Aug 26, 2021

Merge branch 'master' into conc-split

d7ed687

ti-chi-bot merged commit cdadfdb into pingcap:master Aug 26, 2021

joccau pushed a commit to joccau/tidb that referenced this pull request Sep 7, 2021

restore: split & scatter regions concurrently (pingcap#27034)

8414f0b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

restore: split & scatter regions concurrently #27034

restore: split & scatter regions concurrently #27034

YuJuncen commented Aug 9, 2021 •

edited

Loading

ti-chi-bot commented Aug 9, 2021 •

edited

Loading

YuJuncen commented Aug 9, 2021

YuJuncen commented Aug 9, 2021

3pointer commented Aug 9, 2021

3pointer commented Aug 9, 2021

kennytm commented Aug 12, 2021

YuJuncen commented Aug 16, 2021

YuJuncen commented Aug 16, 2021

YuJuncen commented Aug 16, 2021

Little-Wallace left a comment

kennytm left a comment

YuJuncen commented Aug 26, 2021

ti-chi-bot commented Aug 26, 2021

ti-chi-bot commented Aug 26, 2021

restore: split & scatter regions concurrently #27034

restore: split & scatter regions concurrently #27034

Conversation

YuJuncen commented Aug 9, 2021 • edited Loading

What problem does this PR solve?

What is changed and how it works?

Check List

Release note

ti-chi-bot commented Aug 9, 2021 • edited Loading

YuJuncen commented Aug 9, 2021

YuJuncen commented Aug 9, 2021

3pointer commented Aug 9, 2021

3pointer commented Aug 9, 2021

kennytm commented Aug 12, 2021

YuJuncen commented Aug 16, 2021

YuJuncen commented Aug 16, 2021

YuJuncen commented Aug 16, 2021

Little-Wallace left a comment

Choose a reason for hiding this comment

kennytm left a comment

Choose a reason for hiding this comment

YuJuncen commented Aug 26, 2021

ti-chi-bot commented Aug 26, 2021

ti-chi-bot commented Aug 26, 2021

YuJuncen commented Aug 9, 2021 •

edited

Loading

ti-chi-bot commented Aug 9, 2021 •

edited

Loading