Block sync hash traversal perf #3558

antiochp · 2021-02-03T22:48:56Z

Resolves #3554.

When we transition from "header sync" to "body sync" we identify missing blocks based on the delta between the chain of headers and the current chain of full blocks. i.e. We sync headers and then we go request the full blocks for each of those headers.
There is some additional complexity where we support a fork between the header chain and the full block chain as one is not always a subset of the other. And to do this we identify the "fork point" which is the last common header between header chain and full block chain before they begin to diverge.

Fork example -

Headers: A -> B -> C' -> D -> E

Full blocks: A -> B -> C

Header sync gave us a chain of headers building on C' while our local node was previously aware of C.
We need to "rewind" to the fork point B immediately prior to C and C' and request missing blocks [C', D, E].

The existing implementation rewinds back from the head of the header chain to identify both the fork point and a set of missing block hashes. This is fine if the delta between header chain and full block chain is small but is not suitable if the header chain must be rewound a significant number of blocks.

It fails badly in the "full archval node sync" scenario where we must rewind from height ~1,500,00 back to 1.

The PR introduces a more efficient approach -

identify height of full block chain
increase this height by the number of blocks we will request in parallel
lookup this "max" header directly on header chain via get_header_by_height()
Iterate back from this "max" header to the fork point.

This places an upper bound on the iteration required as we can lookup headers directly based on height.
This greatly reduces the amount of iteration required when identifying missing blocks to sync.

As part of this rework check_txhashset_needed() was refactored, splitting out get_fork_point() and check_txhashset_needed() into separate fns.
This allowed the implementation to be simplified significantly.

antiochp · 2021-02-15T13:47:34Z

Going to merge this to master.
We need to make sure we test sync thoroughly as part of the next release.

* sync traversal performance improvements * rework how body_sync determines block hashes to request

antiochp marked this pull request as ready for review February 9, 2021 09:29

antiochp added 2 commits February 9, 2021 12:34

wip - sync traversal performance improvements

1b2bae9

rework how body_sync determines block hashes to request

a69fd05

antiochp force-pushed the block_sync_traversal_perf branch from 6c7700b to a69fd05 Compare February 9, 2021 12:34

antiochp merged commit 7649d36 into mimblewimble:master Feb 15, 2021

antiochp deleted the block_sync_traversal_perf branch February 15, 2021 13:48

antiochp mentioned this pull request May 6, 2021

5.1.0 Release Notes #3638

Open

bayk added a commit to mwcproject/mwc-node that referenced this pull request Jun 12, 2024

grin v5.3 (0034) Block sync hash traversal perf (mimblewimble#3558)

7eb94d9

* sync traversal performance improvements * rework how body_sync determines block hashes to request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block sync hash traversal perf #3558

Block sync hash traversal perf #3558

antiochp commented Feb 3, 2021 •

edited

Loading

antiochp commented Feb 15, 2021

Block sync hash traversal perf #3558

Block sync hash traversal perf #3558

Conversation

antiochp commented Feb 3, 2021 • edited Loading

antiochp commented Feb 15, 2021

antiochp commented Feb 3, 2021 •

edited

Loading