[poc]blockchain: Load blockchain nodes in parallel #918

matheusd · 2017-11-14T12:31:20Z

This adds a new function on the blockchain to load all nodes from the block database during startup at once, in a parallelized fashion.

It also adds an index to track the SpentTicketsInBlock for every block on the database. This speeds up startup time by removing the need to load and decode all blocks in order to extract ticket information needed to calculate the stake difficulty.

Closes #913
Requires #915

Makes more sense to have this information in the stake package and will be needed in the next refactor stage.

matheusd · 2017-11-14T12:42:29Z

This is being sent as separate from #915 because involves deeper changes in the startup procedure.

Difference in time between versions (in seconds, on my machine, with 6 cores running the parallelized version):

| variation                          | mainnet | testnet |
| vanilla (#e145193)                 |     40  |      80 |
| w/ ticketsInBlockInfo (#915)       |     26  |      47 |
| w/ index + serial LoadAllBlocks    |     40  |      36 |
| w/ index + parallel LoadAllBlocks  |     17  |      17 |
| same as above + custom encoding    |     11  |       8 |

Marked as [poc] for the moment, because I need feedback on a few issues to continue this line of work. I expect the code itself to be heavily changed and refactored. I'm sending the work as a single function (LoadAllBlocksByBatchHeader) to make it easier to reason about, but expect to refactor into many subfunctions/different packages.

Specific stuff I need feedback on:

Is it ok to add a new index to store the SpentTicketsInBlock for every block, to speed up the startup time?
Should this index be entirely self-contained (as in the poc) or should I add it as an extra index (in the blockchain/indexers package)?
Should I add a config flag to enable/disable usage of this index?
Is it ok to load the nodes in a parallelized fashion?
Should I add a config flag to specify how many workers to use in the parallelized version?

matheusd · 2017-11-15T11:06:14Z

FYI, switching from gson to a custom binary encoding of SpentTicketsInBlock further reduces the time by about 50%. Decrediton on testnet for development loads almost instantly now.

dajohi · 2017-11-16T17:50:31Z

tests are failing on travis...

matheusd · 2017-11-17T11:17:58Z

Ok, I fixed the travis problem, but right now I'm not looking for a thorough review of the code, just really an answer whether the approach of adding a new index to speed up the startup time would be acceptable or not.

jpz · 2017-11-17T11:23:08Z

Hi - how are the before/after tests set up? Are you looking at specific markers in the logs? I'm happy to take a look.

matheusd · 2017-11-17T11:32:25Z

Yeah, look at the logs. Start a vanilla dcrd, it will hang after "Chain State: Height...". Without this patch, the next line in the logs will be "RPC server listening on...".

Check the difference in load time on current master vs this patch applied. The chain should be synced to tip before these tests.

davecgh · 2017-11-29T18:53:15Z

Thanks for the PR and I'll have to take a closer look at this before commenting definitively, but a quick look at this says it's going in the wrong direction as compared to upstream.

The upstream code has significantly sped up operation by keep all headers in memory and, more importantly as it pertains to this, reworking the way the headers are stored to and loaded from the database. This seems to completely ignore that work and go in a direction that would be quite difficult to reconcile with that work. What I'm concerned about is making it a lot harder to get the significantly better upstream code merged due to ignoring it and just going in a different direction here.

I should also note that part of the challenge for Decred is that it has some additional requirements in terms of needing pruneable stake nodes, so there will need to be some additional work done in that regard.

matheusd · 2017-11-30T06:18:51Z

Thanks for the update! I'm not as familiar with the upstream code and practices compared to dcrd, so can you tell me whether that is an ongoing work tracked by some issue/PR/discussion somewhere that I can follow?

What would be the appropriate way to make these sorts of contributions in the future? Validate they are a problem in btcd and send them there first?

davecgh · 2017-11-30T21:10:16Z

The relevant PRs start stomewhere around btcsuite/btcd#913. You'll notice there are a lot of them related to blockchain, chainviews, headers, etc.

davecgh · 2018-01-29T19:25:09Z

See the recent work in #988, #989, #990 which is starting to move the code closer to the upstream code.

matheusd · 2018-01-29T19:31:07Z

👍
Once all these new refactoring PRs are in, I'll make another try.

davecgh · 2018-01-29T19:36:19Z

Cool, thanks. Also, for reference, btcsuite/btcd#919, btcsuite/btcd#1010, and btcsuite/btcd#1014 are pretty significant changes that play into this and will be making their way into dcrd.

matheusd · 2018-02-22T18:19:50Z

superseded

matheusd added 11 commits November 9, 2017 09:21

Add instrumentation

59043f2

use ticketsInBlock instead of individual functions

92bf714

Improve formatting

f0dafc9

Move VoteVersionTuple from blockchain to blockchain/stake

5ed4bef

Makes more sense to have this information in the stake package and will be needed in the next refactor stage.

Move FindSpentTicketsInBlock to stake package

8b7323c

Refactor newBlockNode to use FindSpentTicketsInBlock

1177381

Remove instrumentation

0a1fb36

Remove unused import

0f04341

Another fix on tests

c765731

Load all block nodes from DB before calculating stake difficulty

0151a1a

Parallelized version of LoadAllBlocksByBatchHeader()

87bd958

Fix small bug

288b749

Switch to binary encoding SpentTicketsInBlock

79016e9

matheusd mentioned this pull request Jan 11, 2018

[WIP] Add fullticketindex skeleton #964

Closed

matheusd closed this Feb 22, 2018

matheusd deleted the load-node-index-parallel branch November 25, 2020 21:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[poc]blockchain: Load blockchain nodes in parallel #918

[poc]blockchain: Load blockchain nodes in parallel #918

matheusd commented Nov 14, 2017

matheusd commented Nov 14, 2017 •

edited

Loading

matheusd commented Nov 15, 2017

dajohi commented Nov 16, 2017

matheusd commented Nov 17, 2017

jpz commented Nov 17, 2017

matheusd commented Nov 17, 2017

davecgh commented Nov 29, 2017

matheusd commented Nov 30, 2017

davecgh commented Nov 30, 2017

davecgh commented Jan 29, 2018

matheusd commented Jan 29, 2018

davecgh commented Jan 29, 2018 •

edited

Loading

matheusd commented Feb 22, 2018

[poc]blockchain: Load blockchain nodes in parallel #918

[poc]blockchain: Load blockchain nodes in parallel #918

Conversation

matheusd commented Nov 14, 2017

matheusd commented Nov 14, 2017 • edited Loading

matheusd commented Nov 15, 2017

dajohi commented Nov 16, 2017

matheusd commented Nov 17, 2017

jpz commented Nov 17, 2017

matheusd commented Nov 17, 2017

davecgh commented Nov 29, 2017

matheusd commented Nov 30, 2017

davecgh commented Nov 30, 2017

davecgh commented Jan 29, 2018

matheusd commented Jan 29, 2018

davecgh commented Jan 29, 2018 • edited Loading

matheusd commented Feb 22, 2018

matheusd commented Nov 14, 2017 •

edited

Loading

davecgh commented Jan 29, 2018 •

edited

Loading