Correctly implement kademlia bootstrapping algorithm #375

Stebalien · 2019-07-12T05:10:41Z

We currently bootstrap:

A random key.
Our own key.

However, this doesn't keep the entire routing table full. The correct algorithm is to:

Keep track of the last time we've searched for a key in each of the k buckets.
If we don't search for a key in a single bucket for an hour, pick a random key in that bucket and search for it.

See the end of section 2.3 in the paper.

Stebalien · 2019-07-12T05:12:10Z

@aarshkshah1992 want to try this one?

aarshkshah1992 · 2019-07-12T05:25:19Z

@Stebalien Perfect. Let me take a look at the relevant paper section/code & come up with something.
Can you assign it to me ?

aarshkshah1992 · 2019-07-13T13:43:30Z

@Stebalien

Currently, for bootstrapping, we query for self & a random nodeID once every 5 minutes(by default).

If I understand your comment/the code & this section in the paper correctly, we will now:

Query for self only once when bootstrap is initiated & not every 5 minutes
Query only once for a random id in each of the k-buckets created because of 1).... except for the nearest(last bucket in the routing table) k-bucket. Look at the following line in the paper:

Finally, u refreshes all k-buckets further away than it's closet neighbour

Keep track of which k-buckets get queried by hooking onto the 'dht.findPeerSingle' call
Query for a random nodeID in a k-bucket that does not get queried for more than an hour in 3)

Wdyt ?

aarshkshah1992 · 2019-07-18T13:50:59Z

@Stebalien

So the k-bucket now looks like

type Bucket struct {
......

lastQueriedAtLk sync.RWMutex  
lastQueriedAt time.Time // set to the current time when a new bucket is created
}

func (b *Bucket) lastQueriedAt() time.Time {
    
}

func(b *Bucket) resetLastQueriedAt(newTime) {

}

We add a function to the routing table

func(r *RoutingTable) genRandPeerID(bucketID) peer.ID {
   if (bucketID == len(r.buckets) - 1) {
       Generated a peerID where ATLEAST bucketID bits match the host
   } else {
       Generate a peerID where EXACTLY bucketID bits match the host
   }
}

DHT.Bootstrap now has a dedicated go-routine that periodically scans the routing table to identify buckets that haven't been queried since an hour -> does a random walk for them -> resets the timer.

Two important questions are :

What should be the scan interval ? Should we do it say every 30 minutes ?
Is hooking onto dht.findPeerSingle a reasonable way to identify the peers that get queried so we can reset the timer for their buckets ?

Stebalien · 2019-07-25T03:50:02Z

Exactly.

However, in the periodic query, I'd keep the self query. That is, every hour, query self to determine the nearest bucket (there may be a closer bucket that we're simply not aware of), then proceed normally.

What should be the scan interval ? Should we do it say every 30 minutes ?

30m sounds fine. That gives us the hour delay.

Is hooking onto dht.findPeerSingle a reasonable way to identify the peers that get queried so we can reset the timer for their buckets ?

Yes.

I apologize for the insane delay. I was stalling trying to avoid context switching away from some other tasks till they were done. Thanks for the frequent reminders and please continue to bug me if I do this again.

aarshkshah1992 · 2019-10-11T05:41:33Z

@Stebalien This issue can now be closed.

Stebalien self-assigned this Jul 12, 2019

Stebalien assigned aarshkshah1992 and unassigned Stebalien Jul 12, 2019

aarshkshah1992 mentioned this issue Aug 1, 2019

k-bucket support for proper kad bootstrapping libp2p/go-libp2p-kbucket#38

Merged

aarshkshah1992 mentioned this issue Aug 19, 2019

Feature/correct bootstrapping #384

Merged

Stebalien closed this as completed Oct 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly implement kademlia bootstrapping algorithm #375

Correctly implement kademlia bootstrapping algorithm #375

Stebalien commented Jul 12, 2019

Stebalien commented Jul 12, 2019

aarshkshah1992 commented Jul 12, 2019

aarshkshah1992 commented Jul 13, 2019 •

edited

Loading

aarshkshah1992 commented Jul 18, 2019 •

edited

Loading

Stebalien commented Jul 25, 2019

aarshkshah1992 commented Oct 11, 2019

Correctly implement kademlia bootstrapping algorithm #375

Correctly implement kademlia bootstrapping algorithm #375

Comments

Stebalien commented Jul 12, 2019

Stebalien commented Jul 12, 2019

aarshkshah1992 commented Jul 12, 2019

aarshkshah1992 commented Jul 13, 2019 • edited Loading

aarshkshah1992 commented Jul 18, 2019 • edited Loading

Stebalien commented Jul 25, 2019

aarshkshah1992 commented Oct 11, 2019

aarshkshah1992 commented Jul 13, 2019 •

edited

Loading

aarshkshah1992 commented Jul 18, 2019 •

edited

Loading