-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
swarm: emit PeerConnectedness event from swarm instead of from hosts #1574
Conversation
@@ -112,6 +112,9 @@ var _ host.Host = (*BasicHost)(nil) | |||
// HostOpts holds options that can be passed to NewHost in order to | |||
// customize construction of the *BasicHost. | |||
type HostOpts struct { | |||
// EventBus sets the event bus. Will construct a new event bus if omitted. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel uneasy about this.
Maybe we should add the EventBus method to the Network interface and forward from the host so that we cant end up with two of them.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not really how this could happen, but we can make this change.
Can we defer this until we consolidated all the other repos as well (#1566)? It's happening in the next release.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another option would be to just make this entire thing private.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not a big fan of having both a Config
and HostOpts
. I expect us to merge these two (or refactor them) in some other way when we start using Fx for service construction (#1993). I suggest not making any big changes in this PR.
p2p/net/swarm/swarm.go
Outdated
@@ -310,6 +321,13 @@ func (s *Swarm) addConn(tc transport.CapableConn, dir network.Direction) (*Conn, | |||
}) | |||
c.notifyLk.Unlock() | |||
|
|||
if isFirstConn { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are we sure we want this outside of the notifyLk
? That changes the behavior from what it used to be (inside the notifyAll).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I think this needs to be inside. We need to make sure that the disconnect comes after the connect event.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hm. Actually, I'm not sure if this lock helps us here. We need to make this guarantee across all connections.
p2p/net/swarm/swarm.go
Outdated
s.conns.Unlock() | ||
|
||
if disconnected { | ||
s.emitter.Emit(event.EvtPeerConnectednessChanged{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the reason to do this here instead of near the old place where this happened: https://github.com/libp2p/go-libp2p/blob/marco/1575/p2p/net/swarm/swarm_conn.go#L85 ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably to ensure that we only emit the event on the last disconnect. But it does look like there's a bit of a race and we can emit a disconnect after a connect (looks like we already have that race, actually).
So, I think the only way to sanely do this is to write something like: https://github.com/ipfs/go-bitswap/pull/565/files. We probably don't want to block some global queue while sending events. That is:
Even if this background process falls behind (someone is blocking events), that's fine. We'll just always fire the "last" event. This does mean we can get unmatched connected/disconnected events, but, IMO, that's fine. I.e.:
May be seen as:
Or
Or
Or
But never:
Importantly, the final state is always accurate, which is what users care about. At the moment, it's possible for the last event fired to be "connect", which could cause memory leaks. |
331b2b7
to
b66075a
Compare
Rebased this PR on top of the current master. I think emitting the event while holding the
The following sequence can show up in two different ways:
First way:
Second way (when the two connects are processed first):
I introduced the change in the last commit. @MarcoPolo, @Stebalien does this reasoning make sense to you? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look good and reasoning makes sense.
Unfortunately, I can't find a way to test the deduplication (that no event is emitted when a second connection is dialed)
You could try this:
Make a new test transport that wraps an existing transport. Put two multiaddrs for this test transport (they might have to be slightly different if we're deduping these somewhere). In the test transport have them synchronize with a waitgroup to return a dial at the same time.
8a8ebcb
to
9c74a76
Compare
9c74a76
to
7889a65
Compare
That doesn't work either. The dial worker loop somewhere deduplicates. I have no intention of diving into that mess to figure out where. |
This reverts commit 82fbba8. The update breaks the event system. It likely happened after this libp2p/go-libp2p#1574, but we don't need to debug it right now, as a simple revert fixes this
Fixes #1571.
This was surprisingly easy. Unfortunately, I can't find a way to test the deduplication (that no event is emitted when a second connection is dialed), since the swarm is too good at preventing dials to hosts we're already connected to.