Memberlist revendor and optimizations #2040

fcrisciani · 2017-12-19T02:04:20Z

Revendoring memberlist diff: hashicorp/memberlist@v0.1.0...master
Insert back failed nodes when join is notified by memberlist
Delete same behavior of create, cannot delete a key already deleted

codecov-io · 2017-12-19T02:22:05Z

Codecov Report

❗ No coverage uploaded for pull request base (master@5ab4ab8). Click here to learn what that means.
The diff coverage is 79.16%.

@@            Coverage Diff            @@
##             master    #2040   +/-   ##
=========================================
  Coverage          ?   40.45%           
=========================================
  Files             ?      138           
  Lines             ?    22167           
  Branches          ?        0           
=========================================
  Hits              ?     8967           
  Misses            ?    11887           
  Partials          ?     1313

Impacted Files	Coverage Δ
networkdb/event_delegate.go	`66.66% <0%> (ø)`
networkdb/delegate.go	`74.35% <100%> (ø)`
networkdb/networkdb.go	`66.2% <80.95%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5ab4ab8...44298c9. Read the comment docs.

ddebroy · 2017-12-21T20:44:09Z

networkdb/networkdb.go

-	}
+	// Update the entry
+	entry.ltime = nDB.tableClock.Increment()
+	entry.node = nDB.config.NodeID


The createOrUpdateEntry call below used to be within a Lock/Unlock. Was the locking unnecessary?

Is it possible that as the entry members (ltime/node/value) are being updated, a getEntry from a parallel thread might see inconsistent state for the entry since ndb is not being locked?

the locking was to protect the internal data structure

Yes what you are saying is correct, and will affect both the update/delete code path changed here.
At the same time, previous logic based on the acquisition of 2 times the same lock does not avoid the race between 2 creates for the same key.
Will try to come up with a unit test for that and provide a different fix

ddebroy · 2017-12-21T20:51:43Z

networkdb/networkdb.go

-		reapTime: nDB.config.reapEntryInterval,
-	}
+	entry.ltime = nDB.tableClock.Increment()
+	entry.node = nDB.config.NodeID


Is updating entry.node necessary in the delete path?

yes, because the deletion has the same meaning of a write, so if node A deletes a key of node B, it become the new owner of the key. (Consider that this is all theoretical because current use of it done by libnetwork has a single writer per key)

ddebroy

LGTM with one minor comment.

ddebroy · 2017-12-22T00:35:20Z

networkdb/delegate.go

 	if !ok || network.leaving || !nodePresent {
 		// I'm out of the network OR the event owner is not anymore part of the network so do not propagate
 		return false
 	}

+	nDB.Lock()


how about defer nDB.Unlock() right after nDB.Lock() here? That way no need to unlock in the exit scenario down below and again at the very end

was simply trying to avoid keeping the lock for more than was necessary like the dispatch of the event. For symmetry with the other methods after the entry is inserted into the tree you can release the lock and handle whatever operation is following, like sending the notification to other nodes like the case of Create/Update/Delete or like here dispatching the event to the app

sounds good wrt symmetry.

selansen · 2017-12-22T03:12:47Z

LGTM

fcrisciani · 2017-12-22T16:30:52Z

Needs round on the e2e test before being merged

ddebroy · 2017-12-22T17:47:03Z

LGTM

diff: hashicorp/memberlist@v0.1.0...master Relevant changes: - Calculates the timeout for dial using the deadline - Reduce LAN min suspicion multiplier - fix deadlock in shutdown process of memberlist Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>

Avoid waiting for a double notification once a node rejoin, just put it back to active state. Waiting for a further message does not really add anything to the safety of the operation, the source of truth for the node status resided inside memberlist. Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>

fcrisciani · 2018-01-24T00:22:13Z

rebased on top of latest master and e2e tests passed

ddebroy reviewed Dec 21, 2017

View reviewed changes

fcrisciani force-pushed the memberlist_revendor branch 2 times, most recently from b23c9f4 to 697b17b Compare December 21, 2017 22:43

ddebroy approved these changes Dec 22, 2017

View reviewed changes

Flavio Crisciani added 2 commits January 23, 2018 14:22

Revendor memberlist

bd55d5b

diff: hashicorp/memberlist@v0.1.0...master Relevant changes: - Calculates the timeout for dial using the deadline - Reduce LAN min suspicion multiplier - fix deadlock in shutdown process of memberlist Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>

fcrisciani force-pushed the memberlist_revendor branch from 697b17b to 44298c9 Compare January 24, 2018 00:21

fcrisciani merged commit 537bcde into moby:master Jan 24, 2018

fcrisciani deleted the memberlist_revendor branch January 24, 2018 17:20

fcrisciani mentioned this pull request Jan 30, 2018

Libnetwork revendoring moby/moby#36137

Merged

thaJeztah mentioned this pull request Feb 23, 2018

Error Parsing IPv6 During Swarm Cluster Join #1601

Closed

trapier mentioned this pull request May 17, 2019

[Backport 17.06] NetworkDB qlen optimization #2379

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memberlist revendor and optimizations #2040

Memberlist revendor and optimizations #2040

fcrisciani commented Dec 19, 2017 •

edited

Loading

codecov-io commented Dec 19, 2017 •

edited

Loading

ddebroy Dec 21, 2017 •

edited

Loading

fcrisciani Dec 21, 2017

ddebroy Dec 21, 2017

fcrisciani Dec 21, 2017

ddebroy left a comment

ddebroy Dec 22, 2017

fcrisciani Dec 22, 2017

ddebroy Dec 22, 2017

selansen commented Dec 22, 2017

fcrisciani commented Dec 22, 2017

ddebroy commented Dec 22, 2017

fcrisciani commented Jan 24, 2018

Memberlist revendor and optimizations #2040

Memberlist revendor and optimizations #2040

Conversation

fcrisciani commented Dec 19, 2017 • edited Loading

codecov-io commented Dec 19, 2017 • edited Loading

Codecov Report

ddebroy Dec 21, 2017 • edited Loading

Choose a reason for hiding this comment

fcrisciani Dec 21, 2017

Choose a reason for hiding this comment

ddebroy Dec 21, 2017

Choose a reason for hiding this comment

fcrisciani Dec 21, 2017

Choose a reason for hiding this comment

ddebroy left a comment

Choose a reason for hiding this comment

ddebroy Dec 22, 2017

Choose a reason for hiding this comment

fcrisciani Dec 22, 2017

Choose a reason for hiding this comment

ddebroy Dec 22, 2017

Choose a reason for hiding this comment

selansen commented Dec 22, 2017

fcrisciani commented Dec 22, 2017

ddebroy commented Dec 22, 2017

fcrisciani commented Jan 24, 2018

fcrisciani commented Dec 19, 2017 •

edited

Loading

codecov-io commented Dec 19, 2017 •

edited

Loading

ddebroy Dec 21, 2017 •

edited

Loading