Add ReconnectModifyIndex to handle reconnect lifecycle #14948

DerekStrickland · 2022-10-18T20:35:55Z

This PR fixes a bug where if an allocation with max_client_disconnect configured is on a node that disconnects, and then the node reconnects, future jobspec changes for that job get ignored until the max_client_disconnect interval expires. Previous to this change, Allocation.Reconnected naively just checked the last reconnect event time and the expiry.

This PR:

Adds a ReconnectModifyIndex field to the Allocation struct.
Updates the alloc runner to update the alloc ReconnectModifyIndex when a reconnect is processed by the client
Modifies Client.allocSync to send the ReconnectModifyIndex when syncing client managed attributes
Modifies Node.UpdateAlloc to persist the incoming ReconnectModifyIndex when generating reconnect evals
Renames Allocation.Reconnected to Allocation.IsReconnecting
Refactors Allocation.IsReconnecting to compare the ReconnectModifyIndex to the AllocModifyIndex to determine if an allocation is reconnecting
Updates all related code to match the new name and test the new logic
Updates GenericScheduler.computeJobAllocs to reset the ReconnectModifyIndex to 0 when processing reconnectUpdates and appends them to Plan.NodeAllocation so that the updates get persisted

client/allocrunner/alloc_runner.go

tgross

@lgfa29 it looks like there are some correctness issues here around the state store; let's chat internally about carrying this PR.

tgross · 2022-10-24T13:54:20Z

nomad/structs/structs.go

+	// ReconnectModifyIndex is used to determine if the server has processed the node reconnect.
+	ReconnectModifyIndex uint64


We should make sure this gets onto the api.Allocation struct as well.

tgross · 2022-10-24T14:04:54Z

client/allocrunner/alloc_runner.go

+	// Set the reconnect modify index so that the scheduler can track that the reconnect has not been processed.
+	alloc.ReconnectModifyIndex = ar.Alloc().AllocModifyIndex


This value comes from the server:

// AllocModifyIndex is not updated when the client updates allocations. This
// lets the client pull only the allocs updated by the server.

But that made me remember there are two code paths in the state store for updating allocations: one for upserting allocs from the server and one for updating allocs from the client. But in any case neither of them are handling the ReconnectModifyIndex field because for existing allocations (which is what we care about here), we copy the existing Allocation and then merge the needed fields over before inserting.

So we're not actually updating this field in Nomad's state.

So in client.go line 2033 it get's added to the stripped alloc during allocSync, that then gets sent to Node Update, which then updates state, and triggers an eval, When the eval fires, the index is set. We then have to unset it when applying the plan. Have you tried it out? I had logging in here during development showing it all flowed through.

which then updates state

That's the bit where I don't see how it's happening. Any update of an existing object takes a copy first (ref state_store.go#L3474) and then modifies the copy before inserting it. So if we haven't pulled in information that the client is authoritative on, the state isn't getting updated for the transaction.

I haven't had a chance to test it out thoroughly (still trying to get 1.4.2 out! 😁 ) but I suspect the reason it's "working" right now is because of the state store corruption on line 1223. That'll appear correct under some circumstances but won't have gone thru raft correctly.

tgross · 2022-10-24T14:11:36Z

nomad/node_endpoint.go

@@ -1220,6 +1220,7 @@ func (n *Node) UpdateAlloc(args *structs.AllocUpdateRequest, reply *structs.Gene
 		if evalTriggerBy != structs.EvalTriggerJobDeregister &&
 			alloc.ClientStatus == structs.AllocClientStatusUnknown {
 			evalTriggerBy = structs.EvalTriggerReconnect
+			alloc.ReconnectModifyIndex = allocToUpdate.ReconnectModifyIndex


This assignment corrupts the state store because alloc hasn't been copied after being queried from the state store. I'm fairly certain this line isn't needed at all, as the allocToUpdate is what's getting added to the batch of updates and not alloc.

tgross · 2022-10-24T14:38:21Z

Per our discussion, moving this out of 1.4.2 so that we don't risk rushing it out.

mikenomitch · 2022-10-27T18:22:05Z

Passing in some feedback from a customer. I think it might be related to this underlying issue since it is max_client_disconnect related, but I am not sure.

I found scenario, where I see duplicated ALLOC_INDEXes in one JOB.
Below are required steps (Nomad 1.3.5):

We have job with count=2 and set option max_client_disconnect running as below:
NOMAD_ALLOC_INDEX=0 - NodeA
NOMAD_ALLOC_INDEX=1 - NodeB

We stop Nomad Agent on NodeA, after that we have temporarily 3 allocations:
NOMAD_ALLOC_INDEX=0 - NodeA (Unknown state)
NOMAD_ALLOC_INDEX=1 - NodeB
NOMAD_ALLOC_INDEX=0 - NodeC

When we start agent on NodeA, then again we have 2 allocations - first was recovered:
NOMAD_ALLOC_INDEX=0 - NodeA
NOMAD_ALLOC_INDEX=1 - NodeB
NodeC - allocation terminated here - as expected

I'm changing count from 2 to 3 and 3rd allocation appears with the same ALLOC_INDEX as the first one!!!
NOMAD_ALLOC_INDEX=0 - NodeA
NOMAD_ALLOC_INDEX=1 - NodeB
NOMAD_ALLOC_INDEX=0 - NodeD

Apart from that, in the state with duplicated ALLOC_INDEXes, EXEC feature in Nomad UI stopped working properly (for affected JOB only)

Does this seem related or should I make a new issue?

lgfa29 · 2022-10-27T21:50:04Z

I think I understand the problem now 😅

I have an alternative approach in #15068 that I think makes the disconnect/reconnect flows more similar and so easier to understand, but it's still an early work. I will keep investigating the problem to see which solution would be better.

lgfa29 · 2022-10-27T22:25:59Z

@mikenomitch I think this may be related to this problem. From our docs on NOMAD_ALLOC_INDEX:

The index is unique within a given version of a job

I think #14925 may prevent the job version from changing, which means you could end up with reused indexes.

But it may be better to open a separate issue just in case. If it's the same problem we can close both issues.

lgfa29 · 2022-11-02T02:47:56Z

Closing this in favour of #15068.

Thanks for the all the work and guidance on this issue @DerekStrickland!

DerekStrickland · 2022-11-02T09:58:54Z

@lgfa29 I'm glad you found a good solution!

github-actions · 2023-03-03T02:46:44Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

DerekStrickland self-assigned this Oct 18, 2022

tgross self-requested a review October 18, 2022 21:01

vercel bot deployed to Preview – nomad-storybook-and-ui October 20, 2022 21:08 View deployment

DerekStrickland changed the title ~~Update alloc ModifyTime when reconnect is processed~~ Add ReconnectModifyIndex to handle reconnect lifecycle Oct 21, 2022

vercel bot deployed to Preview – nomad-storybook-and-ui October 21, 2022 11:11 View deployment

vercel bot deployed to Preview – nomad-storybook-and-ui October 21, 2022 12:06 View deployment

DerekStrickland marked this pull request as ready for review October 21, 2022 13:59

DerekStrickland requested a review from lgfa29 October 21, 2022 13:59

DerekStrickland added theme/edge type/bug backport/1.3.x backport to 1.3.x release line backport/1.4.x backport to 1.4.x release line labels Oct 21, 2022

DerekStrickland added this to the 1.4.2 milestone Oct 21, 2022

Add ReconnectModifyIndex to manage reconnect lifecycle

73b0129

DerekStrickland force-pushed the b-disconnected-alloc-update-job branch from 2940367 to 73b0129 Compare October 21, 2022 14:03

vercel bot deployed to Preview – nomad-storybook-and-ui October 21, 2022 14:06 View deployment

tgross reviewed Oct 21, 2022

View reviewed changes

client/allocrunner/alloc_runner.go Outdated Show resolved Hide resolved

Finish refactoring away from setIndexes

46d4c6b

vercel bot deployed to Preview – nomad-storybook-and-ui October 21, 2022 20:13 View deployment

tgross requested changes Oct 24, 2022

View reviewed changes

tgross unassigned DerekStrickland Oct 24, 2022

tgross modified the milestones: 1.4.2, 1.4.x Oct 24, 2022

lgfa29 mentioned this pull request Oct 27, 2022

Update alloc after reconnect and enforece client heartbeat order #15068

Merged

lgfa29 closed this Nov 2, 2022

This was referenced Nov 4, 2022

Backport of Update alloc after reconnect and enforece client heartbeat order into release/1.3.x #15152

Merged

Backport of Update alloc after reconnect and enforece client heartbeat order into release/1.4.x #15153

Merged

github-actions bot locked as resolved and limited conversation to collaborators Mar 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ReconnectModifyIndex to handle reconnect lifecycle #14948

Add ReconnectModifyIndex to handle reconnect lifecycle #14948

DerekStrickland commented Oct 18, 2022 •

edited

Loading

tgross left a comment

tgross Oct 24, 2022

tgross Oct 24, 2022

DerekStrickland Oct 26, 2022

tgross Oct 26, 2022

tgross Oct 24, 2022

tgross commented Oct 24, 2022

mikenomitch commented Oct 27, 2022 •

edited

Loading

lgfa29 commented Oct 27, 2022

lgfa29 commented Oct 27, 2022

lgfa29 commented Nov 2, 2022

DerekStrickland commented Nov 2, 2022

github-actions bot commented Mar 3, 2023

		// ReconnectModifyIndex is used to determine if the server has processed the node reconnect.
		ReconnectModifyIndex uint64

		// Set the reconnect modify index so that the scheduler can track that the reconnect has not been processed.
		alloc.ReconnectModifyIndex = ar.Alloc().AllocModifyIndex

Add ReconnectModifyIndex to handle reconnect lifecycle #14948

Add ReconnectModifyIndex to handle reconnect lifecycle #14948

Conversation

DerekStrickland commented Oct 18, 2022 • edited Loading

tgross left a comment

Choose a reason for hiding this comment

tgross Oct 24, 2022

Choose a reason for hiding this comment

tgross Oct 24, 2022

Choose a reason for hiding this comment

DerekStrickland Oct 26, 2022

Choose a reason for hiding this comment

tgross Oct 26, 2022

Choose a reason for hiding this comment

tgross Oct 24, 2022

Choose a reason for hiding this comment

tgross commented Oct 24, 2022

mikenomitch commented Oct 27, 2022 • edited Loading

lgfa29 commented Oct 27, 2022

lgfa29 commented Oct 27, 2022

lgfa29 commented Nov 2, 2022

DerekStrickland commented Nov 2, 2022

github-actions bot commented Mar 3, 2023

DerekStrickland commented Oct 18, 2022 •

edited

Loading

mikenomitch commented Oct 27, 2022 •

edited

Loading