Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

timeout after 15 minutes waiting for the NNC reconciler to start #1861

Merged
merged 3 commits into from
Mar 31, 2023

Conversation

rbtr
Copy link
Contributor

@rbtr rbtr commented Mar 20, 2023

Reason for Change:

Currently we wait indefinitely for the NNC reconciler to receive an NNC and run the first reconcile before continuing execution (starting the CNS webserver, IPAM pool monitor, etc).
Instead, if the NNC Reconciler does not get an NNC and successfully run a reconcile within 15 minutes, timeout and fail to make it evident that CNS is encountering a fatal error/stuck.

Issue Fixed:

Requirements:

Notes:

@rbtr rbtr requested a review from a team as a code owner March 20, 2023 21:25
@rbtr rbtr requested review from ramiro-gamarra and neaggarwMS and removed request for a team March 20, 2023 21:25
@rbtr rbtr self-assigned this Mar 20, 2023
@rbtr rbtr added do-not-merge cns Related to CNS. labels Mar 20, 2023
@rbtr rbtr force-pushed the nnc-ready-timeout branch from 2bec491 to e587ddf Compare March 21, 2023 18:03
@rbtr rbtr removed the do-not-merge label Mar 27, 2023
@rbtr rbtr force-pushed the nnc-ready-timeout branch from e587ddf to 7a99c02 Compare March 27, 2023 18:36
@rbtr rbtr enabled auto-merge (squash) March 27, 2023 18:41
@rbtr rbtr requested a review from csfmomo March 28, 2023 20:44
@rbtr rbtr force-pushed the nnc-ready-timeout branch from 7a99c02 to caeca58 Compare March 29, 2023 17:29
@rbtr rbtr requested a review from thatmattlong March 29, 2023 17:29
@rbtr rbtr force-pushed the nnc-ready-timeout branch from caeca58 to 35a329d Compare March 29, 2023 22:02
@rbtr rbtr merged commit 18d4c70 into Azure:master Mar 31, 2023
@rbtr rbtr deleted the nnc-ready-timeout branch March 31, 2023 18:11
rbtr added a commit that referenced this pull request Sep 8, 2023
* timeout after 15 minutes waiting for the NNC reconciler to start

Signed-off-by: Evan Baker <[email protected]>

* log error and retry instead of crashing out

Signed-off-by: Evan Baker <[email protected]>

* add metric for nnc reconciler failed to start

Signed-off-by: Evan Baker <[email protected]>

---------

Signed-off-by: Evan Baker <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cns Related to CNS.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants