Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CPDEV-93296]PAAS_check_failed_after_procedure_Kubernetes_upgrade #673

Merged
merged 8 commits into from
Jul 17, 2024

Conversation

nikhil1697
Copy link
Collaborator

Description

After Kubernetes Upgrade procedure check_paas indicates inconsistencies in the IP address management (IPAM) system used by Calico in Kubernetes cluster. Specifically, it identifies the following problems:

  1. IPs with missing handles: These are IP addresses that are allocated but do not have corresponding handles in the IPAM system.
  2. Handles with no matching IPs: These are handles in the IPAM system that do not have corresponding IP addresses allocated to them.

Solution

Added a function(release_calico_leaked_ips) in the upgrade procedure. This function should parse the output of the calicoctl ipam check --show-problem-ips command, identify the problematic IPs and handles, and release them automatically.
*

Test Cases

TestCase 1

Steps:

  1. Run kubemarine upgrade
  2. Run kubemarine check_paas

Results:

Before After
ipam check indicates some problems, for more info you can use calicoctl ipam check --show-problem-ips pass_check for Calico will be successful

Unit tests

Indicate new or changed unit tests and what they do, if any.

@nikhil1697 nikhil1697 marked this pull request as draft June 5, 2024 07:49
@nikhil1697 nikhil1697 marked this pull request as ready for review June 25, 2024 07:11
@koryaga koryaga requested a review from n549 June 25, 2024 07:26
@koryaga koryaga self-assigned this Jun 25, 2024
@koryaga koryaga added the bug Something isn't working label Jun 25, 2024
@koryaga koryaga requested review from koryaga and pranavcracker July 1, 2024 08:27
@nikhil1697 nikhil1697 marked this pull request as draft July 4, 2024 08:34
@vlar0816
Copy link
Contributor

vlar0816 commented Jul 4, 2024

task release_calico_leaked_ips fails with the following message

2024-07-04 08:28:29,803 INFO *** TASK release_calico_leaked_ips ***
2024-07-04 08:28:29,803 DEBUG Getting leaked ips...
2024-07-04 08:28:30,032 DEBUG Found 0 leaked ips
rm: cannot remove '/tmp/7cdc7a2183a94e69834f91e37f425ae9.json': No such file or directory
2024-07-04 08:28:30,068 CRITICAL FAILURE!
2024-07-04 08:28:30,068 CRITICAL TASK FAILED release_calico_leaked_ips
2024-07-04 08:28:30,068 CRITICAL KME0002: Remote group exception

@Netcracker Netcracker deleted a comment from pranavcracker Jul 11, 2024
@nikhil1697 nikhil1697 requested a review from koryaga July 11, 2024 20:42
@nikhil1697 nikhil1697 marked this pull request as ready for review July 12, 2024 04:46
@nikhil1697 nikhil1697 force-pushed the CPDEV-93296_PAAS_check_fails_after_Kubernetes_upgrade branch from 62ce018 to 7673afc Compare July 12, 2024 07:41
@koryaga koryaga force-pushed the CPDEV-93296_PAAS_check_fails_after_Kubernetes_upgrade branch from 18ef2a2 to d93b486 Compare July 17, 2024 13:34
@koryaga koryaga merged commit 561a641 into main Jul 17, 2024
42 checks passed
@koryaga koryaga deleted the CPDEV-93296_PAAS_check_fails_after_Kubernetes_upgrade branch July 17, 2024 14:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants