Error while joining already joined nodes #1461

MiroslavRepka · 2021-08-19T10:40:17Z

What happened:
When I tried to apply the updated manifest for the cluster, with new nodes, the kubeone apply threw an error when trying to join already joined nodes.

What is the expected behavior:
Apply the updated manifest and join only the nodes that need to be joined

How to reproduce the issue:

Create a manifest for kubeone
Run kubeone apply -m <manifest> -y
Update the manifest - add new static nodes
Run kubeone apply -m <manifest> -y
Wait until the kubeone will join the nodes
See the error

Anything else we need to know?
After installing kubeone version 1.2.3, the bug was resolved.

Information about the environment:
KubeOne version (kubeone version): 1.3.0 (not sure which alpha, since kubeone reinstalled)
Operating system: Pop!_OS 21.04
Provider you're deploying cluster on: Hetzner
Operating system you're deploying on: Ubuntu

The text was updated successfully, but these errors were encountered:

kron4eg · 2021-08-19T11:54:43Z

@MiroslavRepka few points would help to triage this issue:

Please provide manifests, the one before the edit and after
Please grep verbose log from kubeone apply by using additional -v flag and upload it too

MiroslavRepka · 2021-08-19T12:25:32Z

Original manifest

apiVersion: kubeone.io/v1beta1
kind: KubeOneCluster
name: cluster

versions:
  kubernetes: 'v1.19.0'

clusterNetwork:
  cni:
    external: {}

cloudProvider:
  none: {}
  external: false

addons:
  enable: true
  path: "addons"

apiEndpoint:
  host: 'x.x.x.x'
  port: 6443

controlPlane:
  hosts:
  - publicAddress: 'x.x.x.x'
    privateAddress: '192.168.2.1'
    sshPrivateKeyFile: './private.pem'

staticWorkers:
  hosts:
  - publicAddress: 'x.x.x.x'
    privateAddress: '192.168.2.2'
    sshPrivateKeyFile: './private.pem'

machineController:
  deploy: false

Updated manifest

apiVersion: kubeone.io/v1beta1
kind: KubeOneCluster
name: cluster

versions:
  kubernetes: 'v1.19.0'

clusterNetwork:
  cni:
    external: {}

cloudProvider:
  none: {}
  external: false

addons:
  enable: true
  path: "addons"

apiEndpoint:
  host: 'x.x.x.x'
  port: 6443

controlPlane:
  hosts:
  - publicAddress: 'x.x.x.x'
    privateAddress: '192.168.2.1'
    sshPrivateKeyFile: './private.pem'

staticWorkers:
  hosts:
  - publicAddress: 'x.x.x.x'
    privateAddress: '192.168.2.2'
    sshPrivateKeyFile: './private.pem'
  - publicAddress: 'x.x.x.x'
    privateAddress: '192.168.2.3'
    sshPrivateKeyFile: './private.pem'

machineController:
  deploy: false

I will provide a full log later today since I do not have version 1.3.0 installed anymore. Here is the error message that was occurring for all nodes that were already joined.

ERRO[11:07:25 CEST] + export PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin:/sbin:/usr/local/bin:/opt/bin
+ PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin:/sbin:/usr/local/bin:/opt/bin
+ [[ -f /etc/kubernetes/admin.conf ]]
+ sudo kubeadm join --config=./kubeone/cfg/worker_1.yaml
error execution phase preflight: [preflight] Some fatal errors occurred:
        [ERROR FileAvailable--etc-kubernetes-kubelet.conf]: /etc/kubernetes/kubelet.conf already exists
        [ERROR Port-10250]: Port 10250 is in use
        [ERROR FileAvailable--etc-kubernetes-pki-ca.crt]: /etc/kubernetes/pki/ca.crt already exists
[preflight] If you know what you are doing, you can make a check non-fatal with `--ignore-preflight-errors=...`
To see the stack trace of this error execute with --v=5 or higher

@kron4eg

kron4eg · 2021-08-19T12:38:32Z

@xmudrii didn't we removed recently some small short-circuit "check&exit"? Seems like this is the case. Kubeone should have checked if node is initialized already and skip kubeadm join in that case.

randrusiak · 2021-08-20T08:39:48Z

I can confirm I have the same issue which only occurs in 1.3.0 alpha versions. In latest stable version everything works.

xmudrii · 2021-09-02T09:25:22Z

@MiroslavRepka @randrusiak Thank you for reporting the issue! We have fixed it in #1485 and the fix will be included in the upcoming 1.3.0-rc.0 release.

MiroslavRepka added the kind/bug Categorizes issue or PR as related to a bug. label Aug 19, 2021

kron4eg added this to the KubeOne 1.3 milestone Aug 19, 2021

xmudrii added priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. sig/cluster-management Denotes a PR or issue as being assigned to SIG Cluster Management. labels Sep 1, 2021

xmudrii self-assigned this Sep 1, 2021

xmudrii mentioned this issue Sep 2, 2021

Check does kubelet.conf exists before trying to join static worker node #1485

Merged

kubermatic-bot closed this as completed in #1485 Sep 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while joining already joined nodes #1461

Error while joining already joined nodes #1461

MiroslavRepka commented Aug 19, 2021

kron4eg commented Aug 19, 2021

MiroslavRepka commented Aug 19, 2021

kron4eg commented Aug 19, 2021

randrusiak commented Aug 20, 2021

xmudrii commented Sep 2, 2021

Error while joining already joined nodes #1461

Error while joining already joined nodes #1461

Comments

MiroslavRepka commented Aug 19, 2021

kron4eg commented Aug 19, 2021

MiroslavRepka commented Aug 19, 2021

kron4eg commented Aug 19, 2021

randrusiak commented Aug 20, 2021

xmudrii commented Sep 2, 2021