-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Rancher RKE support #1092
Comments
Hi @jonathon2nd thank you for trying Liqo! Based on tunnel-operator logs:
it seems that the iptables module is not loaded on the host where the Liqo-Gateway is running. I would suggest you to load the iptables module on the host and check if the problem is resolved. |
After manually running Thank you! |
Probably some configuration generated by the k3s provider are wrong. I would suggest you to uninstall Liqo and then install it again.
Then you need to change in the After that you have change the file you can install Liqo with:
@jonathon2nd keep an eye on this issue #1094 |
Thanks @alacuku ! I will keep an eye on the issue. So looking at the values, both of these look fine.
The other I am not sure about
This directs to the cluster I am deploying to, but I am not sure if Liqo is happy with going through Rancher. I will play around with modifying this. EDIT: In addition, the Controller-Manager finally fails with this error.
It looks like it is able to derive the api url for the cluster curling the url from a test box shows that it is up
|
Another thing I have noticed, is that Calico on the workers start to freak out when Liqo is installed, and we are not in BGP mode but we are using IPVS mode.
Adding the configuration as specified by https://doc.liqo.io/installation/advanced/ Also, I have tested with changing the address in values.yaml with one of the master ip's, no change in error on the Controller-Manager. EDIT:
For some reason, calico is still complaining
So, to get a working install for the record:
So I am still confused. I have tried the configureaton change with calico before when installing with liqocli and it did not work. The helm install must be different in some way with generating the values first and then installing with them unmodified. Will do some more testing and clarify more hopefully. |
Thanks again for the help. It is very much appreciated. I have two clusters now, and I am attempting to get cluster1 to peer to cluster2, as described in https://doc.liqo.io/usage/peering/ I run
However, when I view
I have not seen any pods with errors, so having a hard time figuring out where the hang up is. Does the above look familiar @alacuku ? |
Could you check the logs of |
ah I missed it. hmmm, strange. The url does load. Seem to be having some networking troubles, looking into it.
|
Hi @jonathon2nd. Do you confirm that The following log line:
suggests that something is wrong with the dns resolution of the remote api server. I would suggest to continue on the slack channel and come back here and update the issue after we have a solution. |
That sounds good to me @alacuku. 😄 |
Current state. Wanted to make a consolidated post to document where I am at with the assistance of liqo devs 👏 During testing we upgraded to Rancher 2.6.3. I have deployed two rke ks8 1.21.9 clusters with calico.
Make change to calico as Liqo requires: https://doc.liqo.io/installation/advanced/
Restart all nodes. I then get the kubeconfigs for each clsuter and remove the rancher management cluster entry from them. I do not want rancher cluster downtime to effect liwo of two downstream clusters. Then generate the values for install with At this point I install liqo on both clusters, generate an enrollment command from cluster2 and run on cluster1. Everything is installed and setup properly. I then run through the demo, https://doc.liqo.io/gettingstarted/helloworld/test/.
Ping to local works, but fails to remote
Upon looking at iptables, there is no forward rule for liqo and the default is drop.
We are confused as to how this could be blocking liqo traffic with a So to test, I ran
However, when we deploy a more complex app, such as postgres-ha, networking does not work. Postgres pods on cluster1 can not reach pods on cluster2. Going to test out a couple changes, will update when done. |
Alright, we got a working setup. Like I mentioned before I tested out using changes from submariner for liqo, this was when I was trying to get anything working. Run another iptables command So for cluster2
And ran these yamls on the appropriate cluster
After that and setting up two fresh clusters, everything works (mostly). postgres-ha: Only issue left that we can see at the moment, nodeport are not quite working right. 2022-02-17.15-13-52.mp4 |
Another update, the cidr change for cluster2 is not necessary. During testing I redid clusters with terraform and I added the two iptable rules to conf.
Even though at vm creation those interfaces do not exist, it does not error and when the interfaces are added they work. So, I redid the clusters and left the cidr's alone, and I have not encountered any other issues. The port is still flipping on cluster2, but my redis sentinel deployment is working and FA/HA is working. |
There is also the minor issue of
|
Also, regarding 'Liqo does not support multi-level affinities/node-selection': https://liqo-io.slack.com/archives/C014Z7F25KP/p1642730194018100 This was a bit of a block regarding db's, as we want to have pods spread out to as many hosts without pod crowding. To solve this for our test redis, I added the following to the redis values. For worker nodes on cluster1 I added The The
|
Left this open, RKE is yeet. |
Is your feature request related to a problem? Please describe.
I would like to be able to deploy Liqo to a Rancher managed, k8s RKE cluster.
Describe the solution you'd like
I would like RKE to be a supported option when installing Liqo, similar to k3s https://doc.liqo.io/installation/?provider=K3s
Describe alternatives you've considered
The only install option that functions at all when attempting to install is k3s. When attempting kind or kubeadm, nothing happens.
Additional context
First two options do not do anything
k3s does install, but the gateway fails to start
When the above is happening on a cluster with Calico, calico gives the following errors. Providing in case they are insightful. The errors occur when applying https://doc.liqo.io/installation/advanced/#calico or not on a fresh cluster.
I have tested with and without IPVS (which we use), I have tried flannel or calico (which we use).
I would also like to mention we use Rocky Linux, I am unsure if that plays a part in the issues I am encountering.
Thank you for developing Liqo, it looks very exciting and something we would very much love to use for building out our multi cluster infra more.
The text was updated successfully, but these errors were encountered: