-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Controller first boot (or two) fails when combined with istio-proxy #1968
Comments
@Stono please add the flag |
@Stono usually this means there's some networking issue |
Hey @aledbf - I added that -v=10, it does look like connectivity, i just don't understand why after a crash it then works? It's like that initial call to the k8 master to get the version fails?
|
Please check if the node where it works it is the same where it fails |
@aledbf I've confirmed the node is the same |
@aledbf I've finally got to the bottom of this.
I've actually worked around this by having a custom entrypoint on the
|
@Stono please try |
No joy I'm afriad @aledbf: As you can see, 1 restart on both instances of my controller:
Looking at the first try logs:
Then the second:
This is marginally better than before (typically I had a couple of pod crashes)! From looking at your code though if i'm reading it right you're only doing 50ms backoffs? Probably needs to be a bit longer to give the other istio pod time to boot + pull its configuration from pilot |
@Stono try |
Still the same :'( @aledbf
Looking at the time between the logs though Perhaps some more logs showing "Trying..." or something? |
@Stono please use |
Will check it out in the morning, thanks!
…On 14 Feb 2018 8:08 pm, "Manuel Alejandro de Brito Fontes" < ***@***.***> wrote:
@Stono <https://github.com/stono> please use quay.io/aledbf/nginx-ingress-
controller:0.329
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1968 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ABaviUIO0fJzd1ZwXXqr14JV9T15VwHaks5tUz0hgaJpZM4Rq4_m>
.
|
Yup, only seeing the error once @aledbf:
I'd expect to be seeing |
@Stono please use |
and also |
@Stono I think 0.331 is the one. I am sorry to bother with the tests but I cannot reproduce this issue Edit: the log about the retries only appears if you use |
It's OK @aledbf - no joy with 331 either, this is running with
|
@Stono last test, I promise :) |
Your retry works now, it's just still a bit too fast haha.
|
|
I've gotta nip out for a few hours but will test that when i'm back - sorry! |
It worked, yay! Nice work @aledbf
|
@Stono this takes too long. Is this "normal"? |
@Stono if you remove the istio proxy this there is no problem right? |
@aledbf actually that is a really good point, I didn't realise just how long that was (30 seconds). Maybe someone from the Istio team can offer us support (I've raised a cross issue)
But yes, to answer your question without istio-proxy it boots instantly, first time (we have two clusters, one is istio, one is not) |
👍 |
Hey,
I've observed this behaviour, which is the container failing to boot one or two times with every update we do.
Then it'll crash, and come back to life:
Any ideas?
The text was updated successfully, but these errors were encountered: