-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Configurable Serf ReconnectTimeout and TombstoneTimeout #333
Comments
Hey, Is there a concern with this? Nomad won't schedule to the failed nodes, they just remain in the system in case they reconnect. But I agree, we need to expose more of the configuration so a PR would be appreciated! |
It's only user interface issue. And yes its only important to these who run nomad on highly changeable environments like spot instances. For example we bid on 10 spot instances. After hour or two somebody overbids us, and we decide to launch 10 regular instances. 5 hours later spot instances prices go down. And we decide to run 10 spot instances and shutdown regular ones. This will lead to 20 instances in "leave" state which will never rejoin cluster. Not related to this ticket but one more question: |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Part of our infrastructure running on AWS spot instances. I played with nomad on spot instances and finished with 50 members in "failed" state. I only run 3 instances concurrent, and can imagine what will happened when I will run 50 agents on spot instances. The default 3 days timeout its huge if you running on spot instances.
If you agree I can open PR.
Thanks.
The text was updated successfully, but these errors were encountered: