Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

auto-restart: enable for stopping workflows #5203

Open
dpmatthews opened this issue Oct 19, 2022 · 2 comments
Open

auto-restart: enable for stopping workflows #5203

dpmatthews opened this issue Oct 19, 2022 · 2 comments
Milestone

Comments

@dpmatthews
Copy link
Contributor

If a host gets condemned, any workflows which are in the stopping state do not currently get migrated. This causes problems because any such workflows can keep running indefinitely. We need to allow such workflows to be migrated.

@dpmatthews dpmatthews added this to the cylc-8.x milestone Oct 19, 2022
@oliver-sanders
Copy link
Member

Workflows which are in the stopping state cannot be safely migrated as we cannot preserve their state in a restart.

The stopping state should be transient so this problem should never happen. The issues we've seen recently workflows were getting stuck in the stopping state are being addressed as part of other issues.

@oliver-sanders
Copy link
Member

The issue is when workflows have been stopped in normal shutdown mode where the workflow waits for active jobs to complete before stopping. In this case the jobs might have a high execution time limit (or even no limit at all).

The tricky thing is restarting the workflow in the stopping state without any spawning / queueing logic being activated. We might be able to abuse something like --stop-after-cycle-point to avoid the need for a new flag.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants