-
Notifications
You must be signed in to change notification settings - Fork 40.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Job Controller #1624
Comments
Also discussed in #503. |
See also OpenShift's proposed job concept: https://github.com/openshift/openshift-pep/blob/master/openshift-pep-013-openshift-3.md Potential features include:
|
We have not yet implemented the generic concept, instead using pods directly. Initially it seems like each of the individual places run-once pods were used would be related, the actual flows were easy to control. Advantages of a unified job resource would be that you could easily extend it for your own use - downside is that you still have to control / ensure the job resource gets created (so you have two sync loops instead of one). Was hoping to see how others might use jobs before we proceeded further. The pattern was definitely common, but our uses had subtle differences that may not abstract well. |
That's interesting. Another option is that we could just make individual pods easier to use for these workflow sorts of scenarios (which we should do regardless). Some features I could imagine would be useful for that:
Anything else? |
Logs and the ability to read logs and get pod info long after a pod is deleted.= |
Some application frameworks, such as mapreduce/hadoop-style workloads, may take on the controller responsibilities themselves rather than relying upon a shared service. An example might be the Application Master in YARN. |
@thockin @smarterclayton @bgrant0607 After a long journey through the issues about job management in kubernetes, I have submitted a GSOC proposal for this topic. Your feedback would be most appreciated. |
We'll take a look @TamerTas. Thanks! |
To document later: Some batch workloads are data processing/analysis workloads of independent utility. Others support serving workloads, such as:
|
/sub |
cc @mwielgus |
@soltysh please link your ongoing job proposal here so Kube folks can get a chance to look. The proposal will be coming soon here while Maciej prototypes |
It's already linked here, but here you go openshift/origin#3693. It covers both the job controller part and the cron scheduler. |
Could we discuss this at an upcoming community hangout? |
Absolutely On Jul 21, 2015, at 3:06 PM, Brian Grant [email protected] wrote: Could we discuss this at an upcoming community hangout? — |
I'm not working on the job controller, a better owner might be Mike or @soltysh |
@erictune has been shepherding this, so assigning to him. |
/sub |
@aronchick Continuing from #14186 (comment) Your primary concern is to clarify that this is a batch job (e.g., LSF, Load Leveler, Printer Job) as opposed to an indefinitely running "Job" (e.g., Borg, Aurora, Nomad)? |
And we do distinguish those, but as:
That's the terminology we have in k8s. @aronchick what's your opinion on that? |
We now have this as a beta feature, in head and to appear in 1.1. Closing. |
@erictune are you going to close it? |
UPSTREAM: 119107: Stop using deprecated API
We need something like ReplicationController that runs RestartOnFailure and RestartNever pods "to completion", collects results, etc
The text was updated successfully, but these errors were encountered: