Feature: Versioning on Tasks/Pipelines #1839

pierretasci · 2020-01-09T21:47:14Z

Abstract

A way to define a specific version of a named pipeline or task so that the spec of that pipeline or task can be referenced at a specific moment in time. For example, the way docker containers work today with tags whereby I am able to run container:tag for any valid tag of the same container name.

Use Cases

In a production system, it is likely a project's Tekton pipeline will evolve its definition over time. If I wanted to run the pipeline as it existed a month ago, I have no good way to know what the spec was if it has since changed.
The ability to test a change to a pipeline's spec without affecting the pipeline runs of other users that share the same spec.
Improve the ability to share and reuse tasks so I can depend on a pinned version of a Task from a catalog while the actual spec changes.

Details

I could imagine this being addressed as part of the Tekton spec. For now, our workaround has been to publish tasks and pipelines with the "version" in the name of the task. This does introduce a discoverability problem as well as clutter.

A solution that is part of the Tekton spec would make a lot of sense. Eg:

apiVersion: tekton.dev/v1alpha1
kind: Task
metadata:
  name: my-unique-tests
spec:
  - tag: v1
     spec: 
       steps:
         - name: run-test
            image: ubuntu
            command: ["foo"]
---
apiVersion: tekton.dev/v1alpha1
kind: TaskRun
metadata:
  name: my-unique-tests-run
spec:
  taskRef:
    name: my-unique-task
    tag: v1

An easy addition to this is a default "latest" to keep the existing functionality, again, similar to docker.

The text was updated successfully, but these errors were encountered:

vdemeester · 2020-01-10T08:46:49Z

I think the main question is, is this in scope for tektoncd/pipeline or some higher level component (from Tekton or not) ? versionning could be done using different name and labels, … and managed by an higher level component, keeping the tekton api as simple (relatively speaking) as possible.

This is also where experiment like catalgos (expect some code in the next week 👼), from tektoncd/community#53) comes into place (on the discoverability side of things.

/area api
/kind question
/kind design

pierretasci · 2020-01-13T17:11:09Z

I completely agree that the main question comes down to whether this lives in tekton or externally. I think that both are viable. Since I brought up the issue, I will make a case in favor of making it a part of the Task spec and that is to aid in task reuse. If the goal is to make tasks that are generic and reusable with a well-defined interface, they will need to be versioned somehow to prevent the definition being changed while an old version is running.

A versioned catalog could work but acts as a coarse-grained lock in a sense. It might slow down changes if one of the tasks in the catalog is incompatible in the new version but you really need the changes introduced to another task.

bobcatfish · 2020-01-17T18:30:16Z

I agree - esp. when we start thinking about tektoncd/catalog. We'd want to be able to make sure we can make changes in there and folks can consume those changes as they want to.

@pierretasci I think we've also run into some complication around expressing the version of Tekton Pipelines that a Task is compatible with, do you see that as being a related problem or maybe totally independent? (or @vdemeester maybe this is going to be handled by v1beta1 v1beta2 etc. once we start actually incrementing those?)

kind: Task
metadata:
  name: my-unique-tests
spec:
  - tag: v1
     spec:

It took me a couple min to realize that in this model we would need to have all the versions within one Task instance because otherwise their names would have to be distinct.

One downside is that the verison would become almost mandatory 🤔 or at least not using the version would look something like:

kind: Task
metadata:
  name: my-unique-tests
spec:
  - spec: # two specs for no reason :(

I wonder if we could brainstorm a few more options? One would be to embrace using the Task name as you mentioned - maybe it's reasonable to introduce a convention into the name for versions?

Ugh I can't think of much more... maybe by default we don't have spec.spec and we have something like:

apiVersion: tekton.dev/v1alpha1
kind: Task
metadata:
  name: my-unique-tests
spec:
     steps: # this is the current version, so don't _have_ to use versions
       - name: run-test
          image: ubuntu
          command: ["foo"]
     versions: # you can optionally provide previous versions of the Task
    - tag: v1
       spec: 
         steps:
           - name: run-test
              image: ubuntu
              command: ["foo-old"]

I dunno tho, it seems like using latest causes a lot of problems, if only b/c you don't know what you ran... 😩

Any other ideas?

dlorenc · 2020-01-17T18:41:33Z

Big +1 on solving this somehow.

I think I prefer to put the version info on a label of each Task, then we could add support for TaskRuns/Pipelines to include selectors in their TaskRefs.

This would work for things like "latest" as well as pinning to specific versions, but would not allow full semver-style version comparisons.

pierretasci · 2020-01-17T19:02:53Z

I really like the labels idea because 1. it is built-in to Kubernetes (as are selectors), and 2. it is open enough to allow anyone to change how they want to define their versioning. I wonder how we would handle reconciliation at the controller level when registering two tasks with the same name. AFAIK, the reconcile loop checks namespace and name for uniqueness.

Just to exhaust all possibilities, the other (horrible) idea I have is to have an inheritance chain a la kustomize. In this way, a task or pipeline specifies a parent task/pipeline and that spec is merged all the way down to a root CRD. This would be a nightmare to handle though (what if one piece of the chain is missing?).

To @bobcatfish's question, I think versioning Tekton itself goes hand in hand with with versioning the work being done by a task/pipeline. I may want to take advantage of a new tekton feature which has a new field in the spec but that may blow up in someone's cluster if they don't have the latest Tekton controller.

bobcatfish · 2020-01-17T19:28:54Z

Discussed this with @imjasonh a bit and he has another idea that's pretty cool and very different from what we've discussed so far! This is something that he and a few other folks (@dlorenc @jonjohnsonjr ) have been throwing around as an idea as well, which has the potential to solve a few problems at once.

The preview is that we extend taskRef to support more stuff than just Tasks that live inside the cluster - and specifically we use OCI Artifacts to bundle and store Tasks, e.g.

apiVersion: tekton.dev/v1alpha1
kind: TaskRun
metadata:
  name: my-task-run
spec:
  taskRef:
    image:
      name: gcr.io/my/catalog:v1.2.3
      task: my-task

And we could even do cool stuff like:

apiVersion: tekton.dev/v1alpha1
kind: TaskRun
metadata:
  name: my-task-run
spec:
  taskRef:
    git:
      url: https://github.com/my/repo
      commit: deadbeef
      path: path/to/my/task.yaml

Jason's full proposal for Tekton Task References

vdemeester · 2020-01-20T09:57:52Z

I am definitely interested in re-using oci images/artifacts for this, a lot of tooling exists for it, and it handles versioning really well. I think we discussed it really early when talking about the catalog.

We could use labels for specifying stuff like "minimum version of tekton required", …

bobcatfish · 2020-01-28T14:44:15Z

@pierretasci do you have any thoughts/objections on this? no worries if you feel like you need more information first - I think this is pretty cool and am motivated to continue exploring this option

pierretasci · 2020-01-28T18:02:33Z

Sorry, I wrote my feedback in @imjasonh's doc and forgot to post back here. I think the OCI proposal makes a ton of sense and I think it more than solves the use case intended here. I'm good to explore that as the proposal here.

siamaksade · 2020-01-30T15:09:24Z

cc @sthaha

sthaha · 2020-01-31T01:42:50Z

Doesn't using oci images make the definition of tasks opaque to user? Currently we can't treat tasks as opaque (akin to calling a function without knowing its implementation details) as in order to use a task, you must know the input and output params expected. In the OCI based implementation how would someone find out how to use the task?

imjasonh · 2020-01-31T02:12:34Z

We can build tooling to describe a task defined in an OCI image, by cracking the image open and parsing the YAML. This could start as a standalone CLI in experimental then maybe eventually graduate to the tkn CLI. Something like tkn task describe my-task --image=gcr.io/my/image

sthaha · 2020-01-31T02:27:07Z

@bobcatfish I am a bit confused as to how using oci artifacts address the versioning issue? Hypothetically, say my pipeline refers to v1 and v2 of a task that run in parallel

how would the spec look like?
where would the tasks be created?
are these tasks transient (deleted soon after its use?)

As I see it, the issue is that taskRef refers to the metadata.name of a Task thus we need to somehow encode version and name of the task into metadata.name. Rather how about we use labels to refer to a task? e.g.

taskRef:
   name: catalog.upstream.foobar
   version: 0.1.1

would use a task (independent of metadata.name) that has the label name and version

sthaha · 2020-01-31T02:33:44Z

@imjasonh thanks for explanation. It does make sense to me and additionally addresses publishing a catalog issue. In the oci artifacts based proposal, would the controller even need to create tasks in cluster?

I am however not sure if this addresses versioning of tasks unless we can build a structure/naming convention into this oci artifact based catalog.

imjasonh · 2020-01-31T02:42:33Z

This addresses versioning because OCI images can be versioned and referred to by their versions registry.com/image:v1.2.3, or pinned to a specific immutable version by its content (registry.com/image@sha256:abcde...).

In this model the Task definition doesn’t have to be defined in the cluster, the image only needs to be readable and reachable by the Tekton controller running on the cluster.

imjasonh · 2020-01-31T03:00:07Z

The example from the doc shows how a taskRef might refer to an image, rather than a Task definition installed on the cluster:

apiVersion: tekton.dev/v1alpha1
kind: TaskRun
metadata:
  name: my-task-run
spec:
  taskRef:
    image:
      name: gcr.io/my/catalog:v1.2.3
      task: my-task

sthaha · 2020-01-31T03:07:42Z

This addresses versioning because OCI images can be versioned and referred to by their versions registry.com/image:v1.2.3

Neat! so here, it is the catalog that has a version which I guess all tasks inherit i.e. instead of specifying use task and version we specify task from catalog with version which is fine!

imjasonh · 2020-01-31T03:14:38Z

Yeah, the image can contain a bunch of co-versioned Tasks/Pipelines/whatever, or just one each if the image author wants that. An image containing a bunch of co-versioned things can be considered like a Catalog.

siamaksade · 2020-01-31T03:28:23Z

Interesting idea. +1 for the proposal

@imjasonh are there advantages to combing multiple tasks into a single image rather than enforcing a single task per image? If each image contains a single task and related step images, the syntax becomes simpler since the name is not needed anymore:

apiVersion: tekton.dev/v1alpha1
kind: TaskRun
metadata:
  name: my-task-run
spec:
  taskRef:
    image: gcr.io/maven:v1.2.3

This would be also somewhat similar to how GitHub Workflow references actions e.g. actions/setup-java@v1

imjasonh · 2020-01-31T03:51:33Z

The advantage to co-versioning things together is if you have a Pipeline that runs multiple Tasks, you can co-version all of them together if they're all bundled in one image. But if you as an operator or thing author wants them separate, that's your prerogative.

GitHub's actions/setup-java@v1 thing is a bit more like how we use images as steps, since that reference ends up describing the container image that gets run as part of that step. With GitHub, that reference can also be a GitHub repo that might get built just-in-time into an image that runs.

sthaha · 2020-01-31T05:19:49Z

@imjasonh One more question about the "opaqueness" of oci artifacts. Say we want to support a case where we would like to expose list of tasks in the catalog to a UI (dashboard). Would oci-image allow for that? Or would we have to rely on a Catalog CRD to expose that information?
e.g.

kind: Catalog
spec:
  imageRef:  gcr.io/my/catalog:v1.2.3 .  ## could be Refs instead
  
# filled by controller
status:
  tasks:
     - name: kaniko
     - name: buildah  
  clusterTasks:
     - name: ...

imjasonh · 2020-01-31T12:07:55Z

You could ask the registry over HTTP for image metadata and use it to list the objects it contains. The API is well documented and part of the OCI spec, we would only be defining the spec for the data types in the image. We could easily provide a Go package to parse it all.

Having it wrapped in a CRD means there are two sources of truth: the CRD could say it contains A, B and C while the image itself contains A, C and Z.

imjasonh · 2020-01-31T12:52:23Z

The OCI image also wouldn't have a concept of a ClusterTask since to the image there's no concept of a namespace or a cluster-scoped anything. If operators want to make sure some images aren't available to some users they can enforce that with OPA and/or auth.

siamaksade · 2020-01-31T13:14:40Z

The advantage to co-versioning things together is if you have a Pipeline that runs multiple Tasks, you can co-version all of them together if they're all bundled in one image. But if you as an operator or thing author wants them separate, that's your prerogative.

GitHub's actions/setup-java@v1 thing is a bit more like how we use images as steps, since that reference ends up describing the container image that gets run as part of that step. With GitHub, that reference can also be a GitHub repo that might get built just-in-time into an image that runs.

For the user authoring pipelines, GitHub Actions and Tekton Tasks are similar in that they are components that user can reuse to build a pipeline. step is not a reusable component.

The runtime model of GitHub Actions is similar to a step like you said.

pierretasci · 2020-01-31T17:30:49Z

One small thing I want to add to this discussion that I really like about the idea of using OCI, is that auth is a "solved" problem. End-users will be able to publish their task and pipeline definitions as artifacts and use imagePullSecrets to authorize exclusively themselves to access those definitions in their CI. Not reinventing the wheel here is a huge win.

imjasonh · 2020-02-01T18:37:03Z

Thought about this a bit more, this should be possible to prototype entirely in experimental, with a CLI that's able to bundle tasks/etc into an image, and a "run this task in this image" surface that resolves the Task spec from the image before telling Tekton about it.

Once this is iterated on and the surface settles, it should be easy to shift the image->Task resolution to the Tekton controller and move the bundling CLI into tkn.

This partially addresses the desire to fetch tasks from an OCI image artifact. Issue: #1839 Signed-off-by: Sunil Thaha <[email protected]>

bobcatfish · 2020-04-09T17:44:04Z

@pierretasci has expanded on the proposed OCI format at https://docs.google.com/document/d/1lXF_SvLwl6OqqGy8JbpSXRj4hWJ6CSImlxlIl4V9rnM/edit

pierretasci · 2020-04-16T00:15:49Z

First PR with the new design is up #2395

ghost · 2020-10-05T15:38:20Z

Moving to 0.18 release milestone because there are some remaining questions around fetching large images during reconcile.

dibyom · 2020-11-02T16:37:48Z

Closing since #2395 was merged.

/close

tekton-robot · 2020-11-02T16:37:50Z

@dibyom: Closing this issue.

In response to this:

Closing since #2395 was merged.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tekton-robot added area/api Indicates an issue or PR that deals with the API. kind/question Issues or PRs that are questions around the project or a particular feature kind/design Categorizes issue or PR as related to design. labels Jan 10, 2020

bobcatfish self-assigned this Jan 28, 2020

vdemeester mentioned this issue Mar 10, 2020

The newly created pipeline shows the status of the deleted pipeline if the name of the pipeline is same #2201

Closed

tekton-robot pushed a commit that referenced this issue Mar 10, 2020

Adds resolver to provide task resolution for images

9e8e752

This partially addresses the desire to fetch tasks from an OCI image artifact. Issue: #1839 Signed-off-by: Sunil Thaha <[email protected]>

bobcatfish mentioned this issue Mar 13, 2020

Deleting Pipeline should delete PipelineRuns #2223

Closed

vdemeester mentioned this issue Mar 16, 2020

Capture User Information with PipelineRun/TaskRun Creation #2045

Closed

sthaha mentioned this issue Mar 20, 2020

Fetch remote task refs from images in taskruns #2233

Closed

3 tasks

bobcatfish mentioned this issue Mar 26, 2020

Add support for referencing Tasks in git #2298

Closed

bobcatfish mentioned this issue May 8, 2020

Allow Tasks/Pipelines to indicate versions of Tekton Pipelines they work with #2588

Closed

bobcatfish mentioned this issue Jun 4, 2020

The pipeline resource is fetched via client vs. lister in resolver #2740

Closed

afrittoli mentioned this issue Jun 6, 2020

Emit events from the PipelineRun controller #2545

Closed

3 tasks

bobcatfish mentioned this issue Jul 6, 2020

Solve chicken and egg problem of Tekton config-as-code #859

Closed

bobcatfish mentioned this issue Jul 23, 2020

Design: Sharing Tasks and Pipelines without Copy Pasta tektoncd/catalog#45

Closed

afrittoli added this to the Pipelines v0.16 milestone Aug 10, 2020

bobcatfish mentioned this issue Aug 21, 2020

Implement When Expressions #3117

Closed

4 tasks

dibyom unassigned bobcatfish Aug 24, 2020

bobcatfish added the area/roadmap Issues that are part of the project (or organization) roadmap (usually an epic) label Aug 24, 2020

bobcatfish modified the milestones: Pipelines v0.16, Pipelines v0.17 Sep 11, 2020

ghost modified the milestones: Pipelines v0.17, Pipelines v0.18 Oct 5, 2020

bobcatfish mentioned this issue Oct 30, 2020

Add Cluster scope pipeline support #1876

Closed

tekton-robot closed this as completed Nov 2, 2020

bobcatfish mentioned this issue Nov 3, 2020

Support Tekton "Bundles" stored in Git repos #3490

Closed

lbernick mentioned this issue Oct 5, 2022

Move remote resolution out of alpha #5515

Merged

7 tasks

spirillen mentioned this issue Feb 8, 2025

model-registry.com mypdns/matrix#81550

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Versioning on Tasks/Pipelines #1839

Feature: Versioning on Tasks/Pipelines #1839

pierretasci commented Jan 9, 2020

vdemeester commented Jan 10, 2020

pierretasci commented Jan 13, 2020

bobcatfish commented Jan 17, 2020

dlorenc commented Jan 17, 2020

pierretasci commented Jan 17, 2020

bobcatfish commented Jan 17, 2020

vdemeester commented Jan 20, 2020

bobcatfish commented Jan 28, 2020

pierretasci commented Jan 28, 2020

siamaksade commented Jan 30, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

siamaksade commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

imjasonh commented Jan 31, 2020

siamaksade commented Jan 31, 2020

pierretasci commented Jan 31, 2020

imjasonh commented Feb 1, 2020

bobcatfish commented Apr 9, 2020

pierretasci commented Apr 16, 2020

ghost commented Oct 5, 2020

dibyom commented Nov 2, 2020

tekton-robot commented Nov 2, 2020

Feature: Versioning on Tasks/Pipelines #1839

Feature: Versioning on Tasks/Pipelines #1839

Comments

pierretasci commented Jan 9, 2020

Abstract

Use Cases

Details

vdemeester commented Jan 10, 2020

pierretasci commented Jan 13, 2020

bobcatfish commented Jan 17, 2020

dlorenc commented Jan 17, 2020

pierretasci commented Jan 17, 2020

bobcatfish commented Jan 17, 2020

vdemeester commented Jan 20, 2020

bobcatfish commented Jan 28, 2020

pierretasci commented Jan 28, 2020

siamaksade commented Jan 30, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

siamaksade commented Jan 31, 2020

imjasonh commented Jan 31, 2020

sthaha commented Jan 31, 2020

imjasonh commented Jan 31, 2020

imjasonh commented Jan 31, 2020

siamaksade commented Jan 31, 2020

pierretasci commented Jan 31, 2020

imjasonh commented Feb 1, 2020

bobcatfish commented Apr 9, 2020

pierretasci commented Apr 16, 2020

ghost commented Oct 5, 2020

dibyom commented Nov 2, 2020

tekton-robot commented Nov 2, 2020