As a developer, I want to be notified when autoscaler kicks in during the workspaces startup #22598

ibuziuk · 2023-10-11T12:20:07Z

Is your enhancement related to a problem? Please describe

Currently in order to have the machine auto scaler support admin needs to create DWOC object on the cluster:

apiVersion: controller.devfile.io/v1alpha1
config:
  workspace:
    ignoredUnrecoverableEvents:
      - FailedScheduling
    progressTimeout: 600s

Basically, it means that FailedScheduling events would be ignored during the workspace startup + that workspace startup time would be longer (depending on infra it takes around 10 mins for a new node to be provisioned)

Describe the solution you'd like

Ideally, DWO should detect when autoscaler kicks in and update DWOC accordingly, on the workspace startup screen the notification banner should be shown informing user that workspace startup would take longer due to a new node being provisioned:

Describe alternatives you've considered

Properly document DWOC config for autoscaler support

Additional context

Initial implementation of the Machine Autoscaler support - https://issues.redhat.com/browse/CRW-4072

The text was updated successfully, but these errors were encountered:

amisevsk · 2023-10-13T21:34:49Z

A potentially easier alternative that we also discussed is enabling an option on the CheCluster that can be used to more easily configure the operators for working with auto-scaling.

This could be done more quickly, in case detecting the "in-autoscale" state is tricky.

cgruver · 2023-11-01T19:46:21Z

@ibuziuk I tried creating a DWOC as you mentioned above. It does not appear to work.

DWOC:

apiVersion: controller.devfile.io/v1alpha1
kind: DevWorkspaceOperatorConfig
metadata:
  name: scaling-workspace-config
  namespace: openshift-devspaces
config:
  workspace:
    ignoredUnrecoverableEvents:
      - FailedScheduling
    progressTimeout: 600s

devfile snippet:

schemaVersion: 2.2.0
attributes:
  controller.devfile.io/storage-type: per-workspace
  controller.devfile.io/devworkspace-config: {"name": "scaling-workspace-config", "namespace": "openshift-devspaces"}
metadata:
  name: che-workspace
components:
- name: dev-tools
  container: 
    image: quay.io/cgruver0/che/che-dev-image:latest
    etc...

The resulting DevWorkspace object inherits the attributes as expected, but still fails to start immediately rather than waiting for node scaling.

kind: DevWorkspace
spec:
  contributions:
    - kubernetes:
        name: che-code-che-workspace
      name: editor
  routingClass: che
  started: true
  template:
    attributes:
      controller.devfile.io/devworkspace-config:
        name: devworkspace-config
        namespace: openshift-devspaces
      controller.devfile.io/scc: container-build
      controller.devfile.io/storage-type: per-workspace
    projects:
      - git:
          remotes:
            origin: https://github.com/cgruver/my-che-workspace.git
        name: my-che-workspace
        etc...

Error:

Error creating DevWorkspace deployment: Detected unrecoverable event FailedScheduling: 0/9 nodes are available: 2 Insufficient memory, 3 Insufficient cpu, 3 node(s) had untolerated taint {node-role.kubernetes.io/infra: }, 3 node(s) had untolerated taint {node-role.kubernetes.io/master: }. preemption: 0/9 nodes are available: 3 No preemption victims found for incoming pod, 6 Preemption is not helpful for scheduling...

amisevsk · 2023-11-01T20:26:44Z

@cgruver Che (and Dev Spaces) have their own custom DevWorkspaceOperatorConfigs that are used in place of the one you created:

      # In your DevWorkspace object
      controller.devfile.io/devworkspace-config:
        name: devworkspace-config
        namespace: openshift-devspaces

You could try to edit that DWOC, or, alternatively, configure the DevWorkspace Operator itself by creating your dwoc in DWO's install namespace (openshift-operators) with name devworkspace-operator-config. As far as I know, Che will ignore/overwrite any controller.devfile.io/devworkspace-config attribute in a devfile when converting it into a DevWorkspace.

cgruver · 2023-11-02T14:22:50Z

I think adding this as a parameter in the CheCluster CRD is a great idea.

monaka · 2024-02-23T07:22:39Z

ping?

I created a PoC. Is it acceptable?
monami-ya/che-operator@2681812

tolusha · 2024-02-28T09:37:15Z

@monaka
Yes, It works for me

monaka · 2024-02-29T14:38:58Z

@tolusha Thanks for your check.
I suppose that some additional docs are required. I'll set them up within a couple of weeks.

monaka · 2024-08-04T04:30:55Z

I suppose this can be closed as eclipse-che/che-operator#1864 was merged. @ibuziuk

ibuziuk · 2024-08-21T14:16:34Z

@monaka thanks, but I do not think we can close this issue since there is no update on the user dashboard with appropriate notification.
@dkwon17 @mkuznyetsov @AObuchow @svor it would be nice to prioritize this issue for the next sprint.

vinokurig · 2024-09-24T10:50:45Z

@ibuziuk @dkwon17 I have tried to set up cluster autoscaler but unfortunately I did not manage to cause new node provision. However I noticed that scaling up a machine set causes a new node provision. It means that we can intercept the Created Machine ... event from the openshift-machine-api namespace events and that would mean that a new node, binded to the new machine, will be provisioned soon.
So the plan is to add an event listener to the dashboard side and catch the Created Machine ... event and show a notification about workspace start delay. Any objections, concerns?

ibuziuk · 2024-10-01T13:55:59Z

@vinokurig yes, I think it is a good idea to use Created Machine event as a marker for auto scaler.

ibuziuk added the kind/enhancement A feature request - must adhere to the feature request template. label Oct 11, 2023

che-bot added the status/need-triage An issue that needs to be prioritized by the curator responsible for the triage. See https://github. label Oct 11, 2023

ibuziuk added area/dashboard area/devworkspace-operator area/doc Issues related to documentation labels Oct 11, 2023

ibuziuk changed the title ~~As a developer I want to be notified when autoscelaer kick in during the workspaces startup~~ As a developer I want to be notified when autoscaler kicks in during the workspaces startup Oct 11, 2023

ibuziuk changed the title ~~As a developer I want to be notified when autoscaler kicks in during the workspaces startup~~ As a developer, I want to be notified when autoscaler kicks in during the workspaces startup Oct 11, 2023

amisevsk added severity/P2 Has a minor but important impact to the usage or development of the system. and removed status/need-triage An issue that needs to be prioritized by the curator responsible for the triage. See https://github. labels Oct 13, 2023

monaka mentioned this issue Jun 27, 2024

Support devWorkspace.ignoredUnrecoverableEvents. eclipse-che/che-operator#1864

Merged

10 tasks

ibuziuk added this to Eclipse Che Team A Backlog Aug 21, 2024

ibuziuk moved this to 📅 Planned in Eclipse Che Team A Backlog Aug 21, 2024

ibuziuk moved this from 📅 Planned to 📋 Backlog in Eclipse Che Team A Backlog Aug 21, 2024

ibuziuk added this to Red Hat OpenShift Dev Spaces and Web Terminal Priorities Aug 22, 2024

ibuziuk moved this to Todo in Red Hat OpenShift Dev Spaces and Web Terminal Priorities Aug 22, 2024

ibuziuk moved this from Todo to Analyzing in Red Hat OpenShift Dev Spaces and Web Terminal Priorities Aug 22, 2024

svor assigned olexii4 Aug 28, 2024

svor moved this from 📋 Backlog to 📅 Planned in Eclipse Che Team A Backlog Aug 28, 2024

olexii4 moved this from 📅 Planned to 🚧 In Progress in Eclipse Che Team A Backlog Sep 3, 2024

ibuziuk moved this from Analyzing to Todo in Red Hat OpenShift Dev Spaces and Web Terminal Priorities Sep 13, 2024

vinokurig self-assigned this Sep 13, 2024

vinokurig unassigned olexii4 Sep 13, 2024

ibuziuk moved this from Todo to In Progress in Red Hat OpenShift Dev Spaces and Web Terminal Priorities Sep 18, 2024

vinokurig moved this from 🚧 In Progress to 📅 Planned in Eclipse Che Team A Backlog Oct 1, 2024

vinokurig moved this from 📅 Planned to 🚧 In Progress in Eclipse Che Team A Backlog Oct 2, 2024

vinokurig mentioned this issue Oct 4, 2024

Show warning banner on user namespace FailedScheduling event eclipse-che/che-dashboard#1211

Merged

ibuziuk moved this from 🚧 In Progress to Ready for Review in Eclipse Che Team A Backlog Oct 9, 2024

vinokurig closed this as completed in eclipse-che/che-dashboard#1211 Oct 15, 2024

github-project-automation bot moved this from In Progress to Done in Red Hat OpenShift Dev Spaces and Web Terminal Priorities Oct 15, 2024

vinokurig moved this from Ready for Review to ✅ Done in Eclipse Che Team A Backlog Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As a developer, I want to be notified when autoscaler kicks in during the workspaces startup #22598

As a developer, I want to be notified when autoscaler kicks in during the workspaces startup #22598

ibuziuk commented Oct 11, 2023 •

edited

Loading

amisevsk commented Oct 13, 2023

cgruver commented Nov 1, 2023

amisevsk commented Nov 1, 2023

cgruver commented Nov 2, 2023

monaka commented Feb 23, 2024

tolusha commented Feb 28, 2024

monaka commented Feb 29, 2024

monaka commented Aug 4, 2024 •

edited

Loading

ibuziuk commented Aug 21, 2024

vinokurig commented Sep 24, 2024

ibuziuk commented Oct 1, 2024

As a developer, I want to be notified when autoscaler kicks in during the workspaces startup #22598

As a developer, I want to be notified when autoscaler kicks in during the workspaces startup #22598

Comments

ibuziuk commented Oct 11, 2023 • edited Loading

Is your enhancement related to a problem? Please describe

Describe the solution you'd like

Describe alternatives you've considered

Additional context

amisevsk commented Oct 13, 2023

cgruver commented Nov 1, 2023

amisevsk commented Nov 1, 2023

cgruver commented Nov 2, 2023

monaka commented Feb 23, 2024

tolusha commented Feb 28, 2024

monaka commented Feb 29, 2024

monaka commented Aug 4, 2024 • edited Loading

ibuziuk commented Aug 21, 2024

vinokurig commented Sep 24, 2024

ibuziuk commented Oct 1, 2024

ibuziuk commented Oct 11, 2023 •

edited

Loading

monaka commented Aug 4, 2024 •

edited

Loading