Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Semester End Clean Up Tasks! #4102

Open
11 tasks
balajialg opened this issue Jan 12, 2023 · 4 comments
Open
11 tasks

Semester End Clean Up Tasks! #4102

balajialg opened this issue Jan 12, 2023 · 4 comments
Assignees
Labels
maintenance window Tasks prioritized for maintenance window priority: high High priority tasks process Processes that need to be developed

Comments

@balajialg
Copy link
Contributor

balajialg commented Jan 12, 2023

Summary

At the end of every semester, we need to perform the following housekeeping tasks. Collating them here so that we can prioritize these activities after the end of every semester.

  • Remove packages that did not get used (For Python packages - the Python popularity dashboard would serve as a valuable data point)
  • Remove auto-scaler calendar events that were added during the semester
  • Remove all the compute increase requests received during the semester
  • [Optional] Remove course admins for that specific semester
  • Run the archival process for all hub home directories
  • Reduce the number of nodes allocated for each node pool
  • Resolve dependabot alerts during the maintenance window
  • Blast email via datahub-announce email list with announcements and CTA to make requests such as package addition, RAM increase, calendar update, admin access requests etc..
  • Migrating Ubuntu Migrating Ubuntu 18.04 LTS to 22.04 LTS #4395
  • Version unversioned packages audit datahub environment.yml files for correctness #4167
  • Culling users from our config files related to memory allocation

Important information

Spring 23 semester ends May 12th!

Any other activity I am missing?

@balajialg balajialg added the process Processes that need to be developed label Jan 12, 2023
@balajialg balajialg self-assigned this Jan 12, 2023
@ryanlovett
Copy link
Collaborator

R libraries may be tagged with comments mentioning the course and term for which they are requested. Should we remove them during maintenance windows after every term and require instructors to request them the next time they’re needed? This would help reduce the size of the image, but could lead to more CI builds at the beginning of the term if people don’t prep in advance. I’m in favor of removal but it should be discussed and perhaps raised with users (instructors).

We do need an R popularity dashboard.

@balajialg
Copy link
Contributor Author

balajialg commented Jan 17, 2023

@ryanlovett I am thinking of adding a question "Whether the package requested has an end date for removal from the image" or something related to the template for package request. What do you think?

I am all in favor of building a R popularity dashboard as highlighted in this issue #2942. We should plan some dev cycles in the next few months if possible.

@ryanlovett
Copy link
Collaborator

@balajialg That is logical, but my guess is that instructors would want to specify no end date more often than not. Other infra devs may feel differently, but I think at least for the non-core courses, libraries should be opt-in every term. Smaller images means faster node start up which means faster scaling.

And yes, an R popularity dashboard is crucial. If we have that, we can feel better about removing libraries.

@balajialg
Copy link
Contributor Author

@ryanlovett Sounds good. It will be a good idea to analyze the hubs we want to prune down the image (I am assuming the generic hubs). I will plan to blast out an email at the end of the semester highlighting the image pruning activity and request instructors to raise github issues for packages (Added this to the to-do list)

@balajialg balajialg added maintenance window Tasks prioritized for maintenance window priority: high High priority tasks labels Feb 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
maintenance window Tasks prioritized for maintenance window priority: high High priority tasks process Processes that need to be developed
Projects
None yet
Development

No branches or pull requests

2 participants