[dotnet][rb][java][js][py] remove disk cleanup, add monitoring#17001
[dotnet][rb][java][js][py] remove disk cleanup, add monitoring#17001titusfortner merged 2 commits intotrunkfrom
Conversation
PR Compliance Guide 🔍Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label |
||||||||||||||||||||||||
PR Code Suggestions ✨Latest suggestions up to 6dd40a4
Previous suggestions✅ Suggestions up to commit ab1bc64
|
|||||||||||||||||||||||||||||
There was a problem hiding this comment.
Pull request overview
This PR removes the disk cleanup script (free-disk-space.sh) that was taking 1-2 minutes per workflow run and replaces it with monitoring that reports disk space usage to a Slack channel. The change is motivated by GitHub-hosted runners now having significantly more disk space available (92GB free vs 17GB previously).
Changes:
- Deleted the
free-disk-space.shscript that cleaned up packages and directories - Replaced the cleanup step with disk space monitoring that checks available space after each job
- Added Slack notifications to the
#ci-disk-alertschannel with color-coded alerts based on thresholds
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.
| File | Description |
|---|---|
| scripts/github-actions/free-disk-space.sh | Complete removal of the disk cleanup script (55 lines deleted) |
| .github/workflows/bazel.yml | Removed "Free space" step and added "Check disk space" and "Report disk space" steps with Slack integration |
ab1bc64 to
5ccbf3a
Compare
40ce465 to
689b10f
Compare
User description
free-disk-space runs on every Bazel run, and can take 1 -2 minutes.
Quick look shows we don't need it.Looks like GitHub is transitioning.
If you get the
Azure Region: eastus2, you end up withRemaining disk space: 88GBIf you get
Azure Region: westus, you end up with:Remaining disk space: 16GBWe're only using 5-10GB, and everything I'm seeing shows that the lowest remaining space on any of our CI jobs is 10GB.
💥 What does this PR do?
Remove the free space step from workflow
Temporarily adding a slack alert to get a feel for whether anything is a legit problem.Update: Added a disk space check at end of each job that fails if < 5GB remaining.
I know that's not ideal because it's not that commit's fault, but seems like the easiest way to alert us to a problem.
AllSome Runners now have ~145GB total / ~92GB free (According to the gh command that Claude ran)🔧 Implementation Notes
Instead of proactive cleanup, we now report remaining disk space to#ci-disk-alertsafter every bazel workflow job.Running all tests to exercise it.Update: Honestly, I just didn't want to deal with passing Slack permissions to everything.
💡 Additional Considerations
#ci-disk-alertsSlack channel createdI'll remove the slack alert if we're not remotely close to an issue.🔄 Types of changes
PR Type
Enhancement, Other
Description
Remove disk cleanup script that was taking 1-2 minutes per run
Add disk space monitoring with Slack alerts to
#ci-disk-alertschannelImplement color-coded thresholds (red <10GB, yellow <30GB, green ≥30GB)
Leverage improved runner disk availability (92GB free vs 17GB previously)
Diagram Walkthrough
File Walkthrough
free-disk-space.sh
Remove disk cleanup script entirelyscripts/github-actions/free-disk-space.sh
directories
17GB)
bazel.yml
Replace cleanup with disk space monitoring.github/workflows/bazel.yml
Free spacestep that calledfree-disk-space.shscriptCheck disk spacestep to calculate available disk space in GBReport disk spacestep using Slack webhook for monitoringgood (≥30GB)