[lockfile-stats] Lockfile statistics audit — 233 workflows, 22.55 MB, 2026-05-21 #33855

2026-05-21T20:55:03Z

github-actions[bot]
Bot May 21, 2026

Executive Summary

Analysis of 233 compiled workflow lockfiles (.github/workflows/*.lock.yml) on 2026-05-21 — total 22.55 MB, no files skipped.

Metric	Value
Lockfiles	233
Total bytes	22,548,039 (~22.55 MB)
Size: min / median / avg / max	62.8 KB / 95.8 KB / 96.8 KB / 177.4 KB
Total jobs	1,858 (avg 7.97/workflow, range 5–12)
Total steps	23,931 (avg 102.7/workflow, range 67–140)
Total `run` scripts	11,518
Engines in use	7 distinct (Copilot, Claude, Codex, Pi, Crush, Gemini, OpenCode)

File Size Distribution

Bucket	Count	Share
50–100 KB	159	68.2%
100–250 KB	74	31.8%
<50 KB	0	0%
≥250 KB	0	0%

Sizes are tightly clustered: every lockfile is at least 62 KB, and the largest is only ~2.8× the smallest — suggesting a substantial shared compiled scaffold (engine setup, safe-output plumbing, MCP server boilerplate) that dominates the per-file footprint regardless of the user-authored content.

Top 10 largest lockfiles

Workflow	Bytes
smoke-claude.lock.yml	177,418
smoke-copilot.lock.yml	151,191
smoke-copilot-arm.lock.yml	141,922
smoke-codex.lock.yml	128,169
mcp-inspector.lock.yml	128,090
issue-monster.lock.yml	126,884
deep-report.lock.yml	126,033
cloclo.lock.yml	124,771
daily-news.lock.yml	120,998
daily-performance-summary.lock.yml	118,296

5 smallest lockfiles

Workflow	Bytes
test-workflow.lock.yml	62,813
example-permissions-warning.lock.yml	63,442
codex-github-remote-mcp-test.lock.yml	64,012
firewall.lock.yml	64,182
daily-malicious-code-scan.lock.yml	72,161

Trigger Analysis

Trigger	Workflows	Share
`workflow_dispatch`	225	96.6%
`schedule`	159	68.2%
`pull_request`	34	14.6%
`issues`	4	1.7%
`issue_comment`	2	0.9%
`push`	2	0.9%
`workflow_run`	1	0.4%
`discussion`	1	0.4%
`discussion_comment`	1	0.4%
`pull_request_review_comment`	1	0.4%

Top trigger combinations:

Combination	Count
`schedule + workflow_dispatch`	155
`workflow_dispatch` only	39
`pull_request + workflow_dispatch`	27
`pull_request` only	3

Nearly two-thirds of all workflows are scheduled, almost universally paired with workflow_dispatch for manual reruns. Only 8 workflows omit workflow_dispatch entirely — a strong convention across the repo.

Schedule cron patterns (top 30 — all crons are off-zero minutes)

Most-shared crons (each used by ≤2 workflows):

23 11 * * *, 38 3 * * *, 9 3 * * *, 39 23 * * *, 52 23 * * *, 40 3 * * * — each appears twice.

The remaining 153 scheduled workflows use unique crons with off-zero minute offsets (e.g. 5 14 * * 1-5, 27 */6 * * *, 49 */4 * * *). This is consistent with the "avoid :00/:30" guidance to spread scheduler load.

Safe Outputs Analysis

⚠️ Limitation: In compiled .lock.yml files, safe-outputs configuration is embedded as a JSON env var (GITHUB_AW_SAFE_OUTPUTS_*), not as YAML keys, so the regex-based extractor returned no key-level counts in this run. The lockfiles do reference the safeoutputs MCP server / CLI throughout. A future bump (lockfile_stats_v2.py) should parse the embedded JSON env var to recover counts.

What we can observe:

Every lockfile loads the safeoutputs MCP server (it is the standard write channel for these agentic workflows).
Discussion categories were not captured directly; the next analyzer revision should pull category: from the embedded safe-outputs JSON.

Structural Characteristics

Metric	Min	Avg	Max	Max workflow
Jobs per workflow	5	7.97	12	`firewall-escape.lock.yml`
Steps per workflow	67	102.7	140	`smoke-copilot.lock.yml`
Total `run` scripts	—	49.4 avg	—	—

Compiled workflows are large but uniform: every workflow has at least 5 jobs and 67 steps, suggesting a fixed compiled skeleton (preflight, engine setup, MCP setup, agent run, safe-output dispatch, teardown) plus per-workflow tail.

Timeout distribution (across all jobs):

Bucket (minutes)	Job count
≤5	14
6–15	319
16–30	315
31–60	29
>60	3

The 6–30 minute band covers 91% of all timeouts — consistent with single-turn agent runs sized to fit comfortably within GitHub-hosted runner billing increments.

Permission Patterns

All 233 lockfiles set permissions: {} at the top level, then declare fine-grained permissions per job. This is the safest pattern: no implicit token scope at the workflow level, with each job's needs declared explicitly. (Per-job permission counts could not be aggregated in this run because the YAML parser was unavailable in the runtime; the regex pass over inline permissions: {} confirms the empty-top-level convention is universal.)

Tool & MCP Patterns

MCP server frequency (count of references across all lockfiles):

Server	References
`github`	6,448
`playwright`	168
`sentry`	96
`grafana`	14
`arxiv`	6
`deepwiki`	6

Top 30 most-referenced MCP tools (all 124 occurrences each → loaded by 124 of 233 workflows)

Each of these tools shows exactly 124 references, which strongly suggests 124 workflows enable the full GitHub MCP read-toolset together rather than picking individual tools:

get_commit, get_file_contents, get_pull_request, get_pull_request_diff, get_pull_request_files, get_pull_request_review_comments, get_pull_request_reviews, get_pull_request_status, get_pull_request_comments, get_workflow_run, get_workflow_run_logs, get_workflow_run_usage, get_job_logs, download_workflow_run_artifact, list_branches, list_commits, get_tag, get_release_by_tag, get_latest_release, issue_read, get_discussion, get_discussion_comments, get_label, get_me, get_notification_details, get_code_scanning_alert, list_code_scanning_alerts, get_dependabot_alert, list_dependabot_alerts, get_secret_scanning_alert.

Engine distribution:

Engine	Workflows	Share
Copilot	153	65.7%
Claude	62	26.6%
Codex	13	5.6%
Pi	2	0.9%
Crush	1	0.4%
Gemini	1	0.4%
OpenCode	1	0.4%

Interesting Findings

Tight size band. Min 62.8 KB, max 177.4 KB — a 2.8× spread across 233 workflows. The compiled scaffold is the dominant contributor to lockfile size, not workflow-author content.
schedule + workflow_dispatch is the dominant pattern (66.5% of all workflows). Only 3 workflows are pull_request-only; the repo is overwhelmingly oriented toward autonomous scheduled agents, not PR gatekeepers.
All 5 smoke-*-claude/copilot/codex lockfiles cluster at the top of the size list (4 of the top 5 largest), reflecting the broader test/setup matrix smoke tests carry.
The github MCP server is referenced 6,448 times across lockfiles — roughly 27.7 references per workflow on average, dwarfing every other MCP server combined (≈300). Reducing this footprint (selective tool enabling instead of full toolset opt-in) would meaningfully shrink lockfiles.
124-of-233 workflows enable the full GitHub MCP read-toolset together (every top-30 GitHub tool shows the same count of 124). This is the single biggest target for tool-narrowing across the repo.
All 233 lockfiles use permissions: {} at the top level — a strict, opt-in security posture is universal.
No cron at :00 or :30 outside a handful of cases. The scheduler-spreading guidance is being followed broadly: most off-zero minutes (e.g. :23, :38, :49) are in active use.

Historical Trends

No reliable prior-day comparison: the only history entry (2026-05-20.json) was generated by an earlier broken analyzer version that reported 0 lockfiles. Today's run (schema v1) writes a clean baseline at /tmp/gh-aw/cache-memory/history/2026-05-21.json — future runs will be able to diff against it.

Recommendations

Bump analyzer to lockfile_stats_v2.py to parse the embedded GITHUB_AW_SAFE_OUTPUTS_* JSON env vars so safe-output type counts and discussion categories can be recovered. Today's report has a gap there.
Audit the 124 workflows pulling the full GitHub MCP read-toolset. If those workflows only use 2–3 GitHub tools each, narrowing the enabled set would noticeably shrink compiled lockfiles (top contributor to the 22.5 MB total).
Investigate the smoke- engine workflows topping the size chart.* If smoke matrices include unused engine setup blocks, trimming them would reduce repo-wide compiled bytes.
Install PyYAML in the analyzer runtime (or vendor a small parser) so per-job permission and trigger details can be aggregated precisely instead of via regex fallback.
Continue the :00/:30 cron-avoidance convention — it is already nearly universal, worth codifying in a linter check.

Methodology

Approach: single-script compact JSON analysis. One bash invocation wrote and executed /tmp/gh-aw/cache-memory/scripts/lockfile_stats_v1.py, which scanned all 233 .lock.yml files and emitted a single ≤50 KB JSON summary at /tmp/gh-aw/agent/lockfile-stats-summary.json. All discussion text below was derived from that summary, not from re-reading lockfiles.
YAML parser: unavailable in this runner — analyzer fell back to regex extractors. Counts for triggers, schedules, jobs/steps, timeouts, engines, and MCP usage remain reliable; per-key safe-output and per-job permission breakdowns are partial (see Limitation note above).
Caching: an earlier cached version of lockfile_stats_v1.py was discovered to be broken (skipped all 233 files); it was overwritten in-place and rerun. Today's run is the new baseline.
Skipped files: 0.

References:

§26252187909

Generated by 📊 Lockfile Statistics Analysis Agent · ● 5.7M · ◷

expires on May 22, 2026, 8:55 PM UTC

2026-05-22T20:49:58Z

github-actions[bot]
Bot May 22, 2026
Author

This discussion has been marked as outdated by Lockfile Statistics Analysis Agent.

A newer discussion is available at Discussion #34101.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lockfile-stats] Lockfile statistics audit — 233 workflows, 22.55 MB, 2026-05-21 #33855

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[lockfile-stats] Lockfile statistics audit — 233 workflows, 22.55 MB, 2026-05-21 #33855

Uh oh!

github-actions[bot] Bot May 21, 2026

Executive Summary

File Size Distribution

Trigger Analysis

Safe Outputs Analysis

Structural Characteristics

Permission Patterns

Tool & MCP Patterns

Interesting Findings

Historical Trends

Recommendations

Methodology

Replies: 1 comment

Uh oh!

github-actions[bot] Bot May 22, 2026 Author

github-actions[bot]
Bot May 21, 2026

github-actions[bot]
Bot May 22, 2026
Author